0% found this document useful (0 votes)

63 views13 pages

Data Warehouse and Data Mining

The document discusses data warehouses, including what they are, their key characteristics, and advantages. Specifically: 1) A data warehouse is a central repository that integrates historical and current data from across an organization for analysis and decision making. Unlike operational databases, it focuses on trends, patterns and relationships. 2) Key characteristics of data warehouses include being subject-oriented, integrated, time-variant, non-volatile, ensuring data quality, and being accessible and flexible. 3) Advantages include improved decision making, enhanced operational efficiency, competitive advantages from deeper insights, and increased revenue and profitability. Data warehouses unlock the power of historical and integrated data to drive business success.

Uploaded by

hkushwaha1011

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views13 pages

Data Warehouse and Data Mining

Uploaded by

hkushwaha1011

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Data Warehouse and Data Mining (102)

1: What is Data Warehouse? Discuss the characteristics of data warehouse.

Data Warehouse: A Treasure Chest of Insights

Imagine a vast and organized storehouse, brimming with information about your entire
organization – customer demographics, sales figures, product trends, and so much more. This
isn't just a dusty archive; it's a vibrant hub where data transforms into actionable insights, guiding
strategic decisions and propelling business growth. This, in essence, is the power of a Data
Warehouse.
What is a Data Warehouse?
A Data Warehouse is a central repository of integrated historical and current data, meticulously
curated from diverse operational systems across an organization. It's not just a bigger, fancier
database; it's a purpose-built system designed for analysis and decision support. Unlike
transaction-oriented operational databases, data warehouses prioritize historical trends,
patterns, and relationships, enabling users to explore questions and gain deeper understanding.
Characteristics of a Data Warehouse:
1. Subject-Oriented: Data warehouses are not merely data dumps; they are organized
around specific business subjects like marketing, finance, or customer service. This
subject-oriented structure aligns with the needs of decision-makers, making it easier to
retrieve and analyze relevant data.
2. Integrated: Data from disparate sources, often with different formats and structures, is
meticulously integrated into a single, unified schema. This ensures consistency and
facilitates cross-functional analysis, revealing hidden connections and patterns that
individual systems might miss.
3. Time-Variant: Data warehouses hold historical data, allowing users to track trends,
analyze seasonality, and compare year-on-year performance. This temporal dimension
enables informed decision-making based on past experiences and future projections.
4. Non-Volatile: Unlike operational databases constantly updated with real-time
transactions, data warehouses are relatively stable. Once loaded, data undergoes minimal
updates, ensuring reliable historical analysis without the constant churn of operational
systems.
5. Data Quality: Data in a warehouse undergoes rigorous cleaning and transformation
processes to ensure accuracy, consistency, and completeness. This ensures reliable
insights and avoids misleading conclusions based on erroneous or incomplete data.
6. Accessibility: Data warehouses cater to diverse users with varying technical expertise.
Intuitive user interfaces, data visualization tools, and reporting capabilities make it easy
for business analysts, managers, and even executives to access and interpret insights.
7. Flexibility: Data warehouses are designed to adapt to evolving business needs. They can
accommodate new data sources, integrate with advanced analytics tools, and support
diverse data exploration methods to ensure long-term value and adaptability.
Benefits of a Data Warehouse:
• Improved Decision-Making: Data warehouses provide a single source of truth for
informed decision-making, leading to better resource allocation, strategic planning, and
product development.
• Enhanced Operational Efficiency: By identifying trends and inefficiencies, data
warehouses enable process optimization, cost reduction, and improved customer
experience.
• Competitive Advantage: Deeper insights into market trends, customer behavior, and
competitor strategies can empower businesses to stay ahead of the curve and gain a
competitive edge.
• Increased Revenue and Profitability: Data-driven insights can lead to targeted marketing
campaigns, improved customer segmentation, and more effective pricing strategies,
ultimately boosting revenue and profitability.
Conclusion:
In today's data-driven world, a Data Warehouse is not just a luxury; it's a strategic imperative. By
unlocking the power of historical and integrated data, businesses can gain invaluable insights,
make informed decisions, and navigate the ever-changing market landscape with confidence. The
characteristics discussed above ensure that this "treasure chest of insights" remains valuable,
accessible, and adaptable, driving success in the years to come.

2. (a) How operational and informational data differ to each other?

(b) Write the advantages of data warehousing.

(a) Operational vs. Informational Data:

Both operational and informational data are vital for organizations, but they serve distinct
purposes and have key differences:
Operational Data:
• Focus: Supports the day-to-day running of an organization.
• Examples: Transaction records, customer orders, inventory levels, machine sensor data.
• Characteristics:
o High volume and velocity (frequent updates).
o Volatile and perishable (may lose relevance quickly).
o Granular (detailed, individual level).
o Read-write access required for updates.
o Stored in operational systems (e.g., ERP, CRM) for real-time processing.
Informational Data:
• Focus: Supports decision-making and strategic planning.
• Examples: Trends, reports, analysis results, historical data.
• Characteristics:
o Lower volume and velocity (periodic updates).
o Stable and durable (retains value over time).
o Aggregated (summarized and grouped).
o Read-only access for analysis.
o Stored in data warehouses for historical analysis and reporting.
Key Differences:

Feature Operational Data Informational Data

Purpose Run daily operations Inform decision-making

High volume & velocity, Lower volume & velocity, stable,

Characteristics
volatile, granular aggregated

Access Read-write Read-only

Storage Operational systems Data warehouses

Understanding the differences is crucial:

• Operational data drives daily activities, ensuring smooth, efficient processes.
• Informational data drives insights and strategy, enabling informed decision-making and
optimizing performance.
(b) Advantages of Data Warehousing:
A data warehouse is a central repository of historical and integrated data from various
operational systems. Storing and analyzing data in a dedicated environment offers several
advantages:
1. Improved Decision-Making:
• Unified data access: Data from diverse sources is consolidated and standardized,
providing a single source of truth for analysis.
• Historical analysis: Historical data trends and patterns can be identified, enabling better
predictions and informed future decisions.
• Drill-down capabilities: Users can analyze data at different levels of detail, from overall
trends to specific customer transactions.
2. Enhanced Operational Efficiency:
• Data quality improvement: Data cleansing and transformation in the warehouse improve
data quality across the organization.
• Reduced reporting time: Pre-computed data and reports in the warehouse reduce the
time needed to generate reports for operational insights.
• Resource optimization: Analyzing data in a centralized location frees up resources in
operational systems.
3. Increased Business Intelligence:
• Data discovery: Unforeseen patterns and relationships can be discovered through data
mining and analytics within the warehouse.
• Improved customer understanding: Customer behavior and preferences can be analyzed
to personalize marketing and enhance customer relationships.
• Competitive advantage: Data-driven insights can guide product development, market
strategies, and optimize resource allocation.
4. Scalability and Flexibility:
• Modular architecture: Data warehouses can be easily expanded to accommodate future
data growth and new data sources.
• Flexible access: Different user groups can access relevant data and reports based on their
roles and permissions.
• Integration with other tools: Data warehouses can be integrated with business
intelligence tools and dashboards for advanced visualization and analysis.
In conclusion, data warehousing offers significant advantages for organizations by providing a
central repository for historical and integrated data, enabling improved decision-making,
enhanced operational efficiency, increased business intelligence, and enhanced scalability and
flexibility.

3: Discuss the types of benefits of data warehouse.

The Multifaceted Gems of Data Warehouses: Benefits Across the Spectrum

In the age of information overload, data warehouses emerge as gleaming oases, centralizing and
organizing data to unlock a treasure trove of benefits. For businesses today, a well-implemented
data warehouse is more than just a storage solution; it's a catalyst for improved decision-making,
enhanced efficiency, and ultimately, a competitive edge. Let's delve into the rich tapestry of
benefits woven by data warehouses, examining their impact across various levels:
1. Strategic Benefits:
• Data-driven decision-making: Data warehouses transform raw data into actionable
insights, empowering leadership to make informed strategic decisions. By analyzing
historical trends, customer behavior, and market dynamics, businesses can anticipate
challenges, identify opportunities, and optimize resource allocation. Imagine a retail chain
using its data warehouse to pinpoint profitable product lines, tailor marketing campaigns,
and predict seasonal demand fluctuations – all leading to strategic growth.
• Improved business intelligence: Data warehouses enable organizations to build
comprehensive BI dashboards and reports, providing a crystal-clear view of critical
performance indicators (KPIs). Sales performance, marketing ROI, customer churn rates,
and operational efficiency become readily available, allowing leadership to track
progress, measure the impact of initiatives, and course-correct strategies as needed.
Imagine a healthcare provider analyzing trends in patient demographics, treatment
outcomes, and resource utilization to identify areas for improvement and optimize
patient care.
• Enhanced risk management: Data warehouses can be valuable tools for identifying and
mitigating business risks. By analyzing patterns in operational data, financial transactions,
and customer behavior, businesses can detect potential fraud, anticipate market
downturns, and proactively address compliance issues. Imagine a financial institution
using its data warehouse to identify suspicious transactions, assess creditworthiness of
potential borrowers, and ensure regulatory compliance, minimizing financial and
reputational risks.
2. Operational Benefits:
• Streamlined data access and analysis: Data warehouses consolidate data from disparate
sources into a single repository, eliminating the need to sift through siloed systems. This
centralizes access for analysts, data scientists, and business users, saving time and effort
while simplifying data exploration and analysis. Imagine a marketing team analyzing
customer data from web traffic, social media, and point-of-sale systems within one
platform, gaining a holistic understanding of customer behavior and preferences.
• Improved data quality and consistency: Data warehouses often incorporate data
cleansing and transformation processes, ensuring high-quality, consistent data for
analysis. This eliminates data discrepancies and inaccuracies that can plague operational
systems, leading to more reliable insights and improved decision-making. Imagine a
manufacturing company using its data warehouse to ensure accurate inventory levels,
production schedules, and quality control data, reducing operational inefficiencies and
waste.
• Enhanced collaboration and communication: Data warehouses provide a common
platform for stakeholders across departments to access and analyze the same data. This
fosters cross-functional collaboration, improves communication, and aligns efforts
towards shared goals. Imagine a hospital where doctors, nurses, and administrators can
access patient data from the data warehouse, leading to better-coordinated care and
improved patient outcomes.
3. Financial Benefits:
• Reduced data storage and management costs: Data warehouses can alleviate the burden
of managing data sprawl across multiple systems. By consolidating data into a single
platform, businesses can reduce hardware and software costs, streamline data
maintenance, and optimize IT resources. Imagine a large media company consolidating
data from various platforms into a central data warehouse, leading to significant savings
in storage and data management costs.
• Improved operational efficiency: Data-driven insights from the warehouse can empower
businesses to optimize processes, reduce waste, and identify areas for cost reduction.
Analyzing production processes, supply chains, and marketing campaigns can lead to
operational efficiencies, resource optimization, and increased profitability. Imagine a
logistics company using its data warehouse to optimize delivery routes, reduce fuel
consumption, and improve service levels, leading to cost savings and improved customer
satisfaction.
• Enhanced competitive advantage: The ability to harness data insights for strategic
decision-making, operational efficiency, and risk management gives businesses a
significant competitive edge. In a data-driven world, data warehouses provide the
ammunition for businesses to outmaneuver rivals, develop innovative products and
services, and stay ahead of the curve. Imagine a startup using its data warehouse to
personalize customer experiences, identify new market opportunities, and adapt to
changing market dynamics, ensuring sustainable growth and market leadership.
In conclusion, data warehouses are not mere data repositories; they are veritable treasure troves
of opportunities. Their benefits span the strategic, operational, and financial spectrum,
empowering businesses to make informed decisions, improve efficiency, mitigate risks, and
ultimately, thrive in the dynamic world of data. As you prepare for your university exam,
remember that understanding the multifaceted benefits of data warehouses will not only give
you a strong foundation in data analytics but also equip you to navigate the ever-evolving world
of business in the information age.

4. (a) Write the various components of data warehouse architecture and its purpose.
(b) What is architectural difference between two- tier and multi-tiered data warehouse?

(a) Components of Data Warehouse Architecture and their Purpose:

A data warehouse is a central repository of historical and integrated data from various
operational systems, designed to support decision-making and analysis. Its architecture consists
of several key components, each serving a specific purpose:
1. Source Layer:
• Purpose: Collects data from various operational systems like CRM, ERP, and financial
systems.
• Components: Extractors, connectors, staging area.
• Activities: Data extraction, transformation, and loading (ETL) processes.
2. Data Integration and Transformation Layer:
• Purpose: Cleanses, transforms, and integrates data from diverse sources into a consistent
format.
• Components: Data cleansing tools, transformation engines, data quality tools.
• Activities: Data validation, standardization, dimension modeling, data mapping.
3. Data Warehouse Layer:
• Purpose: Stores integrated and transformed data for historical analysis and reporting.
• Components: Data warehouse database (typically relational or columnar), data marts
(subset of data for specific departments).
• Activities: Data storage, indexing, partitioning, optimization.
4. Data Access and Analysis Layer:
• Purpose: Provides access to data for analysis and reporting.
• Components: Business intelligence (BI) tools, reporting tools, data visualization tools.
• Activities: Querying, data analysis, report generation, dashboards.
5. Metadata Layer:
• Purpose: Provides information about data warehouse objects, their relationships, and
definitions.
• Components: Metadata repository, data dictionary.
• Activities: Data lineage tracking, documentation, managing data access and security.
6. Management Layer:
• Purpose: Oversees the entire data warehouse lifecycle, including performance
monitoring, security, and maintenance.
• Components: Data warehouse management tools, monitoring tools, security tools.
• Activities: Performance optimization, backup and recovery, user management, audit
trails.
(b) Architectural Differences between Two-Tier and Multi-Tier Data Warehouses:
1. Two-Tier Architecture:
• Simplest architecture with only two layers: source and data warehouse/data mart.
• Data is extracted, transformed, and loaded directly into the data warehouse.
• Suitable for small data volumes and straightforward analysis needs.
• Advantages: Simple and cost-effective, easy to implement and manage.
• Disadvantages: Limited scalability, performance bottlenecks with large data volumes,
increased complexity for complex transformations.
2. Multi-Tier Architecture:
• More complex architecture with additional layers between source and data warehouse.
• Data passes through multiple stages of transformation and integration before reaching
the warehouse.
• Suitable for large data volumes and complex analysis needs.
• Advantages: Scalable and flexible, improves performance and data quality, simplifies
complex transformations.
• Disadvantages: More complex and expensive to implement and manage, requires skilled
personnel.
Decision Factors:
• Data Volume: Two-tier for small volumes, multi-tier for large volumes.
• Complexity of Analysis: Two-tier for simple analysis, multi-tier for complex analysis.
• Budget and Resources: Two-tier for lower budgets, multi-tier requires more resources.
• Scalability and Performance Needs: Two-tier less scalable, multi-tier offers better
performance.
Choosing the right architecture depends on the specific needs and constraints of the
organization.
Additional Notes:
• Hybrid architectures combining elements of both two-tier and multi-tier are also
becoming increasingly popular.
• Cloud-based data warehouses offer flexibility and scalability for modern data warehouse
deployments.

5. (a) Differentiate between host based and master- slave processing.

(b) Explain the terms of the following:
(i) Meta data
(ii) Data mart

(a) Differentiating Host-Based and Master-Slave Processing:

Both host-based and master-slave processing are strategies for distributing workload among
multiple processors, but they differ in their control structure and efficiency.
Host-Based Processing:
• Centralized Control: A single host processor manages the entire workload.
• Task Distribution: The host breaks down the problem into smaller tasks and distributes
them to individual processors.
• Independent Processing: Each processor works on its assigned task independently, with
minimal communication with the host or other processors.
• Scalability: Limited scalability as the host becomes a bottleneck with increasing tasks and
processors.
• Suitable for: Simple problems with little inter-task dependencies.
Master-Slave Processing:
• Hierarchical Control: A designated "master" processor controls the workload and
distributes tasks to other "slave" processors.
• Task Delegation: The master assigns specific tasks to slaves, often with dependencies and
synchronized execution.
• Slave Dependence: Slaves rely on the master for instructions and data, with frequent
communication.
• Scalability: Highly scalable as additional slaves can be added to handle increased
workload.
• Suitable for: Complex problems with interdependent tasks requiring coordinated
execution.
Key Differences:

Feature Host-Based Master-Slave

Control Structure Centralized Hierarchical

Task Distribution Independent Controlled by Master

Inter-Processor
Minimal Frequent
Communication

Scalability Limited High

Simple Complex, Interdependent

Suitable for
Problems Problems

In essence, host-based processing is like having a single manager assigning tasks to individual
workers, while master-slave processing is like a team leader coordinating tasks among specialized
members.
(b) Explaining Terms:
i. Meta Data:
Meta data is "data about data." It provides information about a piece of data, such as its format,
source, creation date, and author. It helps to organize, manage, and understand data effectively.
Think of it as a label on a box describing what's inside.
• Examples: File name, file size, date modified, author name, keyword tags, data dictionary
entries.
• Benefits: Improved data organization, retrieval, and analysis, enhanced data quality and
consistency, facilitates data sharing and collaboration.
ii. Data Mart:
A data mart is a subject-specific subset of a larger data warehouse. It focuses on a particular
department, business unit, or function within an organization. Think of it as a smaller store within
a larger mall, catering to specific needs.
• Characteristics:
o Subject-oriented: Tailored to a specific area of interest.
o Integrated data: Combines data from relevant sources.
o Timely and accurate: Provides up-to-date information.
o User-friendly: Designed for easy access and analysis by specific users.
• Benefits:
o Improved decision-making within specific departments.
o Faster and easier data access and analysis.
o Reduced costs compared to maintaining a large data warehouse.
o

6. (a) What is star schema and multi-star schema ?

(b) Discuss the server functions.

(a) Star Schema and Multi-Star Schema:

Both star schema and multi-star schema are data warehouse design techniques used to model
multidimensional data for efficient querying and analysis. They differ in their approach to
dimensional hierarchies and data redundancy.
Star Schema:
• Structure: A star schema resembles a star, with a central fact table surrounded by
dimension tables. The fact table contains measurements and key columns to join with
dimension tables. Dimension tables hold descriptive attributes about the dimensions
(e.g., product category, customer demographics).
• Characteristics:
o Simple and easy to understand, making it ideal for beginners and OLAP queries.
o Denormalized to minimize joins and optimize query performance.
o May lead to data redundancy for shared dimensions across fact tables.
• Example: An e-commerce data warehouse with a "Sales" fact table (product, customer,
date, amount) and dimension tables for product, customer, and date/time.
Multi-Star Schema:
• Structure: Combines multiple star schemas, each focusing on a specific business process
or subject area. Fact tables share dimensions but are separate, minimizing redundancy.
Dimension tables can have higher levels of hierarchy and relationships.
• Characteristics:
o Reduces data redundancy compared to a single star schema.
o More complex to design and maintain due to multiple fact tables and potential
join complexity.
o Offers greater flexibility for analyzing different business processes.
• Example: A multi-star schema for an online store might have one star schema for sales,
another for marketing campaigns, and a third for customer service interactions. Each with
relevant fact and dimension tables, but sharing dimensions like product and customer
across them.
Choosing between Star and Multi-Star Schema:
• Use a star schema:
o For simple analyses involving one fact table and its dimensions.
o When query performance is critical and data redundancy is acceptable.
o For beginners or less complex data models.
• Use a multi-star schema:
o When different business processes require separate analyses with minimal
redundancy.
o For complex data models with deep hierarchies and relationships.
o When data governance and maintainability are priorities.
(b) Server Functions:
Server functions are programs executed within a database server to extend its capabilities and
perform specific tasks. They can process data, manipulate structures, and enhance security,
offering various benefits:
• Extend processing power: Offload complex calculations from the client application to the
server, improving performance and scalability.
• Reduce network traffic: Perform data transformations and aggregations on the server
before sending results to the client.
• Simplify client coding: Encapsulate complex logic within functions, reducing client-side
code complexity and maintenance.
• Enforce data integrity: Implement business rules and validation logic on the server to
ensure data consistency and accuracy.
• Improve security: Perform encryption, access control, and other security tasks within the
database environment.
Types of Server Functions:
• Scalar functions: Return single values based on input parameters.
• Aggregate functions: Perform calculations on groups of data (e.g., SUM, AVG).
• Window functions: Operate on groups of rows within a defined window (e.g., moving
averages).
• User-defined functions (UDFs): Custom functions written in specific languages (e.g., SQL,
Python) to extend server capabilities.
Considerations for Server Functions:
• Performance impact: Complex functions can strain server resources and affect query
performance.
• Security vulnerabilities: Ensure functions don't introduce exploitable vulnerabilities or
unauthorized data access.
• Maintainability: Document and test functions carefully to facilitate future maintenance
and updates.
Examples of Server Functions:
• Calculate shipping costs based on product weight and destination.
• Generate unique identifiers for newly created records.
• Validate credit card numbers before processing payments.
• Aggregate sales data by product category and region.
Server functions offer powerful capabilities to enhance data processing, security, and application
efficiency. However, careful selection, design, and security measures are crucial to avoid
performance bottlenecks and vulnerabilities.

Data Is A Collection of Facts, Such As Numbers, Words, Measurements, Observations or Just Descriptions of
No ratings yet
Data Is A Collection of Facts, Such As Numbers, Words, Measurements, Observations or Just Descriptions of
31 pages
Data Warehousing Module 1
No ratings yet
Data Warehousing Module 1
58 pages
Presentation - Data Warehouse Overview - 20250805 - 230751 - 0000
No ratings yet
Presentation - Data Warehouse Overview - 20250805 - 230751 - 0000
7 pages
Concept of Data Warehouse
No ratings yet
Concept of Data Warehouse
4 pages
Presentation - Data Warehouse Overview - 20250805 - 230751 - 0000
No ratings yet
Presentation - Data Warehouse Overview - 20250805 - 230751 - 0000
7 pages
Data Warehousing Unit 1
No ratings yet
Data Warehousing Unit 1
18 pages
Data Warehousing
No ratings yet
Data Warehousing
4 pages
Features of Data Warehousing
No ratings yet
Features of Data Warehousing
2 pages
Data Mining and Data Warehousing
No ratings yet
Data Mining and Data Warehousing
92 pages
Data Wareousing and Mining-Notes
No ratings yet
Data Wareousing and Mining-Notes
37 pages
Data Warehousing
No ratings yet
Data Warehousing
2 pages
Unit Ii
No ratings yet
Unit Ii
45 pages
Data Mining
No ratings yet
Data Mining
3 pages
Unit 1
No ratings yet
Unit 1
20 pages
KM 2
No ratings yet
KM 2
7 pages
Unit-1 Data Warehousing
No ratings yet
Unit-1 Data Warehousing
17 pages
Introduction to Data Warehousing
No ratings yet
Introduction to Data Warehousing
17 pages
Data Warehouse
No ratings yet
Data Warehouse
22 pages
KM Secb
No ratings yet
KM Secb
18 pages
Characteristics of Data Warehousing
No ratings yet
Characteristics of Data Warehousing
5 pages
Unit I DWDM
No ratings yet
Unit I DWDM
67 pages
Data Warehouse
No ratings yet
Data Warehouse
16 pages
What Is Data Warehouse
No ratings yet
What Is Data Warehouse
19 pages
DWDM
No ratings yet
DWDM
12 pages
Data Mining Warehousing I & II
No ratings yet
Data Mining Warehousing I & II
7 pages
DWM Gufran Notes
No ratings yet
DWM Gufran Notes
318 pages
WA Data Warehouse
No ratings yet
WA Data Warehouse
16 pages
DWH Fundamentals (Training Material)
No ratings yet
DWH Fundamentals (Training Material)
21 pages
Data Warehousing Basics & Components
No ratings yet
Data Warehousing Basics & Components
37 pages
Advanced Database Presentation
No ratings yet
Advanced Database Presentation
11 pages
Data Warehousing-Notes (Module - I & II)
No ratings yet
Data Warehousing-Notes (Module - I & II)
32 pages
Unit 3 Data Warehousing and OLAP
No ratings yet
Unit 3 Data Warehousing and OLAP
26 pages
Data Warehousing Essentials Guide
100% (1)
Data Warehousing Essentials Guide
19 pages
A Complete Notes
No ratings yet
A Complete Notes
10 pages
Data Warehousing
No ratings yet
Data Warehousing
33 pages
DMW Unit 1
No ratings yet
DMW Unit 1
56 pages
Data Warehousing Fundamentals
No ratings yet
Data Warehousing Fundamentals
108 pages
Why We Need Data Warehouse?
No ratings yet
Why We Need Data Warehouse?
4 pages
Data Warehousing and Data Mining
No ratings yet
Data Warehousing and Data Mining
135 pages
DM Unit V
No ratings yet
DM Unit V
50 pages
Data Warehousing and Mining Guide
No ratings yet
Data Warehousing and Mining Guide
46 pages
Datawarehouse Unit2
No ratings yet
Datawarehouse Unit2
75 pages
Data Warehousing - Quick Guide
No ratings yet
Data Warehousing - Quick Guide
70 pages
Data Warehouse: Key Features Explained
No ratings yet
Data Warehouse: Key Features Explained
2 pages
Presentation Prepared By:: Aqsa Ashfaq
No ratings yet
Presentation Prepared By:: Aqsa Ashfaq
22 pages
Warehousing
No ratings yet
Warehousing
15 pages
Unit 3 Introduction To Data Warehousing: Structure Page Nos
No ratings yet
Unit 3 Introduction To Data Warehousing: Structure Page Nos
21 pages
Introduction To Warehousing
No ratings yet
Introduction To Warehousing
21 pages
Kalyani
No ratings yet
Kalyani
8 pages
Datawarehouse Unit-2
No ratings yet
Datawarehouse Unit-2
59 pages
Data 20warehouse 20week 201 281 29
No ratings yet
Data 20warehouse 20week 201 281 29
27 pages
DWDM Unit 1
No ratings yet
DWDM Unit 1
122 pages
DM - MOD - 2 Part - I
No ratings yet
DM - MOD - 2 Part - I
19 pages
Data Mining & Business Intelligence (MU-Sem 6-1T) (Data Warehouse Mining) ... Page
No ratings yet
Data Mining & Business Intelligence (MU-Sem 6-1T) (Data Warehouse Mining) ... Page
25 pages
Data Warehouse Unit 1
No ratings yet
Data Warehouse Unit 1
7 pages
General Studies
No ratings yet
General Studies
40 pages
Introduction To The Gospels
No ratings yet
Introduction To The Gospels
11 pages
SAP Hybris V6.2 Certified Development Professional Study Guide - Quuth5ootaip
No ratings yet
SAP Hybris V6.2 Certified Development Professional Study Guide - Quuth5ootaip
260 pages
Music of Southeast Asian: Lesson
No ratings yet
Music of Southeast Asian: Lesson
22 pages
Articles: I/ The Indefinite Articles "A" and "An"
No ratings yet
Articles: I/ The Indefinite Articles "A" and "An"
2 pages
Present Perfect
No ratings yet
Present Perfect
20 pages
C Program for LL(1) Parsing Table
No ratings yet
C Program for LL(1) Parsing Table
25 pages
Obedience Cost Us Something.
No ratings yet
Obedience Cost Us Something.
15 pages
Langham DevotionalBooklet Who Is Jesus
No ratings yet
Langham DevotionalBooklet Who Is Jesus
68 pages
Final Report
No ratings yet
Final Report
37 pages
Play - Definition
No ratings yet
Play - Definition
5 pages
1400 GMAT Vocabulary Flashcards
No ratings yet
1400 GMAT Vocabulary Flashcards
100 pages
Assessment Brief For Assesment Point 2 - Reflective Writing
No ratings yet
Assessment Brief For Assesment Point 2 - Reflective Writing
7 pages
Google - Professional Machine Learning Engineer.v2024 10 23.q109
No ratings yet
Google - Professional Machine Learning Engineer.v2024 10 23.q109
120 pages
CMIS320-Project 2-DatabaseDesign
No ratings yet
CMIS320-Project 2-DatabaseDesign
5 pages
Modal Verbs: Exercises: 1. Complete Using Can, Could, Must, Have To, Should and Might
No ratings yet
Modal Verbs: Exercises: 1. Complete Using Can, Could, Must, Have To, Should and Might
3 pages
Signs of Social Media Addiction
No ratings yet
Signs of Social Media Addiction
1 page
4th Grade Music Rhythm Lesson
No ratings yet
4th Grade Music Rhythm Lesson
2 pages
Linux OS for Computer Engineering Students
No ratings yet
Linux OS for Computer Engineering Students
3 pages
Finding MR Wright Leaning N Ranch Series Book 2 Ba Tortuga Download
No ratings yet
Finding MR Wright Leaning N Ranch Series Book 2 Ba Tortuga Download
34 pages
Basic Math & Pre-Algebra All-in-One For Dummies 1st Edition Mark Zegarelli Download
No ratings yet
Basic Math & Pre-Algebra All-in-One For Dummies 1st Edition Mark Zegarelli Download
118 pages
Connecting Specific Planned Supports
No ratings yet
Connecting Specific Planned Supports
5 pages
Types of Vestments
No ratings yet
Types of Vestments
11 pages
Ebooks File Introductory Statistics For Data Analysis Warren J. Ewens All Chapters
No ratings yet
Ebooks File Introductory Statistics For Data Analysis Warren J. Ewens All Chapters
49 pages
HPE AI-ML Accelerated With HPE Proliant
No ratings yet
HPE AI-ML Accelerated With HPE Proliant
33 pages
Evaluasi Bahasa Inggris Bab 4
No ratings yet
Evaluasi Bahasa Inggris Bab 4
3 pages
Image Segmentation Techniques
No ratings yet
Image Segmentation Techniques
66 pages
Programación Didáctica: Inglés
0% (1)
Programación Didáctica: Inglés
14 pages
Grammar Translation Method Guide
No ratings yet
Grammar Translation Method Guide
16 pages
Advanced Function Analysis
No ratings yet
Advanced Function Analysis
10 pages

Data Warehouse and Data Mining

Uploaded by

Data Warehouse and Data Mining

Uploaded by

Data Warehouse and Data Mining (102)

1: What is Data Warehouse? Discuss the characteristics of data warehouse.

Data Warehouse: A Treasure Chest of Insights

2. (a) How operational and informational data differ to each other?

(a) Operational vs. Informational Data:

Feature Operational Data Informational Data

Purpose Run daily operations Inform decision-making

High volume & velocity, Lower volume & velocity, stable,

Access Read-write Read-only

Storage Operational systems Data warehouses

Understanding the differences is crucial:

3: Discuss the types of benefits of data warehouse.

The Multifaceted Gems of Data Warehouses: Benefits Across the Spectrum

(a) Components of Data Warehouse Architecture and their Purpose:

5. (a) Differentiate between host based and master- slave processing.

(a) Differentiating Host-Based and Master-Slave Processing:

Feature Host-Based Master-Slave

Control Structure Centralized Hierarchical

Task Distribution Independent Controlled by Master

Scalability Limited High

Simple Complex, Interdependent

6. (a) What is star schema and multi-star schema ?

(a) Star Schema and Multi-Star Schema:

You might also like