Advanced Database Management System Mod13
Advanced Database Management System Mod13
Systems:
Design,
Implementation,
and
Management,
14e
Module 13: Business
Intelligence and Data
Warehouses
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 1
whole or in part.
Chapter Objectives (1 of 2)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 2
whole or in part.
Chapter Objectives (2 of 2)
7. Describe the role and functions of data analytics and data mining
8. Explain how SQL analytic functions are used to support data analytics
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 3
whole or in part.
The Need for Data Analysis
• Organizations tend to grow and prosper as they gain a better understanding of their
environment
− Managers need to evaluate how the business is doing through tracking daily
transactions and analyzing company data
• Companies and software vendors addressed these multilevel decision support needs
by creating autonomous applications for particular groups of users
− This more comprehensive and integrated decision support framework within
organizations became known as business intelligence
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 4
whole or in part.
Business Intelligence (1 of 2)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 5
whole or in part.
Business Intelligence (2 of 2)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 6
whole or in part.
Business Intelligence Architecture (1
of 3)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 7
whole or in part.
Business Intelligence Architecture (2
of 3)
• Master data management (MDM) is a collection of concepts, techniques, and
processes for identification, definition, and management of data elements
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 8
whole or in part.
Business Intelligence Architecture (3
of 3)
• A BI system’s advanced decision support functions come to life via its intuitive and
informational user interface, and reporting capabilities
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 9
whole or in part.
Business Intelligence Benefits
• Improved decision making is the main goal of BI, but BI also provides the following
benefits:
− Integrating architecture
− Common user interface for data reporting and analysis
− Common data repository fosters single version of company data
− Improved organizational performance
• Achieving all these benefits takes a lot of human, financial, technological resources,
and time
− BI benefits are the result of a focused company-wide effort that could take a long
time
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 10
whole or in part.
Business Intelligence Evolution
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 11
whole or in part.
Business Intelligence Technology
Trends
• The following technological advances are driving the growth of business intelligence
technologies:
− Data storage improvements
− Business intelligence appliances
− Business intelligence as a service
− Big Data analytics
− Personal analytics
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 12
whole or in part.
Knowledge Check Activity 13-1
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 13
whole or in part.
Knowledge Check Activity 13-1:
Answer
• What are decision support systems, and what role do they
play in the business environment?
• Answer: Decision Support Systems (DSS) are based on computerized
tools that are used to enhance managerial decision-making. Because
complex data and the proper analysis of such data are crucial to
strategic and tactical decision making, DSS are essential to the well-
being and even survival of businesses that must compete in a global
market place.
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 14
whole or in part.
Operational Data versus Decision
Support Data (1 of 2)
• Although BI is used at the strategic and tactical managerial levels within
organizations, its effectiveness depends on the quality of data gathered at the
operational level
− Operational data is seldom well suited to decision support tasks
• Decision support data gives tactical and strategic business meaning to the
operational data
• Decision support data differs from operational data in three main areas
− Time span
− Granularity (level of aggregation)
− Dimensionality
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 15
whole or in part.
Operational Data versus Decision
Support Data (2 of 2)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 16
whole or in part.
Decision Support Database
Requirements
• The decision support database schema must support complex, non-normalized data
representations
− It must contain data that is aggregated and summarized and the queries must
be able to extract multidimensional time slices
• Data extraction capabilities should allow batch and scheduled data extraction and
should support different data sources and check for inconsistent data or data
validation rules
• Database size – the DBMS must be capable of supporting very large databases
(VLDBs)
− To support a VLDB, the DBMS might be required to support advanced storage
technologies and multiple-processor technologies
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 17
whole or in part.
The Data Warehouse (1 of 2)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 18
whole or in part.
The Data Warehouse (2 of 2)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 19
whole or in part.
Data Marts
• A data mart is a small, single-subject data warehouse subset that provides decision
support to a small group of people
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 20
whole or in part.
Twelve Rules That Define a Data
Warehouse
Table 13.9 Twelve Rules for a Data Warehouse
Rule No. Description
1 The data warehouse and operational environments are separated.
2 The data warehouse data is integrated.
3 The data warehouse contains historical data over a long time.
4 The data warehouse data is snapshot data captured at a given point in time.
5 The data warehouse data is subject oriented.
6 The data warehouse data is mainly read-only with periodic batch updates from operational data.
7 The data warehouse development life cycle differs from classical systems development.
8 The data warehouse contains data with several levels of detail: current detail data, old detail data, lightly summarized
data, and highly summarized data
9 The data warehouse environment is characterized by read-only transactions to very large data sets.
10 The data warehouse environment has a system that traces data sources, transformations, and storage.
11 The data warehouse’s metadata is a critical component of this environment.
12 The data warehouse contains a chargeback mechanism for resource usage that enforces optimal use of the data by end
users.
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 21
whole or in part.
Star Schemas (1 of 5)
• Fact and dimension tables are related by foreign keys and are subject to primary
and foreign key constraints
• The primary key of a fact table is a composite primary key because the fact table is
related to many dimension tables
• The fact table’s primary key is always formed by combining the foreign keys
pointing to the related dimension tables
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 23
whole or in part.
Star Schemas (3 of 5)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 24
whole or in part.
Star Schemas (4 of 5)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 25
whole or in part.
Star Schemas (5 of 5)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 26
whole or in part.
Knowledge Check Activity 13-2
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 27
whole or in part.
Knowledge Check Activity 13-2:
Answer
• What is a data warehouse, and what are its main
characteristics?
• Answer: A data warehouse is an integrated, subject-oriented, time-
variant and non-volatile database that provides support for decision-
making. (See section 13-4 for an in-depth discussion about the main
characteristics.) The data warehouse is usually a read-only database
optimized for data analysis and query processing.
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 28
whole or in part.
Online Analytical Processing
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 29
whole or in part.
Multidimensional Data Analysis
Techniques
• A characteristic of modern OLAP tools is their capacity for data to be processed and
viewed as part of a multidimensional structure
− This type of data analysis is particularly attractive to business decision makers
who tend to view business data as being related to other business data
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 30
whole or in part.
Advanced Database Support
• OLAP tools must have the following features to deliver efficient decision support:
− Access to many different kinds of DBMSs, flat files, and internal and external
data sources
− Access to aggregated data warehouse data and operational database detail data
− Advanced data navigation features
− Rapid and consistent query response times
− The ability to map end-user requests
− Support for very large databases
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 31
whole or in part.
Easy-to-Use End-User Interfaces
• Advanced OLAP features are more useful when access is kept simple
• Many interface features are “borrowed” from previous generations of data analysis
tools
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 32
whole or in part.
OLAP Architecture (1 of 2)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 33
whole or in part.
OLAP Architecture (2 of 2)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 34
whole or in part.
Relational OLAP
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 35
whole or in part.
Multidimensional OLAP
• To speed data access, data cubes are normally held in memory in the cube cache
• Sparsity measures the density of the data held in the data cube
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 36
whole or in part.
Relational versus Multidimensional
OLAP
Table 13.12 Relational vs. Multidimensional OLAP
Characteristic Rolap Molap
Schema Uses star schema Uses data cubes
Additional dimensions can be added dynamically Multidimensional arrays, row stores, column stores
Additional dimensions require re-creation of the data
cube
Database size Medium to large Large
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 37
whole or in part.
Data Analytics
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 38
whole or in part.
Data Mining (1 of 2)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 40
whole or in part.
Predictive Analytics
• Most predictive analytics models are used in areas such as healthcare services,
customer relationships, customer service, customer retention, fraud detection,
targeted marketing, and optimized pricing
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 41
whole or in part.
SQL Analytic Functions (1 of 2)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 42
whole or in part.
SQL Analytic Functions (2 of 2)
• Materialized Views
− A materialized view is a dynamic table that contains SQL query command to
generate rows and stores the actual rows
− It is created the first time the query is run and the summary rows are stored in
the table
− The materialized view rows are automatically updated when the base tables are
updated
− To create materialized views, you must have specified privileges and you must
complete specified prerequisite steps
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 43
whole or in part.
Data Visualization (1 of 2)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 44
whole or in part.
Data Visualization (2 of 2)
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 45
whole or in part.
Data Lake
• A data lake is a mega data repository that stores all company data (structured,
semi-structured, and unstructured) in its raw/natural format
• The idea of the data lake is to store the data in its raw/natural state before it is
processed
• The data lake can also serve as the source data for data scientists to “fish” for data
relationships and patterns in the raw data before it is processed
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 46
whole or in part.
Knowledge Check Activity 13-3
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 47
whole or in part.
Knowledge Check Activity 13-3:
Answer
• What is data analytics?
• Answer: Data analytics is a subset of BI functionality that
encompasses a wide range of mathematical, statistical, and modeling
techniques with the purpose of extracting knowledge from data. Data
analytics is used at all levels within the BI framework, including
queries and reporting, monitoring and alerting, and data visualization.
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 48
whole or in part.
Summary (1 of 2)
Now that the lesson has ended, you should be able to:
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 49
whole or in part.
Summary (2 of 2)
Now that the lesson has ended, you should be able to (continued):
7. Describe the role and functions of data analytics and data mining
8. Explain how SQL analytic functions are used to support data analytics
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023
Cengage. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in 50
whole or in part.