DBS101
Introduction to Database Systems
REFERENCE TEXT:
CHA PT ER 0 1 : DATABA SE SYSTEMS: DES I GN, I M P L EMENTATIO N, A N D
M A N AGEMENT
Coronel, Carlos and Morris, Steven, Database Systems: Design, Implementation, and Management, 14 Edition. © 2023 Cengage. All Rights Reserved. May
not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Learning Objectives
The difference between data, information and knowledge
What a database is, the various types of databases, and why they are valuable assets for decision
making ???
The importance of database design
How modern databases evolved from file systems
The main components of the database system
The main functions of a database management system (DBMS)
Why Database ??????(1)
Figure 1.1 The Pervasive
Nature of Databases
Why Database ??????(2)
We need system to process and store data.
All these systems are responsible for collection, storage, aggregation,
manipulation , dissemination and management of data.
Size of data : We are looking from few megabytes to millions of terabytes.
Database are specialized structures that allow computer-based systems to store,
manage and retrieve data very quickly.
Data vs. Information(1)
Data are raw facts.
Information is the result of processing raw data to reveal meaning.
Information requires context to reveal meaning.
Raw data must be formatted for storage, processing, and presentation.
Data are the foundation of information, which is the bedrock of knowledge.
Data vs. Information (2)
Data: building blocks of information
Information produced by processing data
Information used to reveal meaning in data
Accurate, relevant, timely information is the key to good decision making
Good decision making is the key to organizational survival
Data management: focuses on proper generation, storage, and retrieval of data
Data vs. Information (3)
Figure 1.2 Transforming Raw
Data into Information
Information Vs Knowledge
Generated information is useful only if it can be applied to create knowledge in the
organization.
Do I need to open another section for DBS101?
Would the house prices go up in the next 6 months ?
Introduction to Database
A database is a shared, integrated computer structure that stores a collection of the
following:
End user data
Meta data: Data about the data, used to integrated and manage the end user
data.
In Short
¯ Database produces information by processing data.
¯ Information is used by organizations to make good decisions.
¯ Good decision making is the key to survival in co-operate world.
Database Management System
The DBMS presents the end user with a single, integrated view of the data in the database.
A DBMS provides the following advantages:
Improved data sharing
Improved data security
Better data integration
Minimized data inconsistency
Improved data access
Improved decision making
Increased end-user productivity
DBMS
Figure 1.4 The DBMS
Manages the Interaction
Between the End User and
the Database
Role of DBMS
DBMS is the arbitrator between user and database.
The database itself is stored as a collection of files.
DBMS provides an integrated, shared view of the data.
It takes requests from the user, converts them into complex queries and execute
them .
It hides all the internal details from the end user.
It communicates with different application written by programmer in different
languages like C#, Java or VB.Net
Types Of Databases(1)
Single-user Database: Supports only one user at a time.
Multiuser Database: Supports multiple user at the same time.
Workgroup Database: Relatively small number ( < 50 )of user working within the same
organization.
Enterprise Database: More than 50 users working in different departments of an organization
or even different organizations.
Centralized Database supports data located at a single location.
Distributed Database supports data stored at different locations.
Cloud database is created and maintained using cloud data services (most famous ones:
Microsoft Azure , Amazon AWS).
Types of Databases (2)
Classification by data type :
—General-purpose databases contain a wide variety of data used in multiple disciplines.
—Discipline-specific databases contain data focused on specific subject areas.
—An operational database is designed to support a company’s day-to-day operations.
—An analytical database stores historical data and business metrics used exclusively for
tactical or strategic decision making and is comprised of two main components:
—The data warehouse stores data in a format optimized for decision support.
—Online analytical processing (OLAP) is a set of tools for retrieving, processing.
Types Of Databases (3)
Business intelligence describes a comprehensive approach to capture and process business
data to generate information that support decision making.
Databases can be classified to reflect the degree to which the data is structured
Unstructured data exists in its original (raw) state
Structured data is the result of formatting unstructured data to facilitate storage and use
Semistructured data has already been processed to some extent
Extensible Markup Language (XML) is a language used to represent data elements in textual
format
An XML database supports the storage and management of unstructured XML data
Types Of Databases (4)
Social media refers to web and mobile technologies that enable “anywhere,
anytime, always on” human interaction.
◦ Data is captured about end users and consumers that requires the use of
specialized database systems
The term NoSQL (Not only SQL) is a new generation of DBMS that is not based
on the traditional relational database model.
◦ These databases are designed to handle an unprecedented volume of data,
variety of data types and structures, and velocity of data operations of new
business requirements.
Database Design
Focuses on the design of the database structure that will be used to store and
manage end-user data.
Designer must identify database’s expected use.
Well-designed database:
Facilitates data management.
Generates accurate and valuable information.
Poorly designed database causes difficult-to-trace errors.
Why Database Design Is Important ? (1)
Figure 1.5 Employee Skills
Certification in a Poor
Design
Why Database Design Is Important ? (2)
Figure 1.6 Employee Skills Certification
in a Good Design
Evolution of File System Data Processing
Manual File Systems
Accomplished through a system of file folders and filing cabinets
Computerized File Systems
Data processing (DP) specialist: Created a computer-based system that would track data and
produce required reports
File System Redux: Modern End-User Productivity Tools
Includes spreadsheet programs such as Microsoft Excel
Basic File Terminology
A Simple File System
Problems with File System Data Processing
Lengthy development times
Difficulty of getting quick answers
Complex system administration
Lack of security and limited data sharing
Extensive programming
Structural Dependence
Structural dependence:
• Access to a file is dependent on its own structure.
• All file system programs are modified to conform to a new file structure.
Structural independence:
• File structure is changed without affecting the application’s ability to
access the data.
Data Dependence
Data dependence:
◦ Data access changes when data storage characteristics change.
Data independence:
◦ Data storage characteristics is changed without affecting the program’s
ability to access the data.
Practical significance of data dependence is difference between logical (how
humans view the data) and physical (how computer must work with the data)
format.
Data Redundancy
Unnecessarily storing same data at different places.
Islands of information: Scattered data locations.
Increases the probability of having different versions of the same data.
Data Redundancy Implications
Poor data security
Data inconsistency
Increased likelihood of data-entry errors when complex entries are made in
different files.
Data anomaly: Develops when not all the required changes in the redundant
data are made successfully.
Types of Data Anomaly
Update Anomalies
Insertion Anomalies
Deletion Anomalies
Lack of Design and Data-Modeling Skills
Most users lack the skill to properly design databases.
Despite multiple personal productivity tools being available.
Data-modeling skills is vital in the data design process.
Good data modeling facilitates communication between the designer, user, and
the developer.
Database Systems
Logically related data stored in a single logical data repository.
Physically distributed among multiple storage facilities.
DBMS eliminates most of file system’s problems.
Current generation DBMS software:
Stores data structures, relationships between structures, and access
paths.
Defines, stores, and manages all access paths and components.
Contrasting Database and File Systems
Figure 1.9 Contrasting
Database and File Systems
Database System Environment
The term database system refers to an organization of components that define and
regulate the collection, storage, management, and use of data within a database
environment.
The database system is composed of the following five components:
◦ Hardware
◦ Software
◦ People
◦ Procedures
◦ Data
Database solutions must be cost-effective as well as tactically and strategically effective.
The Database System Environment
Figure 1.10 The Database
System Environment
DBMS Functions (1)
Data dictionary management
• Data dictionary: Stores definitions of the data elements and their relationships
Data storage management
• Performance tuning: Ensures efficient performance of the database in terms of
storage and access speed
Data transformation and presentation
• Transforms entered data to conform to required data structures
Security management
• Enforces user security and data privacy
DBMS Functions (2)
Multiuser access control
• Sophisticated algorithms ensure that multiple users can access the
database concurrently without compromising its integrity
Backup and recovery management
• Enables recovery of the database after a failure
Data integrity management
• Minimizes redundancy and maximizes consistency
DBMS Functions (3)
Database access languages and application programming
interfaces
• Query language: Lets the user specify what must be done without having
to specify how
• Structured Query Language (SQL): De facto query language and data
access standard supported by the majority of DBMS vendors
Database communication interfaces
• Accept end-user requests via multiple, different network environments
Disadvantages of Database Systems
Increased costs
Management complexity
Maintaining currency
Vendor dependence
Frequent upgrade/replacement cycles
Professional Careers In Database
Key Terms
1. Data 6. Knowledge
2. Information 7. Data management
3. Field 8. Data integrity
4. File 9. DBMS
5. Metadata 10. Database
Review Questions
What are the potential costs of implementing a database system ?
What are the main components of a database design ?
Write the functions of DBMS.
Explain the difference between data, information and database.
List and describe the different types of database systems ?
What are basic database functions that a spreadsheet can not perform ?