Database Systems
Unit – 1
Introduction
Asif Sohail
University of the Punjab
Punjab University College of Information Technology (PUCIT)
Database Systems – Asif Sohail Slide 1
Text & Reference Books
• Database Systems – Design, Implementation and Management:
Carlos Coronel, Steven Morris (13th Ed.)
• Modern Database Management: Jeffery. A. Hoffer (12th Ed.)
• Database Systems: Thomas M. Connolly (6th Ed.)
• Fundamentals of Database Systems: Elmasri (7th Ed.)
• An Introduction to Database Systems: C.J. Date (8th Ed.)
• Introduction to PL\SQL: Oracle Press
Database Systems – Asif Sohail Slide 2
Course Information
• Resources:
\\printsrv\\teacher data\\Asif Sohail\\Database Systems
https://classroom.google.com/c/NDY4MTQ0ODk3NTcz?cjc=uwc3vvt
• Office: (Ground Floor) Graduate Building
• Students Counseling hours :
• Mon, Wed: 11:00 – 12:30 P.M
• email: asif@pucit.edu.pk
Assessment Category Marks
Quizzes 12
Assignments 6
Presentations 2
Project 5
Database Systems – Asif Sohail Slide 3
Course Overview
• Introduction
• File Processing System (FPS)
• Database Architecture and Components
• SQL – SELECT Statement
• Database Development and Data Modeling
• Relational Data Model and Relational Keys
• Integrity Constraints and Rules
• SQL Functions
• Entity-Relationship (E-R) Model
• SQL Joins
• Subqueries
• Creating Database Tables (DDL)
Database Systems – Asif Sohail Slide 4
Course Overview
• Normalization
• SQL – Data Manipulation (DML)
• Database Objects (Views, Indexes, …)
• Transaction Management
• Controlling User Rights & Access (DCL)
• Concurrency Control
• Procedural SQL (PL/SQL)
• Distributed Databases
• Stored Procedures and Triggers
• Big Data and NoSQL
• Data Quality and Integration
• Future Trends, Revision
Database Systems – Asif Sohail Slide 5
Outline
• Introduction
• Data? Information? Database?
• File Processing System (FPS)
• Drawbacks of FPS
• Database System or Integrated Database Environment(IDE)
• Advantages of IDE
• Types of Databases
Database Systems – Asif Sohail Slide 6
Introduction
Dunn's Book of Life (1946)
• “Each person in the world creates a Book of Life. This Book
starts with birth and ends with death. Its pages are made up of
the records of the principal events in life. Database/Record
linkage is the name given to the process of assembling the
pages of this Book into a volume.”
Database Systems – Asif Sohail Slide 7
Introduction
• Increasing need of database systems
• The Model of Generating/Consuming Data has Changed
Old Model: Few companies are generating data, all others are consuming data
New Model: all of us are generating data, and all of us are consuming data
8
Database Systems – Asif Sohail Slide 9
Exponential Data Growth
Structured data is highly specific and is stored in a
predefined format, where unstructured data is a
compilation of many varied types of data that are
stored in their native formats.
Database Systems – Asif Sohail Slide 10
Data
• Data refers to the raw facts & figures concerning:
✓ PEOPLE
Data refers to raw facts, figures, or symbols without context or
✓ EVENT interpretation. It's the unprocessed and unorganized representation of
information.
✓ ORGANISATION Information, on the other hand, is processed and organized data that has
✓ OBJECT meaning and context. It provides knowledge or insights, helping to make
informed decisions.
• The amount of available data in the modern computer world is literally
exploding (3 Vs: Volume, Velocity, Variety)
• The modern data sources/types include multimedia data, sensor’s data,
data on web and so on.
• To get most out of data, we require tools than can simplify the task of
managing the data and extract useful information out of it in timely fashion.
Database Systems – Asif Sohail Slide 11
Data
• Large volume of facts, difficult to interpret or make decisions based on
Database Systems – Asif Sohail Slide 12
Data vs Information
• Information is derived from data OR Information is the processed data
• Information may be presented either Textually or Graphically or both
What is the difference bw data
and info?
• The representation and amount of information depends upon the
requirements of a user.
• When data is processed and organized into a form needed for its application,
it is called an Information Product (IP).
Database Systems – Asif Sohail Slide 13
Data vs Information
• Accurate, reliable, relevant and timely information
is key to good decision making.
• Information is now increasingly viewed as an
organizational asset (Redman, 2008) which can
be used to improve organizational performance
and help an organization gain a competitive
advantage in the marketplace.
Database Systems – Asif Sohail Slide 14
Data Hierarchy
Field (Data Item)
Record
File
Database
Database Systems – Asif Sohail Slide 15
A database is a structured collection of data, typically stored electronically. It
can include tables, records, fields, and other elements to organize and
represent information
Database
• Database System can simply be regarded as a computerized
record keeping system.
• Database System is a system whose overall purpose is to
maintain data and make the information available to its users as
per their requirements.
• The user of the system can perform the operations like:
- Defining structure and types of data
- Data manipulation (Insert, Delete, Update, Query etc.)
- Enforcing security restrictions
• A Database Management System (DBMS) is a collection of
programs that manages the database structure and controls
access to the data stored in the database.
A Database Management System (DBMS) is software that provides an interface to
interact with the database. It manages the storage, retrieval, and manipulation of data
within the database
Database Systems – Asif Sohail Slide 16
Database
• Database is a shared collection of logically related data, and a
description of this data, designed to meet the information
requirements of multiple users in an organization.
• Database is an organized collection of data (and metadata) about
entities and the relationships among these entities.
• Metadata: Data that describes properties or characteristics of other
data OR it is “data about data”.
• The description of data is also known as system catalog or data
dictionary.
• Metadata allows database designers & users to understand what
data exists and what the data mean?
• Metadata describes the domain (set of possible values) and the
constraints for a data item.
Database Systems – Asif Sohail Slide 17
Metadata Example
Data Item Value
Name Type Length Min Max Description
Course Alphanumeric 30 Course ID & Name
Section Integer 1 1 9 Section Number
Semester Alphanumeric 10 1 8 Semester & Year
Name Alphanumeric 30 Student name
ID Integer 9 Student ID
Major Alphanumeric 4 Student Major
GPA Decimal 3 0.0 4.0 Student GPA
Database Systems – Asif Sohail Slide 18
Examples of Database Applications
➢ Databases play a critical role in almost all areas where
computers are used.
➢ Databases are everywhere …
• Student / Employee’s Information System
• Library System
• Hotel / Airline Reservation System
• Billing System
• Stock / Inventory System
• Payroll System
• Geographical Information Systems (GISs)
• Data Warehouses
Database Systems – Asif Sohail Slide 19
Record Keeping Techniques
1. Manual Record Keeping.
2. Computerized Record Keeping
a) File Processing System.
b) Database System or Integrated Database
Environment. (The paradigm shift)
Database Systems – Asif Sohail Slide 20
Case Study
• Consider a College System consisting of the following offices:
• Admission Office
• Academics Office
• Exam Office
• Each Office is maintaining its own set of files for its day to day
operations.
• Some of the files used in the system are Student’s File, Faculty File,
Course File, Correspondence File etc.
Database Systems – Asif Sohail Slide 21
Manual System - Drawbacks
• High data volume
• Not reliable A reliable system or information source can
be trusted to perform consistently under
• Inefficient various conditions
• Duplication of data
• Inconsistency
• A lot of data movement is required
• The System can’t answer complex queries involving multiple
departments.
Database Systems – Asif Sohail Slide 22
File Processing System
• File-based systems were an early attempt to computerize the
manual filing system.
• The manual files were replaced by computer files.
• A person responsible for developing and managing
computerized file processing system was called Data Processing
(DP) specialist.
• In the traditional file processing system, the existing manual
system is automated by focusing on the data processing needs of
the individual departments instead of treating the organization
as whole.
• Each application will have its own set of Private Files designed to
meet the needs of a particular department.
Database Systems – Asif Sohail Slide 23
File Processing System
App – 1
File – 1 (operations + data descriptions)
App – 2
File – 2 (operations + data descriptions)
App – n
File – n (operations + data descriptions)
Database Systems – Asif Sohail Slide 24
File Processing System
Duplicate
Data
Database Systems – Asif Sohail Slide 25
Drawbacks of File System
1. Redundancy of Data
2. Inconsistent Data
3. Poor Enforcement of Standards
4. Data Dependence
5. Limited Data Sharing
6. Extensive Development & Maintenance Effort
7. Limited Support for Queries
8. Inflexible
9. Lack of provision for security
10. Limited recovery from failure
Database Systems – Asif Sohail Slide 26
Drawbacks of File System
1. Redundancy of Data
• Redundancy means duplication of data.
• Since applications in FPS are independent so unplanned
duplicate data is there.
• Same information is needed to be kept at different
places. It costs time and money.
• Same data may have to be input several times to
update all the occurrences.
• Redundancy results in data anomalies (Insert, Update,
Delete)
Database Systems – Asif Sohail Slide 27
Data inconsistency occurs when there are variations or
discrepancies in the same piece of information stored in different
Drawbacks of File System parts
2. Data Inconsistencies or Data Anomalies
• Redundancy leads to data inconsistency or data anomalies.
• A data anomaly occurs if an operation (update, insert, delete) has not yet
been performed against all the occurrences. Consequently, same data
stored at different places will disagree with each other
• The use of different programming environment and platforms lead to
Heterogeneous Environment.
3. Poor Enforcement of Standards
• Organization wide enforcement of standards is poor. This leads to the
following types of inconsistencies:
a) Synonym: Using different names for same data item
Example: Stu-Id, Reg-No.
b) Homonym: Using same name for different data items.
Example: “course” for a single subject or entire program.
Database Systems – Asif Sohail Slide 28
Drawbacks of File System
4. Data Dependence
• The definition of data is embedded in the application
programs, rather than being stored separately and
independently.
• The applications are constrained to work only with the given file
description. Any change in the file structure or data requires changes
to all the applications using that file. Such applications are called
Data Dependent Applications. jo filed depend kr rhi aik dusre pe
• Data dependence leads to excessive program maintenance.
• Often difficult to locate all programs affected by change
• Errors are often introduced when making changes.
• Incompatible file structures (use of different programming
languages) makes the situation even worse.
Database Systems – Asif Sohail Slide 29
Drawbacks of File System
5. Limited Data Sharing
• As each application has its own private files so little
opportunity to share data with others.
• The filing system works well when we have to interact with
each file separately. However, it breaks down when we have to
cross-reference or process the information in the files. This
makes it hard to answer queries involving multiple files.
6. Extensive Development & Maintenance Effort
7. Limited Support for Queries FPS me recovery sirf tb hoti agr backup kiya gya ho
8. Inflexible
9. Lack of provision for security File processing systems are often considered inflexible
because they lack the ability to manage data in a dynamic or
structured manner. They typically rely on fixed formats,
10. Limited recovery from failure making it challenging to adapt to changing data need
Database Systems – Asif Sohail Slide 30
Integrated Database Environment(IDE)
• The limitations of file-based approach can be attributed to the following
factors:
• Redundancy of data
• Data dependence
• In contrast to file processing system, the IDE has a single large repository of
data, where data definition is separated from application programs.
• Emphasizes the integration & sharing of data throughout the organization.
• The organization wide requirements are analyzed as a whole and there is no
longer concept of MY FILE or Private Files.
• A Database Management System (DBMS) is a software designed to assist in
maintaining and utilizing large collections of data.
• By storing data in a DBMS rather than as a collection of files, we achieve
enormous advantages.
Database Systems – Asif Sohail Slide 31
Database System
D App – 1 (Operations)
B
App – 2 (Operations)
Database M
(Data + Data Descriptions)
S App – n (Operations)
DBMS manages data resources like
an operating system manages
hardware resources
Database Systems – Asif Sohail Slide 32
Database Systems – Asif Sohail Slide 33
Contrasting Database and File System
File System Database System
Adapted from Lecture slides of Werner Nutt
Database Systems – Asif Sohail Slide 34
Advantages of Database Approach
1.Reduced or Controlled Data Redundancy
2.Improved Data Consistency
3.Enforcement of Standards
4.Reduced Program Maintenance
5.Data Sharing
6.Data Integrity (Improved Data Quality)
7.Improved Security Restrictions
8.Improved Accessibility & Responsiveness
9.Improved Decision Making
10. Efficient Query Processing
11. Backup & Recovery
12.Data Independence
Database Systems – Asif Sohail Slide 35
Advantages of Database Approach
1. Reduced or Controlled Data Redundancy
• Since all the data items are stored in a single database, therefore
Redundancy is minimized.
• Separate data files are integrated into a single, logical structure to reduce
redundancy
• 100% elimination of redundancy is not possible in order keep the logical
connections among the data items.
2. Improved Data Consistency
• Reduction in redundancy automatically avoids inconsistency.
• Wang and Wand describes data consistency as “a data value can only be
expected to be same for the same situation”.
• If a data item appears only once, any change to its value needs to be
performed only for once and the database will always be in some consistent
/ correct state.
Database Systems – Asif Sohail Slide 36
Advantages of Database Approach
3. Enforcement of Standards
• This is possible as database is designed to meet the organization
wide requirements.
• The standards can be name of data items and their format, data
codes, documentation standards, operation standards, security
policies etc.
4. Reduced Program Maintenance & Development Cost
• When all the organization’s requirements are satisfied by one
database instead of many separate files, the maintenance and
development cost automatically reduces.
Database Systems – Asif Sohail Slide 37
Advantages of Database Approach
5. Data Sharing
• Sharing means that the same data source is used by multiple
applications.
• Data is centralized and hence can be shared not only by the
existing applications but also new applications can be
developed to operate against the same data.
• The applications that can access same data simultaneously are
called Online Transaction Processing (OLTP) applications and
DBMS uses Concurrency Control Mechanism to ensure that
multiple users can access and update data correctly.
Database Systems – Asif Sohail Slide 38
Advantages of Database Approach
6. Data Integrity (Improved Data Quality)
• Data integrity refers to the correctness of data.
• Integrity constraints or rules ensure that the data stored in the
database is purified and accurate.
7. Improved Security Restrictions
• Database security is the protection of database from
unauthorized disclosure, alteration or destruction.
• DBMS provides strong security measures against such threats.
Some of them are:
• Password checks
• User defined procedures
• Defining user privileges
• Audit trial system
• Data encryption
Database Systems – Asif Sohail Slide 39
Advantages of Database Approach
8. Improved Accessibility & Responsiveness.
• In File Processing System, data accessibility is quite difficult as
it is procedural based. You should know the detailed
procedural steps. HOW TO DO?
• On the other hand, accessing data is lot easier in DBMS with
the help of a non procedural language – SQL. You only need
to know the simple commands or in other words WHAT TO
DO?
9. Improved Decision Making
• Consolidated reports
Ad-hoc queries in database management systems (DBMS) refer to
• Ad-hoc queries queries that are executed on-demand by users without prior
preparation or plannin
• Multiple views of data
Database Systems – Asif Sohail Slide 40
Advantages of Database Approach
10. Efficient Query Processing
• Specialized data structures and search techniques
• Indexing
Buffering is a technique used in databases to store copies
• Buffering or caching module of database blocks in a temporary area of memory. The
purpose of buffering is to reduce the number of times a
database needs to access secondary memory.
• Query processing and optimization
11. Backup & Recovery
• The backup and recovery subsystem of DBMS provides
efficient techniques for database recovery.
Database Systems – Asif Sohail Slide 41
Advantages of Database Approach
12. Data Independence.
• The separation of data descriptions from the applications using the
data is called data independence.
• Data Independence can also be defined as: “The immunity of an
application to change in storage structure – which implies that the
applications don’t depend on any particular storage structure”.
• Allows change & evolution of database systems without changing
the application programs
• Without data independence, it is must to understand the
arrangements of data elements in order to perform an operation
against them.
• Different data arrangements will need different algorithms even for
the same operation.
Database Systems – Asif Sohail Slide 42
Advantages of Database Approach
12. Data Independence.
• Clearly, data dependence is not desired because of the
following reasons:
a) Each user should be able to access data in a customized way.
b) Users should not have to deal directly with the physical
database storage details.
c) Different application may need different physical structure for
the efficiency of their operations.
d) The DBA should have freedom to change the storage structure
or access technique or both in response to changing
requirements without much disturbing the existing
applications.
Database Systems – Asif Sohail Slide 43
Cost & Risk of Database Approach
1. Need for new specialized personnel
2. Cost of DBMS and additional hardware cost
3. Installation cost & complexity
4. Conversion from legacy system to modern database system
5. Organizational conflict
6. Higher impact of failure
7. Not suitable for applications with tight real-time constraints
Database Systems – Asif Sohail Slide 44
Types of Databases
• Databases can be categories on the basis of the following criterions:
Criteria Types
1. Number of users • Single User Database
• Multi-user Database
2. Data location • Centralized Database
• Distributed Database
3. Data usage • Operational (OLTP) Operational data is used for
• Analytical (OLAP) day-to-day transactions and routine
business operations.
4. Form of data
Structured data is data that fits neatly into
• Unstructured Analytical data is used for analyzing
and gaining insights from historical
such as numbers, short text, and dates. •
data tables and includes discrete data types
Structured and aggregated data.
Unstructured data doesn't fit neatly into a
data table because its size or nature: for • Semi-structured (XML)
example, audio and video files and large
text documents
Database Systems – Asif Sohail Slide 45
What is a semi-structured data example? Semi-structured data sources
include emails, XML and other markup languages, TCP/IP packets, binary
executables, zipped files, data integrated from different sources, and web
pages
Types of Databases
Single user with Mainframe computer
desktop computer with thousands of users
• Categories are:
• Personal computer databases(Single User)
• Workgroup databases(Multi-user)
• Department databases (Multi-user)
• Enterprise database (Multi-user)
Database Systems – Asif Sohail Slide 46
Types of Databases
Personal Computer Databases
• Support one user with a standalone PC
• E.g. a student’s own database or a sales person’s simple database
Workgroup Databases
• Workgroup: relatively small group of people who collaborate on
same project/application.
• A workgroup DB supports the collaborative efforts of a workgroup.
• Allows data sharing.
• Its model is shown on the next fig:
Database Systems – Asif Sohail Slide 47
Continued…
LAN
DB Server
Workgroup
DB
Workgroup DB on LAN
(Method of Data Sharing)
Database Systems – Asif Sohail Slide 48
Types of Databases
Department Databases
• Department: Functional unit within an organization
• Department DBs support function & activities of a department.
• E.g. personnel DB to track employees, jobs, skills etc.
Enterprise Databases
• DB scope is the entire organization
• To support organization-wide operations & decision making.
Database Systems – Asif Sohail Slide 49
Continued…
Hospital
Medical
Clinic
Corporate
Hospital Office
Nursing Data
Home Warehouse
Medical
Clinic
Database Systems – Asif Sohail Slide 50
The internet is a public network accessible to anyone, while
Types of Databases intranets are private networks accessible only to authorized
users within an organization. Extranets are private networks
that allow external parties to access certain parts of an
organization's intranet.
Internet, Intranet & Extranet Databases
• Internet: A worldwide network that connects users of multiple platforms
easily through an interface known as web browser
• Extranet: Use of Internet protocols to establish limited access to company
data & information by the company’s customers & suppliers
• Intranet: Use of internet protocol to establish access to company data and
information that is limited to the organization
Centralized & Distributed Databases
• Centralized database system – supports a database located at a single site
• Distributed database system – supports a database distributed across
several different sites
Database Systems – Asif Sohail Slide 51
Types of Databases
• The past few years have produced advances in technology leading to exciting
new applications of database systems such as:
• Multimedia databases
• Geographic information systems
• Data warehouses
• Multimedia databases – store pictures, video clips, and sound messages
• Geographic information systems (GIS) – store and analyze maps, weather
data, and satellite images
• Data warehouses & on-line analytical processing (OLAP) is used to extract and
analyze useful information from very large databases for decision-making.
Database Systems – Asif Sohail Slide 52
Reading Material
• Text Book: Chapter 1 (1.1 to 1.6)
• Ref. Book-A: Chapter 1 (page 2 to page 15)
• Ref. Book-B: Chapter 1 (1.1 to 1.6)
• Ref. Book-C: Chapter 1 (1.1 to 1.3, 1.6)
Database Systems – Asif Sohail Slide 53