0% found this document useful (0 votes)

13 views43 pages

Rdbms Unit - II

Uploaded by

geetha jeevanandham

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views43 pages

Rdbms Unit - II

Uploaded by

geetha jeevanandham

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 43

Unit- II: Relational, ER Models and Normalization

Relational, ER Models and Normalization: Data Models - Relational Model – Domains -

Tuple and Relation - Super keys - Candidate keys - Primary keys and foreign keys for the
Relations - Relational Constraints - Domain Constraint - Key Constraint - Integrity Constraint
- Entity Relationship (ER) Model – Entities – Attributes – Relationships - Defining
Relationship for College Database - E-R Diagram - Conversion of E-R Diagram to Relational
Database. Functional Dependencies and Normalization for Relational Database: Informal
Design Guidelines for Relational schemas, Functional Dependencies, First Normal Form,
Second Normal form, Third Normal form, Boyce-Codd Normal Form

Data Models:
A Data Model in Database Management System (DBMS) is the concept of tools that are
developed to summarize the description of the database. Data Models provide us with a
transparent picture of data which helps us in creating an actual database. It shows us from the
design of the data to its proper implementation of data.
Types of Relational Models
1. Conceptual Data Model
2. Representational Data Model
3. Physical Data Model
It is basically classified into 3 types: -

1. Conceptual Data Model

The conceptual data model describes the database at a very high level and is useful to
understand the needs or requirements of the database. It is this model, that is used in the
requirement-gathering process i.e., before the Database Designers start making a particular
database. One such popular model is the entity/relationship model (ER model). The E/R
model specializes in entities, relationships, and even attributes that are used by database
designers. In terms of this concept, a discussion can be made even with non-computer
science(non-technical) users and stakeholders, and their requirements can be understood.
Entity-Relationship Model (ER Model): It is a high-level data model which is used to
define the data and the relationships between them. It is basically a conceptual design of any
database which is easy to design the view of data.
Components of ER Model:
1. Entity: An entity is referred to as a real-world object. It can be a name, place, object,
class, etc. These are represented by a rectangle in an ER Diagram.
2. Attributes: An attribute can be defined as the description of the entity. These are
represented by Eclipse in an ER Diagram. It can be Age, Roll Number, or Marks for a
Student.
3. Relationship: Relationships are used to define relations among different entities.
Diamonds and Rhombus are used to show Relationships.
Characteristics of a conceptual data model
 Offers Organization-wide coverage of the business concepts.
 This type of Data Models is designed and developed for a business audience.
 The conceptual model is developed independently of hardware specifications like data
storage capacity, location or software specifications like DBMS vendor and
technology. The focus is to represent data as a user will see it in the “real world.”
Conceptual data models known as Domain models create a common vocabulary for all
stakeholders by establishing basic concepts and scope.
2. Representational Data Model
This type of data model is used to represent only the logical part of the database and does not
represent the physical structure of the database. The representational data model allows us to
focus primarily on the design part of the database. A popular representational model is
a Relational model. The relational Model consists of Relational Algebra and Relational
Calculus. In the Relational Model, we basically use tables to represent our data and the
relationships between them. It is a theoretical concept whose practical implementation is
done in Physical Data Model.
The advantage of using a Representational data model is to provide a foundation to form the
base for the Physical model.
3. Physical Data Model
The physical Data Model is used to practically implement Relational Data Model.
Ultimately, all data in a database is stored physically on a secondary storage device such as
discs and tapes. This is stored in the form of files, records, and certain other data structures. It
has all the information on the format in which the files are present and the structure of the
databases, the presence of external data structures, and their relation to each other. Here, we
basically save tables in memory so they can be accessed efficiently. In order to come up with
a good physical model, we have to work on the relational model in a better way. Structured
Query Language (SQL) is used to practically implement Relational Algebra.
This Data Model describes HOW the system will be implemented using a specific DBMS
system. This model is typically created by DBA and developers. The purpose is actual
implementation of the database.
Characteristics of a physical data model:
 The physical data model describes data need for a single project or application though
it may be integrated with other physical data models based on project scope.
 Data Model contains relationships between tables that which addresses cardinality and
nullability of the relationships.
 Developed for a specific version of a DBMS, location, data storage or technology to
be used in the project.
 Columns should have exact datatypes, lengths assigned and default values.
 Primary and Foreign keys, views, indexes, access profiles, and authorizations, etc. are
defined.
Some Other Data Models
1. Hierarchical Model
The hierarchical Model is one of the oldest models in the data model which was developed by
IBM, in the 1950s. In a hierarchical model, data are viewed as a collection of tables, or we
can say segments that form a hierarchical relation. In this, the data is organized into a tree-
like structure where each record consists of one parent record and many children. Even if the
segments are connected as a chain-like structure by logical associations, then the instant
structure can be a fan structure with multiple branches. We call the illogical associations as
directional associations.
2. Network Model
The Network Model was formalized by the Database Task group in the 1960s. This model is
the generalization of the hierarchical model. This model can consist of multiple parent
segments and these segments are grouped as levels but there exists a logical association
between the segments belonging to any level. Mostly, there exists a many-to-many logical
association between any of the two segments.
3. Object-Oriented Data Model
In the Object-Oriented Data Model, data and their relationships are contained in a single
structure which is referred to as an object in this data model. In this, real-world problems are
represented as objects with different attributes. All objects have multiple relationships
between them. Basically, it is a combination of Object-Oriented programming and a
Relational Database Model.
4. Float Data Model
The float data model basically consists of a two-dimensional array of data models that do not
contain any duplicate elements in the array. This data model has one drawback it cannot store
a large amount of data that is the tables cannot be of large size.
5. Context Data Model
The Context data model is simply a data model which consists of more than one data model.
For example, the Context data model consists of ER Model, Object-Oriented Data Model,
etc. This model allows users to do more than one thing which each individual data model can
do.
6. Semi-Structured Data Model
Semi-Structured data models deal with the data in a flexible way. Some entities may have
extra attributes and some entities may have some missing attributes. Basically, you can
represent data here in a flexible way.
Advantages of Data Models
1. Data Models help us in representing data accurately.
2. It helps us in finding the missing data and also in minimizing Data Redundancy.
3. Data Model provides data security in a better way.
4. The data model should be detailed enough to be used for building the physical
database.
5. The information in the data model can be used for defining the relationship between
tables, primary and foreign keys, and stored procedures.
Disadvantages of Data Models
1. In the case of a vast database, sometimes it becomes difficult to understand the data
model.
2. You must have the proper knowledge of SQL to use physical models.
3. Even smaller change made in structure require modification in the entire application.
4. There is no set data manipulation language in DBMS.
5. To develop Data model, one should know physical data stored characteristics.

Relational Model in DBMS

E.F. Codd proposed the relational Model to model data in the form of relations or tables.
After designing the conceptual model of the Database using ER diagram, we need to convert
the conceptual model into a relational model which can be implemented using
any RDBMS language like Oracle SQL, MySQL, etc. So, we will see what the Relational
Model is.
What is the Relational Model?
The relational model represents how data is stored in Relational Databases. A relational
database consists of a collection of tables, each of which is assigned a unique name. Consider
a relation STUDENT with attributes ROLL_NO, NAME, ADDRESS, PHONE, and AGE
shown in the table.
Table Student

ROLL_NO NAME ADDRESS PHONE AGE

1 RAM DELHI 9455123451 18

2 RAMESH GURGAON 9652431543 18

3 SUJIT ROHTAK 9156253131 20

4 SURESH DELHI 18

Important Terminologies
 Attribute: Attributes are the properties that define an entity.
e.g., ROLL_NO, NAME, ADDRESS
 Relation Schema: A relation schema defines the structure of the relation and
represents the name of the relation with its attributes. e.g., STUDENT (ROLL_NO,
NAME, ADDRESS, PHONE, and AGE) is the relation schema for STUDENT. If a
schema has more than 1 relation, it is called Relational Schema.
 Tuple: Each row in the relation is known as a tuple. The above relation contains 4
tuples, one of which is shown as:

RA
1 DELHI 9455123451 18
M

 Relation Instance: The set of tuples of a relation at a particular instance of time is

called a relation instance. Table 1 shows the relation instance of STUDENT at a
particular time. It can change whenever there is an insertion, deletion, or update in the
database.
 Degree: The number of attributes in the relation is known as the degree of the
relation. The STUDENT relation defined above has degree 5.
 Cardinality: The number of tuples in a relation is known as cardinality.
The STUDENT relation defined above has cardinality 4.
 Column: The column represents the set of values for a particular attribute. The
column ROLL_NO is extracted from the relation STUDENT.
ROLL_NO

 NULL Values: The value which is not known or unavailable is called a NULL value.
It is represented by blank space. e.g., PHONE of STUDENT having ROLL_NO 4 is
NULL.
 Relation Key: These are basically the keys that are used to identify the rows uniquely
or also help in identifying tables. These are of the following types.
 Primary Key
 Candidate Key
 Super Key
 Foreign Key
 Alternate Key
 Composite Key
Constraints in Relational Model
While designing the Relational Model, we define some conditions which must hold for data
present in the database are called Constraints. These constraints are checked before
performing any operation (insertion, deletion, and updation ) in the database. If there is a
violation of any of the constraints, the operation will fail.
Domain Constraints
These are attribute-level constraints. An attribute can only take values that lie inside the
domain range. e.g., If a constraint AGE>0 is applied to STUDENT relation, inserting a
negative value of AGE will result in failure.
Key Integrity
Every relation in the database should have at least one set of attributes that defines a tuple
uniquely. Those set of attributes is called keys. e.g., ROLL_NO in STUDENT is key. No two
students can have the same roll number. So, a key has two properties:
 It should be unique for all tuples.
 It can’t have NULL values.
Referential Integrity
When one attribute of a relation can only take values from another attribute of the same
relation or any other relation, it is called referential integrity. Let us suppose we have 2
relations.
Table Student

ROLL_NO NAME ADDRESS PHONE AGE BRANCH_CODE

1 RAM DELHI 9455123451 18 CS

2 RAMESH GURGAON 9652431543 18 CS

3 SUJIT ROHTAK 9156253131 20 ECE

4 SURESH DELHI 18 IT

Table Branch

BRANCH_CODE BRANCH_NAME

CS COMPUTER SCIENCE

IT INFORMATION TECHNOLOGY

ECE ELECTRONICS AND COMMUNICATION ENGINEERING

CV CIVIL ENGINEERING

BRANCH_CODE of STUDENT can only take the values which are present in
BRANCH_CODE of BRANCH which is called referential integrity constraint. The relation
which is referencing another relation is called REFERENCING RELATION (STUDENT in
this case) and the relation to which other relations refer is called REFERENCED RELATION
(BRANCH in this case).
Anomalies in the Relational Model
An anomaly is an irregularity or something which deviates from the expected or normal state.
When designing databases, we identify three types of anomalies: Insert, Update, and Delete.
Insertion Anomaly in Referencing Relation
We can’t insert a row in REFERENCING RELATION if referencing attribute’s value is not
present in the referenced attribute value. e.g., Insertion of a student with BRANCH_CODE
‘ME’ in STUDENT relation will result in an error because ‘ME’ is not present in
BRANCH_CODE of BRANCH.
Deletion/ Updation Anomaly in Referenced Relation:
We can’t delete or update a row from REFERENCED RELATION if the value of
REFERENCED ATTRIBUTE is used in the value of REFERENCING ATTRIBUTE. e.g; if
we try to delete a tuple from BRANCH having BRANCH_CODE ‘CS’, it will result in an
error because ‘CS’ is referenced by BRANCH_CODE of STUDENT, but if we try to delete
the row from BRANCH with BRANCH_CODE CV, it will be deleted as the value is not used
by referencing relation. It can be handled by the following method:
On Delete Cascade
It would delete the tuples from REFERENCING RELATION if the value used by
REFERENCING ATTRIBUTE is deleted from REFERENCED RELATION. e.g., For, if we
delete a row from BRANCH with BRANCH_CODE ‘CS’, the rows in STUDENT relation
with BRANCH_CODE CS (ROLL_NO 1 and 2 in this case) will be deleted.
On Update Cascade
It will update the REFERENCING ATTRIBUTE in REFERENCING RELATION if the
attribute value used by REFERENCING ATTRIBUTE is updated in REFERENCED
RELATION. e.g; if we update a row from BRANCH with BRANCH_CODE ‘CS’ to ‘CSE’,
the rows in STUDENT relation with BRANCH_CODE CS (ROLL_NO 1 and 2 in this case)
will be updated with BRANCH_CODE ‘CSE’.
Super Keys
Any set of attributes that allows us to identify unique rows (tuples) in a given relationship is
known as super keys. Out of these super keys, we can always choose a proper subset among
these that can be used as a primary key. Such keys are known as Candidate keys. If there is a
combination of two or more attributes that are being used as the primary key, then we call it a
Composite key.
Codd Rules in Relational Model
Edgar F Codd proposed the relational database model where he stated rules. Now these are
known as Codd’s Rules. For any database to be the perfect one, it has to follow the rules.
Advantages of the Relational Model
 Simple model: Relational Model is simple and easy to use in comparison to other
languages.
 Flexible: Relational Model is more flexible than any other relational model present.
 Secure: Relational Model is more secure than any other relational model.
 Data Accuracy: Data is more accurate in the relational data model.
 Data Integrity: The integrity of the data is maintained in the relational model.
 Operations can be Applied Easily: It is better to perform operations in the relational
model.
Disadvantages of the Relational Model
 Relational Database Model is not very good for large databases.
 Sometimes, it becomes difficult to find the relation between tables.
 Because of the complex structure, the response time for queries is high.
Characteristics of the Relational Model
 Data is represented in rows and columns called relations.
 Data is stored in tables having relationships between them called the Relational
model.
 The relational model supports the operations like Data definition, Data manipulation,
and Transaction management.
 Each column has a distinct name, and they are representing attributes.
 Each row represents a single entity.
Keys are one of the basic requirements of a relational database model. It is widely used to
identify the tuples(rows) uniquely in the table. We also use keys to set up relations amongst
various columns and tables of a relational database.
DOMAIN - TUPLE and Relation:
A domain is the set of possible values for a given attribute, and it can be considered as a
constraint on the value of the attribute, such as value of an attribute marks of a subject must
be integer value like 56 and it should be within the range of 0 to full marks.
A tuple is a complete row of the table. It is also called a record. It is data sets representing
attributes of a single item. A data model define how data are organized in database and how
the data can be accessed.

Different Types of Keys in the Relational Model

1. Candidate Key
2. Primary Key
3. Super Key
4. Alternate Key
5. Foreign Key
6. Composite Key
1. Candidate Key: The minimal set of attributes that can uniquely identify a tuple is known
as a candidate key. For Example, STUD_NO in STUDENT relation.
 It is a minimal super key.
 It is a super key with no repeated data is called a candidate key.
 The minimal set of attributes that can uniquely identify a record.
 It must contain unique values.
 It can contain NULL values.
 Every table must have at least a single candidate key.
 A table can have multiple candidate keys but only one primary key.
 The value of the Candidate Key is unique and may be null for a tuple.
 There can be more than one candidate key in a relationship.
Example:
STUD_NO is the candidate key for relation STUDENT.
Table STUDENT

STUD_NO SNAME ADDRESS PHONE

1 Shyam Delhi 123456789

2 Rakesh Kolkata 223365796

3 Suraj Delhi 175468965

 The candidate key can be simple (having only one attribute) or composite as well.
Example:
{STUD_NO, COURSE_NO} is a composite
candidate key for relation STUDENT_COURSE.

Table STUDENT_COURSE
STUD_NO TEACHER_NO COURSE_NO

1 001 C001

2 056 C005

Note: In SQL Server a unique constraint that has a nullable column, allows the value ‘null‘in
that column only once. That’s why the STUD_PHONE attribute is a candidate here but
cannot be a ‘null’ value in the primary key attribute.
2. Primary Key: There can be more than one candidate key in relation out of which one can
be chosen as the primary key. For Example, STUD_NO, as well as STUD_PHONE, are
candidate keys for relation STUDENT but STUD_NO can be chosen as the primary key
(only one out of many candidate keys).
 It is a unique key.
 It can identify only one tuple (a record) at a time.
 It has no duplicate values; it has unique values.
 It cannot be NULL.
 Primary keys are not necessarily to be a single column; more than one column can
also be a primary key for a table.
Example:
STUDENT table -> Student (STUD_NO, SNAME,
ADDRESS, PHONE), STUD_NO is a primary key
Table STUDENT

STUD_NO SNAME ADDRESS PHONE

1 Shyam Delhi 123456789

2 Rakesh Kolkata 223365796

3 Suraj Delhi 175468965

3. Super Key: The set of attributes that can uniquely identify a tuple is known as Super Key.
For Example, STUD_NO, (STUD_NO, STUD_NAME), etc. A super key is a group of single
or multiple keys that identifies rows in a table. It supports NULL values.
 Adding zero or more attributes to the candidate key generates the super key.
 A candidate key is a super key but vice versa is not true.
 Super Key values may also be NULL.
Example:
Consider the table shown above.
STUD_NO+PHONE is a super key.

Relation between Primary Key, Candidate Key, and Super Key

4. Alternate Key: The candidate key other than the primary key is called an alternate key.
 All the keys which are not primary keys are called alternate keys.
 It is a secondary key.
 It contains two or more fields to identify two or more records.
 These values are repeated.
 Eg: - SNAME, and ADDRESS is Alternate keys
Example:
Consider the table shown above.
STUD_NO, as well as PHONE both,
are candidate keys for relation STUDENT but
PHONE will be an alternate key
(only one out of many candidate keys).
Primary Key, Candidate Key, and Alternate Key
5. Foreign Key: If an attribute can only take the values which are present as values of some
other attribute, it will be a foreign key to the attribute to which it refers. The relation which is
being referenced is called referenced relation and the corresponding attribute is called
referenced attribute the relation which refers to the referenced relation is called referencing
relation and the corresponding attribute is called referencing attribute. The referenced
attribute of the referenced relation should be the primary key to it.
 It is a key it acts as a primary key in one table, and it acts as
secondary key in another table.
 It combines two or more relations (tables) at a time.
 They act as a cross-reference between the tables.
 For example, DNO is a primary key in the DEPT table and a non-key in EMP.
Example:
Refer Table STUDENT shown above.
STUD_NO in STUDENT_COURSE is a
foreign key to STUD_NO in STUDENT relation.
Table STUDENT_COURSE

STUD_NO TEACHER_NO COURSE_NO

1 005 C001

2 056 C005

It may be worth noting that, unlike the Primary Key of any given relation, Foreign Key can
be NULL as well as may contain duplicate tuples i.e., it need not follow uniqueness
constraint. For Example, STUD_NO in the STUDENT_COURSE relation is not unique. It
has been repeated for the first and third tuples. However, the STUD_NO in STUDENT
relation is a primary key and it needs to be always unique, and it cannot be null.

Relation between Primary Key and Foreign Key

6. Composite Key: Sometimes, a table might not have a single column/attribute that
uniquely identifies all the records of a table. To uniquely identify rows of a table, a
combination of two or more columns/attributes can be used. It still can give duplicate values
in rare cases. So, we need to find the optimal set of attributes that can uniquely identify rows
in a table.
 It acts as a primary key if there is no primary key in a table.
 Two or more attributes are used together to make a composite key.
 Different combinations of attributes may give different accuracy in terms of
identifying the rows uniquely.
Example:
FULLNAME + DOB can be combined
together to access the details of a student.

Different Types of Keys

Relational Constraints
These are the restrictions or sets of rules imposed on the database contents. It validates the
quality of the database. It validates the various operations like data insertion, updating, and
other processes which must be performed without affecting the integrity of the data. It
protects us against threats/damages to the database. Mainly Constraints on the relational
database are of 4 types.
1. Domain constraints
2. Key constraints or Uniqueness Constraints
3. Entity Integrity constraints
4. Referential integrity constraints
Types of Relational Constraints
Let’s discuss each of the above constraints in detail.
1. Domain Constraints
1. Every domain must contain atomic values (smallest indivisible units) which means
composite and multi-valued attributes are not allowed.
2. We perform a datatype check here, which means when we assign a data type to a
column, we limit the values that it can contain. Eg. If we assign the datatype of
attribute age as int, we can’t give it values other than int datatype.
Example:

EID Name Phone

123456789
01 Bikash Dutta
234456678

Explanation: In the above relation, Name is a composite attribute and Phone is a multi-
values attribute, so it is violating domain constraint.
2. Key Constraints or Uniqueness Constraints
1. These are called uniqueness constraints since it ensures that every tuple in the relation
should be unique.
2. A relation can have multiple keys or candidate keys(minimal super key), out of which
we choose one of the keys as the primary key, we don’t have any restriction on
choosing the primary key out of candidate keys, but it is suggested to go with
the candidate key with less number of attributes.
3. Null values are not allowed in the primary key, hence Not Null constraint is also part
of the key constraint.
Example:
EID Name Phone

01 Bikash 6000000009

02 Paul 9000090009

01 Tuhin 9234567892

Explanation: In the above table, EID is the primary key, and the first and the last tuple have
the same value in EID ie 01, so it is violating the key constraint.
3. Entity Integrity Constraints:
1. Entity Integrity constraints say that no primary key can take a NULL value, since
using the primary key we identify each tuple uniquely in a relation.
Example:

EID Name Phone

01 Bikash 9000900099

02 Paul 600000009

NULL Sony 9234567892

Explanation: In the above relation, EID is made the primary key, and the primary key can’t
take NULL values but in the third tuple, the primary key is null, so it is violating Entity
Integrity constraints.
4. Referential Integrity Constraints
1. The Referential integrity constraint is specified between two relations or tables and
used to maintain the consistency among the tuples in two relations.
2. This constraint is enforced through a foreign key, when an attribute in the foreign
key of relation R1 has the same domain(s) as the primary key of relation R2, then the
foreign key of R1 is said to reference or refer to the primary key of relation R2.
3. The values of the foreign key in a tuple of relation R1 can either take the values of the
primary key for some tuple in relation R2, or can take NULL values, but can’t be
empty.
Example:

EID Name DNO

01 Divine 12

02 Dino 22

04 Vivian 14

DNO Place

12 Jaipur

13 Mumbai

14 Delhi

Explanation: In the above tables, the DNO of Table 1 is the foreign key, and DNO in Table 2
is the primary key. DNO = 22 in the foreign key of Table 1 is not allowed because DNO =
22 is not defined in the primary key of table 2. Therefore, Referential integrity constraints are
violated here.
Advantages of Relational Database Model
 It is simpler than the hierarchical model and network model.
 It is easy and simple to understand.
 Its structure can be changed anytime upon requirement.
 Data Integrity: The relational database model enforces data integrity through various
constraints such as primary keys, foreign keys, and unique constraints. This ensures
that the data in the database is accurate, consistent, and valid.
 Flexibility: The relational database model is highly flexible and can handle a wide
range of data types and structures. It also allows for easy modification and updating of
the data without affecting other parts of the database.
 Scalability: The relational database model can scale to handle large amounts of data
by adding more tables, indexes, or partitions to the database. This allows for better
performance and faster query response times.
 Security: The relational database model provides robust security features to protect
the data in the database. These include user authentication, authorization, and
encryption of sensitive data.
 Data consistency: The relational database model ensures that the data in the database
is consistent across all tables. This means that if a change is made to one table, the
corresponding changes will be made to all related tables.
 Query Optimization: The relational database model provides a query optimizer that
can analyze and optimize SQL queries to improve their performance. This allows for
faster query response times and better scalability.
Disadvantages of the Relational Model
 Few database relations have certain limits which can’t be expanded further.
 It can be complex, and it becomes hard to use.
 Complexity: The relational model can be complex and difficult to understand,
particularly for users who are not familiar with SQL and database design principles.
This can make it challenging to set up and maintain a relational database.
 Performance: The relational model can suffer from performance issues when dealing
with large data sets or complex queries. Joins between tables can be slow, and
indexing strategies can be difficult to optimize.
 Scalability: While the relational model is generally scalable, it can become difficult
to manage as the database grows. Adding new tables or indexes can be time-
consuming, and managing relationships between tables can become complex.
 Cost: Relational databases can be expensive to license and maintain, particularly for
large-scale deployments. Additionally, relational databases often require dedicated
hardware and specialized software to run, which can add to the cost.
 Limited flexibility: The relational model is designed to work with tables that have
predefined structures and relationships. This can make it difficult to work with data
that does not fit neatly into a table-based format, such as unstructured or semi-
structured data.
 Data redundancy: In some cases, the relational model can lead to data redundancy,
where the same data is stored in multiple tables. This can lead to inefficiencies and
can make it difficult to ensure data consistency across the database.
Entity Relationship (ER) Model:
The Entity Relational Model is a model for identifying entities to be represented in the
database and representation of how those entities are related. The ER data model specifies
enterprise schema that represents the overall logical structure of a database graphically.
The Entity Relationship Diagram explains the relationship among the entities present in the
database. ER models are used to model real-world objects like a person, a car, or a company
and the relation between these real-world objects. In short, the ER Diagram is the structural
format of the database.
Why Use ER Diagrams In DBMS?
 ER diagrams are used to represent the E-R model in a database, which makes them
easy to be converted into relations (tables).
 ER diagrams provide the purpose of real-world modeling of objects which makes
them intently useful.
 ER diagrams require no technical knowledge and no hardware support.
 These diagrams are very easy to understand and easy to create even for a naive user.
 It gives a standard solution for visualizing the data logically.
Symbols Used in ER Model
ER Model is used to model the logical view of the system from a data perspective which
consists of these symbols:
 Rectangles: Rectangles represent Entities in the ER Model.
 Ellipses: Ellipses represent Attributes in the ER Model.
 Diamond: Diamonds represent Relationships among Entities.
 Lines: Lines represent attributes to entities and entities sets with other relationship
types.
 Double Ellipse: Double Ellipses represent Multi-Valued Attributes.
 Double Rectangle: Double Rectangle represents a Weak Entity.
Symbols used in ER Diagram
Components of ER Diagram
ER Model consists of Entities, Attributes, and Relationships among Entities in a Database
System.

Components of ER Diagram
Entity
An Entity may be an object with a physical existence – a particular person, car, house, or
employee – or it may be an object with a conceptual existence – a company, a job, or a
university course.
Entity Set: An Entity is an object of Entity Type, and a set of all entities is called an entity
set. For Example, E1 is an entity having Entity Type Student and the set of all students is
called Entity Set. In ER diagram, Entity Type is represented as:

Entity Set
1. Strong Entity
A Strong Entity is a type of entity that has a key Attribute. Strong Entity does not depend on
other Entity in the Schema. It has a primary key, that helps in identifying it uniquely, and it is
represented by a rectangle. These are called Strong Entity Types.
2. Weak Entity
An Entity type has a key attribute that uniquely identifies each entity in the entity set. But
some entity type exists for which key attributes can’t be defined. These are called Weak
Entity types.
For Example, A company may store the information of dependents (Parents, Children,
Spouse) of an Employee. But the dependents don’t have existed without the employee. So
Dependent will be a Weak Entity Type and Employee will be Identifying Entity type for
Dependent, which means it is Strong Entity Type.
A weak entity type is represented by a Double Rectangle. The participation of weak entity
types is always total. The relationship between the weak entity type and its identifying strong
entity type is called identifying relationship and it is represented by a double diamond.

Strong Entity and Weak Entity

Attributes
Attributes are the properties that define the entity type. For example, Roll_No, Name, DOB,
Age, Address, and Mobile_No are the attributes that define entity type Student. In ER
diagram, the attribute is represented by an oval.

Attribute
1. Key Attribute
The attribute which uniquely identifies each entity in the entity set is called the key
attribute. For example, Roll_No will be unique for each student. In ER diagram, the key
attribute is represented by an oval with underlying lines.
Key Attribute
2. Composite Attribute
An attribute composed of many other attributes is called a composite attribute. For
example, the Address attribute of the student Entity type consists of Street, City, State, and
Country. In ER diagram, the composite attribute is represented by an oval comprising of
ovals.

Composite Attribute
3. Multivalued Attribute
An attribute consisting of more than one value for a given entity. For example, Phone_No
(can be more than one for a given student). In ER diagram, a multivalued attribute is
represented by a double oval.

Multivalued Attribute
4. Derived Attribute
An attribute that can be derived from other attributes of the entity type is known as a derived
attribute. e.g., Age (can be derived from DOB). In ER diagram, the derived attribute is
represented by a dashed oval.

Derived Attribute
The Complete Entity Type Student with its Attributes can be represented as:
Entity and Attributes
Relationship Type
A Relationship Type represents the association between entity types. For example, ‘Enrolled
in’ is a relationship type that exists between entity type Student and Course. In ER diagram,
the relationship type is represented by a diamond and connecting the entities with lines.

1. Unary Relationship: When there is only ONE entity set participating in a relation, the
relationship is called a unary relationship. For example, one person is married to only one
person.

Unary Relationship
2. Binary Relationship: When there are TWO entities set participating in a relationship, the
relationship is called a binary relationship. For example, a Student is enrolled in a Course.

Binary Relationship
3. n-ary Relationship: When there are n entities set participating in a relation, the
relationship is called an n-ary relationship.
Cardinality
The number of times an entity of an entity set participates in a relationship set is known
as cardinality. Cardinality can be of different types:
1. One-to-One: When each entity in each entity set can take part only once in the
relationship, the cardinality is one-to-one. Let us assume that a male can marry one female
and a female can marry one male. So, the relationship will be one-to-one.
the total number of tables that can be used in this is 2.

one to one cardinality

2. One-to-Many: In one-to-many mapping as well where each entity can be related to more
than one relationship and the total number of tables that can be used in this is 2. Let us
assume that one surgeon deparment can accomodate many doctors. So the Cardinality will be
1 to M. It means one deparment has many Doctors.
total number of tables that can used is 3.

one to many cardinality

3. Many-to-One: When entities in one entity set can take part only once in the relationship
set and entities in other entity sets can take part more than once in the relationship set,
cardinality is many to one. Let us assume that a student can take only one course, but one
course can be taken by many students. So the cardinality will be n to 1. It means that for one
course there can be n students but for one student, there will be only one course.
The total number of tables that can be used in this is 3.

many to one cardinality

4. Many-to-Many: When entities in all entity sets can take part more than once in the
relationship cardinality is many to many. Let us assume that a student can take more than one
course and one course can be taken by many students. So the relationship will be many to
many.
the total number of tables that can be used in this is 3.

many to many cardinality

How to Draw ER Diagram?
 The very first step is Identifying all the Entities, and place them in a Rectangle, and
labeling them accordingly.
 The next step is to identify the relationship between them and pace them accordingly
using the Diamond, and make sure that, Relationships are not connected to each other.
 Attach attributes to the entities properly.
 Remove redundant entities and relationships.
 Add proper colors to highlight the data present in the database.

Example 1: How to Convert ER Diagram to Relational Database

The ER Model is intended as a description of real-world entities. Although it is constructed in
such a way as to allow easy translation to the relational schema model, this is not an entirely
trivial process. The ER diagram represents the conceptual level of database design meanwhile
the relational schema is the logical level for the database design. We will be following the
simple rules:
1. Entities and Simple Attributes:
An entity type within ER diagram is turned into a table. You may preferably keep the same
name for the entity or give it a sensible name but avoid DBMS reserved words as well as
avoid the use of special characters.
Each attribute turns into a column (attribute) in the table. The key attribute of the entity is the
primary key of the table which is usually underlined. It can be composite if required but can
never be null.
[info]It is highly recommended that every table should start with its primary key attribute
conventionally named as TablenameID.[/info]
Taking the following simple ER diagram:

The initial relational schema is expressed in the following format writing the table names
with the attributes list inside a parentheses as shown below for
Persons (personid, name, lastname, email)
Persons and Phones are Tables. name, lastname, are Table Columns (Attributes).
[info]personid is the primary key for the table: Person[/info]
2. Multi-Valued Attributes
A multi-valued attribute is usually represented with a double-line oval.

If you have a multi-valued attribute, take the attribute, and turn it into a new entity or table of
its own. Then make a 1: N relationship between the new entity and the existing one. In simple
words. 1. Create a table for the attribute. 2. Add the primary (id) column of the parent entity
as a foreign key within the new table as shown below:
Persons (personid , name, lastname, email )
Phones ( phoneid , personid, phone )
[info]personid within the table Phones is a foreign key referring to the personid of
Persons[/info]
3. 1:1 Relationships
To keep it simple and even for better performances at data retrieval, I would personally
recommend using attributes to represent such relationship. For instance, let us consider the
case where the Person has or optionally has one wife. You can place the primary key of the
wife within the table of the Persons which we call in this case foreign key as shown below.
Persons (personid , name, lastname, email , wifeid )
Wife ( wifeid , name )
Or vice versa to put the personid as a foreign key within the Wife table as shown below:
Persons (personid , name, lastname, email )
Wife ( wifeid , name , personid)
[info]For cases when the Person is not married i.e., has no wifeID, the attribute can set to
NULL[/info]
4. 1: N Relationships
This is the tricky part! For simplicity, use attributes in the same way as 1:1 relationship but
we have only one choice as opposed to two choices. For instance, the Person can have
a House from zero to many, but a House can have only one Person. To represent such
relationship the personid as the Parent node must be placed within the Child table as a
foreign key but not the other way around as shown next:

It should convert to:

Persons (personid, name, lastname, email)
House ( houseid , num , address, personid)
5. N: N Relationships
We normally use tables to express such type of relationship. This is the same for N − ary
relationship of ER diagrams. For instance, The Person can live or work in many countries.
Also, a country can have many people. To express this relationship within a relational schema
we use a separate table as shown below:

It should convert into:

Persons (personid, name, lastname, email)
Countries (countryid , name, code)
HasRelat ( hasrelatid , personid , countryid)
Relationship with attributes:
It is recommended to use table to represent them to keep the design tidy and clean regardless
of the cardinality of the relationship.
Case Study
For the sake of simplicity, we will be producing the relational schema for the following ER
diagram:

The relational schema for the ER Diagram is given below as:

Company (CompanyID , name , address )
Staff( StaffID , dob , address , WifeID)
Child( ChildID , name , StaffID )
Wife ( WifeID , name )
Phone(PhoneID , phoneNumber , StaffID)
Task ( TaskID , description)
Work(WorkID , CompanyID , StaffID , since )
Perform(PerformID , StaffID , TaskID )

Example 2: Reduction of ER diagram to Table

The database can be represented using the notations, and these notations can be reduced to a
collection of tables.
In the database, every entity set, or relationship set can be represented in tabular form.
The ER diagram is given below:

There are some points for converting the ER diagram to the table:
Backward Skip 10sPlay VideoForward Skip 10s
o Entity type becomes a table.
In the given ER diagram, LECTURE, STUDENT, SUBJECT and COURSE forms individual
tables.
o All single-valued attribute becomes a column for the table.
In the STUDENT entity, STUDENT_NAME and STUDENT_ID form the column of
STUDENT table. Similarly, COURSE_NAME and COURSE_ID form the column of
COURSE table and so on.
o A key attribute of the entity type represented by the primary key.
In the given ER diagram, COURSE_ID, STUDENT_ID, SUBJECT_ID, and LECTURE_ID
are the key attribute of the entity.
o The multivalued attribute is represented by a separate table.
In the student table, a hobby is a multivalued attribute. So, it is not possible to represent
multiple values in a single column of STUDENT table. Hence, we create a table
STUD_HOBBY with column name STUDENT_ID and HOBBY. Using both the column, we
create a composite key.
o Composite attribute represented by components.
In the given ER diagram, student address is a composite attribute. It contains CITY, PIN,
DOOR#, STREET, and STATE. In the STUDENT table, these attributes can merge as an
individual column.
o Derived attributes are not considered in the table.
In the STUDENT table, Age is the derived attribute. It can be calculated at any point of time
by calculating the difference between current date and Date of Birth.
Using these rules, you can convert the ER diagram to tables and columns and assign the
mapping between the tables. Table structure for the given ER diagram is as below:

Figure: Table structure

Functional Dependency:
The functional dependency is a relationship that exists between two attributes. It typically
exists between the primary key and non-key attribute within a table.
1. X → Y
The left side of FD is known as a determinant, the right side of the production is known as a
dependent.
For example:
Assume we have an employee table with attributes: Emp_Id, Emp_Name, Emp_Address.
Here Emp_Id attribute can uniquely identify the Emp_Name attribute of employee table
because if we know the Emp_Id, we can tell that employee name associated with it.
Functional dependency can be written as:
1. Emp_Id → Emp_Name
We can say that Emp_Name is functionally dependent on Emp_Id.
Types of Functional Dependencies in DBMS
1. Trivial functional dependency
2. Non-Trivial functional dependency
3. Multivalued functional dependency
4. Transitive functional dependency
5. Fully functional dependency
6. Partial functional dependency
1. Trivial Functional Dependency
In Trivial Functional Dependency, a dependent is always a subset of the
determinant. i.e. If X → Y and Y is the subset of X, then it is called trivial functional
dependency
Example:

roll_no name age

42 abc 17

43 pqr 18

44 xyz 18
roll_no name age

Here, {roll_no, name} → name is a trivial functional dependency, since the

dependent name is a subset of determinant set {roll_no, name}. Similarly, roll_no →
roll_no is also an example of trivial functional dependency.
2. Non-trivial Functional Dependency
In Non-trivial functional dependency, the dependent is strictly not a subset of the
determinant. i.e., If X → Y and Y is not a subset of X, then it is called Non-trivial functional
dependency.
Example:

roll_no name age

42 abc 17

43 pqr 18

44 xyz 18

Here, roll_no → name is a non-trivial functional dependency, since the

dependent name is not a subset of determinant roll_no. Similarly, {roll_no, name} → age is
also a non-trivial functional dependency, since age is not a subset of {roll_no, name}
3. Multivalued Functional Dependency
In Multivalued functional dependency, entities of the dependent set are not
dependent on each other. i.e., If a → {b, c} and there exists no functional
dependency between b and c, then it is called a multivalued functional dependency.
For example,

roll_no name age

42 abc 17

43 pqr 18
roll_no name age

44 xyz 18

45 abc 19

Here, roll_no → {name, age} is a multivalued functional dependency, since the

dependents name & age are not dependent on each other(i.e. name → age or age → name
doesn’t exist !)
4. Transitive Functional Dependency
In transitive functional dependency, dependent is indirectly dependent on determinant.
i.e. If a → b & b → c, then according to axiom of transitivity, a → c. This is a transitive
functional dependency.
For example,

enrol_n
o name dept building_no

42 abc CO 4

43 pqr EC 2

44 xyz IT 1

45 abc EC 2

Here, enrol_no → dept and dept → building_no. Hence, according to the axiom of
transitivity, enrol_no → building_no is a valid functional dependency. This is an indirect
functional dependency, hence called Transitive functional dependency.
5. Fully Functional Dependency
In full functional dependency an attribute or a set of attributes uniquely determines
another attribute or set of attributes. If a relation R has attributes X, Y, Z with the
dependencies X->Y and X->Z which states that those dependencies are fully functional.
6. Partial Functional Dependency
In partial functional dependency a non-key attribute depends on a part of the
composite key, rather than the whole key. If a relation R has attributes X, Y, Z where X and Y
are the composite key and Z is non key attribute. Then X->Z is a partial functional
dependency in RBDMS.

Normalization:
-is the process of minimizing redundancy from a relation or set of relations. Redundancy in
relation may cause insertion, deletion, and update anomalies. So, it helps to minimize the
redundancy in relations. Normal forms are used to eliminate or reduce redundancy in
database tables.
What is Database Normalization?
In database management systems (DBMS), normal forms are a series of guidelines that help
to ensure that the design of a database is efficient, organized, and free from data anomalies.
There are several levels of normalization, each with its own set of guidelines, known as
normal forms.
Important Points Regarding Normal Forms in DBMS
 First Normal Form (1NF): This is the most basic level of normalization. In 1NF,
each table cell should contain only a single value, and each column should have a
unique name. The first normal form helps to eliminate duplicate data and simplify
queries.
 Second Normal Form (2NF): 2NF eliminates redundant data by requiring that each
non-key attribute be dependent on the primary key. This means that each column
should be directly related to the primary key, and not to other columns.
 Third Normal Form (3NF): 3NF builds on 2NF by requiring that all non-key
attributes are independent of each other. This means that each column should be
directly related to the primary key, and not to any other columns in the same table.
 Boyce-Codd Normal Form (BCNF): BCNF is a stricter form of 3NF that ensures
that each determinant in a table is a candidate key. In other words, BCNF ensures that
each non-key attribute is dependent only on the candidate key.
Normal forms help to reduce data redundancy, increase data consistency, and improve
database performance. However, higher levels of normalization can lead to more complex
database designs and queries. It is important to strike a balance between normalization and
practicality when designing a database.
Advantages of Normal Form:
 Reduced data redundancy: Normalization helps to eliminate duplicate data in
tables, reducing the amount of storage space needed and improving database
efficiency.
 Improved data consistency: Normalization ensures that data is stored in a consistent
and organized manner, reducing the risk of data inconsistencies and errors.
 Simplified database design: Normalization provides guidelines for organizing tables
and data relationships, making it easier to design and maintain a database.
 Improved query performance: Normalized tables are typically easier to search and
retrieve data from, resulting in faster query performance.
 Easier database maintenance: Normalization reduces the complexity of a database
by breaking it down into smaller, more manageable tables, making it easier to add,
modify, and delete data.
Overall, using normal forms in DBMS helps to improve data quality, increase database
efficiency, and simplify database design and maintenance.
First Normal Form:
If a relation contains composite or multi-valued attribute, it violates first normal form, or a
relation is in first normal form if it does not contain any composite or multi-valued attribute.
A relation is in first normal form if every attribute in that relation is singled valued attribute.
 Example 1 – Relation STUDENT in table 1 is not in 1NF because of multi-valued
attribute STUD_PHONE. Its decomposition into 1NF has been shown in table
2.

 Example 2 –
ID Name Courses
------------------
1 A c1, c2
2 E c3
3 M C2, c3
 In the above table Course is a multi-valued attribute so it is not in 1NF. Below Table is
in 1NF as there is no multi-valued attribute
ID Name Course
------------------
1 A c1
1 A c2
2 E c3
3 M c2
3 M c3
Second Normal Form:
To be in second normal form, a relation must be in first normal form and relation must not
contain any partial dependency. A relation is in 2NF if it has No Partial Dependency, i.e., no
non-prime attribute (attributes which are not part of any candidate key) is dependent on any
proper subset of any candidate key of the table. Partial Dependency – If the proper subset of
candidate key determines non-prime attribute, it is called partial dependency.
 Example 1 – Consider table-3 as following below.
STUD_NO COURSE_NO COURSE_FEE
1 C1 1000
2 C2 1500
1 C4 2000
4 C3 1000
4 C1 1000
2 C5 2000
 {Note that, there are many courses having the same course fee} Here, COURSE_FEE
cannot alone decide the value of COURSE_NO or STUD_NO; COURSE_FEE
together with STUD_NO cannot decide the value of COURSE_NO; COURSE_FEE
together with COURSE_NO cannot decide the value of STUD_NO; Hence,
COURSE_FEE would be a non-prime attribute, as it does not belong to the one only
candidate key {STUD_NO, COURSE_NO} ; But, COURSE_NO -> COURSE_FEE,
i.e., COURSE_FEE is dependent on COURSE_NO, which is a proper subset of the
candidate key. Non-prime attribute COURSE_FEE is dependent on a proper subset of
the candidate key, which is a partial dependency and so this relation is not in 2NF. To
convert the above relation to 2NF, we need to split the table into two tables such as:
Table 1: STUD_NO, COURSE_NO Table 2: COURSE_NO, COURSE_FEE
Table 1 Table 2
STUD_NO COURSE_NO COURSE_NO COURSE_FEE
1 C1 C1 1000
2 C2 C2 1500
1 C4 C3 1000
4 C3 C4 2000
4 C1 C5 2000
 NOTE: 2NF tries to reduce the redundant data getting stored in memory. For
instance, if there are 100 students taking C1 course, we don’t need to store its Fee as
1000 for all the 100 records, instead, once we can store it in the second table as the
course fee for C1 is 1000.
 Example 2 – Consider following functional dependencies in relation R (A, B , C, D )
AB -> C [A and B together determine C]
BC -> D [B and C together determine D]
In the above relation, AB is the only candidate key and there is no partial dependency, i.e.,
any proper subset of AB doesn’t determine any non-prime attribute.
X is a super key.
Y is a prime attribute (each element of Y is part of some candidate key).
Example 1:
In relation STUDENT given in Table 4, FD set: {STUD_NO -> STUD_NAME, STUD_NO -
> STUD_STATE, STUD_STATE -> STUD_COUNTRY, STUD_NO -> STUD_AGE}
Candidate Key: {STUD_NO}
For this relation in table 4, STUD_NO -> STUD_STATE and STUD_STATE ->
STUD_COUNTRY are true.
So, STUD_COUNTRY is transitively dependent on STUD_NO. It violates the third normal
form.
To convert it in third normal form, we will decompose the relation STUDENT (STUD_NO,
STUD_NAME, STUD_PHONE, STUD_STATE, STUD_COUNTRY_STUD_AGE) as:
STUDENT (STUD_NO, STUD_NAME, STUD_PHONE, STUD_STATE, STUD_AGE)
STATE_COUNTRY (STATE, COUNTRY)
Consider relation R (A, B, C, D, E) A -> BC, CD -> E, B -> D, E -> A All possible candidate
keys in above relation are {A, E, CD, BC} All attributes are on right sides of all functional
dependencies are prime.
Example 2: Find the highest normal form of a relation R (A, B, C, D, E) with FD set as {BC-
>D, AC->BE, B->E}
Step 1: As we can see, (AC)+ = {A, C, B, E, D} but none of its subset can determine all
attribute of relation, So AC will be candidate key. A or C can’t be derived from any other
attribute of the relation, so there will be only 1 candidate key {AC}.
Step 2: Prime attributes are those attributes that are part of candidate key {A, C} in this
example and others will be non-prime {B, D, E} in this example.
Step 3: The relation R is in 1st normal form as a relational DBMS does not allow multi-
valued or composite attribute. The relation is in 2nd normal form because BC->D is in 2nd
normal form (BC is not a proper subset of candidate key AC) and AC->BE is in 2nd normal
form (AC is candidate key) and B->E is in 2nd normal form (B is not a proper subset of
candidate key AC).
The relation is not in 3rd normal form because in BC->D (neither BC is a super key nor D is
a prime attribute) and in B->E (neither B is a super key nor E is a prime attribute) but to
satisfy 3rd normal for, either LHS of an FD should be super key or RHS should be prime
attribute. So, the highest normal form of relation will be 2nd Normal form.
For example, consider relation R (A, B, C) A -> BC, B -> A and B both are super keys so
above relation is in BCNF.
Third Normal Form
A relation is said to be in third normal form, if we did not have any transitive dependency for
non-prime attributes. The basic condition with the Third Normal Form is that the relation
must be in Second Normal Form.
Below mentioned is the basic condition that must be hold in the non-trivial functional
dependency X -> Y:
 X is a Super Key.
 Y is a Prime Attribute (this means that element of Y is some part of Candidate Key).
In other words,
Note – If A->B and B->C are two FDs then A->C is called transitive dependency.
The normalization of 2NF relations to 3NF involves the removal of transitive
dependencies. If a transitive dependency exists, we remove the transitively dependent
attribute(s) from the relation by placing the attribute(s) in a new relation along with a
copy of the determinant.
Consider the examples given below.
Example-1:
In relation STUDENT given in Table 4,

FD set:
{STUD_NO -> STUD_NAME, STUD_NO -> STUD_STATE, STUD_STATE ->
STUD_COUNTRY, STUD_NO -> STUD_AGE}
Candidate Key:
{STUD_NO}
For this relation in table 4, STUD_NO -> STUD_STATE and STUD_STATE ->
STUD_COUNTRY are true. So STUD_COUNTRY is transitively dependent on
STUD_NO. It violates the third normal form. To convert it in third normal form, we will
decompose the relation STUDENT (STUD_NO, STUD_NAME, STUD_PHONE,
STUD_STATE, STUD_COUNTRY_STUD_AGE) as:
STUDENT (STUD_NO, STUD_NAME, STUD_PHONE, STUD_STATE, STUD_AGE)
STATE_COUNTRY (STATE, COUNTRY)
Example-2:
Consider relation R (A, B, C, D, E)
A -> BC,
CD -> E,
B -> D,
E -> A
All possible candidate keys in above relation are {A, E, CD, BC} All attribute are on right
sides of all functional dependencies are prime.

Boyce-Codd Normal Form (BCNF)

Boyce–Codd Normal Form (BCNF) is based on functional dependencies that take into
account all candidate keys in a relation; however, BCNF also has additional constraints
compared with the general definition of 3NF.
Rules for BCNF
Rule 1: The table should be in the 3rd Normal Form.
Rule 2: X should be a super key for every functional dependency (FD) X−>Y in a given
relation.
Note: To test whether a relation is in BCNF, we identify all the determinants and make sure
that they are candidate keys.

BCNF in DBMS
You came across a similar hierarchy known as the Chomsky Normal Form in the Theory of
Computation. Now, carefully study the hierarchy above. It can be inferred that every relation
in BCNF is also in 3NF. To put it another way, a relation in 3NF need not be in BCNF.
Ponder over this statement for a while.
To determine the highest normal form of a given relation R with functional dependencies, the
first step is to check whether the BCNF condition holds. If R is found to be in BCNF, it can
be safely deduced that the relation is also in 3NF, 2NF, and 1NF as the hierarchy shows. The
1NF has the least restrictive constraint – it only requires a relation R to have atomic values in
each tuple. The 2NF has a slightly more restrictive constraint.
The 3NF has a more restrictive constraint than the first two normal forms but is less
restrictive than the BCNF. In this manner, the restriction increases as we traverse down the
hierarchy.
Examples
Here, we are going to discuss some basic examples which let you understand the properties of
BCNF. We will discuss multiple examples here.
Example 1
Let us consider the student database, in which data of the student are mentioned.

Stu_ID Stu_Branch Stu_Course Branch_Number Stu_Course_No

Computer
101 Science & DBMS B_001 201
Engineering

Computer
Computer
101 Science & B_001 202
Networks
Engineering

Electronics &
VLSI
102 Communication B_003 401
Technology
Engineering

Electronics &
Mobile
102 Communication B_003 402
Communication
Engineering

Functional Dependency of the above is as mentioned:

Stu_ID −> Stu_Branch
Stu_Course −> {Branch_Number, Stu_Course_No}
Candidate Keys of the above table are: {Stu_ID, Stu_Course}
Why this Table is Not in BCNF?
The table present above is not in BCNF, because as we can see that neither Stu_ID nor
Stu_Course is a Super Key. As the rules mentioned above clearly tell that for a table to be in
BCNF, it must follow the property that for functional dependency X−>Y, X must be in Super
Key and here this property fails, that’s why this table is not in BCNF.
How to Satisfy BCNF?
For satisfying this table in BCNF, we must decompose it into further tables. Here is the full
procedure through which we transform this table into BCNF. Let us first divide this main
table into two tables Stu_Branch and Stu_Course Table.
Stu_Branch Table

Stu_I
D Stu_Branch

101 Computer Science & Engineering

102 Electronics & Communication Engineering

Candidate Key for this table: Stu_ID.

Stu_Course Table

Stu_Course Branch_Number Stu_Course_No

DBMS B_001 201

Computer Networks B_001 202

VLSI Technology B_003 401

Mobile Communication B_003 402

Candidate Key for this table: Stu_Course.

Stu_ID to Stu_Course_No Table
Stu_I
D Stu_Course_No

101 201

101 202

102 401

102 402

Candidate Key for this table: {Stu_ID, Stu_Course_No}.

After decomposing into further tables, now it is in BCNF, as it is passing the condition of
Super Key, that in functional dependency X−>Y, X is a Super Key.
Example 2
Find the highest normal form of a relation R (A, B, C, D, E) with FD set as:
{BC->D, AC->BE, B->E}
Explanation:
 Step-1: As we can see, (AC)+ = {A, C, B, E, D} but none of its subsets can determine
all attributes of the relation, So AC will be the candidate key. A or C can’t be derived
from any other attribute of the relation, so there will be only 1 candidate key {AC}.
 Step-2: Prime attributes are those attributes that are part of candidate key {A, C} in
this example and others will be non-prime {B, D, E} in this example.
 Step-3: The relation R is in 1st normal form as a relational DBMS does not allow
multi-valued or composite attributes.
The relation is in 2nd normal form because BC->D is in 2nd normal form (BC is not a proper
subset of candidate key AC) and AC->BE is in 2nd normal form (AC is candidate key) and
B->E is in 2nd normal form (B is not a proper subset of candidate key AC).
The relation is not in 3rd normal form because in BC->D (neither BC is a super key nor D is
a prime attribute) and in B->E (neither B is a super key nor E is a prime attribute) but to
satisfy 3rd normal for, either LHS of an FD should be super key or RHS should be a prime
attribute. So, the highest normal form of relation will be the 2nd Normal form.
Note: A prime attribute cannot be transitively dependent on a key in BCNF relation.
Consider these functional dependencies of some relation R
AB ->C
C ->B
AB ->B
Suppose it is known that the only candidate key of R is AB. A careful observation is required
to conclude that the above dependency is a Transitive Dependency as the prime attribute B
transitively depends on the key AB through C. Now, the first and the third FD are in BCNF as
they both contain the candidate key (or simply KEY) on their left sides. The second
dependency, however, is not in BCNF but is definitely in 3NF due to the presence of the
prime attribute on the right side. So, the highest normal form of R is 3NF as all three FDs
satisfy the necessary conditions to be in 3NF.
Example 3
For example, consider relation R (A, B, C)
A -> BC,
B -> A
A and B both are super keys, so the above relation is in BCNF.
Note: BCNF decomposition may always not be possible with dependency preserving,
however, it always satisfies the lossless join condition. For example, relation R (V, W, X, Y,
Z), with functional dependencies:
V, W -> X
Y, Z -> X
W -> Y
It would not satisfy dependency preserving BCNF decomposition.
Note: Redundancies are sometimes still present in a BCNF relation as it is not always
possible to eliminate them completely.

Data Models in DBMS
No ratings yet
Data Models in DBMS
67 pages
Data Models in DBMS
No ratings yet
Data Models in DBMS
5 pages
DBMS Data Models Overview
No ratings yet
DBMS Data Models Overview
14 pages
Data Models
No ratings yet
Data Models
5 pages
Unit 2
No ratings yet
Unit 2
25 pages
2020 DBMS
No ratings yet
2020 DBMS
46 pages
Newnnneee
No ratings yet
Newnnneee
19 pages
Data Models in DBMS
No ratings yet
Data Models in DBMS
5 pages
Unit - 5: File System Approach
No ratings yet
Unit - 5: File System Approach
14 pages
Data Model in DBMS
No ratings yet
Data Model in DBMS
5 pages
Chapter 1 Data Model in DBMS
No ratings yet
Chapter 1 Data Model in DBMS
5 pages
Data Models Overview & History
No ratings yet
Data Models Overview & History
13 pages
Fundamentals of Databases PDF
No ratings yet
Fundamentals of Databases PDF
160 pages
Unit 2 Data Models Lecture
No ratings yet
Unit 2 Data Models Lecture
39 pages
Introduction To Data Model L-1
No ratings yet
Introduction To Data Model L-1
17 pages
Data Models Lec3
No ratings yet
Data Models Lec3
28 pages
02 DB Design
No ratings yet
02 DB Design
39 pages
Week II
No ratings yet
Week II
22 pages
Lecture Notes 3 - Part1
No ratings yet
Lecture Notes 3 - Part1
75 pages
SS2 Term 1
No ratings yet
SS2 Term 1
31 pages
Data Models in DBMS: Types & Benefits
No ratings yet
Data Models in DBMS: Types & Benefits
4 pages
Dbms
No ratings yet
Dbms
62 pages
Unit 2 DBMS
No ratings yet
Unit 2 DBMS
22 pages
E-Note SS Two 1st Term Data Processing
No ratings yet
E-Note SS Two 1st Term Data Processing
27 pages
SQIT3043 Chapter 2 - Data Models
No ratings yet
SQIT3043 Chapter 2 - Data Models
14 pages
Week 5 part-II Data Models
No ratings yet
Week 5 part-II Data Models
32 pages
Unit I DBMS
No ratings yet
Unit I DBMS
78 pages
Chapter 4. Database System Architecture & Modeling
100% (1)
Chapter 4. Database System Architecture & Modeling
57 pages
2 Bic 21404 Chapter 2
No ratings yet
2 Bic 21404 Chapter 2
68 pages
Dbms Presentaion
100% (1)
Dbms Presentaion
11 pages
Topic Beyond Syllabus DBMS
No ratings yet
Topic Beyond Syllabus DBMS
12 pages
Week-3 LECTURE Databasemodels
No ratings yet
Week-3 LECTURE Databasemodels
50 pages
Dbms Unit 1
100% (1)
Dbms Unit 1
8 pages
DBMS Unit 2
No ratings yet
DBMS Unit 2
15 pages
DDM Question Bank @
100% (1)
DDM Question Bank @
20 pages
3.data Models
No ratings yet
3.data Models
5 pages
DBMS 4
No ratings yet
DBMS 4
18 pages
Data Models in DB
No ratings yet
Data Models in DB
5 pages
Data Models
No ratings yet
Data Models
6 pages
2 ND Unit DBMS
No ratings yet
2 ND Unit DBMS
23 pages
Types
No ratings yet
Types
2 pages
Department of Computer Science: Dual Degree Integrated Post Graduate Program
No ratings yet
Department of Computer Science: Dual Degree Integrated Post Graduate Program
31 pages
Database Systems - Lecture 5
No ratings yet
Database Systems - Lecture 5
7 pages
DBMS Chapter 2
No ratings yet
DBMS Chapter 2
27 pages
Unit 2 - Handouts
No ratings yet
Unit 2 - Handouts
8 pages
Database Data Models Overview
No ratings yet
Database Data Models Overview
6 pages
File System, Problems With File System Unit1
No ratings yet
File System, Problems With File System Unit1
14 pages
School of Computer Engineering: Kalinga Institute of Industrial Technology
No ratings yet
School of Computer Engineering: Kalinga Institute of Industrial Technology
18 pages
DBMS Detailed Notes
No ratings yet
DBMS Detailed Notes
57 pages
Chapter 2-Database System Concepts and Architecture
No ratings yet
Chapter 2-Database System Concepts and Architecture
55 pages
Birara Sisay Anley
0% (1)
Birara Sisay Anley
12 pages
DBMS Pyq
No ratings yet
DBMS Pyq
34 pages
HND Computing UNIT 38: Database Management: Faisal Saghir
No ratings yet
HND Computing UNIT 38: Database Management: Faisal Saghir
49 pages
Data Models
No ratings yet
Data Models
5 pages
2nd Chapter Slide
No ratings yet
2nd Chapter Slide
98 pages
Data Model and Its Types
100% (1)
Data Model and Its Types
29 pages
Ss 2 Data Processing First Term E-Note
No ratings yet
Ss 2 Data Processing First Term E-Note
48 pages
Chapter Two Note
No ratings yet
Chapter Two Note
25 pages
CH 3 Data Modeling
No ratings yet
CH 3 Data Modeling
31 pages
09 - Azure Data Engineering Cheatsheet
No ratings yet
09 - Azure Data Engineering Cheatsheet
37 pages
Lecture 1
No ratings yet
Lecture 1
33 pages
Assignmemnt 3
No ratings yet
Assignmemnt 3
66 pages
Enquiries in T24 - An Introduction
100% (4)
Enquiries in T24 - An Introduction
47 pages
Language Technology A First Overview: Hans Uszkoreit 1. Scope
No ratings yet
Language Technology A First Overview: Hans Uszkoreit 1. Scope
4 pages
RDBMS Basics for Beginners
No ratings yet
RDBMS Basics for Beginners
27 pages
PL/SQL Technical Assessment
No ratings yet
PL/SQL Technical Assessment
11 pages
AWS Cert Exam Dumps for Architects
No ratings yet
AWS Cert Exam Dumps for Architects
13 pages
RDBMS
No ratings yet
RDBMS
4 pages
SAP BI 7.0 InfoObjects Authorization
No ratings yet
SAP BI 7.0 InfoObjects Authorization
5 pages
Oracle 1Z0-071 Exam Prep Guide
No ratings yet
Oracle 1Z0-071 Exam Prep Guide
18 pages
Multilevel Security For Relational Databases
100% (2)
Multilevel Security For Relational Databases
296 pages
Oracle BI Apps796 Perf Tech NoteV7
No ratings yet
Oracle BI Apps796 Perf Tech NoteV7
71 pages
List of Search Engines
No ratings yet
List of Search Engines
16 pages
000 - Optimise Compute Resources
No ratings yet
000 - Optimise Compute Resources
7 pages
Soal Pentaho 2
No ratings yet
Soal Pentaho 2
6 pages
Power Bi Consultants in Uae PDF
No ratings yet
Power Bi Consultants in Uae PDF
5 pages
Employee Management System Code
No ratings yet
Employee Management System Code
10 pages
Tableau Developer Resume
No ratings yet
Tableau Developer Resume
3 pages
12th IP 10-2-2024 Set-B
No ratings yet
12th IP 10-2-2024 Set-B
1 page
Oracle BI Publisher 11G Guide
No ratings yet
Oracle BI Publisher 11G Guide
85 pages
Gopal Practical Work
No ratings yet
Gopal Practical Work
13 pages
SQL For Data Analysis
No ratings yet
SQL For Data Analysis
236 pages
DatabaseDesignDocumentV1 1
No ratings yet
DatabaseDesignDocumentV1 1
15 pages
Backing Up and Restoring Progeny Databases
No ratings yet
Backing Up and Restoring Progeny Databases
19 pages
Ad - 1Z0-051 - Oracle Database 11g - SQL Fundamentals I - Oracle Certification Exam
No ratings yet
Ad - 1Z0-051 - Oracle Database 11g - SQL Fundamentals I - Oracle Certification Exam
2 pages
Property Database (Market Value Finder)
No ratings yet
Property Database (Market Value Finder)
4 pages
SQL Table Operations Guide
No ratings yet
SQL Table Operations Guide
5 pages
Data Cleaning
No ratings yet
Data Cleaning
11 pages
CBSE Class 10 IT 2022-23 With Solutions
No ratings yet
CBSE Class 10 IT 2022-23 With Solutions
14 pages

Rdbms Unit - II

Uploaded by

Rdbms Unit - II

Uploaded by

Unit- II: Relational, ER Models and Normalization

Relational, ER Models and Normalization: Data Models - Relational Model – Domains -

1. Conceptual Data Model

Relational Model in DBMS

ROLL_NO NAME ADDRESS PHONE AGE

1 RAM DELHI 9455123451 18

2 RAMESH GURGAON 9652431543 18

3 SUJIT ROHTAK 9156253131 20

 Relation Instance: The set of tuples of a relation at a particular instance of time is

ROLL_NO NAME ADDRESS PHONE AGE BRANCH_CODE

1 RAM DELHI 9455123451 18 CS

2 RAMESH GURGAON 9652431543 18 CS

3 SUJIT ROHTAK 9156253131 20 ECE

ECE ELECTRONICS AND COMMUNICATION ENGINEERING

Different Types of Keys in the Relational Model

STUD_NO SNAME ADDRESS PHONE

1 Shyam Delhi 123456789

2 Rakesh Kolkata 223365796

3 Suraj Delhi 175468965

STUD_NO SNAME ADDRESS PHONE

1 Shyam Delhi 123456789

2 Rakesh Kolkata 223365796

3 Suraj Delhi 175468965

Relation between Primary Key, Candidate Key, and Super Key

STUD_NO TEACHER_NO COURSE_NO

Relation between Primary Key and Foreign Key

Different Types of Keys

EID Name Phone

EID Name Phone

NULL Sony 9234567892

EID Name DNO

Strong Entity and Weak Entity

one to one cardinality

one to many cardinality

many to one cardinality

many to many cardinality

Example 1: How to Convert ER Diagram to Relational Database

It should convert to:

It should convert into:

The relational schema for the ER Diagram is given below as:

Example 2: Reduction of ER diagram to Table

Figure: Table structure

roll_no name age

Here, {roll_no, name} → name is a trivial functional dependency, since the

roll_no name age

Here, roll_no → name is a non-trivial functional dependency, since the

roll_no name age

Here, roll_no → {name, age} is a multivalued functional dependency, since the

Boyce-Codd Normal Form (BCNF)

Stu_ID Stu_Branch Stu_Course Branch_Number Stu_Course_No

Functional Dependency of the above is as mentioned:

101 Computer Science & Engineering

102 Electronics & Communication Engineering

Candidate Key for this table: Stu_ID.

Stu_Course Branch_Number Stu_Course_No

DBMS B_001 201

Computer Networks B_001 202

VLSI Technology B_003 401

Mobile Communication B_003 402

Candidate Key for this table: Stu_Course.

Candidate Key for this table: {Stu_ID, Stu_Course_No}.

You might also like