0% found this document useful (0 votes)

31 views49 pages

Normalization

The document discusses the process of normalization in database design, detailing various normal forms (1NF, 2NF, 3NF, BCNF, and 4NF) based on keys and functional dependencies. It explains the importance of normalization to eliminate redundancy and anomalies, as well as the definitions of keys, attributes, and the practical use of these normal forms in achieving high-quality relational designs. Additionally, it covers decomposition techniques, lossless and lossy joins, and properties of decomposition to maintain data integrity.

Uploaded by

sohamparab38

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views49 pages

Normalization

Uploaded by

sohamparab38

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 49

Normalization

3 Normal Forms Based on Primary Keys

■ 3.1 Normalization of Relations

■ 3.2 Practical Use of Normal Forms
■ 3.3 Definitions of Keys and Attributes Participating
in Keys
■ 3.4 First Normal Form
■ 3.5 Second Normal Form
■ 3.6 Third Normal Form
3.1 Normalization of Relations (1)
■ Normalization:
■ The process of decomposing unsatisfactory "bad"
relations by breaking up their attributes into
smaller relations

■ Normal form:
■ Condition using keys and FDs of a relation to
certify whether a relation schema is in a particular
normal form
Normalization of Relations (2)
■ 2NF, 3NF, BCNF
■ based on keys and FDs of a relation schema
■ 4NF
■ based on keys, multi-valued dependencies :
MVDs; 5NF based on keys, join dependencies :
JDs
■ Additional properties may be needed to ensure
a good relational design
3.2 Practical Use of Normal Forms
■ Normalization is carried out in practice so that the
resulting designs are of high quality and meet the
desirable properties
■ The practical utility of these normal forms becomes
questionable when the constraints on which they are
based are hard to understand or to detect
■ The database designers need not normalize to the highest
possible normal form
■ (usually up to 3NF, BCNF or 4NF)
■ Denormalization:
■ The process of storing the join of higher normal form
relations as a base relation—which is in a lower normal form
3.3 Definitions of Keys and Attributes
Participating in Keys (1)

■ A key of a relation schema R = {A1, A2, ...., An}

is a set of attributes S subset-of R with the
property that no two tuples t1 and t2 in any legal
relation state r of R will have t1[S] = t2[S]
Eg: {SSN},{SSN,ENAME}, {SSN,ENAME,SEX}

■ A key K is a superkey with the additional

property that removal of any attribute from K will
cause K not to be a superkey any more.
Eg:{SSN,ENAME}, {SSN,ENAME,SEX}
Definitions of Keys and Attributes
Participating in Keys (2)
■ If a relation schema has more than one key, each is
called a candidate key.
■One of the candidate keys is arbitrarily designated to
be the primary key, and the others are called
secondary keys.
Eg: PRIMARY KEY = {SSN}, Secondary Key
={ESSN},
■ A Prime attribute must be a member of some
candidate key
Eg: SSN, PNO of Works-on {SSN,PNO}
■ A Nonprime attribute is not a prime attribute—that
is, it is not a member of any candidate key.
Problems due to redundancy
Insertion anomaly
Deletion Anamoly
Updation Anamoly

While updating the table, if one

Of the row missed to update,
Data inconsistency can occur.
First Normal Form

■ All the attributes must have atomic values

■ Disallows
■ composite attributes
■ multivalued attributes
■ nested relations; attributes whose values for an
individual tuple are non-atomic
Example
• In the student table, subject column is
multivalued attribute.
• Student table is not in 1NF

Normalize into 1NF

Figure 10.8 Normalization into 1NF
Second Normal Form (1)
■ Uses the concepts of FDs, primary key
■ Definitions
■ Prime attribute: An attribute that is member of the
candidate key K
■ Full functional dependency: a FD X -> Y where removal
of any attribute from X means the FD does not hold any
more
■ Examples:
■ {SSN, PNUMBER} -> HOURS is a full FD since neither SSN
-> HOURS nor PNUMBER -> HOURS hold
■ {SSN, PNUMBER} -> ENAME is not a full FD (it is called a
partial dependency ) since SSN -> ENAME also holds
Second Normal Form (2)
■ A relation should be in 1NF.

■ A relation schema R is in second normal form

(2NF) if every non-prime attribute A in R is fully
functionally dependent on the primary key

■ R can be decomposed into 2NF relations via the

process of 2NF normalization .
2NF
Third Normal Form (1)
■ Definition:
■ Transitive functional dependency: a FD X -> Z
that can be derived from two FDs X -> Y and Y ->
Z
■ Examples:
■ SSN -> DMGRSSN is a transitive FD
■ Since SSN -> DNUMBER and DNUMBER ->
DMGRSSN hold
■ SSN -> ENAME is non-transitive
■ Since there is no set of attributes X where SSN -> X
and X -> ENAME
Third Normal Form (2)
■ A relation schema R is in third normal form (3NF) if it is
in 2NF and no non-prime attribute A in R is transitively
dependent on the primary key
■ R can be decomposed into 3NF relations via the process
of 3NF normalization
■ NOTE:
■ In X -> Y and Y -> Z, with X as the primary key, we consider
this a problem only if Y is not a part of a candidate key.
■ When Y is a candidate key, there is no problem with the
transitive dependency .
■ E.g., Consider EMP (SSN, Emp#, Salary ).
■ Here, SSN -> Emp# -> Salary and Emp# is a candidate key.
Example 1
• Here,
• CK/PK :{rollno}
• FD: Rollno State
State City
• state is a non-prime attribute, which is trivially
dependent.
• Given relation is not in 3NF.
• Decompose relation as:
• Stud(Rollno,City), City(city,state)
Example 2

• Given relation is not in 3NF

• Solution:
• Split the relation into two relations, such as-
Normal Forms Defined Informally
■ 1st normal form
■ All attributes depend on the key
■ 2nd normal form
■ All attributes depend on the whole key
■ 3rd normal form
■ All attributes depend on nothing but the key
General Normal Form Definitions (For
Multiple Keys) (1)

■ The above definitions consider the primary key

only
■ The following more general definitions take into
account relations with multiple candidate keys
■ A relation schema R is in second normal form
(2NF) if every non-prime attribute A in R is fully
functionally dependent on every key of R
General Normal Form Definitions
Third normal form (3NF)
■ Definition:
■ Superkey of relation schema R - a set of attributes S of
R that contains a key of R

■ A relation schema R is in third normal form (3NF) if

whenever a FD X -> A holds in R, then either:
■ (a) X is a key of R, or
■ (b) A is a prime attribute of R
BCNF (Boyce-Codd Normal Form)
■ A relation schema R is in Boyce-Codd Normal Form
(BCNF) if whenever an FD X -> A holds in R, then X is a
superkey of R
■ In simple terms, for any case (say, X->Y), X can't
be a non-prime attribute.
■ Each normal form is strictly stronger than the previous
one
■ Every 2NF relation is in 1NF
■ Every 3NF relation is in 2NF
■ Every BCNF relation is in 3NF
■ There exist relations that are in 3NF but not in BCNF
■ The goal is to have each relation in BCNF (or 3NF)
Example
• 1NF
• 3NF

• BCNF
• We can decompose the table
Figure 10.12 Boyce-Codd normal form
Figure 10.13 a relation TEACH that is in
3NF but not in BCNF
Achieving the BCNF by Decomposition (1)
■ Two FDs exist in the relation TEACH:
■ fd1: { student, course} -> instructor
■ fd2: instructor -> course
■ {student, course} is a candidate key for this relation and
that the dependencies shown follow the pattern in Figure
10.12 (b).
■ So this relation is in 3NF but not in BCNF
■ A relation NOT in BCNF should be decomposed so as to
meet this property, while possibly forgoing the
preservation of all functional dependencies in the
decomposed relations.
Achieving the BCNF by Decomposition (2)
■ Three possible decompositions for relation TEACH
■ {student, instructor} and {student, course}
■ {course, instructor } and {course, student}
■ {instructor, course } and {instructor, student}
■ All three decompositions will lose fd1.
■ We have to settle for sacrificing the functional dependency
preservation. But we cannot sacrifice the non-additivity property
after decomposition.
■ Out of the above three, only the 3rd decomposition will not generate
spurious tuples after join.(and hence has the non-additivity property).
■ A test to determine whether a binary decomposition (decomposition
into two relations) is non-additive (lossless) is discussed in section
11.1.4 under Property LJ1. Verify that the third decomposition above
meets the property.
Fourth Normal Form (4NF)
• MULTIVALUED DEPENDANCY
• A B, is multivalued dependency if
3 conditions for Multivalued Dependency

1. A B ,for a single value of A, more than one value of B

exists.
2. Tables should have at least 3 columns.
(if table has only 2 columns, we can use 1NF to resolve it).
3. For this table with A,B,C columns, B and C should be
independent.

If ALL THE 3 CONDITIONS ARE TRUE, THEN WE CAN SAY THAT

THE TABLE MAY HAVE MULTI-VALUED DEPENDENCY.
EXAMPLE 1
• DECOMPOSITION of the table:
Example 2
Decomposition
• The process of decomposition in DBMS helps
us remove
– redundancy,
– Inconsistencies,
– anomalies from a database when we divide the
table into numerous tables.
• In simpler words, the process of
decomposition refers to dividing a relation X
into {X1, X2,……Xn}.
Types of Decomposition

• Lossless Join Decomposition (Non-additive):

• Consider there is a relation R which is decomposed into
sub relations R1 , R2 , …. , Rn.
• This decomposition is called lossless join decomposition
when the join of the sub relations results in the same
relation R that was decomposed.
• For lossless join decomposition, we always have
R1 ⋈ R2 ⋈ R3 ……. ⋈ Rn = R, where ⋈ is a natural join
operator
• Lossy Join Decomposition:
• Consider there is a relation R which is decomposed into sub
relations R1 , R2 , …. , Rn.
• This decomposition is called lossy join decomposition when
the join of the sub relations does not result in the same
relation R that was decomposed.
• The natural join of the sub relations is always found to have
some extraneous tuples.
• For lossy join decomposition, we always have-
• R1 ⋈ R2 ⋈ R3 ……. ⋈ Rn ⊃ R, where ⋈ is a natural join operator
Determining Whether Decomposition Is Lossless Or Lossy

• Consider a relation R is decomposed into two sub

relations R1 and R2. Then,
• If all the following conditions satisfy, then the
decomposition is lossless.
• If any of these conditions fail, then the decomposition
is lossy.

• Condition-01
• Union of both the sub relations must contain all the
attributes that are present in the original relation R.
• Thus, R1 ∪ R2 = R
• Condition-02
• Intersection of both the sub relations must not be
null.
• In other words, there must be some common
attribute which is present in both the sub
relations.
• Thus, R1 ∩ R2 ≠ ∅
• Condition-03
• Intersection of both the sub relations must be a
super key of either R1 or R2 or both.
• Thus, R1 ∩ R2 = Super key of R1 or R2
Properties of Decomposition
• Lossless:
• All the decomposition that we perform in Database management system should be
lossless.
• All the information should not be lost while performing the join on the sub-relation
to get back the original relation. It helps to remove the redundant data from the
database.
• Dependency Preservation:
• Dependency Preservation is an important technique in database management
system.
• It ensures that the functional dependencies between the entities is maintained while
performing decomposition.
• It helps to improve the database efficiency, maintain consistency and integrity.
• Lack of Data Redundancy:
• Data Redundancy is generally termed as duplicate data or repeated data.
• This property states that the decomposition performed should not suffer redundant
data.
• It will help us to get rid of unwanted data and focus only on the useful data or
information.

Normalization
No ratings yet
Normalization
15 pages
Normalization
No ratings yet
Normalization
57 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
47 pages
Ch10-Functional Dependencies and Normalization For Relational Databases
No ratings yet
Ch10-Functional Dependencies and Normalization For Relational Databases
31 pages
Unit 3 Updated FG
No ratings yet
Unit 3 Updated FG
16 pages
Unit 3 Normalization
No ratings yet
Unit 3 Normalization
70 pages
212 Lecture 11 Chapter8-Normalization
No ratings yet
212 Lecture 11 Chapter8-Normalization
52 pages
DBMS - Lecture - 4 - Normalization
No ratings yet
DBMS - Lecture - 4 - Normalization
37 pages
Normalization
No ratings yet
Normalization
31 pages
Normalization GFGC
No ratings yet
Normalization GFGC
44 pages
Normalization
No ratings yet
Normalization
15 pages
BES Dbms UNIT IV Notes
No ratings yet
BES Dbms UNIT IV Notes
7 pages
Notes On Normalization of Databases Normalization Is Due To E. F. Codd - Creator of The Relational Database Management
No ratings yet
Notes On Normalization of Databases Normalization Is Due To E. F. Codd - Creator of The Relational Database Management
4 pages
CH 14-Final-Normalization
No ratings yet
CH 14-Final-Normalization
39 pages
Unit Ii DBMS Mca 23
No ratings yet
Unit Ii DBMS Mca 23
15 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
61 pages
Module 3 Part 1
No ratings yet
Module 3 Part 1
14 pages
Normalization
No ratings yet
Normalization
74 pages
Unit 2 Normalization-3
No ratings yet
Unit 2 Normalization-3
39 pages
Session 21-24 Modified
No ratings yet
Session 21-24 Modified
33 pages
Unit3-Part2-Normalization-Normal Forms
No ratings yet
Unit3-Part2-Normalization-Normal Forms
20 pages
Normalization and Normal Form
No ratings yet
Normalization and Normal Form
11 pages
DBMS Unit-2
No ratings yet
DBMS Unit-2
39 pages
Database Normalisationl
No ratings yet
Database Normalisationl
15 pages
Lavy
No ratings yet
Lavy
55 pages
DBMS-UNIT-4 R16 (Ref-2)
No ratings yet
DBMS-UNIT-4 R16 (Ref-2)
10 pages
CIS340 Lecture 15-3
No ratings yet
CIS340 Lecture 15-3
42 pages
Normalization of Database
No ratings yet
Normalization of Database
10 pages
DBMS - Lecture 10
No ratings yet
DBMS - Lecture 10
28 pages
DBMS Normalization
No ratings yet
DBMS Normalization
53 pages
Normalization
No ratings yet
Normalization
5 pages
NORMALISATION
No ratings yet
NORMALISATION
15 pages
Normal Forms
No ratings yet
Normal Forms
15 pages
Normalization
No ratings yet
Normalization
6 pages
DBMS UNIT 4 - Class
No ratings yet
DBMS UNIT 4 - Class
14 pages
FALLSEM2018-19 ITE1003 ETH SJTG04 VL2018191004346 Reference Material I Normalization
No ratings yet
FALLSEM2018-19 ITE1003 ETH SJTG04 VL2018191004346 Reference Material I Normalization
31 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
5 pages
Semantics of The Relation Attributes: Each Tuple in A Relation Should Represent One Entity or Relationship Instance
No ratings yet
Semantics of The Relation Attributes: Each Tuple in A Relation Should Represent One Entity or Relationship Instance
36 pages
Unit V:Normalization: Normalization: Relational Database Design Pitfalls, Denormalized Data, Decomposition
No ratings yet
Unit V:Normalization: Normalization: Relational Database Design Pitfalls, Denormalized Data, Decomposition
30 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
9 pages
Normalisation
No ratings yet
Normalisation
29 pages
Normalization
No ratings yet
Normalization
48 pages
Functional Dependencies and Normilization
No ratings yet
Functional Dependencies and Normilization
60 pages
DBMS Module4
No ratings yet
DBMS Module4
16 pages
DBMS 5 FDB Functional Dependency
No ratings yet
DBMS 5 FDB Functional Dependency
30 pages
Normalization
No ratings yet
Normalization
35 pages
The Normal Forms 3NF and BCNF: BY Jasbir Jassu
No ratings yet
The Normal Forms 3NF and BCNF: BY Jasbir Jassu
25 pages
Database Design & Normalization Guide
No ratings yet
Database Design & Normalization Guide
30 pages
Week 6
No ratings yet
Week 6
36 pages
Normal Forms
No ratings yet
Normal Forms
15 pages
SQL Normalisation, Constraints, ERD and ACID Properties
No ratings yet
SQL Normalisation, Constraints, ERD and ACID Properties
9 pages
DBMS Unit-III
No ratings yet
DBMS Unit-III
42 pages
7 Normalization For Relational Databases
No ratings yet
7 Normalization For Relational Databases
38 pages
Normalization and Denormalization
No ratings yet
Normalization and Denormalization
44 pages
Normalization Unit 4
No ratings yet
Normalization Unit 4
34 pages
Module 1-2
No ratings yet
Module 1-2
78 pages
Module 2 Divide and Conquer Method
No ratings yet
Module 2 Divide and Conquer Method
28 pages
Int 21H
No ratings yet
Int 21H
3 pages
5.1 5.2 Relational Database Design
No ratings yet
5.1 5.2 Relational Database Design
62 pages
DMS CHP No 3
No ratings yet
DMS CHP No 3
25 pages
DCC 3
No ratings yet
DCC 3
37 pages
Lecture 6 - Dimensional Modeling
No ratings yet
Lecture 6 - Dimensional Modeling
99 pages
Electricity Bill Management System
No ratings yet
Electricity Bill Management System
8 pages
4 Marks PHP
No ratings yet
4 Marks PHP
31 pages
Quiz Sistem Basis Data
No ratings yet
Quiz Sistem Basis Data
77 pages
303database Handling Using Python
No ratings yet
303database Handling Using Python
3 pages
JBI Protocol Template Scoping Reviews 2024
No ratings yet
JBI Protocol Template Scoping Reviews 2024
7 pages
Tips & Tricks For The Success With Azure SQL Managed Instance
100% (1)
Tips & Tricks For The Success With Azure SQL Managed Instance
41 pages
Ab Initio Transform Components: We Have An Total of 13 Transformation Components
No ratings yet
Ab Initio Transform Components: We Have An Total of 13 Transformation Components
11 pages
W7D2 - Redis POC With AWS Setup & Spring (18june)
No ratings yet
W7D2 - Redis POC With AWS Setup & Spring (18june)
7 pages
Bca Bigdata Fifth - Sem Approved Syllabus
No ratings yet
Bca Bigdata Fifth - Sem Approved Syllabus
23 pages
SQLPDF
No ratings yet
SQLPDF
77 pages
A1Session7 MBIS4002
No ratings yet
A1Session7 MBIS4002
5 pages
Lecture-3 Relational Algebra I
No ratings yet
Lecture-3 Relational Algebra I
41 pages
How To Replicate Data From SAP Source To HANA Using SLT
No ratings yet
How To Replicate Data From SAP Source To HANA Using SLT
8 pages
Laksh New
No ratings yet
Laksh New
20 pages
Module 11 Maintaining InfoSec 211
No ratings yet
Module 11 Maintaining InfoSec 211
57 pages
Automating NetApp ONTAP With WFA
No ratings yet
Automating NetApp ONTAP With WFA
29 pages
Modern Information Retrieval Guide
No ratings yet
Modern Information Retrieval Guide
39 pages
Lab 11 SQL OUTER JOINS LEFT and RIGHT JOIN
No ratings yet
Lab 11 SQL OUTER JOINS LEFT and RIGHT JOIN
9 pages
DW & DM Unit 4 Notes
No ratings yet
DW & DM Unit 4 Notes
40 pages
Defensive Architecture of The Mediterranean - VI - 48
No ratings yet
Defensive Architecture of The Mediterranean - VI - 48
12 pages
Introduction To Organizational Systems: UNIT-2
No ratings yet
Introduction To Organizational Systems: UNIT-2
50 pages
Module 2
No ratings yet
Module 2
55 pages
JDBC Adoc
No ratings yet
JDBC Adoc
8 pages
Project Work Details For BCA 4th Sem - UPDATED
0% (1)
Project Work Details For BCA 4th Sem - UPDATED
9 pages
Business Analytics
No ratings yet
Business Analytics
60 pages
Alflytics Manual
No ratings yet
Alflytics Manual
52 pages
PHP Machine Test
No ratings yet
PHP Machine Test
4 pages
MahmoudShaaban CV
No ratings yet
MahmoudShaaban CV
1 page
Bioinformatics Database Guide
No ratings yet
Bioinformatics Database Guide
19 pages

Normalization

Uploaded by

Normalization

Uploaded by

Normalization

3 Normal Forms Based on Primary Keys

■ 3.1 Normalization of Relations

■ A key of a relation schema R = {A1, A2, ...., An}

■ A key K is a superkey with the additional

While updating the table, if one

■ All the attributes must have atomic values

Normalize into 1NF

■ A relation schema R is in second normal form

■ R can be decomposed into 2NF relations via the

• Given relation is not in 3NF

■ The above definitions consider the primary key

■ A relation schema R is in third normal form (3NF) if

1. A B ,for a single value of A, more than one value of B

If ALL THE 3 CONDITIONS ARE TRUE, THEN WE CAN SAY THAT

• Lossless Join Decomposition (Non-additive):

• Consider a relation R is decomposed into two sub

You might also like