DBMS
UNIT 2 Notes
Entity Relationship (ER) Modeling
Using High-Level Conceptual Data Models for Database Design
1. Database Design Process
o The first step is requirements collection and analysis.
o Database designers interview users to document their data requirements.
Requirements should be detailed and complete.
o Functional requirements (user-defined operations like retrievals and
updates) are specified alongside data requirements.
2. Conceptual Design Phase
o Once requirements are collected, a conceptual schema is created using a
high-level conceptual data model.
o The conceptual schema includes: Entity types, Relationships & Constraints.
o High-level data models do not include implementation details, making them
easier to understand.
o The conceptual schema serves as a reference to ensure all user
requirements are met. This step helps designers focus on data properties
rather than storage details.
3. Logical Design (Data Model Mapping)
o The next step in database design is the actual implementation of the
database, using a commercial DBMS. So, the conceptual schema is
transformed into an implementation data model (e.g., relational SQL model).
o This step is called logical design or data model mapping.
o Data model mapping may be automated or semi-automated by database
design tools.
4. Physical Design Phase
o Specifies internal storage structures, file organizations, indexes, and access
paths.
o Application programs are designed and implemented as database
transactions corresponding to the high-level transaction specifications.
ENTITIES and ATTRIBUTES
Entity: An entity is a real-world object with an independent existence that can have
data stored about it in a database. Entities are represented as tables in relational
databases.
Attribute: Each entity has an attribute, the particular properties or characteristics
that describes it.
For example:
The EMPLOYEE entity e1 has four attributes:
Name, Address, Age, and Home_phone with
their respective values.
TYPES OF ATTRIBUTES
1. Simple/Atomic Attributes – Attributes that are indivisible.
2. Composite Attributes – Attributes that can be divided into smaller subparts.
3. Single Valued Attributes – Attributes that can have only single values for a
particular entity. Example: Age is a single-valued attribute of a person.
4. Multi Valued Attributes – Attributes that can have a set of values for the same
entity. A multivalued attribute may have lower and upper bounds to constrain the
number of values allowed for each individual entity.
5. Stored Attributes – Attributes whose actual value is stored in the database.
Example: DOB.
6. Derived Attributes – Attribute whose actual values are not stored in the database
but it may be derived from the stored attribute. Example: the value of Age can be
determined from the current (today’s) date and the value of that person’s
Birth_date. The Age attribute is hence called a derived attribute and is said to be
derivable from.
7. Complex Attributes – Attribute that combines multivalued and composite
attributes within an entity. It represents hierarchical or nested data structures in
a database.
NULL Values: A NULL value in a database represents missing, unknown, or
inapplicable data. It is not the same as zero or an empty string; rather, it signifies
that a value is absent. Example: a College_degrees attribute applies only to people
with college degrees, therefore, a person with no college degree would have NULL for
College_degrees.
Entity Types, Entity Sets, Keys, and Value Sets
Entity Types and Entity Sets
A database usually contains groups of entities that are similar. Each entity type
defines a collection of entities that share the same attributes but have unique values
for those attributes.
Entity Type: Defines a set of entities that have the same attributes.
Entity Set: Collection of all entities of a particular entity type at a specific time.
Example: In a company database, EMPLOYEE and COMPANY are entity types, while
the collection of all employee records at a given moment is an entity set.
In ER diagrams, entity types are represented as rectangular boxes with attribute
names in ovals connected by lines.
The entity type represents the schema (intension), while the entity set represents the
extension (actual data instances).
Key Attributes of an Entity Type
An important constraint on the entities of an entity type is the key or uniqueness
constraint on attributes, therefore, a key attribute uniquely identifies each entity in
an entity set.
Key Attribute: An attribute whose values are distinct for each individual entity.
Examples: COMPANY entity type: Key attribute is Name (each company has a unique
name), PERSON entity type: Key attribute is Social Security Number (Ssn).
Each of the Vehicle_id and Registration
attributes of the entity type CAR is a key.
Registration attribute – composite key.
Composite Key: A key formed by combining multiple attributes (e.g., Registration =
{State, Number} for a CAR entity type).
Weak Entity Type: An entity type without a key attribute.
ER Diagram Representation: Key attributes are underlined in ovals. Unlike the
relational model, ER diagrams do not specify a primary key.
Value Sets (Domains) of Attributes
Each simple attribute is associated with a value set, specifying the allowed values for
that attribute.
Value Set (Domain): Defines the range of possible values for an attribute or set of
permitted values for a particular attribute
• Examples:
o Age attribute of EMPLOYEE: Integer values between 16 and 70.
o Name attribute: Strings of alphabetic characters.
Value sets correspond to data types like integer, string, Boolean, float, enumerated
types, etc. In UML and database design tools, additional data types like date and time
can be specified.
Mathematical Representation
An attribute A of an entity set E with a value set V can be represented as a function:
An attribute A of entity set E whose value set is V can be defined as a function from E
to the power set6 P(V) of V:
A : E → P(V)
We refer to the value of attribute A for entity e as A(e).
• Single-valued attributes: A(e) is a singleton set for each entity e in E.
• Multivalued attributes: No restriction on the number of values.
• NULL values: Represented by an empty set.
• Composite attributes: Value set is the power set of the Cartesian product of
component attribute value sets.
Value sets contain all possible values, but only a subset exists in the database at any
given time, reflecting the real-world data state.
Relationship Types, Relationship Sets, Roles, and Structural
Constraints
Attributes referring to other entity types indicate implicit relationships.
Initial entity designs often capture relationships as attributes, but these are later
refined into explicit relationships.
Relationship Types, Sets, and Instances:
A relationship type R among n entity types E1, E2, ..., En defines a relationship set.
Mathematically, a relationship set R is a collection of relationship instances ri where
each instance ri links n entities (e1, e2,……,en) from participating entity types.
Mathematically, a relationship set is a subset of the Cartesian product E1 × E2 × ... ×
En.
• Example: WORKS_FOR relates EMPLOYEE to DEPARTMENT.
Relationship Degree, Role Names, and Recursive Relationships:
• Degree of Relationship: Number of participating entity types.
o Binary Relationship (Degree 2): WORKS_FOR (EMPLOYEE and
DEPARTMENT)
o Ternary Relationship (Degree 3): SUPPLY (SUPPLIER, PART, PROJECT)
• Recursive/Self-referencing Relationships: When the same entity type
participates multiple times in different roles.
o Example: The SUPERVISION relationship type relates an employee to a
supervisor, where both employee and supervisor entities are members of the
same EMPLOYEE entity set. Hence, the EMPLOYEE entity type participates
twice in SUPERVISION: once in the role of supervisor(1) (or boss), and once
in the role of supervisee (or subordinate)(2)
• Role Names: Each entity type that participates in a relationship type plays a
particular role in the relationship. The role name signifies the role that a participating
entity from the entity type plays in each relationship instance, and it helps to explain
what the relationship means. For example, in the WORKS_FOR relationship type,
EMPLOYEE plays the role of employee
Constraints on Binary Relationships:
1. Cardinality Ratios: Specifies the maximum number of relationship instances an
entity can participate in.
o 1:1 (One-to-One) - MANAGES (EMPLOYEE manages a DEPARTMENT)
o 1:N (One-to-Many) - WORKS_FOR (EMPLOYEE works for one
DEPARTMENT, but a DEPARTMENT has many EMPLOYEES)
o M:N (Many-to-Many) - WORKS_ON (EMPLOYEE works on multiple
PROJECTS, and each PROJECT has multiple EMPLOYEES)
o N:1 (Many-to-one)
2. Participation Constraints: Specifies the minimum number of relationship
instances that each entity can participate in and is sometimes called the minimum
cardinality constraint. Two types of participation constraints:
o Total Participation (Existence Dependency): Every instance (or every
entity) must participate (e.g., every EMPLOYEE must WORK_FOR a
DEPARTMENT).
o Partial Participation: Some instances may not participate (e.g., not all
EMPLOYEES manage a DEPARTMENT).
o Represented in ER diagrams as:
▪ Double line for total participation
▪ Single line for partial participation
Weak Entity Types
Entity types that do not have their own key attributes are called weak entity types.
In contrast, entity types with a key attribute are strong entity types.
Weak entities are identified by their relationship with a specific owner entity type
and one of their attribute values, therefore, the relationship that connects a weak
entity type to its owner entity type is called the identifying relationship.
Total Participation Constraint: A weak entity type always has a total participation
constraint (existence dependency) with respect to its identifying relationship since it
cannot exist without the owner entity.
Example:
o Consider the DEPENDENT entity type related to EMPLOYEE.
o DEPENDENT Attributes: Name, Birth_date, Sex, Relationship (to the
employee).
o Two dependents of different employees may have the same values for these
attributes but are distinguished by the specific employee they are related to.
o The EMPLOYEE entity is the owner of the DEPENDENT entities.
Partial Key:
A partial key is an attribute that uniquely identifies weak entities related to the same
owner entity. In the DEPENDENT example, if no two dependents of the same
employee have the same first name, then Name acts as a partial key. In the worst
case, a composite attribute of all weak entity attributes may serve as the partial key.
Weak entity types and identifying relationships are represented by double lines
around their boxes and diamonds. Partial key attributes are underlined with a
dashed or dotted line.
ER Diagrams, Naming Conventions, and Design Issues
Examples:
1. An entity type DEPARTMENT with attributes Name, Number, Locations, Manager,
and Manager_start_date. Locations is the only multivalued attribute. Both Name
and Number are (separate) key attributes because each was specified to be unique.
2. An entity type PROJECT with attributes Name, Number, Location, and
Controlling_department. Both Name and Number are (separate) key attributes.
3. An entity type EMPLOYEE with attributes Name, Ssn, Sex, Address, Salary,
Birth_date, Department, and Supervisor. Both Name and Address may be
composite attributes; however, this was not specified in the requirements. We
must go back to the users to see if any of them will refer to the individual
components of Name—First_name, Middle_initial, Last_name—or of Address. In
our example, Name is modeled as a composite attribute, whereas Address is not,
presumably after consultation with the users.
4. An entity type DEPENDENT with attributes Employee, Dependent_name, Sex,
Birth_date, and Relationship (to the employee).
Relationship Types of Degree Higher than Two
The degree of a relationship type is determined by the number of participating
entity types. Binary relationships involve two entity types, while ternary relationships
involve three entity types.
Choosing Between Binary and Ternary (or Higher-Degree) Relationships
ER Diagram Representation:
A ternary relationship type is represented by an entity connected to three other
entities in an ER diagram.
Example: (a) The SUPPLY relationship (Supplier, Part, Project)
(b) Three binary relationships not equivalent to SUPPLY
(c) SUPPLY represented as a weak entity type
Weak Entities in Higher-Degree Relationships
A weak entity may be associated with a ternary relationship type where multiple
owner entities exist.
Example: INTERVIEW (Candidate, Company, Dept_Date) is identified by all three
components.
Constraints on Ternary (or Higher-Degree) Relationships
1. Cardinality Constraints:
Similar to binary relationships, ternary relationships can have 1, M, or N constraints
on each entity.
Example: If a Project-Part combination must be supplied by only one supplier, we
place 1 on the Supplier and M, N on the Project and Part.
2. (min, max) Constraints:
Defines the minimum and maximum number of relationships an entity can
participate in. Helps in understanding participation restrictions beyond just
cardinality.
A UNIVERSITY DATABASE