UNIT -2
The Data Link Layer: Introduction, Link layer addressing, Error detection and Correction: Cyclic codes,
Checksum, Forward error correction, Data link control: DLC Services, Data link layer protocols, HDLC,
Point to Point Protocol, Media Access control: Random Access, Controlled Access, Channelization,
Connecting devices and virtual LANs: Connecting Devices.
Introduction:
The Internet is a combination of networks glued together by connecting devices (routers or switches). If a
packet is to travel from a host to another host, it needs to pass through these networks.
NODES AND LINKS:
Communication at the data-link layer is node-to-node. A data unit from one point in the Internet
needs to pass through many networks (LANs and WANs) to reach another point. Theses LANs and WANs
are connected by routers. It is customary to refer to the two end hosts and the routers as nodes and the
networks in between as links. Figure 2.2 is a simple representation of links and nodes when the path of the
data unit is only six nodes.
FIGURE 2.2: NODES AND LINKS
The first node is the source host; the last node is the destination host. The other four nodes are four
routers. The first, the third, and the fifth links represent the three LANs; the second and the fourth links
represent the two WANs.
SERVICES:
The data-link layer is located between the physical and the network layers. The data link layer
provides services to the network layer; it receives services from the physical layer.
The duty scope of the data-link layer is node-to-node. When a packet is travelling in the Internet, the
data-link layer of a node (host or router) is responsible for delivering a datagram to the next node in the path.
For this purpose, the data-link layer of the sending node needs to encapsulate the datagram received
from the network in a frame, and the data-link layer of the receiving node needs to decapsulate the datagram
from the frame.
In other words, the data-link layer of the source host needs only to encapsulate, the data-link layer of
the destination host needs to decapsulate, but each intermediate node needs to both encapsulate and
decapsulate.
Figure 2.3 shows the encapsulation and decapsulation at the data-link layer. For simplicity, we have
assumed that we have only one router between the source and destination. The datagram received by the
data-link layer of the source host is encapsulated in a frame. The frame is logically transported from the
source host to the router.
The frame is decapsulated at the data-link layer of the router and encapsulated at another frame. The
new frame is logically transported from the router to the destination host. Note that, although we have shown
only two data-link layers at the router, the router actually has three data-link layers because it is connected to
three physical links.
FIGURE 2.3: A COMMUNICATION WITH ONLY THREE NODES
With the contents of the above figure in mind, we can list the services provided by a data-link layer
as shown below.
FRAMING: Definitely, the first service provided by the data-link layer is framing. The data-link
layer at each node needs to encapsulate the datagram (packet received from the network layer) in a frame
before sending it to the next node.
The node also needs to decapsulate the datagram from the frame received on the logical channel.
Although we have shown only a header for a frame, we will see in future chapters that a frame may have
both a header and a trailer. Different data-link layers have different formats for framing.
FLOW CONTROL: Whenever we have a producer and a consumer, we need to think about flow
control. If the producer produces items that cannot be consumed, accumulation of items occurs. The sending
data-link layer at the end of a link is a producer of frames; the receiving data-link layer at the other end of a
link is a consumer.
If the rate of produced frames is higher than the rate of consumed frames, frames at the receiving end
need to be buffered while waiting to be consumed (processed). Definitely, we cannot have an unlimited
buffer size at the receiving side. We have two choices. The first choice is to let the receiving data-link layer
drop the frames if its buffer is full.
The second choice is to let the receiving data-link layer send a feedback to the sending data-link layer
to ask it to stop or slow down. Different data-link-layer protocols use different strategies for flow control.
ERROR CONTROL: At the sending node, a frame in a data-link layer needs to be changed to bits,
transformed to electromagnetic signals, and transmitted through the transmission media. At the receiving
node, electromagnetic signals are received, transformed to bits, and put together to create a frame.
Since electromagnetic signals are susceptible to error, a frame is susceptible to error. The error needs
first to be detected. After detection, it needs to be either corrected at the receiver node or discarded and
retransmitted by the sending node.
CONGESTION CONTROL: Although a link may be congested with frames, which may result in
frame loss, most data-link-layer protocols do not directly use a congestion control to alleviate congestion,
although some wide-area networks do. In general, congestion control is considered an issue in the network
layer or the transport layer because of its end-to-end nature.
TWO CATEGORIES OF LINKS: Although two nodes are physically connected by a transmission
medium such as cable or air, we need to remember that the data-link layer controls how the medium is used.
We can have a data-link layer that uses the whole capacity of the medium; we can also have a data-link layer
that uses only part of the capacity of the link.
In other words, we can have a point-to-point link or a broadcast link. In a point-to-point link, the link
is dedicated to the two devices; in a broadcast link, the link is shared between several pairs of devices.
Two Sub layers:
To better understand the functionality of and the services provided by the link layer, we can divide
the data-link layer into two sub layers: data link control (DLC) and media access control (MAC).
The data link control sub layer deals with all issues common to both point-to-point and broadcast
links; the media access control sub layer deals only with issues specific to broadcast links.
LINK-LAYER ADDRESSING:
A link-layer address is sometimes called a link address, sometimes a physical address, and
sometimes a MAC address.
Since a link is controlled at the data-link layer, the addresses need to belong to the data-link layer.
When a datagram passes from the network layer to the data-link layer, the datagram will be encapsulated in a
frame and two data-link addresses are added to the frame header. These two addresses are changed every
time the frame moves from one link to another. Figure 2.4 demonstrates the concept in a small internet.
In the internet in Figure 2.4, we have three links and two routers. We also have shown only two
hosts: Alice (source) and Bob (destination). For each host, we have shown two addresses, the IP addresses
(N) and the link-layer addresses (L).
Note that a router has as many pairs of addresses as the number of links the router is connected to.
We have shown three frames, one in each link. Each frame carries the same datagram with the same source
and destination addresses (N1 and N8), but the link-layer addresses of the frame change from link to link.
In link 1, the link-layer addresses are L1 and L2. In link 2, they are L4 and L5. In link 3, they are L7
and L8.
FIGURE 2.4: IP ADDRESSES AND LINK-LAYER ADDRESSES IN A SMALL INTERNET
Note that the IP addresses and the link-layer addresses are not in the same order. For IP addresses,
the source address comes before the destination address; for link-layer addresses, the destination address
comes before the source.
THREE TYPES OF ADDRESSES:
Some link-layer protocols define three types of addresses: unicast, multicast, and broadcast.
UNICAST ADDRESS: Each host or each interface of a router is assigned a unicast address.
Unicasting means one-to-one communication. A frame with a unicast address destination is destined only for
one entity in the link.
MULTICAST ADDRESS: Some link-layer protocols define multicast addresses. Multicasting means
one-to-many communication. However, the jurisdiction is local (inside the link).
BROADCAST ADDRESS: Some link-layer protocols define a broadcast address. Broadcasting
means one-to-all communication. A frame with a destination broadcast address is sent to all entities in the
link.
Address Resolution Protocol (ARP):
Anytime a node has an IP datagram to send to another node in a link, it has the IP address of the
receiving node. The source host knows the IP address of the default router.
Each router except the last one in the path gets the IP address of the next router by using its
forwarding table. The last router knows the IP address of the destination host. However, the IP address of the
next node is not helpful in moving a frame through a link; we need the link-layer address of the next node.
This is the time when the Address Resolution Protocol (ARP) becomes helpful.
The ARP protocol is one of the auxiliary protocols defined in the network layer, as shown in Figure
2.5. It belongs to the network layer, but we discuss it here because it maps an IP address to a logical-link
address. ARP accepts an IP address from the IP protocol, maps the address to the corresponding link-layer
address, and passes it to the data-link layer.
FIGURE 2.5: POSITION OF ARP IN TCP/IP PROTOCOL SUITE
Anytime a host or a router needs to find the link-layer address of another host or router in its
network, it sends an ARP request packet. The packet includes the link-layer and IP addresses of the sender
and the IP address of the receiver. Because the sender does not know the link-layer address of the receiver,
the query is broadcast over the link using the link-layer broadcast address (see Figure 2.6).
Every host or router on the network receives and processes the ARP request packet, but only the
intended recipient recognizes its IP address and sends back an ARP response packet. The response packet
contains the recipient’s IP and link-layer addresses. The packet is unicast directly to the node that sent the
request packet.
In Figure 2.6a, the system on the left (A) has a packet that needs to be delivered to another system
(B) with IP address N2. System A needs to pass the packet to its data-link layer for the actual delivery, but it
does not know the physical address of the recipient.
It uses the services of ARP by asking the ARP protocol to send a broadcast ARP request packet to
ask for the physical address of a system with an IP address of N2. This packet is received by every system
on the physical network, but only system B will answer it, as shown in Figure 2.6b.
System B sends an ARP reply packet that includes its physical address. Now system A can send all
the packets it has for this destination using the physical address it received.
FIGURE 2.6: ARP OPERATION
Packet Format:
Figure 2.7 shows the format of an ARP packet. The names of the fields are self-explanatory. The
hardware type field defines the type of the link-layer protocol; Ethernet is given the type 1.
The protocol type field defines the network-layer protocol: IPv4 protocol is (0800)16. The source
hardware and source protocol addresses are variable-length fields defining the link-layer and network-layer
addresses of the sender.
The destination hardware address and destination protocol address fields define the receiver link-
layer and network-layer addresses. An ARP packet is encapsulated directly into a data-link frame. The frame
needs to have a field to show that the payload belongs to the ARP and not to the network-layer datagram.
FIGURE 2.7: ARP PACKET
ERROR DETECTION AND CORRECTION
Networks must be able to transfer data from one device to another with acceptable accuracy. For
most applications, a system must guarantee that the data received are identical to the data transmitted.
Any time data are transmitted from one node to the next, they can become corrupted in passage.
Many factors can alter one or more bits of a message. Some applications require a mechanism for detecting
and correcting errors.
Some applications can tolerate a small level of error. For example, random errors in audio or video
transmissions may be tolerable, but when we transfer text, we expect a very high level of accuracy.
At the data-link layer, if a frame is corrupted between the two nodes, it needs to be corrected before it
continues its journey to other nodes. However, most link-layer protocols simply discard the frame and let the
upper-layer protocols handle the retransmission of the frame. Some multimedia applications, however, try to
correct the corrupted frame.
Types of Errors
• When bits flow from 1 point to another, they are subject to unpredictable-changes ‘.’ of interference.
• The interference can change the shape of the signal.
• Two types of errors: 1) Single-bit error 2) Burst-error.
1) Single-Bit Error
Only 1 bit of a given data is changed from 1 to 0 or from 0 to 1 .
2) Burst Error Two or more bits in the data have changed from 1 to 0 or from 0 to 1 .
A burst-error occurs more than a single-bit error. This is because: Normally, the duration of noise is
longer than the duration of 1-bit.
When noise affects data, the noise also affects the bits.
The no. of corrupted-bits depends on data-rate and duration of noise.
Redundancy
The central concept in detecting/correcting errors is redundancy.
Some extra-bits along with the data have to be sent to detect/correct errors. These extra bits are called
redundant-bits.
The redundant-bits are added by the sender and removed by the receiver.
The presence of redundant-bits allows the receiver to detect/correct errors.
ERROR DETECTION AND ERROR CORRECTION
• Error-correction is more difficult than error-detection.
1) Error Detection
Here, we are checking whether any error has occurred or not.
2) Error Correction
Here, we need to know exact number of corrupted-bits and location of bits in the message.
Two important factors to be considered:
1) Number of errors and
2) Message-size.
Coding
• Redundancy is achieved through various coding-schemes.
1) Sender adds redundant-bits to the data-bits. This process creates a relationship between redundant-
bits and
data-bits.
2) Receiver checks the relationship between redundant-bits & data-bits to detect/correct errors.
• Two important factors to be considered:
1) Ratio of redundant-bits to the data-bits and
2) Robustness of the process.
Two broad categories of coding schemes: 1) Block-coding and 2) convolution coding is more complex.
.
1.) Block Coding
The message is divided into k-bit blocks. These blocks are called data-words.
Here, r-redundant-bits are added to each block to make the length n=k+r.
The resulting n-bit blocks are called code-words.
Since n>k, the number of possible code-words is larger than the number of possible data-words.
Block-coding process is 1-to-1; the same data-word is always encoded as the same code-word.
Thus, we have 2n-2k code-words that are not used. These code-words are invalid or illegal.
1.1) Error Detection
• If the following 2 conditions are met, the receiver can detect a change in the original code-word:
1) The receiver has a list of valid code-words.
2) The original code-word has changed to an invalid code-words.
1) At Sender
i) The sender creates code-words out of data-words by using a generator.
The generator applies the rules and procedures of encoding.
ii) During transmission, each code-word sent to the receiver may change.
2) At Receiver
i) a) If the received code-word is the same as one of the valid code-words, the code-word is accepted;
the corresponding data-word is extracted for use.
b) If the received code-word is invalid, the code-word is discarded.
ii) However, if the code-word is corrupted but the received code-word still matches a valid codeword, the
error
remains undetected.
• An error-detecting code can detect only the types of errors for which it is designed; other types of errors
may remain undetected.
Hamming Distance
• The main concept for error-control: Hamming distance.
• The Hamming distance b/w 2 words is the number of differences between the corresponding bits.
• Let d(x,y) = Hamming distance b/w 2 words x and y.
• Hamming distance can be found by
→ applying the XOR operation on the 2 words and
→ counting the number of 1s in the result.
• For example:
1) The Hamming distance d(000, 011) is 2 because 000⊕011= 011 (two 1s).
2) The Hamming distance d(10101, 11110) is 3 because 10101⊕11110 = 01011 (three 1s).
Hamming Distance and Error
Hamming distance between the received word and the sent code-word is the number of bits that are
corrupted during transmission.
For example: Let Sent code-word = 00000 Received word = 01101
Hamming distance = d(00000, 01101) =3. Thus, 3 bits are in error.
Minimum Hamming Distance for Error Detection
• Minimum Hamming distance is the smallest Hamming distance b/w all possible pairs of code-words.
• Let dmin = minimum Hamming distance.
• To find dmin value, we find the Hamming distances between all words and select the smallest one.
Minimum-distance for Error-detection
• If ‘s’ errors occur during transmission, the Hamming distance b/w the sent code-word and received
code-word is ‘s’
• If code has to detect upto ‘s’ errors, the minimum-distance b/w the valid codes must be ‘s+1’ i.e.
dmin=s+1.
We use a geometric approach to define dmin=s+1.
Let us assume that the sent code-word x is at the center of a circle with radius s.
All received code-words that are created by 0 to s errors are points inside the circle or on the
perimeter of the circle.
All other valid code-words must be outside the circle
• For example: A code scheme has a Hamming distance dmin = 4.
This code guarantees the detection of upto 3 errors (d = s + 1 or s = 3).
Linear Block Codes
• Almost all block codes belong to a subset of block codes called linear block codes.
• A linear block code is a code in which the XOR of 2 valid code-words creates another valid code-word.
(XOR or Addition modulo-2)
The code in Table 10.1 is a linear block code because the result of XORing any code-word with any
other code-word is a valid code-word.
For example, the XORing of the 2nd and 3rd code-words creates the 4th one.
Minimum Distance for Linear Block Codes
• Minimum Hamming distance is no. of 1s in the nonzero valid code-word with the smallest no. of 1s.
The numbers of 1s in the nonzero code-words are 2, 2, and 2.
So the minimum Hamming distance is dmin = 2.
Parity Check Code
• This code is a linear block code. This code can detect an odd number of errors.
• A k-bit data-word is changed to an n-bit code-word where n=k+1.
• One extra bit is called the parity-bit.
• The parity-bit is selected to make the total number of 1s in the code-word even.
• Minimum hamming distance dmin = 2. This means the code is a single-bit error-detecting code.
1) At Sender
The encoder uses a generator that takes a copy of a 4-bit data-word (a0, a1, a2, and a3) and generates
a parity-bit r0.
The encoder
→ accepts a copy of a 4-bit data-word (a0, a1, a2, and a3) and
→ generates a parity-bit r0 using a generator
→ generates a 5-bit code-word
The parity-bit & 4-bit data-word are added to make the number of 1s in the code-word even.
The addition is done by using the following:
The result of addition is the parity-bit.
1) If the no. of 1s in data-word = even, result = 0. (r0=0)
2) If the no. of 1s in data-word = odd, result = 1. (r0=1)
3) In both cases, the total number of 1s in the code-word is even.
The sender sends the code-word, which may be corrupted during transmission.
2) At Receiver
The receiver receives a 5-bit word.
The checker performs the same operation as the generator with one exception:
The addition is done over all 5 bits.
Cyclic Codes
• Cyclic codes are special linear block codes with one extra property:
If a code-word is cyclically shifted (rotated), the result is another code-word.
For ex: if code-word = 1011000 and we cyclically left-shift, then another code-word = 0110001.
• Let First-word = a0 to a6 and Second-word = b0 to b6, we can shift the bits by using the following:
Cyclic Redundancy Check (CRC)
• CRC is a cyclic code that is used in networks such as LANs and WANs.
Let Size of data-word = k bits (here k=4).
Size of code-word = n bits (here n=7).
Size of divisor = n-k+1 bits (here n-k+1=4).
• Here is how it works :
1) At Sender
n-k 0s is appended to the data-word to create augmented data-word. (here n-k=3).
The augmented data-word is fed into the generator (Figure 10.6).
The generator divides the augmented data-word by the divisor.
The remainder is called check-bits (r2r1r0).
The check-bits (r2r1r0) are appended to the data-word to create the code-word.
2) At Receiver
The possibly corrupted code-word is fed into the checker.
The checker is a replica of the generator.
The checker divides the code-word by the divisor.
The remainder is called syndrome bits (r2r1r0).
The syndrome bits are fed to the decision-logic-analyzer.
The decision-logic-analyzer performs following functions:
i) For No Error
If all syndrome-bits are 0s, the received code-word is accepted.
Data-word is extracted from received code-word .
ii) For Error
If all syndrome-bits are not 0s, the received code-word is discarded .
CHECKSUM
• Checksum is an error-detecting technique.
• In the Internet,
→ The checksum is mostly used at the network and transport layer.
→ The checksum is not used in the data link layer.
• Like linear and cyclic codes, the checksum is based on the concept of redundancy.
1) At Source
Firstly the message is divided into m-bit units.
Then, the generator creates an extra m-bit unit called the checksum.
The checksum is sent with the message.
2) At Destination
The checker creates a new checksum from the combination of the message and sentchecksum.
i) If the new checksum is all 0s, the message is accepted.
ii) ii) If the new checksum is not all 0s, the message is discarded
One's Complement
We can represent unsigned numbers between 0 and 2n-1 using only n bits.
If the number has more than n bits, the extra leftmost bits need to be added to the n rightmost bits
(wrapping).
A negative number can be represented by inverting all bits (changing 0 to 1 and 1 to 0).
This is the same as subtracting the number from 2n-1.
1) At Sender
The sender initializes the checksum to 0 and adds all data items and the checksum.
The result is 36.
However, 36 cannot be expressed in 4 bits.
The extra two bits are wrapped and added with the sum to create the wrapped sum value 6.
The sum is then complemented, resulting in the checksum value 9 (15 - 6 = 9).
The sender now sends six data items to the receiver including the checksum 9.
2) At Receiver
The receiver follows the same procedure as the sender.
It adds all data items (including the checksum); the result is 45.
The sum is wrapped and becomes 15. The wrapped sum is complemented and becomes 0.
Since the value of the checksum is 0, this means that the data is not corrupted. The receiver drops the
checksum and keeps the other data items.
If the checksum is not zero, the entire packet is dropped.
Internet Checksum
• Traditionally, the Internet has been using a 16-bit checksum.
• The sender or the receiver uses five steps.
EXAMPLE FOR MESSAGE-----FOROUZAN
Algorithm
Other Approaches to the Checksum
• If two 16-bit items are transposed in transmission, the checksum cannot catch this error.
• The reason is that the traditional checksum is not weighted: it treats each data item equally.
• In other words, the order of data items is immaterial to the calculation.
• Two approaches have been used to prevent this problem: 1)Fletcher and 2)Adler
Fletcher Checksum
• The Fletcher checksum was devised to weight each data item according to its position.
• Fletcher has proposed two algorithms: 8-bit and 16-bit (Figure 10.18).
• The first, 8-bit Fletcher, calculates on 8-bit data items and creates a 16-bit checksum.
The second, 16-bit Fletcher, calculates on 16-bit data items and creates a 32-bit checksum.
• The 8-bit Fletcher is calculated over data octets (bytes) and creates a 16-bit checksum.
• The calculation is done modulo 256 (28), which means the intermediate results are divided by 256 and the
remainder is kept.
• The algorithm uses two accumulators, L and R.
• The first simply adds data items together;
The second adds a weight to the calculation.
Adler Checksum
• The Adler checksum is a 32-bit checksum.
• It is similar to the 16-bit Fletcher with three differences (Figure 10.19).
1) Calculation is done on single bytes instead of 2 bytes at a time.
2) The modulus is a prime number (65,521) instead of 65,536.
3) L is initialized to 1 instead of 0.
• A prime modulo has a better detecting capability in some combinations of data.
FORWARD ERROR CORRECTION
• Retransmission of corrupted and lost packets is not useful for real-time multimedia transmission because it
creates an unacceptable delay in reproducing: we need to wait until the lost or corrupted packet is resent.
• We need to correct the error or reproduce the packet immediately.
• Several schemes have been designed and used that are collectively referred to as forward error correction
(FEC) techniques.
Using Hamming Distance
• To detect t errors, we need to have dmin = 2t + 1
• In other words, if we want to correct 10 bits in a packet, we need to make the minimum hamming distance
21 bits, which means a lot of redundant bits need to be sent with the data.
Using XOR
• Use the property of the exclusive OR operation as shown below.
• We divide a packet into N chunks, create the exclusive OR of all the chunks and send N + 1 chunks.
• If any chunk is lost or corrupted, it can be created at the receiver site.
• If N = 4, it means that we need to send 25 percent extra data and be able to correct the data if only one out
of four chunks is lost.
Chunk Interleaving
• Another way to achieve FEC in multimedia is to allow some small chunks to be missing at the receiver.
• We cannot afford to let all the chunks belonging to the same packet be missing.
However, we can afford to let one chunk be missing in each packet.
each packet is divided into 5 chunks (normally the number is much larger).
• Then, we can create data chunk-by-chunk (horizontally), but combine the chunks into packets
vertically.
• In this case, each packet sent carries a chunk from several original packets.
• If the packet is lost, we miss only one chunk in each packet.
• Normally, missing of a chunk is acceptable in multimedia communication.
Combining Hamming Distance and Interleaving
• Hamming distance and interleaving can be combined.
• Firstly, we can create n-bit packets that can correct t-bit errors.
• Then, we interleave m rows and send the bits column-by-column.
• In this way, we can automatically correct burst-errors up to m×t bit errors.