Module 5-1
Module 5-1
UNIT – V
                                           APPLICATION LAYER
     The Application Layer- DNS - Name Space - Resource Records - Name Servers- E-Mail - Architecture And
     Services – The User Agent –Message Format –Message Transfer –Final Delivery – WWW –Architecture –
     Static Web Pages – Dynamic Web Pages And Web Applications –HTTP –Network Security –Introduction
     To Cryptography –Substitution Ciphers-Transposition Ciphers-Public Key Algorithms – RSA –
     Authentication Protocols –Authentication Using Kerberos.
Working:-
   To map a name onto an IP address, an application program calls a library procedure called Resolver,
     passing it the name as a parameter.
   The resolver sends a UDP packet to a local DNS server, which then looks up the name and returns the IP
     address to the resolver, which then returns it to the caller.
   Armed with the IP address, the program can then establish a TCP connection with the destination, or
     send it UDP packets.
     The Internet is divided into several hundred top level domains, where each domain covers many hosts.
Each domain is partitioned into sub domains, and these are further partitioned as so on. All these domains can
be represented by a tree, in which the leaves represent domains that have no sub domains. A leaf domain may
contain a single host, or it may represent a company and contains thousands of hosts. Each domain is named by
the path upward from it to the root. The components are separated by periods (pronounced “dot”)
Eg: Sun Microsystems Engg. Department = eng.sun.com.
2.  RESOURCE RECORDS:
    Every domain can have a sent of resource records associated with it. For a single host, the most common
resource record is just its IP address. When a resolver gives a domain name to DNS, it gets both the resource
records associated with that name i.e., the real function of DNS is to map domain names into resource records.
A resource record is a 5-tuple and its format is as follows:
3. NAME SERVERS:
    It contains the entire database and responds to all queries about it. DNS name space is divided up into non-
overlapping zones, in which each zone contains some part of the tree and also contains name servers holding the
authoritative information about that zone.
Figure 5-2. Part of the DNS name space showing the division into zones.
When a resolver has a query about a domain name, it passes the query to one of the local name servers:
1. If the domain being sought falls under the jurisdiction of name server, it returns the authoritative resource
    records        (that comes from the authority that manages the record,and is always correct).
2. If the domain is remote and no information about the requested domain is available locally the name server
    sends a query message to the top level name server for the domain requested.
E.g.: A resolver of flits.cs.vle.nl wants to know the IP address of the host Linda.cs.yale.edu
ELECTRONIC MAIL
(a). Composition: It refers to the process of creating messages and answers. Any text editor is used for body of
     the message. While the system itself can provide assistance with addressing and numerous header fields
     attached to each message.
(b). Reporting: It has to do with telling the originator what happened to the message that is, whether it was
     delivered, rejected (or) lost.
(c). Transfer: It refers to moving messages from originator to the recipient.
(d). Displaying: Incoming messages are to be displayed so that people can read their email.
(e). Disposition: It concerns what the recipient dose with the message after receiving it. Possibilities include
     throwing it away before reading (or) after reading, saving it and so on.
Most systems allow users to create mailboxes to store incoming e-mail. Commands are needed to create and
destroy mailboxes, inspect the contents of mailboxes, insert and delete messages from mailboxes, and so on.
Figure 5-4: Envelopes and messages. (a) Paper mail. (b) Electronic mail.
   A user agent is normally a program (sometimes called a mail reader) that accepts a variety of commands for
composing, receiving, and replying to messages, as well as for manipulating mailboxes.
SENDING E-MAIL
    To send an e-mail message, a user must provide the message, the destination address, and possibly some
other parameters. The message can be produced with a free-standing text editor, a word processing program, or
possibly with a specialized text editor built into the user agent. The destination address must be in a format that
the user agent can deal with. Many user agents expect addresses of the form user@dns-address.
READING E-MAIL
   When a user agent is started up, it looks at the user's mailbox for incoming e-mail before displaying
anything on the screen. Then it may announce the number of messages in the mailbox or display a one-line
summary of each one and wait for a command.
RFC 822
Messages consist of a primitive envelope (described in RFC 821), some number of header fields, a blank line,
and then the message body. Each header field (logically) consists of a single line of ASCII text containing the
field name, a colon, and, for most fields, a value.
RFC 822 specified the headers but left the content entirely up to the users. Nowadays, on the worldwide
Internet, this approach is no longer adequate. The problems include sending and receiving
   1. Messages in languages with accents (e.g., French and German).
   2. Messages in non-Latin alphabets (e.g., Hebrew and Russian).
   3. Messages in languages without alphabets (e.g., Chinese and Japanese).
   4. Messages not containing text at all (e.g., audio or images).
A solution was proposed in RFC 1341 called MIME (Multipurpose Internet Mail Extensions)
        The basic idea of MIME is to continue to use the RFC 822 format, but to add structure to the message
body and define encoding rules for non-ASCII messages. By not deviating from RFC 822, MIME messages can
be sent using the existing mail programs and protocols. All that has to be changed are the sending and receiving
programs, which users can do for themselves.
MESSAGE TRANSFER
       The message transfer system is concerned with relaying messages from the originator to the recipient.
The simplest way to do this is to establish a transport connection from the source machine to the destination
machine and then just transfer the message.
        SMTP is a simple ASCII protocol. After establishing the TCP connection to port 25, the sending
machine, operating as the client, waits for the receiving machine, operating as the server, to talk first. The server
starts by sending a line of text giving its identity and telling whether it is prepared to receive mail. If it is not,
the client releases the connection and tries again later.
        Even though the SMTP protocol is completely well defined, a few problems can still arise.
       One problem relates to message length. Some older implementations cannot handle messages
       exceeding 64 KB.
       Another problem relates to timeouts. If the client and server have different timeouts, one of them may
       give up while the other is still busy, unexpectedly terminating the connection.
        For example, if host 1 holds mailing list A and host 2 holds mailing list B and each list contains an entry
for the other one, then a message sent to either list could generate a never-ending amount of e-mail traffic unless
somebody checks for it.
FINAL DELIVERY
With the advent of people who access the Internet by calling their ISP over a modem, it breaks down.
One solution is to have a message transfer agent on an ISP machine accept e-mail for its customers and store it
in their mailboxes on an ISP machine. Since this agent can be on-line all the time, e-mail can be sent to it 24
hours a day.
Figure:5-7
  (a) Sending and reading mail when the receiver has a permanent Internet connection and the user agent
                        runs on the same machine as the message transfer agent.
(b) Reading e-mail when the receiver has a dial-up connection to an ISP
POP3 begins when the user starts the mail reader. The mail reader calls up the ISP (unless there is already a
connection) and establishes a TCP connection with the message transfer agent at port 110. Once the connection
has been established, the POP3 protocol goes through three states in sequence:
   1. Authorization.
   2. Transactions.
   3. Update.
The authorization state deals with having the user log in.
The transaction state deals with the user collecting the e-mails and marking them for deletion from the mailbox.
POP3 normally downloads all stored messages at each contact, the result is that the user's e-mail quickly gets
spread over multiple machines, more or less at random; some of them not even the user's.
This disadvantage gave rise to an alternative final delivery protocol, IMAP (Internet Message Access
Protocol).
       The World Wide Web is an architectural framework for accessing linked documents spread out over
millions of machines all over the Internet. The initial proposal for a web of linked documents came from CERN
physicist Tim Berners-Lee in 1989.
ARCHITECTURAL OVERVIEW
        From the users' point of view, the Web consists of a vast, worldwide collection of documents or Web
pages. Each page may contain links to other pages anywhere in the world. Users can follow a link by clicking
on it, which then takes them to the page pointed to. This process can be repeated indefinitely.
         Pages are viewed with a program called a browser, of which Internet Explorer and Netscape Navigator
are two popular ones. The browser fetches the page requested, interprets the text and formatting commands on
it, and displays the page, properly formatted, on the screen.
       Strings of text that are links to other pages, called hyperlinks, are often highlighted, by underlining,
displaying them in a special color, or both.
         Here the browser is displaying a Web page on the client machine. When the user clicks on a line of text
that is linked to a page on the abcd.com server, the browser follows the hyperlink by sending a message to the
abcd.com server asking it for the page. When the page arrives, it is displayed. If this page contains a hyperlink
to a page on the xyz.com server that is clicked on, the browser then sends a request to that machine for the page.
When an item is selected, the browser follows the hyperlink and fetches the page selected. Therefore, the
embedded hyperlink needs a way to name any other page on the Web. Pages are named using URLs (Uniform
Resource Locators).
There are two possibilities: plug-ins and helper applications. A plug-in is a code module that the browser
fetches from a special directory on the disk and installs as an extension to itself.
The other way to extend a browser is to use a helper application. This is a complete program, running as a
separate process.
PROCESSING OF REQUEST
Figure 5-9. (a) Normal request-reply message sequence. (b) Sequence when TCP handoff is used
        Sometimes a trick, called TCP handoff, is used to get around this problem. With this trick, the TCP end
point is passed to the processing node so it can reply directly to the client.
        Consider a parallel between people and pages. In the United States, almost everyone has a social
security number, which is a unique identifier, as no two people are supposed to have the same one.
Nevertheless, if you are armed only with a social security number, there is no way to find the owner's address,
and certainly no way to tell whether you should write to the person in English, Spanish, or Chinese. The Web
has basically the same problems.
       The solution chosen identifies pages in a way that solves all three problems at once. Each page is
assigned a URL (Uniform Resource Locator) that effectively serves as the page's worldwide name.
       URLs have three parts: the protocol (also known as the scheme), the DNS name of the machine on
which the page is located, and a local name uniquely indicating the specific page (usually just a file name on the
machine where it resides). As an example, the Web site for the author's department contains several videos
about the university and the city of Amsterdam. The URL for the video page is
http://www.cs.vu.nl/video/index-en.html
       This URL consists of three parts: the protocol (http), the DNS name of the host (www.cs.vu.nl), and the
file name (video/index-en.html), with certain punctuation separating the pieces. The file name is a path relative
to the default Web directory at cs.vu.nl.
     The basis of the Web is transferring Web pages from server to client. In the simplest form, Web pages
      are static. They are just files sitting on some server waiting to be retrived.
     In this context, even a video is a static web page because it is just a file.
     In this section we will look at static web page in details. In the next one, we will examine dynamic
      content.
     HTML allows users to produce Web pages that include text, graphics, video, pointers toother Web
      pages, and more.
     HTML is a markup language, or language for describing how documents are to be formatted.
     Markup languages thus contain explicit commands for formatting. For example, in HTML, <b>means
      start boldface mode, and </b> means leave boldface mode.
     Writing a browser is then straightforward: the browser simply has to understand the markup commands.
      Embedding all the markup commands within each HTML file and standardizing them makes it possible
      for any Web browser to read and reformat any Web page.
     While it is certainly possible to write documents like this with any plain text editor, and many people do,
      it is also possible to use word processors or special HTML editors that do most of the work.
     A Web page consists of a head and a body, each enclosed by<html>and </html>tags.
     The head is bracketed by the <head>and </head>tags and the body is bracketed by the<body>and
      </body>tags. The strings inside the tags are called directives.
     Most, but not all, HTML tags have this format. That is, they use <something>to mark the beginning of
      something and </something>to mark its end.
     Tags can be in either lowercase or uppercase. Thus, <head>and <HEAD>mean the same thing, but
      lower case is best for compatibility.
     Some tags have (named) parameters, called attributes. For example, the<img>tag is used for including
      an image inline with the text. It has two attributes, src and alt. The first attribute gives the URL for the
      image.
     The list of special characters is given in the standard. All of them begin with an ampersand and end with
      a semicolon.
            For example,   produces a space, è produces e` and é producese´. Since <,
               >, and & have special meanings, they can be expressed onlywith their escape sequences, <,
               >, and &.
     The main item in the head is the title, delimited by <title>and </title>.The title itself is not displayed on
      the page. Some browsers use it to label the page’s window.
     Several headings are used in each heading is generated by an <hn>tag, where n is a digit in the range 1
      to 6. Thus, <h1>is the most important heading ;<h6>is the least important one.
Tag Description
            XML and XSL is (eXtensible Markup Language) and (eXtensible Style Language).
            HTML, with or without forms, does not provide any structure to web pages.
            It also mixes the content with the formatting, as e-commerce and other applications become
             more common, there is an increasing need for structuring pages and separating the content from
             the formatting.
            The W#C has developed an enhancement to HTML to allow web pages to be structured for
             automated processing.
            It defines a structure called book_list, which is a list of books. Each book has three fields, the
             title, author, and year of publication.
            In this example, each of the three fields is an indivisible entity, but it is also permitted to futher
             subdivide the fields.
            The author fields could have been done as follows to give a finer-grained control over searching
             and formatting.
       HTML keeps evolving to meet new demands. Many people in the industry feel that in the future, the
majority of web-enabled device will not be PCs, but wireless, handheld PDA-type device.
      These devices have limited memory for large browser full of heuristics that try to somehow deal with
       syntactically incorrect web pages.
      There are 6 major difference in HTML and XHML:
           XHTML pages and browser must strictly conform to the standard. No more shodly web pages.
           All tags and attributes must be in lower case, tags like <HTML> are not valid in XHTML.
           Closing tags are required, even for </p>. for tags that have no natural closing tag. Such as
            <br>,<hr> and <img>,a slash must preced the closing”>”.
                    Eg:<img src=”pic0001.jpg”/>
           Attribute must be contained within quotation marks.
                    Eg: <img src=”pic001.jpg” height=500/>
            The 500 has to be enclosed in quotation marks, just like the name of the JPEG file, even though
            500 is just a number.
           Tags must be nest properly. In the past, proper nesting was not required as long as final state
            achieved was correct. Tags closed in the inverse order that they were opened.
                    Eg: <center><b>vacation pictures</center></b>.
           Every document must specify its document type. For a discussion of all the changes, major and
            minor, see www.w3.org.
Dynamic web documents are created at both client and server sides.
CLIENT-SIDE GENERATION
        CGI, PHP, JSP, and ASP scripts solve the problem of handling forms and interactions with databases on
the server. They can all accept incoming information from forms, look up information in one or more databases,
and generate HTML pages with the results.
       Usually the server side scripting is done with PHP and client side scripting is javascript. Complete web
pages can be generated on the fly by various scripts on the server machine. Once they are received by the
browser, they are treated as normal HTML pages and displayed.
       Dynamic content generation is also possible on the client side. Web pages can be written in XML and
then converted to HTML according to XSL file. Javascript programs can perform arbitrary computations.
Finally plugins and helper applications can be used to display content in a variety of formats.
   The requirements of information security within an organization have undergone two major changes in the
    last decades.
   The generic name for the collection of tools designed to protect data and to prevent hackers is computer
    security.
   Another important thing is that affected security is the introduction of distributed systems and the use of
    networks and communication facilities for carrying data between terminal user and computer and between
    computer and compiler.
   Network security measures are needed to protect data during their transmission and to guarantee that data
    transmissions are authentic.
SECURITY ATTACKS:
SECURITY MECHANISMS:
SECURITY SERVICE:
2)AUTHENTICATION-The receiptient that the message is from source that it claim to be from.
     When a message is sent ,the receiver can prove that the alleged sender in fact sent themessage similarly
      when a message is received,the sender can prove that alleged receiver in fact received the message.
   Secrecy
   Integrity
   Availability
SECRECY
Secrecy requires that the information in a computer system only be accessible for reading by authorized parties.
This type of access includes printing, displaying and other forms of disclosure, including simply revealing the
existence of an object.
INTEGRITY
Integrity requires that computer system can be modified only by authorized parties. Modification includes
writing, changing status, deleting and creating.
AVAILABILITY
     The types of attacks on the security of a computer system or network can be viewed by the function of the
      computer system.
In general, there is a flow of information from a source to destination. This is called normal flow.
They are:
         Interruption
         Interception
         Modification
         Fabrication
INTERRUPTION
An asset of the system is destroyed or it becomes unavailable or unusable. This is an attack on availability.
Examples: destruction of a piece of hardware such as hard disk, the cutting of a communication line, or the
disabling of the file management system.
INTERCEPTION
An authorized party gains access to an asset. This is an attack on confidentiality. The authorized party could be
a person or program or a computer.
E.g.: wiretapping to capture data in a network and the illicit copying of files or programs.
MODIFICATION
An authorized party gains access to an asset. This is an attack on confidentiality. This is an attack on integrity.
E.g: changing the values in a data file, altering a program so that it performs differently and modifying the
content of messages being transmitted in a network.
FABRICATION
An authorized party inserts counterfeit objects into the system. This is an attack on authenticity. Examples
include the insertion of spurious messages in a network or the addition of records to a file.
PASSIVE ATTACKS
Passive attacks means the eavesdropping on or monitoring of, transmissions. The goal of the component is to
obtain information that is being transmitted. Two types of attacks are involved here.
In this, a telephone conversation, an e-mail message, a transferred file may contain sensitive or confidential
information. This helps to prevent the opponent from learning the content of these transmissions.
TRAFFIC ANALYSIS
In this encryption is used for masking the contents which helps to observe the pattern of the messages. The
opponent could determine the location and identity of communicating hosts and could observe the frequency
and length of messages being exchanged.
Passive attacks are difficult to detect because they do not involve any alteration of data.
ACTIVE ATTACKS
In this, the attacks involve some modification of the data stream or creation of false stream. This is divided into
four categories.
They are:
     Masquerade
     Replay
     Modification of messages
     Denial of service
   Masquerade
A masquerade takes place when one entity pretends to be a different entity. A masquerade attack usually
includes one of the other forms of active attacks.
 Replay
Replay involves the passive capture of a data unit and its subsequent retransmission to produce an unauthorized
effect.
This means that some portion of a legitimate message is altered or that messages are delayed or reordered to
produce an unauthorized effect.
 Denial of service
The denial of service prevents or inhibits the normal size or management of communication facilities.
It is quite difficult to prevent active attacks absolutely and would require physical protections of all
communications facilities and paths at all times.
CRYPTOGRAPHY:
CIPHERTEXT:Transformed message.
SECRET KEY:Some critical information used by the cipher knows only to sender and receiver.
              Substitution.
              Tranposition.
SUBSTITUTION:In which each element in the plaintext(bit,letter,group or letter is mapped into another
element.
              Symmetric
              Single key
              Secret key or conventional encryption.
    A Block cipher processes the input one block of elements at a time ,producing an output block for each
     input block.(ie.Encrypt one bit/character at a time)
WKLV LV HDVB.
    A stream cipher processs the input elements continuosly,producing output one element at a time.(ie.
     Break plaintext message in equal size blocks and encrypt each block as a unit).
     Eg:THIS IS EASY
INTRODUCTION TO CRYPTOGRAPHY
SUBSTITUTION TECHNIQUES:
    The two basic building block of all encryption techniques are substitution and transposition.
    A substitution techniques is one in which the letters of plaintext are replaced by other letters or by
     number or symbols.
CAESAR CIPHER:
    The earliest known use of a substitution cipher and the simplest,was by Julius Caesar.
    The Caesar cipher involves replacing each letter of alphabets with the letter standing three places further
     down the alphabets.
Plaintext: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z.
Ciphertext: D E F G H I J K L M N O P Q R S T U V W X Y Z A B C
MONOALPHABETIC CIPHER:
Eg:A->X,B->Y,C->Z,D->A…….Z->W
PQXOYRZHP XQ QEOBB
    The basic attack takes advantage of the statistical properties of natural languages.
    In English, for example ,e is the most common letter, followed by t, o, a, n, i, etc.
MASTER OF COMPUTER APPLICATIONS                                                                         Page 23
                                        COMPUTER NETWORKS – UNIT -5 Y3/S5
      The most common two-letter combinations, or digrams, are th, in, er, re, and an.
      The most common three-letter combinations, or trigrams,are the, ing, and, and ion.
      A cryptanalyst trying to break a monoalphabetic cipher would start out by
      counting the relative frequencies of all letters in the ciphertext.
      He would then look at trigrams to find a common one of the form tXe, which strongly suggests that
      X is h. Similarly, if the pattern thYt occurs frequently, they probably stands for a.
TRANSPOSITION TECHNIQUES:
    A very different kind of mapping is achieved by performing some sort of permutation on plaintext
     letters.
    This technique is referred as transposition cipher.
    In which letters of the plaintext are written alternating between rows and the rows and then read
     sequentially to give cipher.
    To write the message in rectangle row by row and read the message off,column by column but permute
     the order of column.
    The order of column then becomes keyword AUTHOR and order the column by lexicographic order of
     the letters in the keyword.
One-Time pad:
Introduction:
      First described by Frank Miller in 1882.
      The one-time pad was re-invented in 1917 and patented a couple of years later.
      It is derived from the Vernam cipher, named after Gilbert Vernam, one of its inventors.
      Vernam's system was a cipher that combined a message with a key read from a punched tape.
      In its original form, Vernam's system was vulnerable because the key tape was a loop, which was reused
       whenever the loop made a full cycle.
      One-time use came later, when Joseph Mauborgne recognized that if the key tape were totally random,
       then cryptanalysis would be impossible.
                               Or
      The One-Time Pad is an evolution of the Vernham cipher, which was invented by Gilbert Vernham in
       1918, and used a long tape of random letters to encrypt the message.
      An Army Signal Corp officer, Joseph Mauborgne, proposed an improvement using a random key that
       was truly as long as the message, with no repetitions, which thus totally obscures the original message.
      Since any plaintext can be mapped to any ciphertext given some key, there is simply no way to
       determine which plaintext corresponds to a specific instance of ciphertext.
Define:
   Each new message requires a new key of the same length as the new message. Such a scheme, known as
       a one-time pad.
   It is unbreakable.
   It produces a random output that bears no statistical relationship to the plaintext.
   Because the ciphertext contains no information whatsoever about the plaintext, there is simply no way to
       break the code.
Example:
        Suppose Alice wishes to send the message "HELLO" to Bob. Assume two pads of paper containing
identical random sequences of letters were somehow previously produced and securely issued to both. Alice
chooses the appropriate unused page from the pad.
        The way to do this is normally arranged for in advance, as for instance 'use the 12th sheet on 1 May', or
'use the next available sheet for the next message'. The material on the selected sheet is the key for this message.
Each letter from the pad will be combined in a predetermined way with one letter of the message. It is common,
but not required, to assign each letter a numerical value, e.g., "A" is 0, "B" is 1, and so on. In this example, the
technique is to combine the key and the message using modular addition. The numerical values of
corresponding message and key letters are added together, modulo 26. If key material begins with "XMCKL"
and the message is "HELLO", then the coding would be done as follows:
           H    E    L     L    O message
         7 (H) 4 (E) 11 (L) 11 (L) 14 (O) message
       + 23 (X) 12 (M) 2 (C) 10 (K) 11 (L) key
       = 30    16   13    21    25 message + key
       = 4 (E) 16 (Q) 13 (N) 21 (V) 25 (Z) message + key (mod 26)
E Q N V Z → ciphertext
    If a number is larger than 25, then the remainder after subtraction of 26 is taken in modular arithmetic
fashion. This simply means that if the computations "go past" Z, the sequence starts again at A.
    The ciphertext to be sent to Bob is thus "EQNVZ". Bob uses the matching key page and the same process,
but in reverse, to obtain the plaintext. Here the key is subtracted from the ciphertext, again using modular
arithmetic:
            E    Q     N    V     Z ciphertext
          4 (E) 16 (Q) 13 (N) 21 (V) 25 (Z) ciphertext
       - 23 (X) 12 (M) 2 (C) 10 (K) 11 (L) key
       = -19     4   11    11   14 ciphertext – key
       = 7 (H) 4 (E) 11 (L) 11 (L) 14 (O) ciphertext – key (mod 26)
            H     E    L    L     O → message
Similar to the above, if a number is negative then 26 is added to make the number zero or higher.
      The Security of the one-time pas is entirely due to the randomness of the key.
      If the Stream of characters that constitute the key is truly random, then the stream of characters that
       constitute the ciphertext will be truly random.
    Thus, there are no patterns or regularities that a cryptanalyst can use to attack the ciphertext.
    The one-time pad offers complete security but, in practice, has two fundamental difficulties:
    1. There is the practical problem of making large quantitied of random keys. Any heavily used system
       might requires millions of random character on a regular basis. Supply truly random characters in this
       volume is a significant task.
    2. Even more daunting is the problem of key distribution and protection. For every message to be sent, a
       key of equal length is needed by both sender and receiver. Thus, a mammoth key distribution problem
       exists.
    Cryptographic Principles:
    • Redundancy
           – All encrypted messages must contain some redundancy, that is, information not needed to
               understand the message.
    • Freshness
           – Some measures must be taken to ensure that each message received can be verified as being
               fresh, that is, sent very recently.
    Redundancy Motivation:
    • Consider a mail-order company, The Couch Potato (TCP), with 60,000 products.
    • Ordering messages consist of a 16-byte customer name followed by a 3-byte data field.
RSA
Much practical security is based on it. Its major disadvantage is that it requires keys of at least 1024 bits for
good security (versus 128 bits for symmetric-key algorithms), which makes it quite slow.
      key generation
      encryption
      decryption
   RSA involves a public key and a private key. The public key can be known to everyone and is used for
encrypting messages. Messages encrypted with the public key can only be decrypted using the private key.
   The RSA method is based on some principles from number theory. We will now summarize how to use the
method.
   1. Choose two large primes, p and q (typically 1024 bits).
   2. Compute n = p x q and z = (p - 1) x (q - 1).
   3. Choose a number relatively prime to z and call it d.
   4. Find e such that e x d = 1 mod z.
The security of the method is based on the difficulty of factoring large numbers. If the cryptanalyst could factor
the (publicly known) n, he could then find p and q, and from these z.
With these parameters computed in advance, we are ready to begin encryption. Divide the plaintext (regarded as
a bit string) into blocks, so that each plaintext message, P, falls in the interval 0 P < n. Do that by grouping the
                                                                          k
plaintext into blocks of k bits, where k is the largest integer for which 2 < nis true.
                                           e                                         d
To encrypt a message, P, compute C = P (mod n). To decrypt C, compute P = C (mod n). It can be proven that
for all P in the specified range, the encryption and decryption functions are inverses. To perform the encryption,
you need e and n. To perform the decryption, you need d and n. Therefore, the public key consists of the pair (e,
n), and the private key consists of (d, n).
In the above example, the encryption of the plain text “SUZANNE” is shown:
p = 3, q = 11, n = 33, z = 20
Encryption:
C       = Me mod n
        = 887 mod 187
        = [ (884 mod 187) * ( 882 mod 187) * ( 881 mod 187] mod 187
        = [ (59,969,536 mod 187)(7744 mod 187)(88 mod 187)] mod187
        = (132 * 77 * 88) mod 187
        = 894,432 mod 187
        = 11
Decryption:
M        =Cd mod n
        =1123 mod 187
        =[(111 mod 187)* (112 mod 187)* (114 mod 187)*(118 mod 187)* (118mod 187)] mod 187
        =[(111mod187)*(121mod187)*(14,641mod187)*(214,358,881mod187)*(214,358,881mod187)]mod187
        =(11 * 121 * 55 * 33 * 33) mod 187
        =79,720,245 mod 187
        =88.
If p = 3, q = 11, n = 33, Φ(n) = 20,
                             d = 7 (because 7, 20 have no common factors)
                          =>7e = 1 mod 20
                               =3
Some form of chaining is needed for data encryption. However, in practice, most RSA-based systems use
public-key cryptography primarily for distributing one-time session keys for use with some symmetric-key
algorithm such as AES or triple DES. RSA is too slow for actually encrypting large volumes of data but is
widely used for key distribution.
        The sender uses an encryption algorithm, and the receiver uses a decryption
         algorithm.
 Cipher
 Key
        Eve is the person who somehow disturbs the communication between Alice and
         Bob by intercepting messages to uncover the data or by sending her own
         disguised messages.
 Two Categories
 The sender uses this key and an encryption algorithm to encrypt data;
         the receiver uses the same key and the corresponding decryption algorithm to
          decrypt the data .
 Asymmetric-Key Cryptography
        In asymmetric or public-key cryptography, there are two keys: a private key and
         a public key.
               o When the message is received by Bob, the private key is used to decrypt
                 the message.
     Three types of keys are used in cryptography: the secret key, the public key, and the
       private key.
 The first, the secret key, is the shared key used in symmetric-key cryptography.
     The second and the third are the public and private keys used in asymmetric-key
       cryptography.
 Comparison
 The sender puts the message in a box and locks the box by using a key.
            The receiver unlocks the box with a key and takes out the message.
       The difference lies in the mechanism of the locking and unlocking and the type
        of keys used.
 In symmetric-key cryptography, the same key locks and unlocks the box.
       In asymmetric-key cryptography, one key locks the box, but another key is
        needed to unlock it.
 SYMMETRIC-KEY CRYPTOGRAPHY
   An encryption system in which the sender and receiver of a message share a single,
     common key that is used to encrypt and decrypt the message.
   Symmetric-key systems are simpler and faster.
   The main drawback is that the two parties must exchange the key in a secure way.
   Public-key encryption avoids this problem because the public key can be distributed
    in a non-secure way, and the private key is never transmitted.
   Symmetric-key cryptography is sometimes called secret-key cryptography.
   The most popular symmetric-key system is the Data Encryption Standard (DES).
   The disadvantage of symmetric cryptography is that it presumes two parties have
    agreed on a key and been able to exchange that key in a secure manner prior to
    communication.
      Secret Key Authentication
     o Before two parties can send information securely, they must first exchange
        a secret key.
     o Even if the sender and receiver found a channel that they believed to be
        secure, in the past there has been no way to test the secrecy of each key.
     o Quantum cryptography solves this problem.
     o It allows the sender and receiver to test and guarantee the secrecy of each
        individual key.
 ASYMMETRIC-KEY CRYPTOGRAPHY
An asymmetric-key (or public-key) cipher uses two keys: one private and one public.
RSA :
           The most common public key algorithm is RSA, named for its inventors Rivest,
            Shamir, and Adleman (RSA).
 Selecting Keys
o Bob use the following steps to select the private and public keys:
        o Remember that a prime number is one that can be divided evenly only by 1 and
          itself.
        2. Bob multiplies the above two primes to find n, the modulus for encryption and
        decryption.
o In other words, n =p X q.
In RSA, e and n are announced to the public; d and ø are kept secret.
 Encryption
       o    she can change the message, usually a short one, to an integer. This is the
            plaintext.
o C=pᵉ(modn)
 Decryption
        o When he receives the ciphertext, he uses his private key d to decrypt the
          message:
P= Cᵈ(modn)
 Restriction
o For RSA to work, the value of P must be less than the value of n.
o If P is a large number, the plaintext needs to be divided into blocks to make P less than
  n.
 Applications
o RSA, therefore, is useful for short messages such as a small message or a symmetric key
  to be used for a symmetric-key cryptosystem.
o I RSA is used in digital signatures and other cryptosystems that often need to encrypt a
  small message without having access to a symmetric key.