0% found this document useful (0 votes)

27 views53 pages

Parsing Part - 1

Uploaded by

Md. Abdul Mukit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views53 pages

Parsing Part - 1

Uploaded by

Md. Abdul Mukit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 53

CSE-361:Compiler Design

Parsing : Part-I
Parsing
Parsing During Compilation
regular
expressions errors

lexical token rest of intermediate

source parser parse
representation
program analyzer get next tree front end
token

symbol
• Collecting token
table information
• Perform type checking
• uses a grammar to check structure of tokens • Intermediate code
generation
• produces a parse tree
• syntactic errors and recovery
• recognize correct syntax
• report errors
Parsers
We categorize the parsers into two groups:
1. Top- Down Parser: The parse tree is created top to bottom, starting from the root
2. Bottom-Up Parser: The parse tree is created bottom to top, starting from the leaves

Both Top-Down and Bottom-Up parsers scan the input from left to right ( One symbol at a time)

Efficient Top-Down and Bottom-Up parsers can be implement only for the sub-classes of context
free grammars
▪ LL for top-down parsing
▪ LR for bottom-up parsing
Errors in Programs
Error Detection
Adequate Error Reporting is Not a Trivial Task
ERROR RECOVERY
ERROR RECOVERY MAY TRIGGER MORE ERRORS!
ERROR RECOVERY APPROACHES: PANIC MODE
ERROR RECOVERY APPROACHES:
PHRASE-LEVEL RECOVERY
ERROR RECOVERY APPROACHES:
ERROR PRODUCTIONS
ERROR RECOVERY APPROACHES:
GLOBAL CORRECTION
Syntactical Analysis
Each language definition has rules that describe the syntax of well formed programs.
• Format of the rules: context-free grammars
• Why not regular expressions/NFA’s/DFA’s?
▪ Source program constructs have recursive structure:
digits = [0-9]+;
expr = digits | “(“ expr “+” expr “)”

◦ Finite automata can’t recognize recursive constructs, so cannot ensure expressions

are well-bracketed: a machine with N states cannot remember parenthesis—nesting
depth greater than N
◦ CFG’s are more powerful, but also more costly to implement
CFG versus Regular Expression
CFG versus Regular Expression
Language: set of strings
String: finite sequence of symbols taken from finite alphabet
Regular expressions and CFG’s both describe languages, but over different
alphabets
CFG versus Regular Expression
CFG’s strictly more expressive than RE’s:
Any language recognizable/generated by a RE can also be recognized/generated
by a CFG, but not vice versa.

Also known as Backus-Naur Form (BNF, Algol 60)

CONTEXT FREE GRAMMARS (CFG)
RULE ALTERNATIVE NOTATIONS
Notational Conventions
Terminals
◦ Lower-case letters early in the alphabet: a, b, c
◦ Operator symbols: +, -
◦ Punctuations symbols: parentheses, comma
◦ Boldface strings: id or if
Nonterminals:
◦ Upper-case letters early in the alphabet: A, B, C
◦ The letter S (start symbol)
◦ Lower-case italic names: expr or stmt
▪Upper-case letters late in the alphabet, such as X, Y, Z, represent either nonterminals or terminals.
▪Lower-case letters late in the alphabet, such as u, v, …, z, represent strings of terminals.
Notational Conventions
▪Lower-case Greek letters, such as , , , represent strings of grammar symbols.
▪Thus A→  indicates that there is a single nonterminal A on the left side of the production
and a string of grammar symbols  to the right of the arrow.
▪If A→ 1, A→ 2, …., A→ k are all productions with A on the left, we may write:
▪ A→ 1 | 2 | …. | k
▪Unless otherwise started, the left side of the first production is the start symbol.
E → E A E | ( E ) | -E | id
A→+|-|*| / |
SUMMARY OF NOTATIONAL CONVENTIONS
Context Free Grammars : A First Look
Production rules:

1. assign_stmt → id := expr ;
2. expr → expr operator term Derivation: A sequence of grammar rule
applications and substitutions that
3. expr → term
transform a starting non-term into a
4. term → id sequence of terminals / tokens.
5. term → real
Terminals: id real integer + - := ;
6. term → integer
Nonterminals: assign_stmt, expr, operator, term
7. operator → + Start symbol: assign_stmt
8. operator → -
Example Grammar:
Simple Arithmetic Expressions
1. expr → expr op expr
2. expr → ( expr )
3. expr → - expr 9 Production rules
4. expr → id
5. op → +
6. op → - Terminals: id + - * /  ( )
7. op → * Nonterminals: expr, op
Start symbol: expr
8. op → /
9. op → 
DERIVATIONS
DERIVATIONS
CFG Terminology
Derivation
Let’s derive: id = id + real – integer ;
production rules:
assign_stmt
→ id = expr ; =

→ id = expr operator term;

→ id = expr operator term operator term;
→ id = term operator term operator term;
→ id = id operator term operator term;
→ id = id + term operator term;
→ id = id + real operator term;
→ id = id + real - term;
→ id = id + real - integer;
LEFTMOST DERIVATION
RIGHTMOST DERIVATION
PARSE TREE
PARSE TREE

Input: (id * id) + id

PARSE TREE

Input: (id * id) + id

PARSE TREE
PARSE TREE
AMBIGUOUS GRAMMAR
AMBIGUOUS GRAMMAR

Two different parse trees!!

Which derivation of the parse tree is correct??
AMBIGUOUS GRAMMAR
AMBIGUOUS GRAMMAR

YES
AMBIGUOUS GRAMMAR
PROBLEMS OF AMBIGUOUS GRAMMAR

4/2+2 4 1

This parse tree gives the Right Answer.

4/2+2=4
PROBLEMS OF AMBIGUOUS GRAMMAR

The Ambiguous Grammar does not consider the

Precedence and Associativity
SOLUTION: REMOVING AMBIGUITY
Ambiguous Grammar:
How to Solve Associativity Problem?
Operators with
Input: 3 + 2 + 6 Parse Tree -1 Parse Tree -2 ▪ Two different parse trees!!
▪ According to the Grammar both
are correct.
Ambiguous Grammar
▪ But Parse Tree-1 gives WRONG
1. E →E + E answer and Parse Tree-2 gives
2. E → E * E num(3) num(6)
RIGHT answer.
3. E → num
num(2) num(6) num(3) num(2) Removing the associativity
problem from the Grammar:

✓ Operators with same precedence must be 1. E → E + num

resolved by Associativity 2. E → E * num
✓ Some operators have left associativity (+, -, *, /) 3. E→ num
and some operators have right associativity (^) Still it is an Ambiguous Grammar!!
Ambiguous Grammar:
How to Solve Associativity Problem?(2)
Input: 3 + 2 + 6 Parse Tree

Removing the associativity

problem from the Grammar:
num(6)
1. E → E + num
2. E → E * num
3. E→ num num(2)

num(3)
Still it is an Ambiguous Grammar!!
Ambiguous Grammar:
How to Solve Precedence Problem?
Input: 3 + 2 * 6 Parse Tree -1 Parse Tree -2 ▪ Two different parse trees!!
▪ According to the Grammar both
are correct.
Ambiguous Grammar
▪ But Parse Tree-1 gives RIGHT
1. E →E + E answer and Parse Tree-2 gives
2. E → E * E num(3) num(6)
WRONG answer.
3. E → num
num(2) num(6) num(3) num(2)
After Conversion to
Unambiguous Grammar:
✓ Lower precedence operation rules should be
1. E → E + T | T
declared in the upper level in the Grammar
2. T → T * F | F
✓ Higher precedence operation rules should be
3. F → num
declared in the lower level in the Grammar
Ambiguous Grammar:
How to Solve Precedence Problem?(2)
Input: 3 + 2 * 6 Parse Tree

After Conversion to
T
Unambiguous Grammar:
T T F
1. E → E + T | T F
F num(6)
2. T → T * F | F num(3)
3. F → num num(2)
UNAMBIGUOUS GRAMMAR
Reading Materials
 Chapter -4 of your Text book:
 Compilers: Principles, Techniques, and Tools
THE END

Parsing Part - 1
No ratings yet
Parsing Part - 1
53 pages
Chapter-3 So Far
No ratings yet
Chapter-3 So Far
50 pages
Chapter - Three: Syntax Analysis
No ratings yet
Chapter - Three: Syntax Analysis
100 pages
Compiler Design: Syntactic Analysis
No ratings yet
Compiler Design: Syntactic Analysis
96 pages
Compiler Design Chapter-3
0% (1)
Compiler Design Chapter-3
177 pages
Chapter - Three
No ratings yet
Chapter - Three
139 pages
Compiler Design - Syntax Analysis
No ratings yet
Compiler Design - Syntax Analysis
14 pages
1 Syntax Analyzer
No ratings yet
1 Syntax Analyzer
33 pages
Chapter 3
No ratings yet
Chapter 3
77 pages
2-Role of Parser and Parse Tree-02!08!2024
No ratings yet
2-Role of Parser and Parse Tree-02!08!2024
69 pages
Syntax Analysis and Parsing Guide
No ratings yet
Syntax Analysis and Parsing Guide
105 pages
Principles of Programming Languages: Syntax Analysis
100% (1)
Principles of Programming Languages: Syntax Analysis
51 pages
Chapter-3-Syntax Analysis
No ratings yet
Chapter-3-Syntax Analysis
126 pages
Compiler Construction Week 04 Syntax Analysis I)
No ratings yet
Compiler Construction Week 04 Syntax Analysis I)
41 pages
Chapter 3
No ratings yet
Chapter 3
180 pages
Unit 3 Syntax - Analyzer
No ratings yet
Unit 3 Syntax - Analyzer
56 pages
Syntax Analysis: EECS 483 - Lecture 4 University of Michigan Monday, September 17, 2006
No ratings yet
Syntax Analysis: EECS 483 - Lecture 4 University of Michigan Monday, September 17, 2006
28 pages
Lecture 03
No ratings yet
Lecture 03
36 pages
Chapter Four
No ratings yet
Chapter Four
54 pages
CS6109 Module 4
No ratings yet
CS6109 Module 4
36 pages
CH03
No ratings yet
CH03
57 pages
Lecture 9
No ratings yet
Lecture 9
22 pages
Slide Set 5 Parsing
No ratings yet
Slide Set 5 Parsing
18 pages
Module 2 C D Notes
No ratings yet
Module 2 C D Notes
21 pages
Chapter 3 (Updated)
No ratings yet
Chapter 3 (Updated)
165 pages
Compiler Design Lec-Three Syntax Analysis
No ratings yet
Compiler Design Lec-Three Syntax Analysis
60 pages
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
No ratings yet
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
44 pages
CC Lec 7
No ratings yet
CC Lec 7
16 pages
Syntax Analyzer
No ratings yet
Syntax Analyzer
38 pages
Chapter 3
No ratings yet
Chapter 3
41 pages
CC Unit 3
No ratings yet
CC Unit 3
51 pages
Lecture05-Syntax Analysis-CFG
No ratings yet
Lecture05-Syntax Analysis-CFG
19 pages
Group II - Syntax Analysis (Parsing)
No ratings yet
Group II - Syntax Analysis (Parsing)
12 pages
Parsing Notes
No ratings yet
Parsing Notes
96 pages
Compiler Construction Week 6
No ratings yet
Compiler Construction Week 6
34 pages
1 Syntax Analyzer
No ratings yet
1 Syntax Analyzer
33 pages
Unit-2 Syntax Analysis
No ratings yet
Unit-2 Syntax Analysis
27 pages
2024 CD-Ch03 Syntaxx Analysis
No ratings yet
2024 CD-Ch03 Syntaxx Analysis
28 pages
Parsing: Programming Language Principles
No ratings yet
Parsing: Programming Language Principles
33 pages
SE Compiler Chapter 3-Parser
No ratings yet
SE Compiler Chapter 3-Parser
27 pages
Class 18 Context Free Grammar
No ratings yet
Class 18 Context Free Grammar
35 pages
Syntax Analyser
No ratings yet
Syntax Analyser
30 pages
Why Syntax Analysis?
No ratings yet
Why Syntax Analysis?
15 pages
Compiler Theory: (A Simple Syntax-Directed Translator)
No ratings yet
Compiler Theory: (A Simple Syntax-Directed Translator)
50 pages
3 Syntax Analysis
No ratings yet
3 Syntax Analysis
42 pages
Syntax Analysis in Compiler Design
No ratings yet
Syntax Analysis in Compiler Design
16 pages
Context-Free Grammars in Compiler Design
No ratings yet
Context-Free Grammars in Compiler Design
35 pages
Unit-2 2.1. Review of CFG Ambiguity of Grammars 2.1.1. Limitations of Regular Language
No ratings yet
Unit-2 2.1. Review of CFG Ambiguity of Grammars 2.1.1. Limitations of Regular Language
44 pages
Class Three
No ratings yet
Class Three
74 pages
Module 2
No ratings yet
Module 2
19 pages
3 Role of Parser
No ratings yet
3 Role of Parser
135 pages
4 Parsing
No ratings yet
4 Parsing
32 pages
Automata Theory Lec-03
No ratings yet
Automata Theory Lec-03
58 pages
CH2 1
No ratings yet
CH2 1
27 pages
Compiler Syntax & Yacc Guide
No ratings yet
Compiler Syntax & Yacc Guide
21 pages
Unit-II CFG Pda Presentation
No ratings yet
Unit-II CFG Pda Presentation
68 pages
Lecture 5
No ratings yet
Lecture 5
28 pages
Lecture 4 Transmission Control Protocol
No ratings yet
Lecture 4 Transmission Control Protocol
52 pages
Lecture 5 Multimedia Applications
No ratings yet
Lecture 5 Multimedia Applications
51 pages
01 - DNA Extraction
No ratings yet
01 - DNA Extraction
10 pages
04 Vlsi
No ratings yet
04 Vlsi
12 pages
Brute Force Algorithms Explained
No ratings yet
Brute Force Algorithms Explained
55 pages
CS-1004 OOP Fall 2024
No ratings yet
CS-1004 OOP Fall 2024
5 pages
Worldline - POD 2
No ratings yet
Worldline - POD 2
4 pages
Group2 - Chapter 2 Problems
No ratings yet
Group2 - Chapter 2 Problems
5 pages
Keylogger Report
No ratings yet
Keylogger Report
10 pages
30 Python Best Practices, Tips, and Tricks by Erik Van Baaren Python Land Medium
No ratings yet
30 Python Best Practices, Tips, and Tricks by Erik Van Baaren Python Land Medium
23 pages
Java SpringBoot Developer Resume
No ratings yet
Java SpringBoot Developer Resume
3 pages
NPTEL Week 8 Solution - 123
No ratings yet
NPTEL Week 8 Solution - 123
3 pages
Java Basics for New Programmers
No ratings yet
Java Basics for New Programmers
75 pages
Setting Up An SMS Gateway With Ubuntu 8.04, Kannel and Huawei E220 GSM Modem
No ratings yet
Setting Up An SMS Gateway With Ubuntu 8.04, Kannel and Huawei E220 GSM Modem
13 pages
Unit - I Introduction To Programming Languages: Computer Application in Business
No ratings yet
Unit - I Introduction To Programming Languages: Computer Application in Business
93 pages
Automated DR Testing - Zerto
No ratings yet
Automated DR Testing - Zerto
3 pages
Lab 10 Overloading and Recursion
100% (1)
Lab 10 Overloading and Recursion
8 pages
Unit-2 SE Notes
No ratings yet
Unit-2 SE Notes
32 pages
Aniket Kundu Resume
No ratings yet
Aniket Kundu Resume
1 page
NIIT Pimpri
No ratings yet
NIIT Pimpri
8 pages
Understanding AWS Core Services - Services List
No ratings yet
Understanding AWS Core Services - Services List
3 pages
Xcerts Certifications
No ratings yet
Xcerts Certifications
4 pages
Using Elevate Web Builder
100% (1)
Using Elevate Web Builder
240 pages
Basic Solver Usage: 1 2 GAMS Options 3 The Solver Option File
No ratings yet
Basic Solver Usage: 1 2 GAMS Options 3 The Solver Option File
4 pages
C# Game Development Guide
No ratings yet
C# Game Development Guide
3 pages
Discord Bot Creation Guide
No ratings yet
Discord Bot Creation Guide
17 pages
Project Compressed
No ratings yet
Project Compressed
86 pages
محاضرات أمنية تصميم البرمجيات
No ratings yet
محاضرات أمنية تصميم البرمجيات
74 pages
Ues103 L2
No ratings yet
Ues103 L2
179 pages
Android Development Essentials
No ratings yet
Android Development Essentials
34 pages
All Error Corrrector in Visual C++ PDF
No ratings yet
All Error Corrrector in Visual C++ PDF
3,640 pages
Oosd Course Outline
No ratings yet
Oosd Course Outline
8 pages
PPL Practice Question
No ratings yet
PPL Practice Question
1 page
JD Experimented Embedded Software Engineer For Univers2020
No ratings yet
JD Experimented Embedded Software Engineer For Univers2020
5 pages
COBOL V6 Migration 20180605 0
No ratings yet
COBOL V6 Migration 20180605 0
52 pages

Parsing Part - 1

Uploaded by

Parsing Part - 1

Uploaded by

CSE-361:Compiler Design

lexical token rest of intermediate

◦ Finite automata can’t recognize recursive constructs, so cannot ensure expressions

Also known as Backus-Naur Form (BNF, Algol 60)

→ id = expr operator term;

Input: (id * id) + id

Input: (id * id) + id

Two different parse trees!!

This parse tree gives the Right Answer.

The Ambiguous Grammar does not consider the

✓ Operators with same precedence must be 1. E → E + num

Removing the associativity

You might also like