0% found this document useful (0 votes)

53 views39 pages

Ch2 Modified

1) The document describes building a simple compiler by defining a programming language syntax using context-free grammar, developing a predictive parser, and implementing syntax-directed translation to generate intermediate code. 2) A context-free grammar consists of tokens, nonterminals, productions, and a start symbol. Productions specify rewriting rules to derive strings from the grammar. 3) Derivations and parse trees represent the structure of strings according to the grammar. Derivations apply productions to replace nonterminals, and parse trees visually depict the structure.

Uploaded by

Hassnain Abbas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views39 pages

Ch2 Modified

Uploaded by

Hassnain Abbas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

1

A SIMPLE SYNTAX-
DIRECTED TRANSLATOR

Chapter 2
2

Building a Simple Compiler

• Building our compiler involves:
– Defining the syntax of a programming language
– Develop a source code parser: for our compiler
we will use predictive parsing
– Implementing syntax directed translation to
generate intermediate code
3

Syntax Definition
• Context-free grammar is a 4-tuple with
– A set of tokens (terminal symbols)
– A set of nonterminals
– A set of productions
– A designated start symbol
4

Example Grammar

Context-free grammar for simple expressions:

G = <{list,digit}, {+,-,0,1,2,3,4,5,6,7,8,9}, P, list>

with productions P =

list  list + digit

list  list - digit

list  digit

digit  0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9
5

Derivation
• Given a CF grammar we can determine the
set of all strings (sequences of tokens)
generated by the grammar using derivation
– We begin with the start symbol
– In each step, we replace one nonterminal in the
current sentential form with one of the right-
hand sides of a production for that nonterminal
6

Derivation for the Example

Grammar

list
 list + digit
 list - digit + digit
 digit - digit + digit
 9 - digit + digit
 9 - 5 + digit
9-5+2

This is an example leftmost derivation, because we replaced

the leftmost nonterminal (underlined) in each step.
7

Derivation for the Example

Rightmost Grammar
Likewise, a rightmost derivation replaces the rightmost
nonterminal in each step
list
P=  digit - list
list  digit + list  digit - digit + list
 digit - digit + digit
list  digit - list
 digit - digit + 2
list  digit  digit - 5 + 2
digit  0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 9-5+2
8

Parse Trees
• The root of the tree is labeled by the start symbol
• Each leaf of the tree is labeled by a terminal
(=token) or 
• Each interior node is labeled by a nonterminal
• If A  X1 X2 … Xn is a production, then node A has
immediate children X1, X2, …, Xn where Xi is a
(non)terminal or  ( denotes the empty string)
9

Parse Tree for the Example

Grammar
Parse tree of the string 9-5+2 using grammar G

list

list digit

digit
The sequence of
9 - 5 + 2 leafs is called the
yield of the parse tree
The Two Derivations for x – 2 * y
Rule Sentential Form Rule Sentential Form
— Expr — Expr
1 Expr Op Expr 1 Expr Op Expr
3 <id,x> Op Expr 3 Expr Op <id,y>
5 <id,x> – Expr 6 Expr * <id,y>
1 <id,x> – Expr Op Expr 1 Expr Op Expr * <id,y>
2 <id,x> – <num,2> Op Expr 2 Expr Op <num,2> * <id,y>
6 <id,x> – <num,2> * Expr 5 Expr – <num,2> * <id,y>
3 <id,x> – <num,2> * <id,y> 3 <id,x> – <num,2> * <id,y>

Leftmost derivation Rightmost derivation

In both cases, Expr * id – num * id

• The two derivations produce different parse trees
• The parse trees imply different evaluation orders!
Derivations and Parse Trees
G
Leftmost derivation
Rule Sentential Form
— Expr E
1 Expr Op Expr
3 <id,x> Op Expr
5 <id,x> – Expr E Op E
1 <id,x> – Expr Op Expr
2 <id,x> – <num,2> Op Expr
6 <id,x> – <num,2> * Expr x – Op E
E
3 <id,x> – <num,2> * <id,y>

This evaluates as x – ( 2 * y ) 2 y
*
Derivations and Parse Trees
G
Rightmost derivation
Rule Sentential Form
— Expr E
1 Expr Op Expr
3 Expr Op <id,y>
6 Expr * <id,y> E Op E
1 Expr Op Expr * <id,y>
2 Expr Op <num,2> * <id,y>
5 Expr – <num,2> * <id,y> E Op E * y
3 <id,x> – <num,2> * <id,y>

x – 2
This evaluates as ( x – 2 ) * y
13

Ambiguity

Consider the following context-free grammar:

G = <{string}, {+,-,0,1,2,3,4,5,6,7,8,9}, P, string>

with production P =

string  string + string | string - string | 0 | 1 | … | 9

This grammar is ambiguous, because more than one parse tree
represents the string 9-5+2
14

Ambiguity (cont’d)

string string

string string string

string string string string string

9 - 5 + 2 9 - 5 + 2
15

Associativity of Operators
Left-associative operators have left-recursive productions
left  left + term | term
String a+b+c has the same meaning as (a+b)+c
Right-associative operators have right-recursive productions
right  term = right | term
String a=b=c has the same meaning as a=(b=c)
Operators on the same line have the same associativity and
precedence:
left-associative: + -

left-associative: */
16

Syntax of Statements

Syntax-Directed Translation
• Uses a CF grammar to specify the syntactic
structure of the language
• AND associates a set of attributes with the
terminals and nonterminals of the grammar
• AND associates with each production a set of
semantic rules to compute values of attributes
• A parse tree is traversed and semantic rules
applied: after the tree traversal(s) are completed,
the attribute values on the nonterminals contain
the translated form of the input
19

Synthesized and Inherited

Attributes
• An attribute is said to be …
– synthesized if its value at a parse-tree node is
determined from the attribute values at the children of
the node
– Suppose a node N in a parse tree is labeled by the
grammar symbol X . We write X.a to denote the value
of attribute a of X at that node.
– inherited if its value at a parse-tree node is determined
by the parent (by enforcing the parent’s semantic rules)
20

Example Attribute Grammar

Syntax-directed definition for infix to postfix translation

String concat operator

Production Semantic Rule
expr  expr1 + term expr.t := expr1.t // term.t // “+”
expr  expr1 - term expr.t := expr1.t // term.t // “-”
expr  term expr.t := term.t
term  0 term.t := “0”
term  1 term.t := “1”
… …
term  9 term.t := “9”
21

Example Annotated Parse Tree

expr.t = “95-2+”

expr.t = “95-” term.t = “2”

expr.t = “9” term.t = “5”

term.t = “9”

9 - 5 + 2
22

Depth-First Traversals
procedure visit(n : node);
begin
for each child m of n, from left to right do
visit(m);
evaluate semantic rules at node n
end
23

Depth-First Traversals (Example)

expr.t = “95-2+”

expr.t = “95-” term.t = “2”

expr.t = “9” term.t = “5”

term.t = “9”

9 - 5 + 2 Note: all attributes are

of the synthesized type
24

Translation Schemes
• A translation scheme is a CF grammar embedded
with semantic actions
• When drawing a parse tree for a translation scheme,
we indicate an action by constructing an extra child
for it, connected by a dashed line to the node that
corresponds to the head of the production.

rest  + term { print(“+”) } rest

rest
Embedded
semantic action
+ term { print(“+”) } rest
25

Example Translation Scheme

expr  expr + term { print(“+”) }

expr  expr - term { print(“-”) }
expr  term
term  0 { print(“0”) }
term  1 { print(“1”) }
… …
term  9 { print(“9”) }
26

Example Translation Scheme

(cont’d)

expr
{ print(“+”) }
expr + term
{ print(“2”) }
{ print(“-”) }
- term 2
expr
{ print(“5”) }
term 5
{ print(“9”) }
9
Translates 9-5+2 into postfix 95-2+
27

Parsing
• Parsing = process of determining if a string of
tokens can be generated by a grammar
• For any CF grammar there is a parser that takes at
most O(n3) time to parse a string of n tokens
• Top-down parsing “constructs” a parse tree from
root to leaves
• Bottom-up parsing “constructs” a parse tree from
leaves to root
28

Predictive Parsing
• Recursive descent parsing is a top-down parsing
method
– Each nonterminal has one (recursive) procedure that is
responsible for parsing the nonterminal’s syntactic
category of input tokens
– When a nonterminal has multiple productions, each
production is implemented in a branch of a selection
statement based on input look-ahead information
• Predictive parsing is a special form of recursive
descent parsing where we use one lookahead
token to unambiguously determine the parse
operations
29

Example Predictive Parser

(Grammar)

type  simple
| ^ id
| array [ simple ] of type
simple  integer
| char
| num dotdot num
30

Example Predictive Parser

(Execution Step 1)

Check lookahead
type()
and call match

match(‘array’)

Input: array [ num dotdot num ] of integer

lookahead
31

Example Predictive Parser

(Execution Step 2)
type()

match(‘array’) match(‘[’)

Input: array [ num dotdot num ] of integer

lookahead
32

Example Predictive Parser

(Execution Step 3)
type()

match(‘array’) match(‘[’) simple()

match(‘num’)

Input: array [ num dotdot num ] of integer

lookahead
33

Example Predictive Parser

(Execution Step 4)
type()

match(‘array’) match(‘[’) simple()

match(‘num’) match(‘dotdot’)

Input: array [ num dotdot num ] of integer

lookahead
34

Example Predictive Parser

(Execution Step 5)
type()

match(‘array’) match(‘[’) simple()

match(‘num’) match(‘dotdot’) match(‘num’)

Input: array [ num dotdot num ] of integer

lookahead
35

Example Predictive Parser

(Execution Step 6)
type()

match(‘array’) match(‘[’) simple() match(‘]’)

match(‘num’) match(‘dotdot’) match(‘num’)

Input: array [ num dotdot num ] of integer

lookahead
36

Example Predictive Parser

(Execution Step 7)
type()

match(‘array’) match(‘[’) simple() match(‘]’) match(‘of’)

match(‘num’) match(‘dotdot’) match(‘num’)

Input: array [ num dotdot num ] of integer

lookahead
37

Example Predictive Parser

(Execution Step 8)
type()

match(‘array’) match(‘[’) simple() match(‘]’) match(‘of’) type()

match(‘num’) match(‘dotdot’) match(‘num’) simple()

match(‘integer’)
Input: array [ num dotdot num ] of integer

lookahead
38

Adding a Lexical Analyzer

• Typical tasks of the lexical analyzer:
– Remove white space and comments
– Encode constants as tokens
– Recognize keywords
– Recognize identifiers and store identifier names
in a global symbol table
39

The Lexical Analyzer “lexer”

Lexical analyzer
y := 31 + 28*x
lexan()

<id, “y”> <assign, > <num, 31> <‘+’, > <num, 28> <‘*’, > <id, “x”>

token
(lookahead)
tokenval Parser
(token attribute) parse()

CS 4300: Compiler Theory A Simple Syntax-Directed Translator
No ratings yet
CS 4300: Compiler Theory A Simple Syntax-Directed Translator
70 pages
A Simple One-Pass Compiler (To Generate Code For The JVM)
No ratings yet
A Simple One-Pass Compiler (To Generate Code For The JVM)
70 pages
Compiler 2
100% (1)
Compiler 2
45 pages
Compiler 2
No ratings yet
Compiler 2
45 pages
Chapter 2 - Simple Syntax Directed Translator
No ratings yet
Chapter 2 - Simple Syntax Directed Translator
39 pages
Entrepreneurship Process
No ratings yet
Entrepreneurship Process
22 pages
Compiler Theory: (A Simple Syntax-Directed Translator)
No ratings yet
Compiler Theory: (A Simple Syntax-Directed Translator)
50 pages
Lecture 1 Introduction DR Raheel 19022024 032426pm
No ratings yet
Lecture 1 Introduction DR Raheel 19022024 032426pm
32 pages
CC-Lec 5 Week 5 Cfgs
No ratings yet
CC-Lec 5 Week 5 Cfgs
29 pages
Chapter 2
No ratings yet
Chapter 2
47 pages
Chapter 2 (Part 1)
No ratings yet
Chapter 2 (Part 1)
32 pages
Compiler Design Lec-Three Syntax Analysis
No ratings yet
Compiler Design Lec-Three Syntax Analysis
60 pages
CH03
No ratings yet
CH03
57 pages
CH2-1 To CH2-3
No ratings yet
CH2-1 To CH2-3
79 pages
(Week 3) Syntax Analysis (Derivation)
No ratings yet
(Week 3) Syntax Analysis (Derivation)
46 pages
Compiler Design Chapter-3
0% (1)
Compiler Design Chapter-3
177 pages
Compiler Construction Week 04 Syntax Analysis I)
No ratings yet
Compiler Construction Week 04 Syntax Analysis I)
41 pages
Chapter 3
No ratings yet
Chapter 3
180 pages
Syntax Analyzer
No ratings yet
Syntax Analyzer
38 pages
Compiler Design 3
No ratings yet
Compiler Design 3
140 pages
CH2 2
No ratings yet
CH2 2
30 pages
Syntax Analysis and Parsing Guide
No ratings yet
Syntax Analysis and Parsing Guide
105 pages
A Simple One - Pass Compiler
No ratings yet
A Simple One - Pass Compiler
62 pages
2024 CD-Ch03 Syntaxx Analysis
No ratings yet
2024 CD-Ch03 Syntaxx Analysis
28 pages
Syntax Analysis in Compiler Design
No ratings yet
Syntax Analysis in Compiler Design
16 pages
Chapter-3 So Far
No ratings yet
Chapter-3 So Far
50 pages
Chapter - Three: Syntax Analysis
No ratings yet
Chapter - Three: Syntax Analysis
100 pages
Chapter - Three
No ratings yet
Chapter - Three
139 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
78 pages
Lec4 SyntaxAnalysis
No ratings yet
Lec4 SyntaxAnalysis
41 pages
CH2 1
No ratings yet
CH2 1
27 pages
CSC 409 Note 2
No ratings yet
CSC 409 Note 2
12 pages
Chapter-3-Syntax Analysis
No ratings yet
Chapter-3-Syntax Analysis
126 pages
Chapter 3 - Syntax Analysis
No ratings yet
Chapter 3 - Syntax Analysis
67 pages
Compiler Design - Syntax Analysis
No ratings yet
Compiler Design - Syntax Analysis
14 pages
Chapter 3
No ratings yet
Chapter 3
41 pages
Module1 1
No ratings yet
Module1 1
20 pages
Chapter-02 (Part-II) PDF
No ratings yet
Chapter-02 (Part-II) PDF
23 pages
Chapter 3 - Syntax Analysis Part One
No ratings yet
Chapter 3 - Syntax Analysis Part One
17 pages
Lecture 5
No ratings yet
Lecture 5
28 pages
G52Cmp Compilers: Syntax Analysis
No ratings yet
G52Cmp Compilers: Syntax Analysis
36 pages
Syntax Analysis: EECS 483 - Lecture 4 University of Michigan Monday, September 17, 2006
No ratings yet
Syntax Analysis: EECS 483 - Lecture 4 University of Michigan Monday, September 17, 2006
28 pages
4 Parsing
No ratings yet
4 Parsing
32 pages
BCS 324 Compiler Design Notes - Unit2
No ratings yet
BCS 324 Compiler Design Notes - Unit2
37 pages
Syntax Analysis
No ratings yet
Syntax Analysis
90 pages
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
No ratings yet
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
44 pages
Simple One Pass Compiler
No ratings yet
Simple One Pass Compiler
62 pages
(Week 4) Syntax Analysis (CFG)
No ratings yet
(Week 4) Syntax Analysis (CFG)
50 pages
8 Notes
No ratings yet
8 Notes
12 pages
Compiler 3
No ratings yet
Compiler 3
11 pages
2-Role of Parser and Parse Tree-02!08!2024
No ratings yet
2-Role of Parser and Parse Tree-02!08!2024
69 pages
Chapter 3 (Updated)
No ratings yet
Chapter 3 (Updated)
165 pages
Syntax Analyzer and Parsing Techniques
No ratings yet
Syntax Analyzer and Parsing Techniques
38 pages
Chapter 3 Syntax Analysis Full Reading Material
No ratings yet
Chapter 3 Syntax Analysis Full Reading Material
76 pages
Lecture 3 03032025 113959am
No ratings yet
Lecture 3 03032025 113959am
51 pages
Lecture2 PDF
No ratings yet
Lecture2 PDF
45 pages
Chapter 3
No ratings yet
Chapter 3
77 pages
CW GAT - Preparation and Tips
No ratings yet
CW GAT - Preparation and Tips
4 pages
Design Rules
No ratings yet
Design Rules
23 pages
Ex 6
50% (2)
Ex 6
2 pages
HCI in Software Development
No ratings yet
HCI in Software Development
15 pages
Human Computer Interaction
No ratings yet
Human Computer Interaction
15 pages
Design Rationale for HCI Experts
No ratings yet
Design Rationale for HCI Experts
18 pages
File Status Display Using Stat Call
No ratings yet
File Status Display Using Stat Call
1 page
"Software Engineering" Assignment 4 "Use Case Descriptions and Diagram"
No ratings yet
"Software Engineering" Assignment 4 "Use Case Descriptions and Diagram"
10 pages
Evolutionary Prototyping Guide
No ratings yet
Evolutionary Prototyping Guide
2 pages
SE Class Diagram
No ratings yet
SE Class Diagram
3 pages
Ch4b Modified
No ratings yet
Ch4b Modified
64 pages
Approaches of Machine Intelligence
No ratings yet
Approaches of Machine Intelligence
11 pages
Ch4b Modified
No ratings yet
Ch4b Modified
64 pages
Run-Time Environments: COP5621 Compiler Construction
No ratings yet
Run-Time Environments: COP5621 Compiler Construction
21 pages
Resource-Allocation Graph
No ratings yet
Resource-Allocation Graph
15 pages
SAP Front-End Software Upgrade Guide
No ratings yet
SAP Front-End Software Upgrade Guide
8 pages
Solus Ultra International Certification 4.1.13
100% (1)
Solus Ultra International Certification 4.1.13
259 pages
Staggered Truss Framing Systems Using ETABS
100% (1)
Staggered Truss Framing Systems Using ETABS
12 pages
An Introduction On OMR Sheets: Instructions On How To Fill Registration Number and Question Paper Code On OMR Sheets
No ratings yet
An Introduction On OMR Sheets: Instructions On How To Fill Registration Number and Question Paper Code On OMR Sheets
2 pages
Game Data File List
No ratings yet
Game Data File List
512 pages
Thycotic Denver Presentation
No ratings yet
Thycotic Denver Presentation
41 pages
Exercise 2 Implementing The Shop With EJB: 2.1 Overview
No ratings yet
Exercise 2 Implementing The Shop With EJB: 2.1 Overview
8 pages
Enterprise Agreement
No ratings yet
Enterprise Agreement
14 pages
AIM For Business Flows Bf.016 Application Setup Document: Oracle Process Manufacturing - Process Execution
No ratings yet
AIM For Business Flows Bf.016 Application Setup Document: Oracle Process Manufacturing - Process Execution
8 pages
1.1. Considerations in Adopting RHEL 9
No ratings yet
1.1. Considerations in Adopting RHEL 9
232 pages
Voronoi Diagrams in Page Segmentation
100% (1)
Voronoi Diagrams in Page Segmentation
4 pages
DL Asset Track™ NB
No ratings yet
DL Asset Track™ NB
2 pages
List of Fastboot Command
No ratings yet
List of Fastboot Command
5 pages
DSA6 - Linked List-1
No ratings yet
DSA6 - Linked List-1
98 pages
1.4.1.2 Packet Tracer - Skills Integration Challenge OSPF Instructions - IG PDF
No ratings yet
1.4.1.2 Packet Tracer - Skills Integration Challenge OSPF Instructions - IG PDF
6 pages
02 - Modulo-5 Counter
No ratings yet
02 - Modulo-5 Counter
4 pages
Paradigm™ Rock & Fluid Canvas™ 2009 - Epos™ 4.0 Project Basics 1-1
No ratings yet
Paradigm™ Rock & Fluid Canvas™ 2009 - Epos™ 4.0 Project Basics 1-1
23 pages
On Neutrosophic Sets and Topology
No ratings yet
On Neutrosophic Sets and Topology
9 pages
CSC 111 NOTE Complete
No ratings yet
CSC 111 NOTE Complete
75 pages
Training For TATA - 1646SM
100% (1)
Training For TATA - 1646SM
184 pages
50 Tricks To Identify DSA Patterns Https - Drive
No ratings yet
50 Tricks To Identify DSA Patterns Https - Drive
4 pages
Nature-Inspired Design of Hybrid Intelligent Systems
100% (1)
Nature-Inspired Design of Hybrid Intelligent Systems
817 pages
AS-QMS-011 Calibration PDF
100% (2)
AS-QMS-011 Calibration PDF
11 pages
Introduction To Symbolic Logic
100% (2)
Introduction To Symbolic Logic
60 pages
Database Auditing for Compliance
No ratings yet
Database Auditing for Compliance
71 pages
DSA Lab Manual-2023-2024
No ratings yet
DSA Lab Manual-2023-2024
77 pages
Ses-Cdegs 2k - Malz
100% (1)
Ses-Cdegs 2k - Malz
75 pages
URS Contents: Blank Template
100% (1)
URS Contents: Blank Template
11 pages
C Assignment File
No ratings yet
C Assignment File
38 pages
Code Aster User Manuel
No ratings yet
Code Aster User Manuel
14 pages

Ch2 Modified

Uploaded by

Ch2 Modified

Uploaded by

1

Building a Simple Compiler

Context-free grammar for simple expressions:

G = <{list,digit}, {+,-,0,1,2,3,4,5,6,7,8,9}, P, list>

list  list + digit

list  list - digit

Derivation for the Example

This is an example leftmost derivation, because we replaced

Derivation for the Example

Parse Tree for the Example

Leftmost derivation Rightmost derivation

In both cases, Expr * id – num * id

Consider the following context-free grammar:

G = <{string}, {+,-,0,1,2,3,4,5,6,7,8,9}, P, string>

string  string + string | string - string | 0 | 1 | … | 9

string string string

string string string string string

Synthesized and Inherited

Example Attribute Grammar

String concat operator

Example Annotated Parse Tree

expr.t = “95-” term.t = “2”

expr.t = “9” term.t = “5”

Depth-First Traversals (Example)

expr.t = “95-” term.t = “2”

expr.t = “9” term.t = “5”

9 - 5 + 2 Note: all attributes are

rest  + term { print(“+”) } rest

Example Translation Scheme

expr  expr + term { print(“+”) }

Example Translation Scheme

Example Predictive Parser

Example Predictive Parser

Input: array [ num dotdot num ] of integer

Example Predictive Parser

Input: array [ num dotdot num ] of integer

Example Predictive Parser

match(‘array’) match(‘[’) simple()

Input: array [ num dotdot num ] of integer

Example Predictive Parser

match(‘array’) match(‘[’) simple()

Input: array [ num dotdot num ] of integer

Example Predictive Parser

match(‘array’) match(‘[’) simple()

match(‘num’) match(‘dotdot’) match(‘num’)

Input: array [ num dotdot num ] of integer

Example Predictive Parser

match(‘array’) match(‘[’) simple() match(‘]’)

match(‘num’) match(‘dotdot’) match(‘num’)

Input: array [ num dotdot num ] of integer

Example Predictive Parser

match(‘array’) match(‘[’) simple() match(‘]’) match(‘of’)

match(‘num’) match(‘dotdot’) match(‘num’)

Input: array [ num dotdot num ] of integer

Example Predictive Parser

match(‘array’) match(‘[’) simple() match(‘]’) match(‘of’) type()

match(‘num’) match(‘dotdot’) match(‘num’) simple()

Adding a Lexical Analyzer

The Lexical Analyzer “lexer”

You might also like