KEMBAR78
Slides Regular | PDF | String (Computer Science) | Logic
0% found this document useful (0 votes)
80 views312 pages

Slides Regular

The document discusses deterministic finite automata (DFA) and regular languages. It provides examples of DFAs with different states and transitions labeled with binary symbols. The DFAs either accept or reject input strings based on whether the string causes the automaton to end in an accept state. The language recognized by a DFA is the set of strings that cause it to accept. Memory in DFAs is encoded in their finite states. The document also establishes a convention where a "sink state" is used to represent strings that can never be accepted.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
80 views312 pages

Slides Regular

The document discusses deterministic finite automata (DFA) and regular languages. It provides examples of DFAs with different states and transitions labeled with binary symbols. The DFAs either accept or reject input strings based on whether the string causes the automaton to end in an accept state. The language recognized by a DFA is the set of strings that cause it to accept. Memory in DFAs is encoded in their finite states. The document also establishes a convention where a "sink state" is used to represent strings that can never be accepted.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 312

Big picture

● All languages
● Decidable
Turing machines
● NP
● P
● Context-free
Context-free grammars, push-down automata
● Regular
Automata, non-deterministic automata,
regular expressions
DFA (Deterministic Finite Automata)
0 1
0 1
q0 qa
1 0 0
1
DFA (Deterministic Finite Automata)
0 1
0 1
q0 qa
1 0 0
1
● States , this DFA has 4 states

● Transitions
labelled with elements of the alphabet S = {0,1}
DFA (Deterministic Finite Automata)
0 1
0 1
q0 qa
1 0 0
1
Computation on input w:
● Begin in start state q0
● Read input string in a one-way fashion
● Follow the arrows matching input symbols
● When input ends: ACCEPT if in accept state
REJECT if not
DFA (Deterministic Finite Automata)

0 1
0 1
q0 qa
1 0
0
1

Example: Input string


w = 0011
DFA (Deterministic Finite Automata)

always start in start state


0 1
0 1
q0 qa
1 0
0
1

Example: Input string


w = 0011
DFA (Deterministic Finite Automata)

0 1
0 1
q0 qa
1 0
0
1

Example: Input string


w = 0011
DFA (Deterministic Finite Automata)

0 1
0 1
q0 qa
1 0
0
1

Example: Input string


w = 0011
DFA (Deterministic Finite Automata)

0 1
0 1
q0 qa
1 0
0
1

Example: Input string


w = 0011
DFA (Deterministic Finite Automata)

0 1
0 1
q0 qa
1 0
0
1

Example: Input string


w = 0011
DFA (Deterministic Finite Automata)

0 1
0 1
q0 qa
1 0
0
1

Example: Input string


w = 0011 ACCEPT
because end in
accept state
DFA (Deterministic Finite Automata)

0 1
0 1
q0 qa
1 0
0
1

Example: Input string


w = 010
DFA (Deterministic Finite Automata)

always start in start state


0 1
0 1
q0 qa
1 0
0
1

Example: Input string


w = 010
DFA (Deterministic Finite Automata)

0 1
0 1
q0 qa
1 0
0
1

Example: Input string


w = 010
DFA (Deterministic Finite Automata)

0 1
0 1
q0 qa
1 0
0
1

Example: Input string


w = 010
DFA (Deterministic Finite Automata)

0 1
0 1
q0 qa
1 0
0
1

Example: Input string


w = 010
DFA (Deterministic Finite Automata)

0 1
0 1
q0 qa
1 0
0
1

Example: Input string


w = 010 REJECT
because does not
end in accept state
DFA (Deterministic Finite Automata)

0 1
0 1
q0 qa
1 0
0
1

Example: Input string w = 01 ACCEPT


w = 010 REJECT
w = 0011 ACCEPT
w = 00110 REJECT
DFA (Deterministic Finite Automata)

0 1
M :=
0 1
q0 qa
1 0
0
1
M recognizes language
L(M) = { w : w starts with 0 and ends with 1 }
L(M) is the language of strings causing M to accept

Example: 0101 is an element of L(M), 0101  L(M)


Example 1
M := q0 q1
S = {0,1} 1 0
0


00 causes M to accept, so 00 is in L(M) 00  L(M)
● 01 does not cause M to accept, so 01 not in L(M),
01  L(M)

0101  L(M)

01101100  L(M)

011010  L(M)
Example 1
M := q0 q1
S = {0,1} 1 0
0

L(M) = {w : w has an even number of 1 }

Note: If there is no 1, then there are zero 1,


zero is an even number, so M should accept.

Indeed 0000000  L(M)


Example
M := 1
S = {0,1}

● L(M) = ?
Example
M := 1
S = {0,1}

● L(M) = every possible string over {0,1}

= {0,1}*
Example 1

S = {0,1}
0 0
M :=
q0
1
1

0
● L(M) = ?
Example 1

S = {0,1}
0 0
M :=
q0
1
1

0

L(M) = all strings over {0,1} except empty string e
= {0,1}* - { e }
Example
S = {0,1} 0 1
1
M :=
0 0
q0
0
1
1 0
1

● L(M) = ?
Example
S = {0,1} 0 1
1
M :=
0 0
q0
0
1
1 0
1

● L(M) = { w : w starts and ends with same symbol }


● Memory is encoded in … what ?
Example
S = {0,1} 0 1
1
M := Remember 0
0 0
q0
0
1 Remember 1
1 0
1

● L(M) = { w : w starts and ends with same symbol }


● Memory is encoded in states.
DFA have finite states, so finite memory
Convention:

0 1
M :=
0 1
q0 qa
1 0
0
We already saw that 1

L(M) = { w : w starts with 0 and ends with 1 }

The arrow q0 leads to a “sink” state.


1 If followed, M can never accept
Convention:

0 1
M :=
0 1
q0 qa
0
Don't need to write such arrows:
If, from some state, read symbol with no
corresponding arrow, imagine M goes into “sink state”
that is not shown, and REJECT.

This makes pictures more compact.


Another convention:

List multiple transition on same arrow:

0,1,2

Means 0
1
2

This makes pictures more compact.


Example ∑ = {0,1}

M=
0,1 0,1

L(M) = ?
Example ∑ = {0,1}

M=
0,1 0,1

L(M) = ∑2 = {00,01,10,11}
Example from programming languages:
Recognize strings representing numbers:
S = {0,1,2,3,4,5,6,7,8,9, +, -, . }
+ 0,...,9
0,...,9
-
.
0,...,9

0,...,9

Note: 0,...,9 means 0,1,2,3,4,5,6,7,8,9: 10 transitions


Example from programming languages:
Recognize strings representing numbers:
S = {0,1,2,3,4,5,6,7,8,9, +, -, . }
+ 0,...,9
0,...,9
-
.
0,...,9

Possibly put sign (+, -) 0,...,9


Follow with arbitrarily many digits, but at least one
Possibly put decimal point
Follow with arbitrarily many digits, possibly none
Example from programming languages:
Recognize strings representing numbers:
S = {0,1,2,3,4,5,6,7,8,9, +, -, . }
+ 0,...,9
0,...,9
-
.
0,...,9

Input w = 17 ACCEPT 0,...,9


Input w = + REJECT
Input w = -3.25 ACCEPT
Input w = +2.35-. REJECT
Example
S = {0,1}

● What about { w : w has same number of 0 and 1 }

● Can you design a DFA that recognizes that?

● It seems you need infinite memory

● We will prove later that


there is no DFA that recognizes that language !
Next: formal definition of DFA
● Useful to prove various properties of DFA

● Especially important to prove that things CANNOT


be
recognized by DFA.

● Useful to practice mathematical notation


State diagram of a DFA:

● One or more states

● Exactly one start state

● Some number of accept states

● Labelled transitions exiting each state, 1


for every symbol in S
● Definition: A finite automaton (DFA)
is a 5-tuple (Q, S, d, q0, F) where

● Q is a finite set of states



S is the input alphabet

d : Q X S → Q is the transition function
●q
0 in Q is the start state

F  Q is the set of accept states

Q X S is the set of ordered pairs (a,b) : a ∈ Q, b ∈ ∑


Example {q,r,s}X{0,1}={(q,0),(q,1),(r,0),(r,1),(s,0),(s,1)}
1
q0 q1
1 0
0

● Example: above DFA is 5-tuple (Q, S, d, q0, F) where


● Q = { q0, q1}

S = {0,1}
● d(q
0 ,0) = ?
1
q0 q1
1 0
0

● Example: above DFA is 5-tuple (Q, S, d, q0, F) where


● Q = { q0, q1}

S = {0,1}
● d(q d(q0 ,1) = ?
0 ,0) = q0
1
q0 q1
1 0
0

● Example: above DFA is 5-tuple (Q, S, d, q0, F) where


● Q = { q0, q1}

S = {0,1}
● d(q d(q0 ,1) = q1
0 ,0) = q0
d(q1 ,0) = q1 d(q1 ,1) = q0
● q0 in Q is the start state
● F=?
1
q0 q1
1 0
0

● Example: above DFA is 5-tuple (Q, S, d, q0, F) where


● Q = { q0, q1}

S = {0,1}
● d(q d(q0 ,1) = q1
0 ,0) = q0
d(q1 ,0) = q1 d(q1 ,1) = q0
● q0 in Q is the start state
● F = { q0}  Q is the set of accept states
● Definition: A DFA (Q, S, d, q0, F) accepts a string w if
● w = w 1 w2 … w k where,  1  i  k, wi is in S
(the k symbols of w)

● The sequence of k+1 states r0, r1, .., rk where


ri = is state DFA is in after reading i-th symbol in w:
(1) r0 = q0, and
(2) ri+1 = d(ri ,wi+1 )  0  i < k
has rk in F
● We call this sequence the trace of the DFA on w
1
Example q0 q1
1 0
0

● Above DFA (Q, S, d, q0, F) accepts w = 011


1
Example q0 q1
1 0
0

● Above DFA (Q, S, d, q0, F) accepts w = 011


● w = 011 = w1 w2 w3 w1 = 0 w 2 = 1 w 3 = 1
1
Example q0 q1
1 0
0

● Above DFA (Q, S, d, q0, F) accepts w = 011


● w = 011 = w1 w2 w3 w1 = 0 w 2 = 1 w 3 = 1

We must show trace of DFA on w ends in F, that is:


● The sequence of 3+1=4 states r , r , r , r
0 1 2 3 such that:
(1) r0 = q0
(2) ri+1 = d(ri ,wi+1 )  0  i < 3
has r3 in F
1
Example q0 q1
1 0
0

● Above DFA (Q, S, d, q0, F) accepts w = 011


● w = 011 = w1 w2 w3 w1 = 0 w 2 = 1 w 3 = 1

● r0 = q0
● r1 := ?
1
Example q0 q1
1 0
0

● Above DFA (Q, S, d, q0, F) accepts w = 011


● w = 011 = w1 w2 w3 w1 = 0 w 2 = 1 w 3 = 1

● r0 = q0
● r1 = d(r0 ,w1 )=d(q0 ,0 ) = q0
●r
2 := ?
1
Example q0 q1
1 0
0

● Above DFA (Q, S, d, q0, F) accepts w = 011


● w = 011 = w1 w2 w3 w1 = 0 w 2 = 1 w 3 = 1

● r0 = q0
● r1 = d(r0 ,w1 )=d(q0 ,0 ) = q0
● r2 = d(r1 ,w2 )=d(q0 ,1 ) = q1
● r3 := ?
1
Example q0 q1
1 0
0

● Above DFA (Q, S, d, q0, F) accepts w = 011


● w = 011 = w1 w2 w3 w1 = 0 w 2 = 1 w 3 = 1

● r0 = q0
● r1 = d(r0 ,w1 )=d(q0 ,0 ) = q0
● r2 = d(r1 ,w2 )=d(q0 ,1 ) = q1
● r3 = d(r2 ,w3 )=d(q1 ,1 ) = q0
● r3 = q0 in F OK DONE!
● Definition: For a DFA M, we denote by L(M) the
set of strings accepted by M:
L(M) := { w : M accepts w}

We say M accepts or recognizes the language L(M)

● Definition: A language L is regular


if $ DFA M : L(M) = L
In the next lectures we want to:

● Understand power of regular languages

● Develop alternate, compact notation to specify


regular languages

Example: Unix command grep '\<c.*h\>' file


selects all words starting with c and ending with h
in file
● Understand power of regular languages:

● Suppose A, B are regular languages, what about


● not A := { w : w is not in A }
● A U B := { w : w in A or w in B }
● A o B := { w1 w2 : w1 in A and w2 in B }
● A* := { w1 w2 … wk : k  0 , wi in A for every i }

● Are these languages regular?


● Understand power of regular languages:

● Suppose A, B are regular languages, what about


● not A := { w : w is not in A }
● A U B := { w : w in A or w in B }
● A o B := { w1 w2 : w1 in A and w2 in B }
● A* := { w1 w2 … wk : k  0 , wi in A for every i }

● Terminology: Are regular languages closed


under not, U, o, * ?
● Theorem:
If A is a regular language, then so is (not A)
● Theorem:
If A is a regular language, then so is (not A)

● Proof idea: ?????????? the set of accept states


● Theorem:
If A is a regular language, then so is (not A)

● Proof idea: Complement the set of accept states


● Example
● Theorem:
If A is a regular language, then so is (not A)

● Proof idea: Complement the set of accept states


●Example:
M :=
1
q0 q1
1
0 0
L(M) =
{ w : w has even number of 1}
● Theorem:
If A is a regular language, then so is (not A)

● Proof idea: Complement the set of accept states


●Example:
M := M' :=
1 1
q0 q1 q0 q1
1 0 1 0
0 0
L(M) = L(M') = not L(M) =
{ w : w has even number of 1} { w : w has odd number of 1}
● Theorem: If A is a regular language, then so is (not A)
● Proof:
Given DFA M = (Q, S, d, q0, F) such that L(M) = A.
Define DFA M' = ??????????????????????????

This definition is the creative step of this proof,


the rest is (perhaps complicated but) mechanical
“unwrapping definitions”
● Theorem: If A is a regular language, then so is (not A)
● Proof:
Given DFA M = (Q, S, d, q0, F) such that L(M) = A.
Define DFA M' = (Q, S, d, q0, F'), where F' := not F.
● We need to show L(M') = not L(M), that is:
for any w, ??????????????????????????
● Theorem: If A is a regular language, then so is (not A)
● Proof:
Given DFA M = (Q, S, d, q0, F) such that L(M) = A.
Define DFA M' = (Q, S, d, q0, F'), where F' := not F.
● We need to show L(M') = not L(M), that is:
for any w, M' accepts w  M does not accept w.

● Note that the traces of M and M' on w … ?


● Theorem: If A is a regular language, then so is (not A)
● Proof:
Given DFA M = (Q, S, d, q0, F) such that L(M) = A.
Define DFA M' = (Q, S, d, q0, F'), where F' := not F.
● We need to show L(M') = not L(M), that is:
for any w, M' accepts w  M does not accept w

● Note that the traces of M and M' on w are equal

● Let rk be the last state in this trace


● Note that r in F'  r
k k not in F, since F' = not F. 
What is a proof?

● A proof is an explanation, written in English, of why


something is true.

● Every sentence must be logically connected to the


previous ones, often by “so”, “hence”, “since”, etc.

● Your audience is a human being, NOT a machine.


● Theorem: If A is a regular language, then so is (not A)
● Proof:
DFA M = (Q, S, d, q0, F) such that L(M) = A.
DFA M' = (Q, S, d, q0, F'), where F' := not F.
L(M') = not L(M)
M' accepts w  M does not accept w

Trace of M on w.

rk in F'  rk not in F, F' = not F. 


What is a proof?

Complement the set of accept states

Given DFA M = (Q, S, d, q0, F) such that L(M) = A.


Define DFA M' = (Q, S, d, q0, F'), where F' := not F.
● We need to show L(M') = not L(M), that is:
● for any w, M' accepts w  M does not accept w
● Note that the traces of M and M' on w are equal
● Let rk be the last state in this trace
● Note that rk in F'  rk not in F, since F' = not F. 

To know a proof means to know all the pyramid


Example ∑ = {0,1}

M=
0,1 0,1

L(M) = ∑2 = {00,01,10,11}

What is a DFA M' :


L(M') = not ∑2 = all strings except those of length 2 ?
Example ∑ = {0,1}

M' =
0,1 0,1 0,1 0,1

L(M') = not ∑2 = {0,1}* - {00,01,10,11}

Do not forget the convention about the sink state!


● Suppose A, B are regular languages, what about
● not A := { w : w is not in A } REGULAR
● A U B := { w : w in A or w in B }
● A o B := { w1 w2 : w1 in A and w2 in B }
● A* := { w1 w2 … wk : k  0 , wi in A for every i }
● Theorem: If A, B are regular, then so is A U B

● Proof idea: Take Cartesian product of states


In a pair (q,q'),
q tracks DFA for A,
q' tracks DFA for B.

● Next we see an example.


1
In it we abbreviate
1
with 1
Example 1 0
MB :=
MA := a b c d

0 1 1
0
L(MB) = B = ?
L(MA) = A = ?
Example 1 0
MB :=
MA := a b c d

0 1 1
0
L(MB) = B =
L(MA) = A =
{ w : w has odd number of 0}
{ w : w has even number of 1}

MAUB := How many states?


Example 1 0
MB :=
MA := a b c d

0 1 1
0
L(MB) = B =
L(MA) = A =
{ w : w has odd number of 0}
{ w : w has even number of 1}

MAUB := 1
a,c b,c

0
L(MAUB) = AUB = 0
1
{ w : w has even number of 1,
a,d b,d
or odd number of 0}
● Theorem: If A, B are regular, then so is A U B
● Proof:
Given DFA MA = (QA,S, δA,qA, FA) such that L(M) = A,
DFA MB = (QB,S, δB,qB, FB) such that L(M) = B.
Define DFA M = (Q, S, d, q0, F), where
Q := ?
● Theorem: If A, B are regular, then so is A U B
● Proof:
Given DFA MA = (QA,S, δA,qA, FA) such that L(M) = A,
DFA MB = (QB,S, δB,qB, FB) such that L(M) = B.
Define DFA M = (Q, S, d, q0, F), where
Q := QA X QB
q0 := ?
● Theorem: If A, B are regular, then so is A U B
● Proof:
Given DFA MA = (QA,S, δA,qA, FA) such that L(M) = A,
DFA MB = (QB,S, δB,qB, FB) such that L(M) = B.
Define DFA M = (Q, S, d, q0, F), where
Q := QA X QB
q0 := (qA , qB )
F := ?
● Theorem: If A, B are regular, then so is A U B
● Proof:
Given DFA MA = (QA,S, δA,qA, FA) such that L(M) = A,
DFA MB = (QB,S, δB,qB, FB) such that L(M) = B.
Define DFA M = (Q, S, d, q0, F), where
Q := QA X QB
q0 := (qA , qB )
F := {(q,q') ∈ Q : q ∈ FA or q' ∈ FB }
δ( (q,q'), v) := (?, ? )
● Theorem: If A, B are regular, then so is A U B
● Proof:
Given DFA MA = (QA,S, δA,qA, FA) such that L(M) = A,
DFA MB = (QB,S, δB,qB, FB) such that L(M) = B.
Define DFA M = (Q, S, d, q0, F), where
Q := QA X QB
q0 := (qA , qB )
F := {(q,q') ∈ Q : q ∈ FA or q' ∈ FB }
δ( (q,q'), v) := (δA (q,v), δB (q',v) )
● We need to show L(M) = A U B that is, for any w:
M accepts w  MA accepts w or MB accepts w
● Proof M accepts wMA accepts w or MB accepts w
● Suppose that M accepts w of length k.
● From the definitions of accept and M,
the trace (s0 , t0 ) , …, (sk , tk ) of M on w
has (sk,tk)∈?
● Proof M accepts wMA accepts w or MB accepts w
● Suppose that M accepts w of length k.
● From the definitions of accept and M,
the trace (s0 , t0 ) , …, (sk , tk ) of M on w
has (sk,tk)∈ F.

● By our definition of F, what can we say about (sk,tk ) ?


● Proof M accepts wMA accepts w or MB accepts w
● Suppose that M accepts w of length k.
● From the definitions of accept and M,
the trace (s0 , t0 ) , …, (sk , tk ) of M on w
has (sk,tk)∈ F.

● By our definition of F, sk ∈ FA or tk ∈ FB.

● Without loss of generality, assume sk ∈ FA.


Then MA accepts w because s0 , …, sk
is the trace of MA on w, and sk ∈ FA .
● Proof M accepts wMA accepts w or MB accepts w
● W/out loss of generality, assume MA accepts w, |w|=k

● From the definition of MA accepts w,


the trace r0 , …, rk of MA on w has rk in FA

● Let t0 , …, tk be the trace of MB on w

● M accepts w because the trace of M on w is


??????????
● Proof M accepts wMA accepts w or MB accepts w
● W/out loss of generality, assume MA accepts w, |w|=k

● From the definition of MA accepts w,


the trace r0 , …, rk of MA on w has rk in FA

● Let t0 , …, tk be the trace of MB on w

● M accepts w because the trace of M on w is


(r0 , t0 ), …, (rk , tk )
and (rk,tk) is in F, by our definition of F. 
● Suppose A, B are regular languages, what about
● not A := { w : w is not in A } REGULAR
● A U B := { w : w in A or w in B } REGULAR
● A o B := { w1 w2 : w1 in A and w2 in B }
● A* := { w1 w2 … wk : k  0 , wi in A for every i }

● Other two are more complicated!

● Plan: we introduce NFA


prove that NFA are equivalent to DFA
reprove A U B, prove A o B, A* regular, using NFA
Big picture
● All languages
● Decidable
Turing machines
● NP
● P
● Context-free
Context-free grammars, push-down automata
● Regular
Automata, non-deterministic automata,
regular expressions
Non deterministic finite automata (NFA)

● DFA: given state and input symbol, 1


unique choice for next state,
deterministic:

1
● Next we allow multiple choices,
non-deterministic 1


We also allow e-transitions: e
can follow without reading anything
Example of NFA
q0
b a
e
a q2
q1 a,b
Intuition of how it computes:
● Accept string w if there is a way to follow transitions
that ends in accept state

Transitions labelled with symbol in S = {a,b}
must be matched with input

e transitions can be followed without matching
Example of NFA
q0
b a
e
a q2
q1 a,b

Example:

Accept a (first follow e-transition )
● Accept baaa
ANOTHER Example of NFA

q1 b
b a
e
q0 q3
a,b q2 b
Example:
● Accept bab (two accepting paths, one
uses the e-transition)
● Reject ba (two possible paths, but neither
has final state = q1)
● Definition: A non-deterministic finite automaton (NFA)
is a 5-tuple (Q, S, d, q0, F) where

● Q is a finite set of states



S is the input alphabet

d : Q X (S U {e} ) → Powerset(Q)
●q
0 in Q is the start state

F  Q is the set of accept states

● Recall: Powerset(Q) = set of all subsets of Q


Example: Powerset({1,2}) = ?
● Definition: A non-deterministic finite automaton (NFA)
is a 5-tuple (Q, S, d, q0, F) where

● Q is a finite set of states



S is the input alphabet

d : Q X (S U {e} ) → Powerset(Q)
●q
0 in Q is the start state

F  Q is the set of accept states

● Recall: Powerset(Q) = set of all subsets of Q


Example: Powerset({1,2}) = {, {1}, {2}, {1,2} }
1
q0 q1
0, 1 e

● Example: above NFA is 5-tuple (Q, S, d, q0, F)


● Q = { q0, q1}

S = {0,1}
● d(q
0 ,0) = ?
1
q0 q1
0, 1 e

● Example: above NFA is 5-tuple (Q, S, d, q0, F)


● Q = { q0, q1}

S = {0,1}
● d(q
0 ,0) = {q0} d(q0 ,1) = ?
1
q0 q1
0, 1 e

● Example: above NFA is 5-tuple (Q, S, d, q0, F)


● Q = { q0, q1}

S = {0,1}
● d(q
0 ,0) = {q0} d(q0 ,1) = {q0, q1} d(q0 ,e) = ?
1
q0 q1
0, 1 e

● Example: above NFA is 5-tuple (Q, S, d, q0, F)


● Q = { q0, q1}

S = {0,1}
● d(q
0 ,0) = {q0} d(q0 ,1) = {q0, q1} d(q0 ,e) = 
d(q1 ,0) = ?
1
q0 q1
0, 1 e

● Example: above NFA is 5-tuple (Q, S, d, q0, F)


● Q = { q0, q1}

S = {0,1}
● d(q
0 ,0) = {q0} d(q0 ,1) = {q0, q1} d(q0 ,e) = 
d(q1 ,0) =  d(q1 ,1) = ?
1
q0 q1
0, 1 e

● Example: above NFA is 5-tuple (Q, S, d, q0, F)


● Q = { q0, q1}

S = {0,1}
● d(q
0 ,0) = {q0} d(q0 ,1) = {q0, q1} d(q0 ,e) = 
d(q1 ,0) =  d(q1 ,1) =  d(q1 ,e) = ?
1
q0 q1
0, 1 e

● Example: above NFA is 5-tuple (Q, S, d, q0, F)


● Q = { q0, q1}

S = {0,1}
● d(q
0 ,0) = {q0} d(q0 ,1) = {q0, q1} d(q0 ,e) = 
d(q1 ,0) =  d(q1 ,1) =  d(q1 ,e) = {q0}
● q0 in Q is the start state
● F=?
1
q0 q1
0, 1 e

● Example: above NFA is 5-tuple (Q, S, d, q0, F)


● Q = { q0, q1}

S = {0,1}
● d(q
0 ,0) = {q0} d(q0 ,1) = {q0, q1} d(q0 ,e) = 
d(q1 ,0) =  d(q1 ,1) =  d(q1 ,e) = {q0}
● q0 in Q is the start state
● F = { q1}  Q is the set of accept states
● Definition: A NFA (Q, S, d, q0, F) accepts a string w if
$ integer k, ∃ k strings w1 , w2 , …, wk such that
● w = w 1 w2 … w k where  1  i  k, wi  S U {e}
(the symbols of w, or e)

● $ sequence of k+1 states r0, r1, .., rk in Q such that:


● r0 = q0

ri+1  d(ri ,wi+1 )  0  i < k
● rk is in F

● Differences with DFA are in green


Back to first example NFA:
q0
b a
e
a q2
q1 a,b
Accepts w = baaa
w1 = b, w2 = a, w3 = a, w4= e, w5 = a
Accepting sequence of 5+1 = 6 states:
r0 = ?
Back to first example NFA:
q0
b a
e
a q2
q1 a,b
Accepts w = baaa
w1 = b, w2 = a, w3 = a, w4= e, w5 = a
Accepting sequence of 5+1 = 6 states:
r0 = q0, r1 = ?
Back to first example NFA:
q0
b a
e
a q2
q1 a,b
Accepts w = baaa
w1 = b, w2 = a, w3 = a, w4= e, w5 = a
Accepting sequence of 5+1 = 6 states:
r0 = q0, r1 = q1, r2 = ?
Transitions:
r1  d(r0,b) = {q1}
Back to first example NFA:
q0
b a
e
a q2
q1 a,b
Accepts w = baaa
w1 = b, w2 = a, w3 = a, w4= e, w5 = a
Accepting sequence of 5+1 = 6 states:
r0 = q0, r1 = q1, r2 = q2, r3 = ?
Transitions:
r1  d(r0,b) = {q1} r2  d(r1,a) = {q1,q2}
Back to first example NFA:
q0
b a
e
a q2
q1 a,b
Accepts w = baaa
w1 = b, w2 = a, w3 = a, w4= e, w5 = a
Accepting sequence of 5+1 = 6 states:
r0 = q0, r1 = q1, r2 = q2, r3 = q0, r4 = ?
Transitions:
r1  d(r0,b) = {q1} r2  d(r1,a) = {q1,q2}
r3  d(r2,a) = {q0}
Back to first example NFA:
q0
b a
e
a q2
q1 a,b
Accepts w = baaa
w1 = b, w2 = a, w3 = a, w4= e, w5 = a
Accepting sequence of 5+1 = 6 states:
r0 = q0, r1 = q1, r2 = q2, r3 = q0, r4 = q2, r5 = ?
Transitions:
r1  d(r0,b) = {q1} r2  d(r1,a) = {q1,q2}
r3  d(r2,a) = {q0} r4  d(r3,e) = {q2}
Back to first example NFA:
q0
b a
e
a q2
q1 a,b
Accepts w = baaa
w1 = b, w2 = a, w3 = a, w4= e, w5 = a
Accepting sequence of 5+1 = 6 states:
r0 = q0, r1 = q1, r2 = q2, r3 = q0, r4 = q2, r5 = q0
Transitions:
r1  d(r0,b) = {q1} r2  d(r1,a) = {q1,q2}
r3  d(r2,a) = {q0} r4  d(r3,e) = {q2} r5  d(r4,a) = {q0}
● NFA are at least as powerful as DFA,
because DFA are a special case of NFA

● Are NFA more powerful than DFA?

● Surprisingly, they are not:

● Theorem:
For every NFA N there is DFA M : L(M) = L(N)
● Theorem:
For every NFA N there is DFA M : L(M) = L(N)


Construction without e transitions

Given NFA N (Q, S, d, q, F)

Construct DFA M (Q', S, d', q', F') where:
● Q' := Powerset(Q)
● q' = {q}

F' = { S : S  Q' and S contains an element of F}
● d'(S, a) := U
s  S d(s,a)

= { t : t  d (s,a) for some s  S }



It remains to deal with e transitions

● Definition: Let S be a set of states.


E(S) := { q : q can be reached from some state
s in S traveling along 0 or more e transitions }


We think of following e transitions at beginning, or
right after reading an input symbol in S
● Theorem:
For every NFA N there is DFA M : L(M) = L(N)


Construction including e transitions

Given NFA N (Q, S, d, q, F)

Construct DFA M (Q', S, d', q', F') where:
● Q' := Powerset(Q)
● q' = E({q})

F' = { S : S  Q' and S contains an element of F}
● d'(S, a) := E( U
s  S d(s,a) )

= { t : t  E( d (s,a) ) for some s  S }


Example: NFA → DFA conversion

NFA DFA

1
{1,3} {3} 
b a
a e
2 3
a,b {2} {2,3} {1,2,3}

QDFA = Powerset(QNFA)
= Powerset({1,2,3}) {1} {1,2}
= {,{1},{2},{3},{1,2}...}
Example: NFA → DFA conversion

NFA DFA

1
{1,3} {3} 
b a
a e
2 3
a,b {2} {2,3} {1,2,3}

qDFA = E({qNFA})
= E({1}) {1} {1,2}
= {1,3}
Example: NFA → DFA conversion

NFA DFA

1
{1,3} {3} 
b a
a e
2 3
a,b {2} {2,3} {1,2,3}

FDFA = {S : S contains
an element of FNFA} {1} {1,2}
Example: NFA → DFA conversion

NFA DFA

1
{1,3} {3} 
b a
a e
2 3
a,b {2} {2,3} {1,2,3}

dDFA({1}, a)
= E(dNFA(1, a)) a
{1} {1,2}
= E() = 
Example: NFA → DFA conversion

NFA DFA

1
{1,3} {3} 
b a
a e
2 3
a,b {2} {2,3} {1,2,3}

dDFA({1}, b) b
= E(dNFA(1, b)) a
{1} {1,2}
= E({2}) = {2}
Example: NFA → DFA conversion

NFA DFA

1
{1,3} {3} 
b a
a e
2 3 a
a,b {2} {2,3} {1,2,3}

dDFA({2}, a) b
= E(dNFA(2, a)) a
{1} {1,2}
= E({2,3}) = {2,3}
Example: NFA → DFA conversion

NFA DFA

1
{1,3} {3} 
b a b
a e
2 3 a
a,b {2} {2,3} {1,2,3}

dDFA({2}, b) b
= E(dNFA(2, b)) a
{1} {1,2}
= E({3}) = {3}
Example: NFA → DFA conversion

NFA DFA

a
1
{1,3} {3} 
b a b
a e
2 3 a
a,b {2} {2,3} {1,2,3}

dDFA({3}, a) b
= E(dNFA(3, a)) a
{1} {1,2}
= E({1}) = {1,3}
Example: NFA → DFA conversion

NFA DFA

a b
1
{1,3} {3} 
b a b
a e
2 3 a
a,b {2} {2,3} {1,2,3}

dDFA({3}, b) b
= E(dNFA(3, b)) a
{1} {1,2}
= E() = 
Example: NFA → DFA conversion

NFA DFA

a b
1
{1,3} {3} 
b a b
a e
2 3 a
a,b {2} {2,3} {1,2,3}
a
dDFA({2,3}, a) b
= E(dNFA(2,a) U dNFA(3,a)) a
{1} {1,2}
= E({2,3} U {1}) = {1,2,3}
Example: NFA → DFA conversion

NFA DFA

a b
1
{1,3} {3} 
b a b
a e b
2 3 a
a,b {2} {2,3} {1,2,3}
a
dDFA({2,3}, b) b
= E(dNFA(2,b) U dNFA(3,b)) a
{1} {1,2}
= E({3} U ) = {3}
Example: NFA → DFA conversion

NFA DFA
a

a b
1
{1,3} {3} 
b a b
a e b
2 3 a
a,b {2} {2,3} {1,2,3}
a
dDFA({1,3}, a) b
= E(dNFA(1,a) U dNFA(3,a)) a
{1} {1,2}
= E( U {1}) = {1,3}
Example: NFA → DFA conversion

NFA DFA
a

a b
1
{1,3} {3} 
b a b b
a e b
2 3 a
a,b {2} {2,3} {1,2,3}
a
dDFA({1,3}, b) b
= E(dNFA(1,b) U dNFA(3,b)) a
{1} {1,2}
= E({2} U ) = {2}
Example: NFA → DFA conversion

NFA DFA
a

a b
1
{1,3} {3} 
b a b b
a e b
2 3 a
a,b {2} {2,3} {1,2,3}
a
dDFA({1,2}, a) b a
= E(dNFA(1,a) U dNFA(2,a)) a
{1} {1,2}
= E( U {2,3}) = {2,3}
Example: NFA → DFA conversion

NFA DFA
a

a b
1
{1,3} {3} 
b a b b
a e b
2 3 a
a,b {2} {2,3} {1,2,3}
a
dDFA({1,2}, b) b a,b
= E(dNFA(1,b) U dNFA(2,b)) a
{1} {1,2}
= E({2} U {3}) = {2,3}
Example: NFA → DFA conversion

NFA DFA
a

a b
1
{1,3} {3} 
b a b b a
a e b
2 3 a
a,b {2} {2,3} {1,2,3}
a
dDFA({1,2,3}, a) b a,b
=E(d (1,a) U dNFA(2,a) U dNFA(3,a))
{1} a
NFA
{1,2}
=E( U {2,3} U {1}) = {1,2,3}
Example: NFA → DFA conversion

NFA DFA
a

a b
1
{1,3} {3} 
b a b b a
a e b
2 3 a b
a,b {2} {2,3} {1,2,3}
a
dDFA({1,2,3}, b) b a,b
=E(d (1,b) U dNFA(2,b) U dNFA(3,b))
{1} a
NFA
{1,2}
=E({2} U {3} U ) = {2,3}
Example: NFA → DFA conversion

NFA DFA
a a,b

a b
1
{1,3} {3} 
b a b b a
a e b
2 3 a b
a,b {2} {2,3} {1,2,3}
a
b a,b
dDFA(, a) = 
{1} {1,2} a
dDFA(, b) = 
Example: NFA → DFA conversion

NFA DFA
a a,b

a b
1
{1,3} {3} 
b a b b a
a e b
2 3 a b
a,b {2} {2,3} {1,2,3}
a

We can delete the


unreachable states.
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
b
a
e  {1,2,3}
3

{2}
QDFA = Powerset(QNFA) {1,2}
{1,3}
= Powerset({1,2,3}) {2,3}
= {,{1},{2},{3},{1,2}...}
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
b
a
e  {1,2,3}
3

{2}
qDFA = E({qNFA}) {1,2}
{1,3}
= E({1}) {2,3}
= {1}
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
b
a
e  {1,2,3}
3

{2}
FDFA = {S : S contains {1,2}
{1,3}
an element of FNFA} {2,3}
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
a
b
a
e  {1,2,3}
3

{2}
dDFA({1}, a) {1,2}
{1,3}
= E(dNFA(1, a)) {2,3}
= E() = 
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
a b
b
a
e  {1,2,3}
3

{2}
dDFA({1}, b) {1,2}
{1,3}
= E(dNFA(1, b)) {2,3}
= E({2,3}) = {1,2,3}
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
a b
b
a
e  {1,2,3}
3

{2} a
dDFA({2}, a) {1,2}
{1,3}
= E(dNFA(2, a)) {2,3}
= E({3}) = {1,3}
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
a b
b
a
e  {1,2,3}
3

b
{2} a
dDFA({2}, b) {1,2}
{1,3}
= E(dNFA(2, b)) {2,3}
= E() = 
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
a a b
b
a
e  {1,2,3}
3

b
{2} a
dDFA({3}, a) {1,2}
{1,3}
= E(dNFA(3, a)) {2,3}
= E() = 
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
a,b a b
b
a
e  {1,2,3}
3

b
{2} a
dDFA({3}, b) {1,2}
{1,3}
= E(dNFA(3, b)) {2,3}
= E() = 
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
a,b a b
b
a
e  {1,2,3}
3

b
{2} a
dDFA({1,2}, a) {1,2}
{1,3} a
= E(dNFA(1,a) U dNFA(2,a)) {2,3}
= E( U {3}) = {1,3}
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
a,b a b
b
a
e  {1,2,3}
3

b b
{2} a
dDFA({1,2}, b) {1,2}
{1,3} a
= E(dNFA(1,b) U dNFA(2,b)) {2,3}
= E({2,3} U ) = {1,2,3}
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
a,b a b
b
a
e  {1,2,3}
3

b a b
{2} a
dDFA({1,3}, a) {1,2}
{1,3} a
= E(dNFA(1,a) U dNFA(3,a)) {2,3}
= E( U ) = 
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
a,b a b
b
a
e  {1,2,3}
3

b a b b
{2} a
dDFA({1,3}, b) {1,2}
{1,3} a
= E(dNFA(1,b) U dNFA(3,b)) {2,3}
= E({2,3} U ) = {1,2,3}
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
a,b a b
b
a
e  {1,2,3}
3

b a b b
{2} a
dDFA({2,3}, a) {1,2}
{1,3} a
= E(dNFA(2,a) U dNFA(3,a)) {2,3}
a
= E({3} U ) = {1,3}
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
a,b a b
b
a
e  {1,2,3}
3

b b a b b
{2} a
dDFA({2,3}, b) {1,2}
{1,3} a
= E(dNFA(2,b) U dNFA(3,b)) {2,3}
a
= E( U ) = 
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
a,b a b
b
a
e  {1,2,3}
3

b b a b b
a
{2} a
dDFA({1,2,3}, a) {1,2}
{1,3} a
=E(dNFA
(1,a) U dNFA(2,a) U dNFA(3,a)) {2,3}
a
=E( U {3} U ) = {1,3}
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
a,b a b b
b
a
e  {1,2,3}
3

b b a b b
a
{2} a
dDFA({1,2,3}, b) {1,2}
{1,3} a
=E(dNFA
(1,b) U dNFA(2,b) U dNFA(3,b)) {2,3}
a
=E({2,3} U  U ) = {1,2,3}
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {3} {1}
1 2
a,b a b b
b
a
e  a,b {1,2,3}
3

b b a b b
a
{2} a
{1,2}
dDFA(, a) =  {1,3} a
{2,3} a
dDFA(, b) = 
ANOTHER Example: NFA → DFA conversion

NFA DFA

b {1}
1 2
a b b
b
a
e  a,b {1,2,3}
3

a b
a

We can delete the {1,3}


unreachable states.
Summary: NFA and DFA recognize the same
languages

We now return to the question:


● Suppose A, B are regular languages, what about
● not A := { w : w is not in A } REGULAR
● A U B := { w : w in A or w in B } REGULAR
● A o B := { w1 w2 : w1 in A and w2 in B }
● A* := { w1 w2 … wk : k  0 , wi in A for every i }
Theorem: If A, B are regular languages, then so is
A U B := { w : w in A or w in B }

● Proof idea: Given DFA MA : L(MA) = A,


DFA MB : L(MB) = B,
● Construct NFA N : L(N) = A U B N

M M e
A B
U = e
MA N
M e
U B = e
Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

DFA MB = (QB, S, dB, qB, FB) : L(MB) = B,



Construct NFA N = (Q, S, d, q, F) where:
● Q := ?
MA N
M e
U B = e
Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

DFA MB = (QB, S, dB, qB, FB) : L(MB) = B,



Construct NFA N = (Q, S, d, q, F) where:
● Q := {q} U Q
A U QB , F := ?
MA N
M e
U B = e
Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

DFA MB = (QB, S, dB, qB, FB) : L(MB) = B,



Construct NFA N = (Q, S, d, q, F) where:
● Q := {q} U Q
A U QB , F := FA U FB
● d(r,x) := { dA(r,x) } if r in QA and x  e
● d(r,x) := ? if r in QB and x  e
MA N
M e
U B = e
Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

DFA MB = (QB, S, dB, qB, FB) : L(MB) = B,



Construct NFA N = (Q, S, d, q, F) where:
● Q := {q} U Q
A U QB , F := FA U FB
● d(r,x) := { dA(r,x) } if r in QA and x  e
● d(r,x) := { dB(r,x) } if r in QB and x  e

d(q,e) := ?
MA N
M e
U B = e
Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

DFA MB = (QB, S, dB, qB, FB) : L(MB) = B,



Construct NFA N = (Q, S, d, q, F) where:
● Q := {q} U Q
A U QB , F := FA U FB
● d(r,x) := { dA(r,x) } if r in QA and x  e
● d(r,x) := { dB(r,x) } if r in QB and x  e
● d(q,e) := {qA, qB}
● We have L(N) = A U B
Example
Is L = {w in {0,1}* : |w| is divisible by 3 OR
w starts with a 1} regular?
Example
Is L = {w in {0,1}* : |w| is divisible by 3 OR
w starts with a 1} regular?

OR is like U, so try to write L = L1 U L2


where L1, L2 are regular
Example
Is L = {w in {0,1}* : |w| is divisible by 3 OR
w starts with a 1} regular?

OR is like U, so try to write L = L1 U L2


where L1, L2 are regular
L1 = {w : |w| is div. by 3} L2 = {w : w starts with a 1}
Example
Is L = {w in {0,1}* : |w| is divisible by 3 OR
w starts with a 1} regular?

OR is like U, so try to write L = L1 U L2


where L1, L2 are regular
L1 = {w : |w| is div. by 3} L2 = {w : w starts with a 1}

M1 = 0,1

0,1 0,1

L(M1) = L1
Example
Is L = {w in {0,1}* : |w| is divisible by 3 OR
w starts with a 1} regular?

OR is like U, so try to write L = L1 U L2


where L1, L2 are regular
L1 = {w : |w| is div. by 3} L2 = {w : w starts with a 1}

M1 = 0,1 M2 = 0,1
1
0,1 0,1

L(M1) = L1 L(M2) = L2
Example
Is L = {w in {0,1}* : |w| is divisible by 3 OR
w starts with a 1} regular?

OR is like U, so try to write L = L1 U L2


where L1, L2 are regular
L1 = {w : |w| is div. by 3} L2 = {w : w starts with a 1}
0,1
M= L(M) = L(M1) U L(M2)
0,1 0,1
e = L1 U L2
1 0,1 =L
e
 L is regular.
We now return to the question:
● Suppose A, B are regular languages, then
● not A := { w : w is not in A } REGULAR
● A U B := { w : w in A or w in B } REGULAR
● A o B := { w1 w2 : w1 in A and w2 in B }
● A* := { w1 w2 … wk : k  0 , wi in A for every i }
Theorem: If A, B are regular languages, then so is
A o B := { w : w = xy for some
x in A and y in B }.
● Proof idea: Given DFAs M , M
A B for A, B

construct NFA N : L(N) = A o B.


M M
A oB

N e
e
= e
N
M MB
e
e
A o = e

Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

DFA MB = (QB, S, dB, qB, FB) : L(MB) = B,



Construct NFA N = (Q, S, d, q, F) where:
● Q := ?
N
M MB
e
e
A o = e

Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

DFA MB = (QB, S, dB, qB, FB) : L(MB) = B,



Construct NFA N = (Q, S, d, q, F) where:
● Q := Q U Q
A B , q := ?
N
M MB
e
e
A o = e

Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

DFA MB = (QB, S, dB, qB, FB) : L(MB) = B,



Construct NFA N = (Q, S, d, q, F) where:
● Q := Q U Q
A B , q := qA , F := ?
N
M MB
e
e
A o = e

Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

DFA MB = (QB, S, dB, qB, FB) : L(MB) = B,



Construct NFA N = (Q, S, d, q, F) where:
● Q := Q U Q
A B , q := qA , F := FB
● d(r,x) := ? if r in QA and x  e
N
M MB
e
e
A o = e

Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

DFA MB = (QB, S, dB, qB, FB) : L(MB) = B,



Construct NFA N = (Q, S, d, q, F) where:
● Q := Q U Q
A B , q := qA , F := FB
● d(r,x) := { dA(r,x) } if r in QA and x  e
● d(r,e) := ? if r in FA
N
M MB
e
e
A o = e

Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

DFA MB = (QB, S, dB, qB, FB) : L(MB) = B,



Construct NFA N = (Q, S, d, q, F) where:
● Q := Q U Q
A B , q := qA , F := FB
● d(r,x) := { dA(r,x) } if r in QA and x  e
● d(r,e) := { q
B } if r in FA
● d(r,x) := ? if r in QB and x  e
N
M MB
e
e
A o = e

Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

DFA MB = (QB, S, dB, qB, FB) : L(MB) = B,



Construct NFA N = (Q, S, d, q, F) where:
● Q := Q U Q
A B , q := qA , F := FB
● d(r,x) := { dA(r,x) } if r in QA and x  e
● d(r,e) := { q
B } if r in FA
● d(r,x) := { dB(r,x) } if r in QB and x  e
● We have L(N) = A o B
Example
Is L = {w in {0,1}* : w contains a 1 after a 0}
regular?

Note: L = {01, 0001001, 111001, … }


Example
Is L = {w in {0,1}* : w contains a 1 after a 0}
regular?

Let L0 = {w : w contains a 0}
L1 = {w : w contains a 1}. Then L = L0 o L1.
Example
Is L = {w in {0,1}* : w contains a 1 after a 0}
regular?

Let L0 = {w : w contains a 0}
L1 = {w : w contains a 1}. Then L = L0 o L1.

M0 = 1 0,1
0

L(M0) = L0
Example
Is L = {w in {0,1}* : w contains a 1 after a 0}
regular?

Let L0 = {w : w contains a 0}
L1 = {w : w contains a 1}. Then L = L0 o L1.

M0 = 1 0,1 M1 = 0 0,1
0 1

L(M0) = L0 L(M1) = L1
Example
Is L = {w in {0,1}* : w contains a 1 after a 0}
regular?

Let L0 = {w : w contains a 0}
L1 = {w : w contains a 1}. Then L = L0 o L1.
M = 1 0,1 0 0,1
0 e 1

L(M) = L(M0) o L(M1) = L0 o L1 = L

 L is regular.
We now return to the question:
● Suppose A, B are regular languages, then
● not A := { w : w is not in A } REGULAR
● A U B := { w : w in A or w in B } REGULAR
● A o B := { w1 w2 : w1 ∈A and w2 ∈ B } REGULAR
● A* := { w1 w2 … wk : k  0 , wi in A for every i }
Theorem: If A is a regular language, then so is
A* := { w : w = w1...wk, wi in A for i=1,...,k }

● Proof idea: Given DFA MA : L(MA) = A,


Construct NFA N : L(N) = A*
N
M
A
* e
e
= e
N
MA
* e e
= e

Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

Construct NFA N = (Q, S, d, q, F) where:


● Q := ?
N
MA
* e e
= e

Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

Construct NFA N = (Q, S, d, q, F) where:


● Q := {q} U Q
A , F := ?
N
MA
* e e
= e

Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

Construct NFA N = (Q, S, d, q, F) where:


● Q := {q} U Q
A , F := {q} U FA
● d(r,x) := ? if r in QA and x  e
N
MA
* e e
= e

Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

Construct NFA N = (Q, S, d, q, F) where:


● Q := {q} U Q
A , F := {q} U FA
● d(r,x) := { dA(r,x) } if r in QA and x  e
● d(r,e) := ? if r in {q} U FA
N
MA
* e e
= e

Construction:
A = (QA, S, dA, qA, FA) : L(MA) = A,
● Given DFA M

Construct NFA N = (Q, S, d, q, F) where:


● Q := {q} U Q
A , F := {q} U FA
● d(r,x) := { dA(r,x) } if r in QA and x  e
● d(r,e) := { qA } if r in {q} U FA
● We have L(N) = A*
Example
Is L = {w in {0,1}* : w has even length}
regular?
Example
Is L = {w in {0,1}* : w has even length}
regular?

Let L0 = {w : w has length = 2}. Then L = L0*.


Example
Is L = {w in {0,1}* : w has even length}
regular?

Let L0 = {w : w has length = 2}. Then L = L0*.

M0 =
0,1 0,1

L(M0) = L0
Example
Is L = {w in {0,1}* : w has even length}
regular?

Let L0 = {w : w has length = 2}. Then L = L0*.

M = e

e 0,1 0,1

L(M) = L(M0)* = L0* = L


 L is regular.
We now return to the question:
● Suppose A, B are regular languages, then
● not A := { w : w is not in A }
● A U B := { w : w in A or w in B }
● A o B := { w1 w2 : w1 in A and w2 in B }
● A* := { w1 w2 … wk : k  0 , wi in A for every i }

are all regular!


We now return to the question:
● Suppose A, B are regular languages, then
● not A := { w : w is not in A }
● A U B := { w : w in A or w in B }
● A o B := { w1 w2 : w1 in A and w2 in B }
● A* := { w1 w2 … wk : k  0 , wi in A for every i }

What about A ∩ B := { w : w in A and w in B } ?


We now return to the question:
● Suppose A, B are regular languages, then
● not A := { w : w is not in A }
● A U B := { w : w in A or w in B }
● A o B := { w1 w2 : w1 in A and w2 in B }
● A* := { w1 w2 … wk : k  0 , wi in A for every i }

De Morgan's laws: A ∩ B = not ( (not A) U (not B) )


By above, (not A) is regular, (not B) is regular,
(not A) U (not B) is regular,
not ( (not A) U (not B) ) = A ∩ B regular
We now return to the question:
● Suppose A, B are regular languages, then
● not A := { w : w is not in A }
● A U B := { w : w in A or w in B }
● A o B := { w1 w2 : w1 in A and w2 in B }
● A* := { w1 w2 … wk : k  0 , wi in A for every i }
● A ∩ B := { w : w in A and w in B }

are all regular


Big picture
● All languages
● Decidable
Turing machines
● NP
● P
● Context-free
Context-free grammars, push-down automata
● Regular
Automata, non-deterministic automata,
regular expressions
How to specify a regular language?

Write a picture → complicated

Write down formal definition → complicated


d(q0 ,0) = q0, …

Use symbols from S and operations *, o, U → good

({0} * U {1}) o {001}


Regular expressions: anything you can write with
 , ε , symbols from S, and operations *, o, U

Conventions:
● Write a instead of {a}
● Write AB for A o B
● Write ∑ for U
a∈∑ a So if ∑ = {a,b} then ∑ = a U b
● Operation * has precedence over o, and o over U
so 1 U 01* means 1U(0(1)*)

Example: 110, 0*, S*, S*001S*, (SS)*, 01 U 10


Definition Regular expressions RE over ∑ are:
Ø
ε
a if a in S
R R' if R, R' are RE
R U R' if R, R' are RE
R* if R is RE
Definition The language described by RE:
L(Ø) = Ø
L( ε ) = { ε }
L(a) = {a} if a in ∑
L(R R') = L(R) o L(R')
L(R U R') = L(R) U L(R')
L(R*) = L(R)*
Example ∑ = { a, b}
RE Language
● ab U ba ?
● a*
● (a U b)*
● a*ba*
● ∑*b∑*
● ∑*aab∑*
● (∑∑)*
● a*(a*ba*ba*)*
● a*baba*a Ø
Example ∑ = { a, b}
RE Language
● ab U ba {ab, ba}
● a*
● (a U b)*
● a*ba*
● ∑*b∑*
● ∑*aab∑*
● (∑∑)*
● a*(a*ba*ba*)*
● a*baba*a Ø
Example ∑ = { a, b}
RE Language
● ab U ba {ab, ba}
● a* {ε, a, aa, … } = { w : w has only a}
● (a U b)*
● a*ba*
● ∑*b∑*
● ∑*aab∑*
● (∑∑)*
● a*(a*ba*ba*)*
● a*baba*a Ø
Example ∑ = { a, b}
RE Language
● ab U ba {ab, ba}
● a* {ε, a, aa, … } = { w : w has only a}
● (a U b)* all strings
● a*ba*
● ∑*b∑*
● ∑*aab∑*
● (∑∑)*
● a*(a*ba*ba*)*
● a*baba*a Ø
Example ∑ = { a, b}
RE Language
● ab U ba {ab, ba}
● a* {ε, a, aa, … } = { w : w has only a}
● (a U b)* all strings
● a*ba* {w : w has exactly one b}
● ∑*b∑*
● ∑*aab∑*
● (∑∑)*
● a*(a*ba*ba*)*
● a*baba*a Ø
Example ∑ = { a, b}
RE Language
● ab U ba {ab, ba}
● a* {ε, a, aa, … } = { w : w has only a}
● (a U b)* all strings
● a*ba* {w : w has exactly one b}
● ∑*b∑* {w : w has at least one b}
● ∑*aab∑*
● (∑∑)*
● a*(a*ba*ba*)*
● a*baba*a Ø
Example ∑ = { a, b}
RE Language
● ab U ba {ab, ba}
● a* {ε, a, aa, … } = { w : w has only a}
● (a U b)* all strings
● a*ba* {w : w has exactly one b}
● ∑*b∑* {w : w has at least one b}
● ∑*aab∑* {w : w contains the string aab}
● (∑∑)*
● a*(a*ba*ba*)*
● a*baba*a Ø
Example ∑ = { a, b}
RE Language
● ab U ba {ab, ba}
● a* {ε, a, aa, … } = { w : w has only a}
● (a U b)* all strings
● a*ba* {w : w has exactly one b}
● ∑*b∑* {w : w has at least one b}
● ∑*aab∑* {w : w contains the string aab}
● (∑∑)* {w : w has even length}
● a*(a*ba*ba*)*
● a*baba*a Ø
Example ∑ = { a, b}
RE Language
● ab U ba {ab, ba}
● a* {ε, a, aa, … } = { w : w has only a}
● (a U b)* all strings
● a*ba* {w : w has exactly one b}
● ∑*b∑* {w : w has at least one b}
● ∑*aab∑* {w : w contains the string aab}
● (∑∑)* {w : w has even length}
● a*(a*ba*ba*)* {w : w contains even number of b}
● a*baba*a Ø
Example ∑ = { a, b}
RE Language
● ab U ba {ab, ba}
● a* {ε, a, aa, … } = { w : w has only a}
● (a U b)* all strings
● a*ba* {w : w has exactly one b}
● ∑*b∑* {w : w has at least one b}
● ∑*aab∑* {w : w contains the string aab}
● (∑∑)* {w : w has even length}
● a*(a*ba*ba*)* {w : w contains even number of b}
● a*baba*a Ø Ø (anything o Ø = Ø)
Theorem: For every RE R there is NFA M: L(M) = L(R)
Theorem: For every RE R there is NFA M: L(M) = L(R)
Construction:

R= M := ?
Theorem: For every RE R there is NFA M: L(M) = L(R)
Construction:

R= M :=


R=e M := ?
Theorem: For every RE R there is NFA M: L(M) = L(R)
Construction:

R= M :=


R=e M :=

● R=a M := ?
Theorem: For every RE R there is NFA M: L(M) = L(R)
Construction:

R= M :=


R=e M :=

● R=a M :=
a

● R = R U R' ?
Theorem: For every RE R there is NFA M: L(M) = L(R)
Construction:

R= M :=


R=e M :=

● R=a M :=
a

● R = R U R' use construction for A U B seen earlier


● R = R o R' ?
Theorem: For every RE R there is NFA M: L(M) = L(R)
Construction:

R= M :=


R=e M :=

● R=a M :=
a

● R = R U R' use construction for A U B seen earlier


● R = R o R' use construction for A o B seen earlier
● R = R* ?
Theorem: For every RE R there is NFA M: L(M) = L(R)
Construction:

R= M :=


R=e M :=

● R=a M :=
a

● R = R U R' use construction for A U B seen earlier


● R = R o R' use construction for A o B seen earlier
● R = R* use construction for A* seen earlier
Example: RE → NFA

RE = (ab U a)*
Example: RE → NFA

RE = (ab U a)*

a
Ma =

L(Ma)=L(a)
Example: RE → NFA

RE = (ab U a)*

b
Ma = a Mb =

L(Ma)=L(a) L(Mb)=L(b)
Example: RE → NFA

RE = (ab U a)*

Mab =
a e b

L(Mab)=L(ab)
Example: RE → NFA

RE = (ab U a)*

a
Mab = Ma =
a e b

L(Mab)=L(ab) L(Ma)=L(a)
Example: RE → NFA

RE = (ab U a)*

Mab U a =
a e b
e
e a

L(Mab U a)=L(ab U a)
Example: RE → NFA

RE = (ab U a)*

M(ab U a)* =
e
a e b
e e
e a

e
L(M(ab U a)*)=L((ab U a)*)=L(RE)
ANOTHER Example: RE → NFA

RE =(e U a)ba*
ANOTHER Example: RE → NFA

RE =(e U a)ba*

Me =

L(Me)=L(e)
ANOTHER Example: RE → NFA

RE =(e U a)ba*

a
Me = Ma =

L(Me)=L(e) L(Ma)=L(a)
ANOTHER Example: RE → NFA

RE =(e U a)ba*

Me U a =
e

e
a

L(Me U a)=L(e U a)
ANOTHER Example: RE → NFA

RE =(e U a)ba*

b
Me U a = Mb =
e
L(Mb)=L(b)
e
a

L(Me U a)=L(e U a)
ANOTHER Example: RE → NFA

RE =(e U a)ba*

M(e U a)b =
e
e b

e e
a

L(M(e U a)b)=L((e U a)b)


ANOTHER Example: RE → NFA

RE =(e U a)ba*

M(e U a)b = Ma = a

e
e b

e e L(Ma)=L(a)
a

L(M(e U a)b)=L((e U a)b)


ANOTHER Example: RE → NFA

RE =(e U a)ba*

M(e U a)b = e
Ma* = e a
e
e b

e e L(Ma*)=L(a*)
a

L(M(e U a)b)=L((e U a)b)


ANOTHER Example: RE → NFA

RE =(e U a)ba*

M(e U a)ba* =
e
e b e e
e e a
e
a

L(M(e U a)ba*)=L((e U a)ba*)=L(RE)


Recap:

Here “” means “can be converted to”

We have seen: RE  NFA  DFA

Next we see: DFA  RE

In two steps: DFA  Generalized NFA  RE


Generalized NFA (GNFA)
a U b*

a*b* ab
q0 qa

Nondeterministic

Transitions labelled by RE

Read blocks of input symbols at a time


Generalized NFA (GNFA)
a U b*

a*b* ab
q0 qa

Convention:
Unique final state
Exactly one transition between each pair of states
except nothing going into start state
nothing going out of final state
If arrow not shown in picture, label = 
●Definition: A generalized finite automaton (GNFA)
● is a 5-tuple (Q, S, d, q , q ) where
0 a
● Q is a finite set of states

S is the input alphabet
● d : (Q - {q }) X (Q – {q }) → Regular Expressions
a 0
● q0 in Q is the start state
● qa in Q is the accept state
● Definition: GNFA (Q, S, d, q0, qa) accepts a string w if
● ∃integer k, ∃ k strings w1 , w2 , …, wk  S*
such that w = w1 w2 … wk
(divide w in k strings)

● $ sequence of k+1 states r0, r1, .., rk in Q such that:


● r0 = q0

wi+1 L(d(ri ,ri+1 ))  0  i < k
● rk = q a

● Differences with NFA are in green


Example b*

a* ab
q0 q1 qa

Accepts w = aaabbab
w1=?
Example b*

a* ab
q0 q1 qa

Accepts w = aaabbab
w1=aaa w2=?
Example b*

a* ab
q0 q1 qa

Accepts w = aaabbab
w1=aaa w2=bb w3=ab
r0=q0 r1=?
Example b*

a* ab
q0 q1 qa

Accepts w = aaabbab
w1=aaa w2=bb w3=ab
r0=q0 r1=q1 r2=?

w1 = aaa  L(d(r0,r1)) = L(d(q0,q1)) = L(a*)


Example b*

a* ab
q0 q1 qa

Accepts w = aaabbab
w1=aaa w2=bb w3=ab
r0=q0 r1=q1 r2=q1 r3 = ?

w1 = aaa  L(d(r0,r1)) = L(d(q0,q1)) = L(a*)


w2 = bb  L(d(r1,r2)) = L(d(q1,q1)) = L(b*)
Example b*

a* ab
q0 q1 qa

Accepts w = aaabbab
w1=aaa w2=bb w3=ab
r0=q0 r1=q1 r2=q1 r3 = qa

w1 = aaa  L(d(r0,r1)) = L(d(q0,q1)) = L(a*)


w2 = bb  L(d(r1,r2)) = L(d(q1,q1)) = L(b*)
w3 = ab  L(d(r2,r3)) = L(d(q1,qa)) = L(ab)
Theorem:  DFA M  GNFA N : L(N) = L(M)
Construction:
To ensure unique transition between each pair:

1 1U0

To ensure unique final state, no transitions ingoing


start state, no transitions outgoing final state:
e
e e
e
Theorem:  GNFA N  RE R : L(R) = L(N)
Construction:
If N has 2 states, then N = S
q0 qa
thus R := S
If N has > 2 states, eliminate some state qr  q0, qa :
for every ordered pair qi, qj (possibly equal)
that are connected through qr
R2
R1 R3 R1R2*R3 U R4
qi qr qj qi qj
R4

Repeat until 2 states remain


Example: DFA → GNFA → RE

DFA a b

b,c q2
q1
Example: DFA → GNFA → RE

GNFA a b

e bUc q2 e qa
q0 q1
Example: DFA → GNFA → RE

a b

e bUc q2 e qa
q0 q1

Eliminate q1: re-draw GNFA with all other states

q0 q2 e qa
Example: DFA → GNFA → RE

a b

e bUc q2 e qa
q0 q1

Eliminate q1: find a path through q1

q0 q2 e qa
Example: DFA → GNFA → RE
Ø
a b

e bUc q2 e qa
q0 q1

Eliminate q1: add edge to new GNFA


Don't forget: no arrow means label Ø
b

e a* (b U c) U Ø q2 e qa
q0
Example: DFA → GNFA → RE

a b

e bUc q2 e qa
q0 q1

Eliminate q1: simplify RE on new edge

a* (b U c) q2 e qa
q0
Example: DFA → GNFA → RE

a b

e bUc q2 e qa
q0 q1

Eliminate q1: if no more paths through q1, start over

a* (b U c) q2 e qa
q0
Example: DFA → GNFA → RE

a* (b U c) q2 e qa
q0

Eliminate q2: re-draw GNFA with all other states

q0 qa
Example: DFA → GNFA → RE

a* (b U c) q2 e qa
q0

Eliminate q2: find a path through q2

q0 qa
Example: DFA → GNFA → RE

a* (b U c) q2 e qa
q0

Eliminate q2: add edge to new GNFA

a* (b U c) b* e U Ø qa
q0
Example: DFA → GNFA → RE

a* (b U c) q2 e qa
q0

Eliminate q2: simplify RE on new edge

a* (b U c) b* qa
q0
Example: DFA → GNFA → RE

a* (b U c) q2 e qa
q0

Eliminate q2: if no more paths through q2, start over

a* (b U c) b* qa
q0
Example: DFA → GNFA → RE

a* (b U c) b* qa
q0

Only two states remain:

RE = a* (b U c) b*
ANOTHER Example: DFA → GNFA → RE
a
DFA
q1 b q3
c
a
c q2

b
ANOTHER Example: DFA → GNFA → RE
a
GNFA
q0 e q1 b q3 e qa
c
a
c q2

b
ANOTHER Example: DFA → GNFA → RE
a

q0 e q1 b q3 e qa
c
a
Eliminate q1: c q2

b
re-draw GNFA with
all other states
q3 e qa
q0

a
q2

b
ANOTHER Example: DFA → GNFA → RE
a

q0 e q1 b q3 e qa
c
a
Eliminate q1: c q2

b
find a path
through q1
q3 e qa
q0

a
q2

b
ANOTHER Example: DFA → GNFA → RE
a

q0 e q1 b q3 e qa
c
a
Eliminate q1: c q2

b
add edge to
new GNFA
e a*b U Ø q3 e qa
q0

a
q2

b
ANOTHER Example: DFA → GNFA → RE
a

q0 e q1 b q3 e qa
c
a
Eliminate q1: c q2

b
find another
path through q1
e a*b U Ø q3 e qa
q0

a
q2

b
ANOTHER Example: DFA → GNFA → RE
a

q0 e q1 b q3 e qa
c
a
Eliminate q1: c q2

b
add edge to
new GNFA
e a*b U Ø q3 e qa
q0

a
e a*c U Ø q2

b
ANOTHER Example: DFA → GNFA → RE
a

q0 e q1 b q3 e qa
c
a
Eliminate q1: c q2

b
find another
path through q1
e a*b U Ø q3 e qa
q0

a
e a*c U Ø q2

b
ANOTHER Example: DFA → GNFA → RE
a

q0 e q1 b q3 e qa
c
a
Eliminate q1: c q2 don't forget current
b q2 → q3 edge!
add edge to
This time is not Ø !
new GNFA
e a*b U Ø q3 e qa
q0

ca*b U a
e a*c U Ø q2

b
ANOTHER Example: DFA → GNFA → RE
a

q0 e q1 b q3 e qa
c
a
Eliminate q1: c q2

b
find another
path through q1
e a*b U Ø q3 e qa
q0

ca*b U a
e a*c U Ø q2

b
ANOTHER Example: DFA → GNFA → RE
a

q0 e q1 b q3 e qa
c
a
Eliminate q1: c q2 don't forget current
b q2 → q2 edge!
add edge to
new GNFA
e a*b U Ø q3 e qa
q0

ca*b U a
e a*c U Ø q2

ca*c U b
ANOTHER Example: DFA → GNFA → RE
a

q0 e q1 b q3 e qa
c
a
Eliminate q1: c q2

b
when no more paths
through q1, start over
(and simplify a*b q3 e qa
q0
REs)
a*c ca*b U a
q2

ca*c U b
ANOTHER Example: DFA → GNFA → RE

a*b q3 e qa
q0

a*c ca*b U a
q2

Eliminate q2: ca*c U b

re-draw GNFA with


all other states

q0 a*b q3 e qa
ANOTHER Example: DFA → GNFA → RE

a*b q3 e qa
q0

a*c ca*b U a
q2

Eliminate q2: ca*c U b


find a path through q2

q0 a*b q3 e qa
ANOTHER Example: DFA → GNFA → RE

a*b q3 e qa
q0

a*c ca*b U a
q2

Eliminate q2: ca*c U b

add edge to new GNFA

a*c(ca*c U b)*(ca*b U a) U a*b


q0 q3 e qa
ANOTHER Example: DFA → GNFA → RE

a*b q3 e qa
q0

a*c ca*b U a
q2
Eliminate q2:
ca*c U b
when no more paths
through q2, start over

a*c(ca*c U b)*(ca*b U a) U a*b


q0 q3 e qa
ANOTHER Example: DFA → GNFA → RE

a*c(ca*c U b)*(ca*b U a) U a*b


q0 q3 e qa

Eliminate q3:
re-draw GNFA with
all other states

q0 qa
ANOTHER Example: DFA → GNFA → RE

a*c(ca*c U b)*(ca*b U a) U a*b


q0 q3 e qa

Eliminate q3: Ø
find a path through q3 Ø

don't forget: no arrow means Ø

q0 qa
ANOTHER Example: DFA → GNFA → RE

a*c(ca*c U b)*(ca*b U a) U a*b


q0 q3 e qa

Eliminate q3: Ø

add edge to new GNFA Ø

(a*c(ca*c U b)*(ca*b U a) U a*b) Ø* ε U Ø


q0 qa
ANOTHER Example: DFA → GNFA → RE

a*c(ca*c U b)*(ca*b U a) U a*b


q0 q3 e qa

Eliminate q3:
when no more paths through q3, start over
(and simplify REs)
don't forget: Ø*= ε

q0
a*c(ca*c U b)*(ca*b U a) U a*b
qa
ANOTHER Example: DFA → GNFA → RE

q0
a*c(ca*c U b)*(ca*b U a) U a*b
qa

Only two states remain:

RE = a*c(ca*c U b)*(ca*b U a) U a*b


Recap:
Here “” means “can be converted to”

RE  DFA  NFA
Any of the three recognize exactly
the regular languages (initially defined using DFA)
These conversions are used every time you enter
an RE, for example for pattern matching using grep

● The RE is converted to an NFA


● Then the NFA is converted to a DFA
● The DFA representation is used to pattern-match

Optimizations have been devised,


but this is still the general approach.
What language is NOT regular?

Is { 0n 1n : n  0 } = {ε, 01, 0011, 000111, … } regular?


Pumping lemma:
L regular language   p 0
 w  L, |w|  p
 x,y,z : w= xyz, |y|> 0, |xy| p
 i  0 : xyiz  L

Recall y0 = e, y1 = y, y2 = yy, y3 = yyy, ...


Pumping lemma:
L regular language   p 0
 w  L, |w|  p
 x,y,z : w= xyz, |y|> 0, |xy| p
Proof Idea:  i  0 : xyiz  L
Let M be a DFA recognizing L. Choose p := |Q|
Let w  L, |w|  p.
Among the first p+1 states of the trace of M on w,
2 states must be the same q.
y = portion of w that brings q back to q
can repeat or remove y and still accept string
Pumping lemma:
L regular language   p 0 A
 w  L, |w|  p
 x,y,z : w= xyz, |y|> 0, |xy| p
 i  0 : xyiz  L

Useful to prove L NOT regular. Use contrapositive:


L regular language  A
same as
(not A)  L not regular
Pumping lemma (contrapositive)
 p 0 not A
 w  L, |w|  p  L not regular
 x,y,z : w = xyz, |y| > 0, |xy|  p
 i  0 : xyiz  L

To prove L not regular it is enough to prove not A

Not A is the stuff in the box.


Proving something like
 bla  bla  bla  bla bla
means winning a game

Theory is all about winning games!


Example NAME THE BIGGEST NUMBER GAME

● Two players:
You, Adversary.
● Rules:
First Adversary says a number.
Then You say a number.
You win if your number is bigger.

Can you win this game?


Example NAME THE BIGGEST NUMBER GAME

● Two players:
You, Adversary.
● Rules:
First Adversary says a number.
Then You say a number.
You win if your number is bigger.

You have winning strategy:


if adversary says x, you say x+1
Example NAME THE BIGGEST NUMBER GAME

● Two players:
You, Adversary. , 
● Rules:
First Adversary says a number. xy:y>x
Then You say a number.
You win if your number is bigger.

You have winning strategy: Claim is true


if adversary says x, you say x+1
Another example:

Theorem:  NFA N  DFA M : L(M) = L(N)

We already saw a winning strategy for this game


What is it?
Another example:

Theorem:  NFA N  DFA M : L(M) = L(N)

We already saw a winning strategy for this game


The power set construction.
Games with more moves:
Chess, Checkers, Tic-Tac-Toe

You can win if


 move of the Adversary
 move You can make
 move of the Adversary
 move You can make

: You checkmate
Pumping lemma (contrapositive)
 p 0
 w  L, |w|  p  L not regular
 x,y,z : w = xyz, |y| > 0, |xy|  p
 i  0 : xyiz  L
Rules of the game:
Adversary picks p,
You pick w ∈ L of length  p,
Adversary decomposes w in xyz, where |y| > 0, |xy|p
You pick i  0
Finally, you win if xyiz  L
Theorem: L := {0n 1n : n  0} is not regular
Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
You move w := 0p 1p  i  0 : xyiz  L
Adversary moves x,y,z
You move i := 2
You must show xyyz  L:
Since |xy|p and w = xyz = 0p 1p , y only has 0
So xyyz = 0p + |y| 1p
Since |y| > 0, this is not of the form 0n 1n DONE
Theorem: L := {w : w has as many 0 as 1} not regular
Same Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
You move w := ?  i  0 : xyiz  L
Theorem: L := {w : w has as many 0 as 1} not regular
Same Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
You move w := 0p 1p  i  0 : xyiz  L
Adversary moves x,y,z
You move i := ?
Theorem: L := {w : w has as many 0 as 1} not regular
Same Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
You move w := 0p 1p  i  0 : xyiz  L
Adversary moves x,y,z
You move i := 2
You must show xyyz  L:
Since |xy|p and w = xyz = 0p 1p , y only has 0
So xyyz = ?
Theorem: L := {w : w has as many 0 as 1} not regular
Same Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
You move w := 0p 1p  i  0 : xyiz  L
Adversary moves x,y,z
You move i := 2
You must show xyyz  L:
Since |xy|p and w = xyz = 0p 1p , y only has 0
So xyyz = 0p + |y| 1p
Since |y| > 0, not as many 0 as 1 DONE
Theorem: L := {0j 1k : j > k} is not regular
Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
You move w := ?  i  0 : xyiz  L
Theorem: L := {0j 1k : j > k} is not regular
Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
You move w := 0p+1 1p  i  0 : xyiz  L
Adversary moves x,y,z
You move i := ?
Theorem: L := {0j 1k : j > k} is not regular
Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
You move w := 0p+1 1p  i  0 : xyiz  L
Adversary moves x,y,z
You move i := 0
You must show xz  L:
Since |xy|p and w = xyz = 0p+1 1p , y only has 0
So xz = 0p + 1 - |y| 1p
Since |y| > 0, this is not of the form 0j 1k with j > k
Theorem: L := {uu : u  {0,1}* } is not regular
Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
You move w := ?  i  0 : xyiz  L
Theorem: L := {uu : u  {0,1}* } is not regular
Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
You move w := 0p1 0p 1  i  0 : xyiz  L
Adversary moves x,y,z
You move i := ?
Theorem: L := {uu : u  {0,1}* } is not regular
Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
You move w := 0p 1 0p 1  i  0 : xyiz  L
Adversary moves x,y,z
You move i := 2
You must show xyyz  L:
Since |xy|p and w = xyz = 0p 1 0p 1 , y only has 0
So xyyz = 0p + |y| 1 0p 1
Since |y| > 0, first half of xyyz only 0, so xyyz  L
2
n
Theorem: L := { 1 : n  0 } is not regular
Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
You move w := ?  i  0 : xyiz  L
2
n
Theorem: L := { 1 : n  0 } is not regular
Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
2
You move w := 1p  i  0 : xyiz  L
Adversary moves x,y,z
You move i := ?
2
n
Theorem: L := { 1 : n  0 } is not regular
Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
2
You move w := 1p  i  0 : xyiz  L
Adversary moves x,y,z
You move i := 2
You must show xyyz  L:
Since |xy|p, |xyyz|  ?
2
n
Theorem: L := { 1 : n  0 } is not regular
Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
2
You move w := 1p  i  0 : xyiz  L
Adversary moves x,y,z
You move i := 2
You must show xyyz  L:
Since |xy|p, |xyyz|  p2 + p < (p+1)2
Since |y| > 0, |xyyz| > ?
2
n
Theorem: L := { 1 : n  0 } is not regular
Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
2
You move w := 1p  i  0 : xyiz  L
Adversary moves x,y,z
You move i := 2
You must show xyyz  L:
Since |xy|p, |xyyz|  p2 + p < (p+1)2
Since |y| > 0, |xyyz| > p2
So |xyyz| cannot be … what ?
2
n
Theorem: L := { 1 : n  0 } is not regular
Proof:  p 0
Use pumping lemma  w  L, |w|  p
Adversary moves p  x,y,z : w = xyz, |y| > 0, |xy|  p
2
You move w := 1p  i  0 : xyiz  L
Adversary moves x,y,z
You move i := 2
You must show xyyz  L:
Since |xy|p, |xyyz|  p2 + p < (p+1)2
Since |y| > 0, |xyyz| > p2
So |xyyz| cannot be a square. xyyz  L
Big picture
● All languages
● Decidable
Turing machines
● NP
● P
● Context-free
Context-free grammars, push-down automata
● Regular
Automata, non-deterministic automata,
regular expressions

You might also like