THEORY OF COMPUTATIONdung/Site/Theory_of_Computation_files/full.pdf · { the set of students taking...

$: THEORY OF COMPUTATIONdung/Site/Theory_of_Computation_files/full.pdf · { the set of students taking the theory of compu-tation course The elements or members of a set are the ob-jects$
THEORY OF COMPUTATION

There are four sorts of men:

He who knows not and knows not he knows not: he is afool - shun him;

He who knows not and knows he knows not: he is simple– teach him;

He who knows and knows not he knows: he is asleep –wake him;

He who knows and knows he knows: he is wise – followshim

Arabian proverb

Chapter 1: Sets, Relations andLanguages

Recommended Readings: Textbook + A Basis for Theo-retical Computer Science by M. A. Arbib, A. J. Kfoury,R N. Moll.

A set is a collection of objects.

Examples of sets are:

– the set of integers, denoted by Z,

– the set of nonnegative integers, also called natu-

ral numbers, denoted by N,

– the set of truth values B = {T, F}– the set of students taking the theory of compu-

tation course

The elements or members of a set are the ob-

jects comprising it. If b is an element of a set L,

then we write b ∈ L.

There are two ways to display a set:

either explicitly listing the elements belonging to

it like in the case of the set of truth values

B = {T, F},

or by specifying the properties that characterize

the elements of this set like: the set of even integers

{x |x ∈ Z and xmod 2 = 0}

Two sets are equal if they have the same ele-

ments

A set without any element is called the empty set

A set with infinitely many elements is said to be

infinite

A set with finitely many elements is said to be fi-

nite

A set X is a subset of a set Y, written X ⊆ Y , if

each element of X is also an element of Y.

X is a proper subset of Y if X is a subset of

Y and X and Y are not equal.

For example, the set of nonnegative integers is a

proper subset of the set of integers, N ⊂ Z.

Basic facts: For any set A, the empty set ∅ is a

subset of A, ∅ ⊆ A, and A is a subset of A, A ⊆ A.

If A ⊆ B and B ⊆ A then A = B.

Set operations: Union, Difference, In-

tersection

The union of two sets A,B is the set of elements

which belongs to at least one of them.

The intersection of two sets A,B is the set of ele-

ments which belongs to both of them.

We say two sets are disjoint if their intersection

is empty.

The set difference A\B of two sets A,B is the set

of those elements in A that are not in B.

Laws for set operations:

Idempotency A ∪ A = A

A ∩ A = A

Commutativity A ∪B = B ∪ AA ∩B = B ∩ A

Associativity (A∪B)∪C = A∪(B∪C)

(A ∩B) ∩ C = A ∩ (B ∩ C)

Suppose now that there is a big set U such that

both A, and B are subsets of U. Let B = U \ B,

B = U \B. Then

A \B = A ∩BA ∩B = A ∪BA ∪B = A ∩B

Exercises Prove the above equalities.

�

Distributivity A ∪ (B ∩ C) = (A ∪ B) ∩(A ∪ C)

A∩(B∪C) = (A∩B)∪(A∩C)

Proof

1. To show A∪ (B ∩C) = (A∪B)∩ (A∪C), we

show A ∪ (B ∩ C) ⊆ (A ∪ B) ∩ (A ∪ C) and

(A ∪B) ∩ (A ∪ C) ⊆ A ∪ (B ∩ C).(a) Let x ∈ A ∪ (B ∩ C).

From x ∈ A ∪ (B ∩ C), it follows: x ∈ A or

x ∈ B ∩ C.If x ∈ A then x ∈ A ∪ B and x ∈ A ∪ C.

Hence x ∈ (A ∪B) ∩ (A ∪ C).If x ∈ B ∩ C then x ∈ B and x ∈ C. If

x ∈ B then x ∈ A ∪ B. If x ∈ C then

x ∈ A ∪ C. Therefore if x ∈ B ∩ C then

x ∈ (A ∪B) ∩ (A ∪ C).

Hence from x ∈ A or x ∈ B ∩ C we can

conclude x ∈ (A ∪ B) ∩ (A ∪ C). Therefore

we have proved that if x ∈ A∪ (B ∩C) then

x ∈ (A ∪B) ∩ (A ∪ C).

(b) Let x ∈ (A ∪ B) ∩ (A ∪ C). To show x ∈A ∪ (B ∩ C).

From x ∈ (A ∪B) ∩ (A ∪ C), it follows: x ∈A ∪ B and x ∈ A ∪ C. Hence (x ∈ A or

x ∈ B) and (x ∈ A or x ∈ C). Hence there

are four cases: (x ∈ A) or (x ∈ A and x ∈ B)

or (x ∈ A and x ∈ C) or (x ∈ B and x ∈ C).

It follows that x ∈ A or (x ∈ B and x ∈ C).

That means x ∈ A ∪ (B ∩ C).

2. Exercise.

�

Absorption A ∩ (A ∪B) = A

A ∪ (A ∩B) = A

Proof Exercise.

�

DeMorgan’s Laws

A\(B∪C) = (A\B)∩(A\C)

A\(B∩C) = (A\B)∪(A\C)

Proof

1. Suppose there is a set U such that all sets A,B,C

are subsets of U. Hence

A \ (B ∪ C) = A ∩ (B ∪ C) = A ∩ (B ∩ C)

= (A ∩B) ∩ (A ∩ C) = (A \B) ∩ (A \ C)

2. Exercise.

�

If S is a collection of sets then⋃

S is the set

whose elements are the elements of the sets in S.

Example: S = {{a, b}, {c}, {a, d}}⋃S = {a, b, c, d}

S = { {a, b}, {c}, {a, {d, a} } }⋃S = {a, b, c, {d, a} }

The power set of a set S, denoted by 2S, is the

set of all subsets of S.

Example: S = {a} , 2S = {∅, {a} }

S = {a, b} , 2S = {∅, {a}, {b}, {a, b} }

A partition of a set S is a set Π of subsets of

S, i.e. Π ⊆ 2S, such that

1. Each element of Π is nonempty

2. Distinct members of Π are disjoint

3.⋃Π = S

Example S = {a, b, c}

Π = { {a}, {b, c} } is a partition of S.

A = { {a}, {a, b}, {c} } is not a partition.

B = { {a}, {b, c}, ∅ } is not a partition.

Functions or Maps

Ordered pairs are written as (a,b) where a is

the first component and b the second.

The Cartesian product of two sets A,B, de-

noted by A×B, is the set of all ordered pairs (a,b)

with a ∈ A, b ∈ B.

Example {a, b} × {a} = {(a, a), (b, a)}

The plane could be represented as a Cartesian

product

R×R

where R is the set of all real numbers.

A map or function from a set A to a set B,

denoted f : A → B, is an assignment to each

element a in A a single element, denoted by f(a),

in B.

A is called the domain of f and B is the codomain

of f.

f(a) is called the image of a under f.

The range of f is denoted by

f (A) = {b | there is a in A such that b = f (a)}

For A′ ⊆ A, f (A′) = {f (a) : a ∈ A′} is called

the image of A′ under f.

Example of functions:

The integer addition +: N×N → N is a func-

tion from N×N to N

Exercises 1) If A is empty, how many functions are

there from A to B ?

2) If B is empty, how many functions are there

from A to B ?

3) If A contains exactly one element, how many

functions are there from A to B ?

A function f : A −→ B is one to one if for

any two distinct elements a, a′ ∈ A, f (a) 6= f (a′).

A function f : A −→ B is onto B ifB = f (A).

A function f : A −→ B is a bijection be-

tween A and B if it is both one-to-one and onto

B.

Let A and B be two finite sets. A and B have the

same number of elements iff there is a bijection be-

tween A and B

Given a function f : A → B and g : B → C, the

composition f ◦ g : A→ C is defined by:

f ◦ g(a) = g(f (a))

Relations

A binary relation on two sets A,B is a subset

of A×B.

An example of binary relation is the greater or

equal relation≥ in the set of integersR = {(n,m) |n ≥m} ⊆ Z× Z.

An ordered n-tuple is written as (a1, . . . , an)

where ai is the ith component of (a1, . . . , an).

The n-fold Cartesian product of the sets

A1, . . . , An, denoted by A1× . . .×An, is the set of

all ordered n-tuples with ai ∈ Ai for i = 1, . . . , n.

An n-ary relation on sets A1, . . . , An is a subset

of A1 × . . .× An.

Property (A function is a special relation)

A function from a set A to a set B is a binary

relation R on A,B such that following property is

satisfied:

– For each a ∈ A there is exactly one ordered pair

in R with first component a.

�

A binary relation R ⊆ A × B has an inverse

R−1 ⊆ B × A defined by:

(b, a) ∈ R−1 iff (a, b) ∈ R.

For binary relations Q ⊆ A×B and R ⊆ B×C,

the composition Q ◦R is defined by

Q◦R = {(a, c) : ∃ b ∈ B s.t. (a, b) ∈ Q and (b, c) ∈ R}

Note that for two functions f : A −→ B and

g : B −→ C, f ◦ g : A −→ C.

Exercise Show that if f : A −→ B is a bijec-

tion, then f−1 is also a bijection from B to A.

Special Types of Relations

A relation R ⊆ A × A can be represented as a

directed graph.

A relation R ⊆ A× A is reflexive if for each

a ∈ A, (a, a) ∈ R.

A relation R ⊆ A×A is symmetric if (a, b) ∈R whenever (b, a) ∈ R.

Symmetric relation can be represented by undi-

rected graph.

A relation R ⊆ A × A is transitive if when-

ever (a, b) ∈ R and (b, c) ∈ R then (a, c) ∈ R

A relation R ⊆ A× A is an equivalent rela-

tion if it is reflexive, transitive and symmetric.

Let R be an equivalent relation on a set A. Then

for each a in A, the equivalence class of a with

respect to R is denoted by [a]R and is defined for-

mally by

[a]R = {b | (a, b) ∈ R}

When the context R is clear, we simply write [a]

for [a]R

A representation of an equivalent relation R ⊆A × A as an undirected graph consists of a num-

ber of ”clusters” where within clusters each pair is

connected by a line. The set of nodes in a cluster

is an equivalence class.

Property Let R be an equivalent relation on a

set A. Then for any two elements a,b in A, either

[a]R = [b]R or [a]R, [b]R are disjoint.

Proof Exercise.

LetR be an equivalent relation on a set A. Define

A modulo R to be the set

A/R = {[a]R | a ∈ A}

Theorem 1. Let R ⊆ A × A be an equivalent

relation. Then A modulo R is a partition of A.

Proof Exercise.

Example of Partition and equivalence relation:

Let Z+ be the set of positive integers. Define an

equivalent relation ≡⊆ Z+ × Z+ as follows:

(p, q) ≡ (r, s) iff ps = qr

The ≡ is an equivalent relation characterizing

the rational numbers, i.e.

The set Z+×Z+ modulo ≡ is the set of rational

numbers.

A relation R ⊆ A × A is antisymmetric if

whenever (a, b) ∈ R and a 6= b then (b, a) 6∈ R

A relation that is reflexive, transitive and anti-

symmetric is called a partial order.

A partial order R ⊆ A × A is called a total

order if for all a, b ∈ A, either (a, b) ∈ R or

(b, a) ∈ R.

A path from a to b in a binary relation R ⊆A×A is a sequence (a1, . . . , an), n ≥ 1 such that

a = a1, b = an, for each i = 1, . . . n−1, (ai, ai+1) ∈R.

A path (a1, . . . , an) is a cycle if a1 = an and all

ai’s are distinct.

Two sets A, B are equinumerous if there is a

bijection f : A −→ B.

We say that the cardinality of A is n if A is

equinumeruos with the set {1,...,n}.

A is countably infinite if it is equinumerous

with the set of natural numbers N .

A is uncountable if it is not equinumerous with

the set of natural numbers N .

Exercise Every subset of a finite set is finite.

Exercise Every subset of a countably infinite set is

finite or countably infinite.

Exercise The set N ×N is countably infinite.

Theorem 2. The union of countably infinite col-

lection of countably infinite sets is again count-

able infinite.

Exercixe Is the set of all finite subsets of N count-

ably infinite ?

Three Fundamental ProofTechniques

Principle of Mathematical Induction: Let

A be a set of natural number such that

1. 0 ∈ A, and

2. for each natural number n, if {0, 1, . . . , n} ⊆ A

then n + 1 ∈ A.

Then A = N.

Exercise Prove that for each natural number n

1 + . . . + n =n(n + 1)

2

Exercise Suppose f, g : N −→ (N − {0}) satisfy

following properties:

1. g(0) ≤ f (0), and

2. for each n ≥ 0,

g(n + 1)

g(n)≤ f (n + 1)

f (n)

Then for any n ≥ 0: g(n) ≤ f (n).

The pigeon principle If A, B are finite sets

and |A| > |B|. Then there is no one-to-one func-

tion from A to B.

Exercise Suppose there are 100 new students for

the august semester at AIT and only 97 vacant

rooms. Is it possible to give each student a sepa-

rate room ? Prove it.

Exercise If there are 10 scholarships and 50 appli-

cants. Is it possible that each applicant get a schol-

arship. (Note that scholarship can not be shared).

Exercise Could you try to prove the pigeon prin-

ciple using mathematical induction ?

The Diagonalization Principle Let R be a

binary relation on a set A, and let

–

D = {a | a ∈ A and (a, a) 6∈ R}D is called the diagonal set of R.

– for each a ∈ A,

Ra = {b | b ∈ A and (a, b) ∈ R}

Then for each a ∈ A , D 6= Ra.

Exercise Let A = {1, 2, 3, 4, 5} and

R = {(1, 3), (2, 4), (3, 5), (5, 1), (2, 3), (1, 1), (4, 2), (4, 4)}.Determine the sets D,R1, R2, R3, R4, R5. Check

whethere the diagonal principle holds in this case.

Exercise Would the diagonal principle still holds if

the sets Ra are replaced by the sets Ra = {b | b ∈A and (b, a) ∈ R}

Exercise Would the diagonal principle still holds

if the sets D is replaced by the set C = {a | a ∈A and (a, a) ∈ R}.

Theorem 3. The set 2N is uncountably infinite

Proof (Sketch) Suppose the theorem is not cor-

rect. Therefore, the set 2N is countably infinite, i.e.

there is a bijection (i.e. one-one and onto) function

f : N→ 2N.

Define R = {(n,m) |m ∈ f (n) }. Apply the

diagonal principle on R and show a contradiction

to the assumption that f is a bijection.

�

Exercice Is the set of all infinite subsets of N count-

ably infinite ?

Algorithms

Definition

Let R ⊆ A2 be a directed graph on a set A.

The reflexive transitive closure of R is the

relation

R∗ = {(a, b) | a, b ∈ A and there is a path from a to b in R}

Algorithms

The question we interested in this course is:

What is an algorithm

Example of an algorithm:

Initially S := ∅for i = 1,..., n do

for each i-tuple (b1, . . . , bi) ∈ Ai do

if (b1, . . . , bi) is a path in R then add (b1, bi)

to S

How much time the algorithm needs to terminate ?

The total number of operations can be no more

than

n + n2 + . . . + nn

that belongs to O(nn+1) where n is the number

of elements of A. This algorithm is therefore not

efficient.

An efficient algorithm:

Initially S = R ∪ {(ai, ai) | ai ∈ A}for each j = 1,2,...,n do

for each i = 1,2,...,n and k = 1,2,...,n do

If (ai, aj), (aj, ak) ∈ S but (ai, ak) 6∈ Sthen add (ai, ak) to S.

The algorithm terminate after no more than n3

steps.

Is this algorithm correct, i.e. S = R∗ after ter-

mination of the algorithm. (Could you use mathe-

matical induction to show it correctness ?)

Rate of Growth of Functions

Definition

1. Let f be a function from N into N . The order

of f, denoted by O(f ) is the set of all functions

g : N → N such that there are positive natural

numbers c, d > 0 such that for each n ∈ Ng(n) ≤ c.f (n) + d

2. We write f^_ g iff f ∈ O(g) and g ∈ O(f ).

�

Lemma^_ is a equivalence relation.

Proof Exercise.

Definition The equivalence class of a function

f : N −→ N with respect to^_ is called the

rate of growth of f.

Lemma Let f (n) = a0 + a1.n + . . . + ak.nk be

a polynomial of degree k with nonnegative coeffi-

cients. Then

1. f^_ nk

2. nk ∈ O(nk+1)

3. nk+1 6∈ O(nk)

Proof

1. Let c = a0 + a1 + . . . + ak and d = 0. It is easy

to see: f (n) ≤ c.nk.

2. Obviously nk ≤ nk+1.

3. Suppose nk+1 ∈ O(nk). Hence, there is c, d > 0:

nk+1 ≤ c.nk + d. Let n = c + d. It follows (c +

d).nk = c.nk + d.nk > c.nk + d. Contradiction.

That means that assuming nk+1 ∈ O(nk) leads

to a contradiction. Therefore nk+1 6∈ O(nk).

Lemma 1. Let f, g : N −→ N be monotonic,

i.e. for all m ≥ n: f (m) ≥ f (n) and g(m) ≥g(n). Further let n0 ∈ N be a natural number

such that for each n ≥ n0,

g(n + 1)

g(n)≤ f (n + 1)

f (n)

Then g ∈ O(f )

Proof Let c be a number such that

g(n0) ≤ cf (n0)

From an earlier exercise:

∀n ≥ n0 : g(n) ≤ c.f (n)

Let d = g(n0). We can prove that for each n ∈ N :

g(n) ≤ cf (n) + d

�

Theorem Let r > 1. Then

1. ni ∈ O(rn)

2. rn 6∈ O(ni)

Proof

1. It follows immediately from lemma 1. (Elabo-

rate)

2. Suppose rn ∈ O(ni). Hence ni^_ rn

^_ ni+1.

Contradiction to lemma 1. (Elaborate)

Alphabet and Languages

An alphabet is a finite set of symbols.

A string over an alphabet is a finite sequence

of symbols from this alphabet.

The empty string is denoted by e.

The length of a string is its length as a sequence.

The concatenation of two strings x,y, denoted

by x ◦ y or simply xy, is the string x followed by

string y.

The concatenation is associative, i.e (x ◦ y) ◦z = x ◦ (y ◦ z).

A string v is a substring of a string w if there

are strings x, y such that w = xvy.

For each string w, each natural number i, define

– w0 = e

– wn+1 = wn ◦ w

This is an example of inductive definition.

The reversal of a string is defined inductively

as follows:

– eR = e

– (ua)R = auR

Exercise Prove (w ◦ v)R = vR ◦ wR

The set of all strings over an alphabet Σ is de-

noted by Σ∗.

Any set of strings over an alphabet Σ is a lan-

guage.

Question: Is Σ∗ countably infinite ?

Language Operations

– Complement The complement of a language

A over Σ is Σ∗ − A– Concatenation Let L1, L2 be two languages

over Σ. The concatenation of L1, L2, denoted

by L1 ◦ L2 is defined by:

L1 ◦ L2 = {x ◦ y |x ∈ L1, y ∈ L2}– Denote

1. L0 = {e},2. L1 = L

3. L2 = L ◦ L4. Ln+1 = Ln ◦ LThe Kleen star of a language L, denoted by

L∗, is

L∗ = {w1 ◦ w2 ◦ . . . ◦ wn |wi ∈ L, n ≥ 0}= L0 ∪ L1 ∪ L2 ∪ . . . ∪ Ln ∪ . . .

Notes ∅∗ = {e}

Define

L+ = {w1 ◦ w2 ◦ . . . ◦ wn |wi ∈ L, n ≥ 1}

Regular Languages

The class of regular languages is the least set of

languages satisfying the following properties:

– The emptyset is regular

– For each character a ∈ Σ, the language {a} is

regular

– If A, B are regular languages then A∗, A ∪ B,

A ◦B are also regular.

Finite Representation ofLanguages

If we want to study languages, we need to rep-

resent them. For example, a context-free language

could be represented by a context-free grammar.

Such a representation must be finite even if the

represented language is infinite, otherwise it would

be useless (just imagine an infinite set of grammar

for Thai, Chinese or English. Would you be able to

learn such a set of grammar ?).

On the other handside, different languages should

have different representations.

Could we have enough representations for every

possible language over a nonempty alphabet Σ ?

How many languages there are over an nonempty

alphabet Σ ?

Since Σ∗ is countable infinite, 2Σ∗

is uncountable

infinite.

The representation of languages must be again

some form of language. hence every finite represen-

tation can be viewed as a finite string over some al-

phabet Σ ′. Since Σ ′∗ is countable infinite, we can

not find a finite representation for each langauge

over Σ.

Finte representation of regular languages

The set of regular expresions over Σ is the least

language over Σ ∪ {∗, (, ), ∅,∪, ◦} satisfying the

following properties:

– ∅ a regular expression

– Every symbol in Σ is a regular expression

– If α, β are regular expressions, then so are (αβ),

α∗, α ∪ β

Any regular expression represents a language as

follows:

– L(∅) = empty set

– Every symbol a ∈ Σ, L(a) = {a}– If α, β are regular expressions, then L(αβ) =

L(α)◦L(β), L(α∗) = L(α)∗, L(α∪β) = L(α)∪L(β)

Theorem 4. A language A is regular iff there

exists a regular expression α such that A = L(α)

Proof Exercise.

Exercise Show that the Kleen star, concatenation

and union operators are monotonic, i.e. for lan-

guages L0, L1, L, followings hold:

1. If L0 ⊆ L1 then L∗0 ⊆ L∗12. If L0 ⊆ L1 then L0 ◦ L ⊆ L1 ◦ L3. If L0 ⊆ L1 then L0 ∪ L ⊆ L1 ∪ L

Exercise Check whether the following equations

are correct.

1. ((a ∪ b)∗)∗ = (a ∪ b)∗2. (a ∪ b)∗(a ∪ b)∗ = (a ∪ b)∗3. (a ∪ b)∗ = a∗ ∪ (a∗b)∗

4. (a ∪ b)∗ = ∅∗ ∪ (a∗b)∗

5. (a ∪ b)∗ = (b∗a)∗ ∪ (a∗b)∗

Exercise Exercise 1.8.7 in text book.

Question Given a regular language L, and

a string w, how could we check whether w ∈ L ?

Example Let L = (a ∪ b∗a)∗

w = aababa.

Question: w ∈ L?

Algorithm:

s : x := get-next-symbol;

if x := end-of-file then accept;

else if x := a then goto s;

else if x := b then goto q;

q : x := get-next-symbol;

if x := end-of-file then reject;



s : x := get-next-symbol;

if x := end-of-file then accept;



q : x := get-next-symbol;

if x := end-of-file then reject;



Fig. 1.

Fig. 2.

Fig. 3.

A shortened representation of the algorithm in

above figure is M = (K, Σ, δ, s, F ):

1. K = {s, q}

2. Σ = {a, b}

3. F = {s}

4. δ : K ×Σ −→ K

δ = {(s, a, s), (s, b, q), (q, a, s), (q, b, q)}

Chapter 2: Finite Automata

Chapter 2.1: Deterministic Finite

Automata

Definition 1. A deterministic finite automa-

ton is a quintuple M = (K, Σ, δ, s, F ) where

– K is a finite set of states

– Σ is an alphabet

– s ∈ K is the initial state

– F ⊆ K is the set of final states

– δ, the transition function, is a function from

K ×Σ to K.

�

An example of finite automata is given in figure

3.

A configuration of a finite automaton M =

(K, Σ, δ, s, F ) is a pair (q, w) where q ∈ K and

w ∈ Σ∗

We say that a configuration (q,w) yields an-

other configuration (q′, w′) in one step, denoted

by (q, w) `M (q′, w′), if there is a symbol σ ∈ Σsuch that w = σw′ and q′ = δ(q, σ).

`∗M is the reflexive and transitive closure of `M ,

i.e. we write (q0, w0) `∗M (qn, wn) iff there are

(q1, w1), . . . , (qn1, wn−1) such that

(q0, w0) `M (q1, w1) `M . . . `M (qn1, wn−1) `M(qn, wn)

– A string w is said to be accepted by M =

(K, Σ, δ, s, F ) if and only if there exists a state

q ∈ F such that (s, w) `∗M (q, e)

– The language accepted by M, L(M), is the

set of all strings accepted by M.

Exercise Design an automaton for recognizing

the language a∗ba∗b.

Chapter 2.2: Nondeterministic Finite

Automata

Definition 2. A nondeterministic finite au-

tomaton is a quintuple M = (K, Σ, ∆, s, F )

where


– Σ is an alphabet



– ∆, the transition relation, is a finite subset of

K × (Σ ∪ {e})×K.

Examples A nondeterministic machine for

(a ∪ ab)∗.

Fig. 4.

Fig. 5.

M = (K, Σ, δ, s, F ):

– K = {s, q}

– Σ = {a, b}

– F = {s}

– ∆ ⊆ K × (Σ ∪ {e})×K:

∆ = {(s, a, q), (q, e, s), (q, b, s)}

Exercise Design a deterministic machine for (a ∪ab)∗.

Exercise Could you design a nondeterministic ma-

chine for (a ∪ ab)∗ without using e.

A configuration of a nondeterministic finite

automata M = (K, Σ, ∆, s, F ) is a pair (q, w)

where q ∈ K and w ∈ Σ∗

We say that a configuration (q,w) yields an-

other configuration (q′, w′) in one step, denoted

by (q, w) `M (q′, w′), if there is u ∈ Σ ∪ {e} such

that w = uw′ and (q, u, q′) ∈ ∆.

`∗M is the reflexive and transitive closure of `M ,

i.e. we write (q0, w0) `∗M (qn, wn) iff there are

(q1, w1), . . . , (qn1, wn−1) such that

(q0, w0) `M (q1, w1) `M . . . `M (qn1, wn−1) `M(qn, wn)

– A string w is said to be accepted by M =


q ∈ F such that (s, w) `∗M (q, e)

– The language accepted by M, L(M), is the


Exercise Design a deterministic and a nondeter-

ministic automaton for recognizing the language

(ab ∪ aba)∗.

Chapter 2.3: Equivalence of

Deterministic and Nondeterministic

Finite Automata

Two finite automata M, M ′ are equivalent iff

L(M) = L(M ′).

Theorem 5. For each nondeterministic finite

automaton there exists an equivalent determin-

istic finite automaton.

Proof Let M = (K, Σ, ∆, s, F ) be a nondeter-

ministic automata. For each q ∈ K, define

E(q) = {p ∈ K | (q, e) `∗M (p, e) }

Define M ′ = (K ′, Σ, δ′, s′, F ′).

– K ′ = 2K

– s′ = E(s)

–

δ′ : K ′ ×Σ −→ K ′

δ′(Q, σ) =⋃q∈Q

{E(p) | p ∈ K and (q, σ, p) ∈ ∆}

– F ′ = {Q ⊆ K | Q ∩ F 6= ∅ }

Property For each w ∈ Σ∗:

(q, w) `∗M (p, e) iff (E(q), w) `∗M ′ (P, e)

for some P ⊆ K such that p ∈ P . �

Therefore, it follows that

w ∈ L(M) iff

(s, w) `∗M (p, e) for some p ∈ F iff

(s′, w) `∗M ′ (P, e) and P ∈ F ′ iff

w ∈ L(M ′).

�

Example Let Σ = {a1, . . . , an}, n ≥ 2 and

L = {w | there is one symbol inΣ not appearing in w }

A nondeterministic finite automaton accepting

L:

M = (K,Σ,∆, s, F ) where

– K = {s, q1, . . . , qn}

– F = {q1, . . . , qn}

– ∆ = {(s, e, qi) | 1 ≤ i ≤ n}∪

{(qi, σ, qi) | 1 ≤ i ≤ n, σ 6= ai }

Translate M into an equivalent deterministic fi-

nite automaton.

Chapter 2.4: Properties of Languages

Accepted by Finite Automata

Theorem 6. The class of languages accepted

by finte automata is closed under

1. union

2. concatenation

3. Kleen star

4. complementation

5. intersection

Theorem 7. A language is regular iff it is ac-

cepted by a finite automata

Proof From the definition of regular languages,

it is clear that every regular language is accepted

by a finite automata (exercise).

It remains to prove that every language accepted

by a finite automata is also regular.

M = (K, Σ, ∆, s, F ) be a deterministic finite

automaton.

Let K = {q1, . . . , qn} with s = q1 and Σ =

{a1, . . . , an}.

R(i, j, k) = {σ1 . . . σm ∈ Σ∗ | (qi, σ1 . . . σm) `M

(qk1, σ2 . . . σm) `M . . . `M (qkm−1, σm) `M (qj, e)

and max{k1, . . . , km−1} ≤ k }

It is clear

L(M) =⋃qj∈F

R(1, j, n)

Now we want to show that R(i, j, k) are regular.

We prove this by induction on k.

Base case: k = 0. Obvious from the definition of

R(i, j, 0)

Inductive Step: Follows from the following equation

R(i, j, k) =

R(i, j, k−1)∪R(i, k, k−1)◦R(k, k, k−1)∗◦R(k, j, k−1)

Pumping Theorem

Theorem 8. Let L be a regular language. There

is an integer n > 0 such that any string w ∈ Lwith |w| ≥ n can be rewritten as w = xyz such

that y 6= e, |xy| ≤ n and xykz ∈ L for any

k ≥ 0.

�

Example

L = {aibi | i ≥ 0 } is not regular.

Example

L = {w ∈ {a, b}∗ |w has an equal number of a’s and b’s }

is not regular.

STATE MINIMIZATION

Problem: Given a regular language L, design

a finite automaton M such that L = L(M) and M

has as few states as possible, i.e. for each finite au-

tomaton M ′ that accepts L, the number of states

of M’ is greater or equal the number of states of M

Definition 3. Let M = (K,Σ, δ, s, F ) be a de-

terministic automaton. We say that two strings

x, y ∈ Σ∗ are equivalent wrt M, denoted by

x ∼M y, if there is a state q such that (s, x) `∗M(q, e) and (s, y) `∗M (q, e)

Definition 4. Let L ⊆ Σ∗ and x, y ∈ Σ∗. We

say that x and y are equivalent wrt to L,

denoted by x ≈L y if for all z ∈ Σ∗,

xz ∈ L iff yz ∈ L.

It is obvious that for all x, y ∈ Σ∗ and a ∈ Σ,

x ≈L y implies xa ≈L ya

Theorem 9. For any deterministic finite au-

tomaton M = (K,Σ, δ, s, F ) and for any string

x, y ∈ Σ∗, if x ∼M y then x ≈L(M) y.

Let M = (K,Σ, δ, s, F ) be a deterministic fi-

nite automaton and q ∈ K and Lq be the set of

all strings x such that (s, x) `∗M (q, e). Further for

each x ∈ Σ∗, let [x]M be the equivalent class of x

wrt ∼M .

It is obvious that for each x ∈ Lq following as-

sertions hold:

– Lq = [x]M .

– The mapping µ(q) = Lq is a bijection from K

onto {[x]M | x ∈ Σ∗} such that for all p, q ∈ K,

δ(q, a) = p iff [xa]M = Lp

Therefore M is equivalent to the following au-

tomaton M ′ = (K ′, Σ, δ′, s′, F ′) where

– K ′ = {[x]M |x ∈ Σ∗}– s′ = [e]M– δ′([x]m, a) = [xa]M– F ′ = {[x]M | (s, x) `∗M (q, e), and q ∈ F}

Let L ⊆ Σ∗ be a regular language.

Define a deterministic finite automaton M =

(K,Σ, δ, s, F ) as follows:

– K = {[x]L | x ∈ Σ∗} where [x]L is the equiva-

lent class of x wrt ≈L– s = [e]L– F = {[x]L | x ∈ L}– δ([x]L, a) = [xa]L

Theorem 10.

L = L(M)

Question Given a deterministic finite automa-

ton M = (K,Σ, δ, s, F ), how to construct a deter-

ministic finite automaton M ′ such that L(M) =

L(M ′) and M ′ has a minimal number of states.

– Define

L′p = {w | ∃f ∈ F : (p, w) `∗M (f, e)}

L′p,n = {w |w ∈ L′p, |w| ≤ n}– For p, q ∈ K, define

p ≡ q

iff L′p = L′q

– For p, q ∈ K, define

p ≡n q iff L′p,n = L′q,n

It is obvious that

≡0⊇≡1⊇ . . . ⊇ ≡n⊇ . . . ⊇≡Because K is finite, there is n such that

≡n = ≡n+1 = ≡

Lemma 2. For any two states p,q and any num-

ber n ≥ 0, p ≡n+1 q iff following conditions

hold:

– p ≡n q– for each a ∈ Σ: δ(p, a) ≡n δ(q, a)

ALGORITHMS FOR FINITE

AUTOMATA

Theorem 11. 1. There is an exponential algo-

rithm which given a nondeterministic finite

automaton, constructs an equivalent determin-

istic finite automaton.

2. There is a polynomial algorithm which given

a deterministic finite automaton, constructs

an equivalent deterministic finite automaton

with a minimal number of states.

3. There is a polynomial algorithm which given

two deterministic finite automata, decides whether

they are equivalent.

�

Theorem 12. Given a deterministic finite au-

tomaton M = (K,Σ, δ, s, F ) and a string w,

there is an algorithm to decide whether w is ac-

cepted by M in O(|w|) time.

�

Theorem 13. Given a nondeterministic finite

automaton M = (K,Σ, δ, s, F ) and a string w,

there is a polynomial algorithm to decide whether

w is accepted by M.

Proof The algorithm is given below:

S0 := E(s); n := 0;

repeat the following

{n := n + 1;

σ := the n-th input symbol;

if σ 6= end-of-file then

Sn :=⋃{E(q) | ∃p ∈ Sn−1 : (p, σ, q) ∈ ∆ }

}until σ = end-of-file

if Sn−1 ∩ F 6= ∅ then accept else reject

The algorithm is polynomial.

�

Context Free Languages

Definition 5. A context-free grammar G is

a quadruple (V,Σ,R, S) where

– V is an alphabet

– Σ (the set of terminals) is a subset of V

– R (the set of rules) is a finite subset of (V −Σ)× V ∗

– S (the start symbol) is an element of V −Σ.

�

– The elements of V − Σ are called nontermi-

nals.

– We write AG−→ u for any rule (A, u) ∈ R.

– For any strings w, v ∈ V ∗, we write

wG

=⇒ v

iff there is a rule AG−→ u in R and w = xAy

and v = xuy.

– A sequence

w0G

=⇒ w1G

=⇒ . . .G

=⇒ wn, n ≥ 0

is called a derivation of wn from w0.

– We write

wG

=⇒∗v

iff there is a derivation of v from w.

The language generated by G is

L(G) = {w ∈ Σ∗ |S G=⇒

∗w }

A context-free language is a language gen-

erated by some context free grammar.

Example A grammar for arithmetic expressions:

– Σ = {x,+, ∗, (, )}– V = Σ ∪ {T, F,E}– R = { E −→ E + T

E −→ T

T −→ T ∗ FT −→ F

F −→ (E)

F −→ x }

Give a derivation for (x), (x + x) ∗ x.

Regular Languages are Context-free

Theorem 14. Any regular language is context

free

Proof Let L be a regular language accepted by a

deterministic finite automatonM = (K,Σ, δ, S, F ).

Construct a grammar G = (V,Σ,R, S) as follows:

– V = K ∪Σ– R consists of rules of the forms• P → aQ where δ(P, a) = Q, and• P → e for P ∈ F

We show by induction on n that

(S, a1 . . . an) `M (Q1, a2 . . . an) `M . . . `M (Qn, e)

iff

SG

=⇒ a1Q1G

=⇒ . . .G

=⇒ a1 . . . anQn

– Basic step: n = 0. Obvious.– Inductive Step: Suppose the assertion holds for

n. We show that it holds for n + 1. Let

(S, a1 . . . anan+1) `M (Q1, a2 . . . an) `M . . . `M (Qn+1, e)

be a computation in M. Hence,

(S, a1 . . . an) `M (Q1, a2 . . . an) `M . . . `M (Qn, e)

is also a computation in M. From induction hy-

pothesis,

SG

=⇒ a1Q1G

=⇒ . . .G

=⇒ a1 . . . anQn

is a derivation wrtG. From (Qn, an+1) `M (Qn+1, e),

it follows δ(Qn, an+1) = Qn+1. Hence Qn →an+1Qn+1 is a rule in R. Therefore a1 . . . anQn

G=⇒

a1 . . . anan+1Qn+1.

The other direction holds obviously.

Hence for any string w ∈ Σ∗, w ∈ L iff

(S,w) `∗M (P, e) and P ∈ F iff SG

=⇒∗wP

and P → e in R iff SG

=⇒∗w.

Parse Trees

Let G = (V,Σ,R, S)

1. For each a ∈ Σ, the tree

◦ a

is a parse tree. The root of this tree is a which

is also its only leaf. The yield of this tree is also a.

2. If A −→ e is a rule in G, then

Fig. 6.

is a parse tree whose root is A, whose only leaf

is e and whose yield is e.

3. If

Fig. 7.

are parse trees (n ≥ 1), with roots A1, . . . , An

and yields y1, . . . , yn respectively, and A −→A1 . . . An is a rule in R then

Fig. 8.

is also a parse tree whose root is A, whose leafs

are the leaves of T1, . . . , Tn and whose yield is

y1 . . . yn.

4. Nothing else is a parse tree

�

Examples Construct parse trees yielding (x), (x+

x) ∗x wrt the grammar for arithmetic expressions.

Lemma 3. Let G = (V,Σ,R, S) be a context-

free grammar, and let A ∈ V −Σ, and w ∈ Σ∗.Then the following statements are equivalent

1. A⇒∗ w2. There is a parse tree with root A and yield w.

– We write xL

=⇒ y if the nonterminal symbol

being replaced is the leftmost nonterminal sym-

bol in the string, i.e

x = wAv, y = wuv where w ∈ Σ∗, v ∈ V ∗,

A→ u ∈ R.

A leftmost derivation is of the form

x1L

=⇒ x2L

=⇒ . . .L

=⇒ xn

– We write xR

=⇒ y if the nonterminal symbol be-

ing replaced is the rightmost nonterminal symbol

in the string, i.e

x = wAv, y = wuv where v ∈ Σ∗, w ∈ V ∗,

A→ u ∈ R.

A rightmost derivation is of the form

x1R

=⇒ x2R

=⇒ . . .R

=⇒ xn

Theorem 15. Let G = (V,Σ,R, S) be a context-

free grammar, and let A ∈ V −Σ, and w ∈ Σ∗.

Then the following statements are equivalent:

1. A⇒∗ w

2. There is a parse tree with root A and yield w

3. There is a leftmost derivation AL

=⇒∗w

4. There is a rightmost derivation AR

=⇒∗w

AMBIGUITY

Let Σ = {x,+, ∗, (, )}. Consider the following

two grammars:

– G1 = (V1, Σ,R1, E)

• V1 = Σ ∪ {T, F,E}

• R1 = { E −→ E + T E −→ TT −→ T ∗ F T −→ FF −→ (E) F −→ x }

– G2 = (V2, Σ,R2, E)

• V2 = Σ ∪ {E}

• R2 = { E −→ E + E E −→ E ∗ EE −→ (E) E −→ x }

G1 is unambiguous in the sense that for each

string w ∈ L(G1), there is exactly one parse tree

that yields w. On the contrary, G2 is ambiguous

as there are distinct parse trees yielding the same

string.

Exercise Could we use the following grammar

as the generator of arithmetic expressions in pro-

gramming languages:

– Σ = {x,+, ∗, (, )}– V = Σ ∪ {T, F,E}– R = { E −→ E ∗ T

E −→ T

T −→ T + F

T −→ F

F −→ (E)

F −→ x }

Pushdown Automata

Definition 6. A pushdown automata is a

sixtuple M = (K, Σ, , Γ, ∆, s, F ) where


– Σ is an alphabet (the input symbols)

– Γ is an alphabet (the stack symbols)



– ∆, the transition relation, is a finite subset of

(K × (Σ ∪ {e})× Γ ∗)× (K × Γ ∗)

�

For ease of undertsanding, we often write

(p, a, α)→ (q, β)

for ((p, a, α), (q, β)) ∈ ∆

Let L = {wcwR | w ∈ {a, b}∗ }.

L is accepted by pushdown automaton M =

(K,Σ, Γ,∆, s, F )

– K = {s, p, f}, F = {f}

– Σ = {a, b, c}, Γ = {a, b}

– ∆ consists of the following rules

(s, a, e)→ (p, a) (s, b, e)→ (p, b)

(s, c, e)→ (f, e)

(p, a, e)→ (p, a) (p, b, e)→ (p, b)

(p, c, e)→ (f, e)

(f, a, a)→ (f, e) (f, b, b)→ (f, e)

Fig. 9.

A configuration of a pushdown automata is a

triple

(q, w, v)

where q ∈ K, w ∈ Σ∗ and v ∈ Γ ∗

A configuration (q,w,v) yields another configu-

ration (q′, w′, v′) in one step, denoted by

(q, w, v) `M (q′, w′, v′)

if there is a rule (q, a, α)→ (q′, β) in ∆ such that

w = aw′, v = αv0, v′ = βv0.

We write

(q, w, v) `∗M (q′, w′, v′)

if there is a sequence

(q1, w1, v1) `M . . . `M (qn, wn, vn)

such that (q, w, v) = (q1, w1, v1), (qn, wn, vn) =

(q′, w′, v′).

A string w is said to be accepted by M =


q ∈ F such that (s, w, e) `∗M (q, e, e)

The language accepted by M, L(M), is the


Theorem 16. A language is context-free iff it

is accepted by a pushdown automaton.

Lemma 4. Each context-free language is accepted

by some pushdown automaton.

Proof Let G = (V,Σ,R, S) be a context-free

grammar. The language generated by G is accepted

by the pushdown automatonM = (K,Σ, Γ,∆, s, F ):

– K = {s, q}, F = {q}– Γ = V

– ∆ consists of the following transitions:

• (s, e, e)→ (q, S)

• (q, e, A)→ (q, x) for each rule A→ x in R

• (q, a, a)→ (q, e) for each a ∈ Σ.

�

Exercise Construct a pushdown automaton accept-

ing the language generated by the following gram-

mar G = (V,Σ,R, S) where

– V = {S, a, b}, Σ = {a, b}– R consists of rules

S → aSa S → bSb, S → e

A pushdown automata M = (K,Σ, Γ,∆, s, F ) is

called simple if following conditions are satisfied:

For each ((q, a, β)(p, γ)) ∈ ∆ such that q 6= s:

β ∈ Γ and |γ| ≤ 2

Claim For every pushdown automaton there ex-

ists an equivalent simple pushdown automata.

Proof Let M = (K,Σ, Γ,∆, s, F ) be a push

down automaton. Construct a simple PA M ′ =

(K ′, Σ, Γ ∪ {Z}, ∆′, s′, {f ′}) as follows:

– Add to ∆ the transitions:

((s′, e, e)(s, Z))

((f, e, Z)(f ′, e)) for each f ∈ F– Replace transitions with |β| ≥ 2.

Let ((q, a, β)(p, γ)) be a transition with β =

B1 . . . Bn with n > 1.

Replace this transition by the following transi-

tions

((q, e, B1)(qB1, e))

((qB1, e, B2)(qB1B2, e))

.....

.....

((qB1B2...Bn−2, e, Bn−1)(qB1B2...Bn−1, e))

((qB1B2...Bn−1, a, Bn)(p, γ))

– Get rid of transition with |γ| > 2

Let ((q, a, β)(p, γ)) be a transition with γ =

C1 . . . Cn with m > 1.

Replace this transition by the following transi-

tions

((q, a, β)(r1, Cm))

((r1, e, e)(r2, Cm−1))

....

....

((rm−2, e, e)(rm−1, C2))

((rm−1, e, e)(p, C1))

– Get rid of transition with β = e

Replace all transitions of the form ((q, a, e)(p, γ))

with q 6= s′ by all transitions of the form ((q, a, A)(p, γA))

Lemma 5. Any language accepted by a simple

pushdown automata is context-free.

Proof LetM = (K,Σ, Γ,∆, s, F ) be a pushdown

automata and M ′ be constructed as above.

Define G = (V,Σ,R, S) as follows:

– V = {S} ∪Σ ∪ K × (Γ ∪ {e})×K– R consists of the following rules:

• S −→ 〈s, Z, f ′〉• for each transtition ((q, a, B), (r, C)) ∈ ∆′

and for each p ∈ K ′:〈q, B, p〉 −→ a〈r, C, p〉• for each transition ((q, a, B), (r, C1C2)) ∈ ∆′,

for all p, p′ ∈ K ′〈q, B, p〉 −→ a〈r, C1, p

′〉〈p′, C2, p〉• 〈q, e, q〉 −→ e for each q ∈ K ′

Properties of Context-freeLanguages

Theorem 17. The context-free languages are closed

under union, concatenation and Kleen star.

Proof Let L, L′ be context-free languages gener-

ated by context-free grammars G = (V,Σ,R, S),

G′ = (V ′, Σ,R′, S ′) where we assume that V −Σand V ′ −Σ are disjoint.

Construct new grammars fromG, G′ to generate

L ∪ L′, L ◦ L′, L∗.�

Theorem 18. The intersection of a context-free

language with a regular language is a context-

free language.

Proof Let L be a context-free language accepted

by pushdown automaton M = (K,Σ, Γ,∆, s, F )

and L′ be a regular language accepted by a deter-

ministic finite automaton M ′ = (K ′, Σ, δ, s′, F ′)

where K,K ′ are disjoint.

Construct a new pushdown automaton fromM, M ′

to accept L ∩ L′.

Theorem 19. The intersection of two context

free languages is not always context-free.

The complementation of a context free lan-

guage is not always context-free.

Proof Let L0 = {ambncn |m,n ≥ 0 }, L1 =

{ambmcn |m,n ≥ 0 }. Both L0, L1 are context free

(exercise).

L = L0 ∩ L1 = {anbncn |n ≥ 0 }. We show

below that L is not context-free.

We use the equation L0∩L1 = L0 ∪ L1 to show

that the complementation of a context free lan-

guage is not always context-free (exercise).

�

Theorem 20. (Pumping Theorem)

Let G = (V,Σ,R, S) be a context-free gram-

mar. Then there is a number n such that for

any string w ∈ L(G) of length greater than n

can be rewritten as w = uvxyz in such a way

that either v or y is nonempty and uvkxykz is

in L(G) for any k.

Proof The fanout of a grammarG = (V,Σ,R, S),

denoted by φ(G) is the largest number ofsymbols

on the right handsid of any rule in G. A path in

the parse tree is a sequence of distinct nodes each

connected to the previous one by a line segment;

the first node is the root and the last node is a

leaf. The length of a path is the number of line

segments in it. The height of the parse tree is the

length of the longest path in it.

Lemma 6. The yield of any parse tree of G

with the height h has length at most φ(G)h.

Lemma 7. Let G = (V,Σ,R, S) be a context-

free grammar. Then any string w ∈ L(G) of

length greater than φ(G)|V−Σ| can be rewritten

as w = uvxyz in such a way that either v or y

is nonempty and uvkxykz is in L(G) for any k.

Algorithmic Properties

Definition 7. A context-free grammar G = (V,Σ,R, S)

is said to be in Chomsky normal form if all rules

are of the form

A→ BC

where A,B,C ∈ V

Theorem 21. There is a polynomial algorithm

which given a context-free grammar G and a

string w, decides whether w ∈ L(G) ?

Proof The proof consists of three main steps:

– A polynomial translation of G into an grammar

G′ in Chomsky normal form such that L(G) −(Σ ∪ {e}) = L(G′).

– A polynomial algorithm to decide whether w ∈L(G′) for |w| ≥ 2.

– A polynomial algorithm to decide whether a ∈L(G) for a ∈ Σ ∪ {e}.

Lemma 8. For every context-free grammar G =

(V,Σ,R, S) there is a context-free grammar G′

in Chomsky normal form such that L(G′) =

L(G)− ({e} ∪Σ).

G′ could be constructed in time polynomial to

the size of G.

Proof

– Replace each rule A→ B1B2 . . . Bn by

A→ B1A1

A1 → B2A2

........................

An−2 → Bn−1Bn

where A1, . . . , An−2 are new nonterminals.

– Construct E = {A |A⇒∗ e} as follows:

• E := ∅• While there is a rule A→ α with α ∈ E∗ and

A 6∈ Edo E := E ∪ {A}.

– Removing e-rules:

• Delete all rules of the form A→ e

• For each rule of the form

A→ BC or A→ CB where B ∈ Eadd the rule A→ C

– Construct for each A, D(A) = {B |A ⇒∗ B}as follows:

• D(A) = {A}• While there is a rule B → C with B ∈ D(A)

and C 6∈ D(A)

do D(A) := D(A) ∪ {C}.

– Removing rules containing only one symbol on

the right hand side:

• Delete all rules of the form A→ B

• Replace each rule of the form A→ BC by all

possible rules of the form A → B′C ′ where

B′ ∈ D(B), C ′ ∈ D(C)

• Add the rule S → BC for each rule A→ BC

such that A ∈ D(S)− {S}.�

Lemma 9. Let G be a grammar in Chomski

normal form. There is a polynomial algorithm

to decide whether w ∈ L(G) for |w| ≥ 2.

Proof Let w = x1 . . . xn. Define

N [i, j] = {A |A⇒∗ xi . . . xj}

The sets N [i, j] could be computed by

– N [i, i] = {xi}

– N [i, j] = {A |A→ BC, B ∈ N [i, s],

C ∈ N [s + 1, j] }

w ∈ L(G) iff w ∈ N [1, n].

�

Example Translate the grammar

S → (S) S → SS S → e

into Chomski normal form

S → SS S → ()

S → (S1 S1 → S)

Check whether the string ( ( ) ( ( ) ) ) is gener-

ated by the Chomski normal form grammar.

Fig. 10.

TURING MACHINES

Definition 8. A Turing machine is a 5-tuple

M = (K, Σ, δ, s,H) where

– K is a finite set of states,

– Σ is an alphabet containing the blank symbol

t and the left end symbol ., but not con-

taining the symbols ← and →


– H ⊆ K is the set of halting states

– δ, the transition function, is a function from

(K−H)×Σ to K× (Σ ∪{←,→}) such that

• for all q ∈ K −H there exists p such that

δ(q, .) = (p,→)

• for all q ∈ K −H, and a ∈ Σ, if δ(q, a) =

(p, b) then b 6= .

Example Design a Turing machine for accepting

L = {w |w ∈ {a, b}∗, |w| is even }.

M = (K, Σ, δ, s, H) with

– K = {s, q0, q1, y, n}, H = {y, n}

– Σ = {a, b}

– δ : (K −H)×Σ −→ K × (Σ ∪ {←,→})

q σ δ(q, r)

s t q0,→q0 t y,tq0 a or b q1,→q1 t n,tq1 a or b q0,→

M accepts aa:

(s, .taa) `M (q0, . t aa) `M (q1, . t aa) `M(q0, . t aat) `M (y, . t aat)

M rejects a:

(s, .ta) `M (q0, . t a) `M (q1, . t at) `M(n, . t at)

A configuration of a Turing machine M =

(K, Σ, δ, s,H) is a member of

K × .Σ∗ × (Σ∗ × (Σ − {t}) ∪ {e})

Definition 9. Let M = (K, Σ, δ, s,H) and let

(q1, w1a1u1) and (q2, w2a2u2) be configurations of

M. Then

(q1, w1a1u1) `M (q2, w2a2u2)

iff for some b ∈ Σ ∪ {←,→}, δ(q1, a1) = (q2, b)

and either

1. b ∈ Σ, w1 = w2, u1 = u2 and a2 = b

or

2. b =←, w1 = w2a2, and either

(a) u2 = a1u1, if a1 6= t or u1 6= e

or

(b) u2 = e, if a1 = t and u1 = e

or

3. b =→, w2 = w1a1, and either

(a) u1 = a2u2or

(b) u1 = u2 = e and a2 = t

A halted configuration is of the form (h,w1aw2)

with h ∈ H .

A computation is a sequence of configurations

C1, C2, . . . , Cn such that C1 `M C2 `M . . . `MCn. We say that the computation has length n and

C1 yields Cn.

M is said to halt on input w iff (s, .tw) yields

some halted configuration.

COMBINING TURING MACHINES

Symbol writing machine: For each symbol

a ∈ Σ ∪ {←,→} − {.}, Ma is a Turing machine

which write the symbol a into the scanned tape

square.

Ma = ({s, h}, Σ, δ, s, {h}):δ(s, b) = (h, a) for each b ∈ Σ − {.}

Notations:

We often write

– a for Ma

– L for M←– R for M→– Lt: Finds the first blank square to the left of the

currently scanned symbol

– Rt: Finds the first blank square to the right of

the currently scanned symbol

An initial configuration is of the form (s, .tw)

Definition 10. Given is a Turing machine M =

(K,Σ, δ, s,H) with H = {y, n}. Any halting

configuration whose state is y is said to be an

accepting state while a halting configuration whose

state is n is said to be a rejecting state.

We say that M accepts an input w ∈ (Σ −{t, .})∗ if (s, .tw) yields an accepting configu-

ration.

We say that M rejects an input w ∈ (Σ −{t, .})∗ if (s, .tw) yields a rejecting configura-

tion.

Let Σ0 ⊆ Σ − {t, .} be an alphabet, called

input alphabet. We say that M decides a lan-

guage L ⊆ Σ∗0 if for any string w ∈ Σ∗0If w ∈ L then M accepts w

If w 6∈ L then M rejects w

A language L is called recursive if it is de-

cided by some Turing machine.

Example Show that

L0 = {anbn |n ≥ 0 } and L1 = {anbncn |n ≥ 0 }

are recursive.

Fig. 11.

Fig. 12.

Example Eeach deterministic finite automaton could

be ”simulated” by a TM.

Let M = (K,Σ, δ, s, F ) be a DFA.

Define a TM M ′ = (K∪{y, n}, Σ ′, δ′, s′, {y, n})with

Σ ′ = Σ ∪ {.,t} and

δ′ : K ×Σ ′ → (K ∪ {y, n})× (Σ ′ ∪ {←,→})

– δ(s′,t) = (s,→)

– δ′(p, a) = (δ(p),→) for p ∈ K, a ∈ Σ– δ′(p,t) = (y,t) for p ∈ F– δ′(p,t) = (n,t) for p 6∈ F

It holds:

∀w ∈ Σ∗ : w ∈ L(M) iff (s′, .tw) `∗M (y, .wt)

∀w ∈ Σ∗ : w 6∈ L(M) iff (s′, .tw) `∗M (n, .wt)

�

Recursive Functions

Definition 11. Let M = (K, Σ, δ, s, {h}). Let

Σ0 ⊆ (Σ − {t, .} and w ∈ Σ∗0 . Suppose

(s, .tw) `∗M (h, .tu)

Then u is called the output of M on input w

and denoted by M(w).

1. We say that M computes function

f : Σ∗0 → Σ∗0

if for all w ∈ Σ∗0 , M(w) = f (w).

2. We say that M computes function

f : Nk → N, (k ≥ 1)

if for all binary numbers w1, . . . , wk,

M(w; . . . ;wk) = f (w1, . . . , wk)

where f (w1, . . . , wk) is also in binary repre-

sentation.

A function f is recursive if there is a Turing

machine that computes f.

Example Show that g(n) = 2n and f (n) = n + 1

are recursive functions.

Recursively Enumerable Languages

Definition 12. Let M = (K,Σ, δ, s,H) be a

Turing machine. Let Σ0 ⊆ Σ − {t, .} be an

alphabet, and L ⊆ Σ∗0 .

We say that M semidecides L if for any

string w ∈ Σ∗0 :

w ∈ L iff M halts on inputw

A language L is called recursively enumer-

able if it is semidecided by some Turing ma-

chine.

Theorem 22. – If a language is recursive then

it is also recursively enumerable.

– If a language is recursive then its complement

is also recursive.

Extensions of Turing Machines

We study two important extensions of Turing

machines:

1. Allowing several tapes instead of one

2. Allowing nondeterministism

Multi-tape Turing Machines

A k-tape Turing machine consists of a finite

control together with k infinite tapes. Each tape

is scanned by a read/write head. The machine can

in one step sense the symbols scanned by all its

heads and then depending on those symbols and

it s current state, rewrite some of those scanned

squares or move some of the heads to the left or

right.

Definition 13. A k-tape Turing machine is

a 5-tuple M = (K, Σ, δ, s, H) where

– K, Σ, s, H are as in the definition of ordinary

Turing machine

– δ, the transition function, is a function

δ : (K −H)×Σk −→ K × (Σ ∪ {←,→})k

Example Design a two-tapes machine to decide

L = {wcw |w ∈ {a, b}∗ } where c 6∈ {a, b}.

| B | t | w | c | w | t | ...

↑

| B | t | t | ...

↑

q r1 r2 δ(q, r1, r2)

s t t p,→,→p t t n,t,tp σ t p′,t, σp′ t σ p,→,→p c t r2,→,←r2 x σ r2, x,←r2 x t f, x,→f σ σ f,→,→f t t y,t,tf x y n, x, y if x 6= y

σ stands for a or b

x stands for a or b or ty stands for a or b or t

Extending Turing machines with more than one

tape does not increase the power of the machines.

Theorem 23. Let k > 1 and M = (K, Σ, δ, s, H)

be a k-tape Turing machine.

Then there is a standard Turing machine M ′ =

(K ′, Σ ′, δ′, s′, H) where Σ ⊆ Σ ′, and such that

for any w ∈ (Σ − {t, .})∗ :

M halts on input w with output y on its first

tape after t steps if and only if

M ′ halts on input x with the same output after

a polynomial number of steps of t and the size

of x.

Nondeterministic Turing machines

Definition 14. A nondeterministic Turing

machine is a quintuple M = (K, Σ, ∆, s, H)

where

– K, Σ, s, H are as for standard Turing ma-

chines, and

– ∆, the transition relation, is a subset of

((K −H)×Σ)× (K × (Σ ∪ {←,→}) )

Definition 15. Let M = (K, Σ, ∆, s) and let

(q1, w1a1u1) and (q2, w2a2u2) be configurations of

M. Then

(q1, w1a1u1) `M (q2, w2a2u2)

iff for some b ∈ Σ∪{←,→}, ((q1, a1), (q2, b)) ∈∆ and

1. either b ∈ Σ, w1 = w2, u1 = u2 and a2 = b

2. or b =←, w1 = w2a2, and

(a) either u2 = a1u1, if a1 6= t or u1 6= e

(b) or u2 = e, if a1 = t and u1 = e

3. or b =→, w2 = w1a1, and

(a) either u1 = a2u2(b) or u1 = u2 = e and a2 = t

Definition 16. Let M = (K,Σ, δ, s,H) be a

nondeterministic Turing machine.

We say that M accepts w ∈ (Σ − {.,t})∗ if

(s, .tw) `∗M (h, uav) for some h ∈ H, a ∈ Σ,

u, v ∈ Σ∗.We say that M semidecides L ⊆ (Σ−{.,t})∗

if for any string w ∈ (Σ − {.,t})∗: w ∈ L iff

M halts on input w.

Definition 17. Let M = (K,Σ, δ, s, {y, n}) be

a nondeterministic Turing machine.

We say that M decides L ⊆ (Σ − {.,t})∗if the following conditions hold for any string

w ∈ (Σ − {.,t})∗:

1. There is a number N depending on M and w

such that no computation starting from (s, .tw)

has a length greater than N

2. w ∈ L iff (s, .tw) `∗M (y, uav) for some

a ∈ Σ, u, v ∈ Σ∗.

We say that M computes a function f : (Σ−{.,t})∗ → (Σ−{.,t})∗ if the following condi-

tions hold for any string w ∈ (Σ − {.,t})∗:

1. There is a number N depending on M and w

such that no computation starting from (s, .tw)

has a length greater than N

2. (s, .tw) `∗M (h, uav) iff ua = .t, v = f (w).

Theorem 24. If a nondeterministic Turing ma-

chine semidecides or decides a language or com-

putes a function then there is a standard one

that semidecides or decides the same language

or computes the same function.

Unrestricted Grammars

Definition 18. A unrestricted grammar G

is a quadruple (V,Σ,R, S) where

– V is an alphabet

– Σ (the set of terminals) is a subset of V

– R (the set of rules) is a finite subset of

V ∗(V −Σ)V ∗ × V ∗– S (the start symbol) is an element of V −Σ.

�

– We write uG−→ v for any rule (u, v) ∈ R.

– For any strings w,w′ ∈ V ∗, we write

wG

=⇒ w′

iff there is a rule uG−→ v and w = xuy and

w′ = xvy.

– A sequence

w0G

=⇒ w1G

=⇒ . . .G

=⇒ wn, n ≥ 0

is called a derivation of wn from w0.

– We write

wG

=⇒∗v

iff there is a derivation of v from w.

The language generated by G is

L(G) = {w ∈ Σ∗ |S G=⇒

∗w }

Example Design an unrestricted grammar for L =

{anbncn |n ≥ 1 }

G = (V,Σ,R, S) where

– V = {S,A,B,C, Ta, Tb, Tc, a, b, c}– Σ = {a, b, c}

– R consists of the following rules:

S → ABCS S → Tc

CA→ AC BA→ AB CB → BC

CTc → Tcc

BTc → Tbb BTb → Tbb

ATb → Taa ATa → Taa

Ta → e

�

Theorem 25. A language is generated by an

unrestricted grammar if and only if it is recur-

sively enumerable.

UNDECIDABILITY

The Church - Turing Thesis

An algorithm is anything that can be viewed

as corresponding to a Turing machine that halts

on all inputs.

Nothing is considered to be an algorithm if it

cannot be rendered as a Turing machine that is

guaranteed to halt on all inputs.

Universal Turing Machine

Let M = (K,Σ, δ, s,H) be a Turing machine. M

is now encoded over the alphabet {a, q, 0, 1} as

follows:

– Let i, j be smallest integer such that 2i ≥ |K|and 2j ≥ |Σ| + 2

– each state in K is represented by a string from

q{0, 1}i– each symbol in Σ is represented by a string from

a{0, 1}j– The representation of the special symbols is as

follows:

t a0j

. a0j−11

← a0j−210

→ a0j−211

– The start state is represented by q0i

The representation of a Turing machine M,

denoted by ”M”, consists of a sequence of strings of

the form (p,b,r,c) with p,r representations of states

and b,c representations of symbols.

Example Let M = (K,Σ, δ, {h}) be a TM where

K = {s, p, q, h}, Σ = {0, 1,t, .} and

δ(s,t) = (p,→),

δ(p, 0) = (p,→), δ(p, 1) = (p,→),

δ(p,t) = (q, 0),

δ(q, 0) = (q,←), δ(q, 1) = (q,←),

δ(q,t) = (h,t).

s q00

p q01

q q10

h q11

t a000

B a001

← a010

→ a011

0 a100

1 a101

The representation ”M” is the following string:

(q00, a000, q01, a011)(q01, a100, q01, a011)(q01, a101, q01, a011)

(q01, a000, q10, a100)(q10, a100, q10, a011)(q10, a101, q10, a011)

(q10, a000, q11, a000)

The universal Turing machine U is a ma-

chine which uses the encodings of other machine to

direct its operations.

Intuitively U takes two arguments: a description

of a Turing machine M, ”M”, and a description of

an input string w, ”w”.

U has the following property:

U halts on ”M””w” iff M halts on w

U is constructed in two steps:

1. A three-tape universal machineU ′ is constructed.

2. Transform U ′ into an equivalent single tape ma-

chine U .

U ′ simulates a TM M as follows:

1. The first tape contains the encoding of the tape

of M

2. The second tape contains the encoding of M

3. The third tape contains the encoding of the state

of M at the current point in the simulated com-

putation

U ′ starts with the string ”M””w” on its first tape

and the other tapes blank.

– ”M” is moved onto the second tape, and

– ”w” is shifted down to the left end of the first

tape, preceding it by .t

– U ′ extracts the coding of the initial state of M

and puts it on the third tape.

The halting problem

Halting Problem:

Given an arbitrary Turing machine M and an

input w, is there an algorithm which can

decide whether M accepts w ?

Let H = {”M””w” | Turing machine M accepts

input string w }.

From the Church-Turing thesis,

”Yes” answer to the halting problem iff the lan-

guage H is recursive.

It is clear that the universal Turing machine ac-

cepts H . Hence H recursively enumerable.

Theorem 26. H is not recursive.

Proof Suppose H is recursive. Let

H1 = {w |w = ”M” for some TM M and M accepts w }

Because H is recursive, H1 is also recursive. Let

ΣU = {a, q, 0, 1, (, ), , } be the alphabet of the uni-

versal TM.

Define R ⊆ Σ∗U ×Σ∗U by

(u,w) ∈ R iff u = ”M” for some TM machine M accepting w

For each string u, Ru = {w | (u,w) ∈ R }

Therefore, for each language L,

L is r.e. iff ∃u = ”M” : L = Ru = L(M).

Let D = {w | (w,w) 6∈ R }

From the diagonalization principle, there is no u

such that D = Ru. Therefore D is not r.e.

It is not difficult (exercise) to see that

H1 = {w | (w,w) ∈ R } = D

Since H1 is recursive and D = H1, D is also

recursive, hence r.e. Contradiction because we have

showed that D is not r.e. Hence, the assumption

that H is recursive leads to a contradiction. H is

hence not recursive. �

Undecidable Problems

Since H is not recursive, there is no algorithm to

decide whether a TM M accepts an input string w.

We say

the halting problem is undecidable.

Definition 19. Let L1, L2 ⊆ Σ∗ be languages.

A reduction from L1 to L2 is a recursive

function τ : Σ∗ → Σ∗ such that x ∈ L1 iff

τ (x) ∈ L2.

Theorem 27. If L1 is not recursive and there

is a reduction from L1 to L2 then L2 is not re-

cursive.

Theorem 28. The following problems are un-

decidable:

1. Given a Turing machine M, does M hold on

the empty tape ?

2. Given a Turing machine M, is the language

accepted by M empty ?

3. Given a Turing machine M, does M accepts

every input ?

4. Given Turing machines M1,M2, do they ac-

cept the same language ?

5. Given a Turing machine M, Is the language

accepted by M regular ? Is it context-free ? Is

it decidable ?

Proof

1. Let He = {”M” | M hold on the empty tape}.We give a reduction τe from H to He. Let w =

a1 . . . an. Define

τe(”M””w”) = ”M ′”

where M ′ = Ra1Ra2 . . . RanLtM .

It is clear that M accepts w iff M ′ accepts the

empty tape. From the reduction theorem, He is

not recursive. Therefore, the problem is undecid-

able.

2. Let Hi = {”M” |M halts on some input}.We reduce He to Hi. Let M be a TM.

Fig. 13.

Let τi(”M”) = ”M ′”. It is clear that if M halts

on empty tape then M ′ halts on all inputs (and

hence some input).

Hi is not recursive.

3. Similar to the previous case.

4. Let Ha = {”M” | M accepts all inputs}. Ha is

not recursive.

Let Heq = {”M”M ′” |M,M ′ accept the same

language}.Let u be a string representing a TM that accepts

all inputs in the first step, i.e. δ(s,t) = (h,t).

Define

τa(”M”) = ”M”u

It is clear that ”M” ∈ Ha iff ”M”u ∈ Heq.

Since Ha is not recursive, Heq is not recursive.

5. See Rice theorem

Theorem 29. (Rice’s Theorem)

Suppose that C is a proper nonempty subset

of the class of all recursively enumerable lan-

guages. The the following question is undecid-

able: Given a Turing machine M, is L(M) ∈ C

Proof Let H ′ = {M |L(M) ∈ C }. We reduce H

to H ′. Let L ∈ C and ML that accepts L.

Without loss of generality, we could assume that

∅ 6∈ C.

Let ”M””w” ∈ H . Let τ (”M””w”) = ”M ′”

where M ′ is a TM rendering the following algo-

rithm:

If U(”M””w”) halts then ML(x)

where x is the input of M ′.

It is clear that if M accepts w then M ′ accepts

a language in C. Hence, τ is a reduction from H to

H ′. Therefore, H ′ is not recursive.

Computational Complexity

Example Consider the problem of a travelling sale

man who has to visit 10 offices in 10 cities. The sale

man has a map with distances between the cities

and he should find an itinerary with the shortest

distance.

A simple algorithm would check all possible itineraries.

The number would be 9! = 362880. If the number

of cities is say 40, no presently forseeable computer

could handle it as it would take billions of years to

finish.

What represents a practically feasible algorithm ?

The Class P

Definition 20. A Turing machine M = (K,Σ, δ, s,H)

is said to be polynomially bounded if there

is a polynomial p(n) such that for any input x,

there is no configuration C such that

(s, .tx) `p(|x|)+1M C

A language is said polynomially decidable

if there is a polynomially bounded Turing ma-

chine that decides it.

The class of all polynomial decidable languages

is denoted by P.

Theorem 30. P is closed under complementa-

tion

Some Well-Known Problems

Reachability Problem

Given a directed graph G ⊆ V × V where V =

{v1, . . . , vn} and two nodes vi, vj, is there a path

from vi to vj ?

Independent Set

Given an undirected graph G and an integer

K ≥ 2, is there a set of nodes C with |C| ≥ K

such that for all vi, vj ∈ C, there is no edge be-

tween vi and vj ?

Boolean Satisfiability

X = {x1, . . . , xn} : a finite set of boolean vari-

ables and

X = {x1, . . . , xn} where xi is the negation of

xi.

The elements of X ∪X are called literals; vari-

ables are positive literals, whereas negations of

variables are negative literals.

A clause is a set of literals. A Boolean for-

mula is a set of clauses.

A truth assignment T is a mapping from X

into {>,⊥} where > and ⊥ stand for true and

false respectively.

We say that a truth assignment T satisfies a

Boolean formula F if for each clause C of F there

is at least one variable xi such that xi ∈ C and

T (xi) = > or xi ∈ C and T (xi) = ⊥.

F is satisfiable if there is a truth assignment

that satisfies F.

Satisfiability: Given a Boolean formula F, is F

satisfiable ?

3-Satisfiability: Given a Boolean formula F whose

clause contains only three or fewer literals, is F sat-

isfiable ?

2-Satisfiability: Given a Boolean formula F whose

clause contains only two or fewer literals, is F sat-

isfiable ?

Reachability is in P .

We have given before an polynomial algorithm

for computing the set of all pairs (v, u) in a graph

such that there is a path from v to u. Hence reach-

ability is in P .

2-satisfiability is in P .

A 2-satisfiability problem could be reduced to

reachability by presenting any clause x ∨ y as a

link from node x to y and from y to x.

An instance of 2-satisfiability is unsatisfiable if

there is a variable x such that there is a path from

x to x and vice versa (exercise: by mathematical

induction).

Are there practical problems that are recursive

but non-polynomially decidable ?

The Class NP

Definition 21. A nondeterministic Turing ma-

chine M = (K,Σ,∆, s,H) is said to be poly-

nomially bounded if there is a polynomial

p(n) such that for any input x, there is no con-

figuration C such that

(s, .tx) `p(|x|)+1M C

A language is said to be nondeterministic

polynomially decidable if there is a nonde-

terministic polynomially bounded Turing machine

that decides it.

The class of all nondeterministic polynomial

decidable languages is denoted by NP.

The Boolean satisfiability, 3-satisfiability, indepen-

dent set problems are all in NP .

Theorem 31. P ⊆ NP

Open Problem P = NP ?

NP - Completeness

Definition 22. – A function τ : Σ∗ → Σ∗ is

said to be polynomial-time computable

if there is a polynomial-bounded Turing ma-

chine that computes it.

– Let L1, L2 ⊆ Σ∗ be languages. A polynomial

reduction from L1 to L2 is a polynomial-

time computable function τ : Σ∗ → Σ∗ such

that x ∈ L1 iff τ (x) ∈ L2.

Definition 23. A language L is said to be NP-

complete if and only if:

1. L ∈ NP2. For every language L′ ∈ NP, there is a polynomial-

time transformation from L′ to L.

Theorem 32. Let L be a NP-complete language.

Then P = NP iff L ∈ P

Theorem 33. Satisfiablity is NP-complete.

Theorem 34. 3-satisfiability is NP-complete.

Proof We give a polynomial reduction from satis-

fiability to 3-satisfiability.

Let C ≡ (x1 ∨ x2 ∨ . . . ∨ xk).Let y1, . . . , yk−3 be new Boolean variables. C is

reduced to following set of 3-clauses:

(x1∨x2∨y1), (y1∨x3∨y2), (y2∨x4∨y3), . . . ,(yk−3 ∨ xk−1 ∨ xk)

τ (F ) is the collections of all 3-clauses obtained

by transforming clauses in F.

F is satisfiable iff τ (F ) is satisfiable.

�

Theorem 35. Independent set problem is NP-

complete.

Proof We reduce 3-satisfiability to independent

set problem where K = m where m is the number

of clauses.

For each clause, there are three nodes labelled

by the literals in the clause. Further, two nodes

labelled by complementary literals are linked.

For example:

(x1∨x2∨x3) (x1∨x2∨x3) (x1∨x2∨x3) (x1∨x2 ∨ x3)

Fig. 14.

�

THEORY OF COMPUTATIONdung/Site/Theory_of_Computation_files/full.pdf · { the set of students taking...

Documents

Transcript of THEORY OF COMPUTATIONdung/Site/Theory_of_Computation_files/full.pdf · { the set of students taking...