Ambiguity in the Grammar \S \rightarrow ABA,\; A \rightarrow aA \mid \epsilon,\; B \rightarrow bB \mid \epsilon\ | AI Research | Coursify

Ambiguity in the Grammar (S \rightarrow ABA,; A \rightarrow aA \mid \epsilon,; B \rightarrow bB \mid \epsilon)

Verified Sources

May 20, 2026

A context-free grammar is called ambiguous if there exists at least one string in its language that admits two distinct parse trees; equivalently, the same string has two different leftmost derivations or two different rightmost derivations.2 Ambiguity matters because a parser may assign multiple syntactic structures to the same input, which makes interpretation non-unique.2

For the grammar

S \rightarrow ABA

A \rightarrow aA \mid \epsilon

B \rightarrow bB \mid \epsilon

the nonterminal $A$ generates any number of $a$ 's, including none, so $L(A)=\{a^i \mid i\ge 0\}$ , and $B$ generates any number of $b$ 's, including none, so $L(B)=\{b^j \mid j\ge 0\}$ .2 Therefore, the start symbol generates strings of the form

L(S)=\{a^i b^j a^k \mid i,j,k \ge 0\}.

This can also be written as $a^\* b^\* a^\*$ .2 The key source of ambiguity is that the two occurrences of $A$ in $S \to ABA$ can both generate $a$ -strings, so when a derived string contains only $a$ 's, or when the middle $B$ vanishes to $\epsilon$ , there may be multiple ways to distribute the same $a$ 's between the left and right $A$ .2

A compact structural view is:

Because both $A_1$ and $A_2$ can independently derive $a^\*$ and $B$ can derive $\epsilon$ , the boundary between the left and right blocks of $a$ 's is not unique for some strings.2

Parse trees, ambiguity, and Chomsky normal form - Defines ambiguity via existence of a string with at least two parse trees. ↩ ↩² ↩³ ↩⁴ ↩⁵
CS 373: Theory of Computation - University of Illinois - Explains ambiguity of CFGs in terms of different parse trees and derivations. ↩ ↩² ↩³
Context-Free Grammars and Languages - Discusses ambiguity, parse trees, and why multiple parses create non-unique structure. ↩ ↩²
Unambiguous Grammar - Includes this exact grammar as an ambiguity exercise and supports the analysis of how the productions generate strings. ↩ ↩²

Ambiguous Grammar

Core Definition

To prove a grammar ambiguous, it is enough to find one string in the language that has two distinct parse trees or two distinct leftmost derivations.2

Parse trees, ambiguity, and Chomsky normal form - Defines ambiguity via existence of a string with at least two parse trees. ↩
CS 373: Theory of Computation - University of Illinois - Explains ambiguity of CFGs in terms of different parse trees and derivations. ↩

Why this grammar is ambiguous

Let us choose the string

w = a.

This string belongs to $L(G)$ because we may let one $A$ generate $a$ , let $B \Rightarrow \epsilon$ , and let the other $A \Rightarrow \epsilon$ . However, there is also another derivation in which the first $A \Rightarrow \epsilon$ , $B \Rightarrow \epsilon$ , and the second $A$ generates $a$ . Since these correspond to different syntactic structures, the grammar is ambiguous.2

More generally, every string $a^n$ with $n\ge 1$ is ambiguous in this grammar. Since $B \Rightarrow \epsilon$ , we have

S \Rightarrow ABA \Rightarrow a^i \epsilon a^k = a^{i+k},

and for any fixed $n$ , there are multiple choices of $(i,k)$ such that $i+k=n$ . For example, $a^2$ can be split as $a^2\epsilon\epsilon$ , $a\epsilon a$ , or $\epsilon\epsilon a^2$ . At least two such splits yield distinct parse trees, which is sufficient for ambiguity.2

An abstract view of the ambiguity for $a^n$ is:

Left (A) contributes	(B) contributes	Right (A) contributes	Result
(a^n)	(\epsilon)	(\epsilon)	(a^n)
(a^{n-1})	(\epsilon)	(a)	(a^n)
(\cdots)	(\epsilon)	(\cdots)	(a^n)
(\epsilon)	(\epsilon)	(a^n)	(a^n)

Thus the grammar does not assign a unique structure to such strings.2

Unambiguous Grammar - Includes this exact grammar as an ambiguity exercise and supports the analysis of how the productions generate strings. ↩ ↩²
Parse trees, ambiguity, and Chomsky normal form - Defines ambiguity via existence of a string with at least two parse trees. ↩ ↩² ↩³
CS 373: Theory of Computation - University of Illinois - Explains ambiguity of CFGs in terms of different parse trees and derivations. ↩ ↩²
Context-Free Grammars and Languages - Discusses ambiguity, parse trees, and why multiple parses create non-unique structure. ↩

Proof That the Grammar Is Ambiguous

1
Step 1
From $A \rightarrow aA \mid \epsilon$ , the nonterminal $A$ generates $a^*$ . From $B \rightarrow bB \mid \epsilon$ , the nonterminal $B$ generates $b^*$ . Hence $S \rightarrow ABA$ generates $a^*b^*a^*$ .2

Footnotes

Parse trees, ambiguity, and Chomsky normal form - Defines ambiguity via existence of a string with at least two parse trees. ↩

Unambiguous Grammar - Includes this exact grammar as an ambiguity exercise and supports the analysis of how the productions generate strings. ↩
2
Step 2
Choose the shortest nontrivial string $a$ . It is in the language because one occurrence of $A$ can produce $a$ , while $B$ and the other $A$ can both produce $\epsilon$ .

Footnotes

Unambiguous Grammar - Includes this exact grammar as an ambiguity exercise and supports the analysis of how the productions generate strings. ↩
3
Step 3
$S \Rightarrow ABA \Rightarrow aABA \Rightarrow aBA \Rightarrow aA \Rightarrow a.$ Here the left $A$ contributes the terminal $a$ , while $B$ and the right $A$ both derive $\epsilon$ .
4
Step 4
$S \Rightarrow ABA \Rightarrow BA \Rightarrow A \Rightarrow aA \Rightarrow a.$ Here the left $A$ derives $\epsilon$ , $B$ derives $\epsilon$ , and the right $A$ contributes the terminal $a$ .
5
Step 5
These are two distinct leftmost derivations of the same terminal string $a$ . Therefore the grammar is ambiguous.2

Footnotes

CS 373: Theory of Computation - University of Illinois - Explains ambiguity of CFGs in terms of different parse trees and derivations. ↩

Context-Free Grammars and Languages - Discusses ambiguity, parse trees, and why multiple parses create non-unique structure. ↩

Two distinct parse trees for the string (a)

Below are two different parse trees for the same string $a$ .

Parse Tree 1: the left $A$ produces $a$ , while $B$ and the right $A$ produce $\epsilon$ .

Parse Tree 2: the left $A$ produces $\epsilon$ , $B$ produces $\epsilon$ , and the right $A$ produces $a$ .

Since these parse trees are structurally different but yield the same terminal string $a$ , the grammar is ambiguous by definition.2

Parse trees, ambiguity, and Chomsky normal form - Defines ambiguity via existence of a string with at least two parse trees. ↩
CS 373: Theory of Computation - University of Illinois - Explains ambiguity of CFGs in terms of different parse trees and derivations. ↩

Common Mistake

Different derivation orders do not by themselves prove ambiguity. The important point is that the same string must have different parse trees, or equivalently different leftmost derivations.2

CS 373: Theory of Computation - University of Illinois - Explains ambiguity of CFGs in terms of different parse trees and derivations. ↩
Context-Free Grammars and Languages - Discusses ambiguity, parse trees, and why multiple parses create non-unique structure. ↩

The grammar generates $a^*b^*a^*$ . The source of ambiguity is that both copies of $A$ generate the same language $a^*$ , and $B$ may disappear by deriving $\epsilon$ . Therefore some strings, especially $a^n$ , can be split between the two $A$ symbols in multiple ways.2

Parse trees, ambiguity, and Chomsky normal form - Defines ambiguity via existence of a string with at least two parse trees. ↩
Unambiguous Grammar - Includes this exact grammar as an ambiguity exercise and supports the analysis of how the productions generate strings. ↩

Number of Possible Splits of the String (a^n)

If (B \Rightarrow \epsilon), the (n) copies of (a) can be divided between the two occurrences of (A) in (n+1) ways.

Formal interpretation of the chart

For a string $a^n$ , if $B \Rightarrow \epsilon$ , then the left occurrence of $A$ may generate $a^i$ and the right occurrence may generate $a^{n-i}$ for any integer $i$ with $0 \le i \le n$ . Hence the number of possible splits is

n+1.

So for $n=1$ , there are $2$ splits; for $n=2$ , there are $3$ splits; and in general the grammar offers multiple structural decompositions of the same terminal string. The existence of more than one valid decomposition for even one value of $n\ge 1$ already proves ambiguity.2

This also shows that the ambiguity is not accidental or isolated: it is a systematic effect caused by overlapping generative roles of the two $A$ nonterminals combined with the epsilon production $B \Rightarrow \epsilon$ .2

Unambiguous Grammar - Includes this exact grammar as an ambiguity exercise and supports the analysis of how the productions generate strings. ↩ ↩²
Parse trees, ambiguity, and Chomsky normal form - Defines ambiguity via existence of a string with at least two parse trees. ↩ ↩²
CS 373: Theory of Computation - University of Illinois - Explains ambiguity of CFGs in terms of different parse trees and derivations. ↩

Further Clarifications

Exam Strategy

When asked to prove a grammar ambiguous, first compute what each nonterminal generates, then look for overlap caused by repeated nonterminals or epsilon-productions. Short strings often expose ambiguity fastest.2

Parse trees, ambiguity, and Chomsky normal form - Defines ambiguity via existence of a string with at least two parse trees. ↩
Unambiguous Grammar - Includes this exact grammar as an ambiguity exercise and supports the analysis of how the productions generate strings. ↩

Knowledge Check

Question 1 of 4

Q1Single choice

When is a context-free grammar called ambiguous?

When at least one string in its language has two distinct parse trees

When it contains an epsilon-production

When it generates infinitely many strings

When it has more than one nonterminal

Explore Related Topics

Lexical Analysis and the Main Structure Used: Finite Automata

Lexical analysis relies on finite automata—typically deterministic finite automata (DFA)—to recognize token patterns defined by regular expressions.

Regular expressions for identifiers, numbers, etc., are converted to NFAs then to a DFA for fast scanning.
The DFA processes the source character by character, tracking a single current state and emitting a token at each accepting state.
Queues, stacks, and trees support other compiler phases (parsing, AST construction) but are not the primary model for token recognition.
Lexers output a stream of tokens that the parser consumes for syntax analysis.

Which Grammar Type Is the Most Powerful? Understanding Type-0, Type-1, Type-2, and Type-3 Grammars

The most expressive grammar in the Chomsky hierarchy is the unrestricted Type‑0 grammar, which generates all recursively enumerable languages and matches the power of a Turing machine.

The hierarchy is strict: $L_3 \subset L_2 \subset L_1 \subset L_0$ , so each higher type can generate everything the lower types can, plus more.
Type‑3 (regular) → finite automaton; Type‑2 (context‑free) → pushdown automaton; Type‑1 (context‑sensitive) → linear‑bounded automaton; Type‑0 (unrestricted) → Turing machine.
Example languages: $\{a^n b^n \mid n \ge 0\}$ is context‑free but not regular; $\{a^n b^n c^n \mid n \ge 1\}$ is context‑sensitive but not context‑free.
“More powerful” refers to expressive capacity (ability to generate a larger class of languages), not ease of parsing or practical use.

Syntax-Directed Translation: Infix to Prefix Notation

The module shows how a syntax‑directed translation scheme using only synthesized attributes can convert infix arithmetic expressions into prefix (Polish) notation while preserving operator precedence and left‑associativity.

Grammar: E → E + T | E - T | T; T → T * F | F; F → digit, enforcing precedence ( *  > + / - ).
Semantic actions compute a val string for each non‑terminal, concatenating the operator before its operand strings.
Example results: 9 - 5 + 2 → + - 9 5 2; 9 - 5 * 2 → - 9 * 5 2.
Synthesized (S‑attributed) attributes allow immediate bottom‑up evaluation during LR‑style parsing.
Left‑recursive rules enable left‑associativity; to use LL parsers the grammar must be transformed and inherited attributes introduced.

Research more with Coursify

Ambiguity in the Grammar (S \rightarrow ABA,; A \rightarrow aA \mid \epsilon,; B \rightarrow bB \mid \epsilon)

AI Summary

Footnotes

Ambiguous Grammar

Core Definition

Footnotes

Why this grammar is ambiguous

Footnotes

Proof That the Grammar Is Ambiguous

Footnotes

Footnotes

Footnotes

Two distinct parse trees for the string (a)

Footnotes

Common Mistake

Footnotes

Footnotes

Number of Possible Splits of the String (a^n)

Formal interpretation of the chart

Footnotes

Further Clarifications

Exam Strategy

Footnotes

Knowledge Check

Explore Related Topics