Grammars for Natural Language



Grammars for Natural Language

In Natural Language Processing, grammars define the rules that explain how words combine to form valid sentences in natural language.

Natural language grammars must handle:

  • complex sentence structures
  • ambiguity
  • auxiliary verbs
  • question formation
  • movement of words
  • uncertainty in interpretation

Unlike programming languages, natural language grammar is flexible and ambiguous.

Auxiliary Verbs and Verb Phrases

Auxiliary Verbs

Auxiliary verbs (also called helping verbs) assist the main verb to express:

  • tense
  • voice
  • mood
  • aspect

Common Auxiliary Verbs

TypeExamplesFunction
Primary Auxiliariesbe, have, dotense and voice
Modal Auxiliariescan, may, must, willpossibility, permission

Structure of Verb Phrase (VP)

A Verb Phrase (VP) contains:

  • auxiliary verbs
  • main verb
  • sometimes objects or complements

Verb Phrase Rule (Grammar)

RuleMeaning
VP → Aux + Vauxiliary + main verb
VP → V + NPverb + noun phrase
VP → Aux + V + NPfull verb phrase

Example Sentence

Sentence: “She has been reading a book.”

Verb phrase structure:

ComponentRole
hasauxiliary
beenauxiliary
readingmain verb
bookobject

Movement Phenomenon in Language

Movement phenomenon refers to rearranging words from their original position in a sentence.

This concept comes from Transformational Grammar.

Movement is common in:

  • questions
  • passive sentences
  • relative clauses

Example

Statement: “You are reading the book.”

Question: “What are you reading?”

Movement occurs:

Original PositionMoved Position
object (what)beginning of sentence

Types of Movement

TypeExample
WH-movementWhat did you buy?
Auxiliary movementAre you ready?
Passive movementThe book was written by Ram

Movement makes grammar more complex for parsers.

Handling Questions in Context-Free Grammars

Context-Free Grammar (CFG)

CFG is a grammar system where each rule has the form:

A → B C

where A is a non-terminal symbol.

CFG is widely used in Computational Linguistics.

Grammar Rules for Questions

To generate questions, additional rules are required.

RuleDescription
S → Aux NP VPquestion structure
S → WH Aux NP VPWH question
WH → what, where, whoquestion words

Example

Sentence: “You are reading a book.”

CFG transformation:

StepResult
statementYou are reading a book
auxiliary movementAre you reading a book?

WH question:

“What are you reading?”

CFG rule applied:

S → WH Aux NP VP

Human Preferences in Parsing

Humans often prefer simpler interpretations of sentences.

Natural language sentences may have multiple parse trees.

Example:

Sentence: “I saw the man with the telescope.”

Two meanings:

InterpretationMeaning
I used telescopeinstrument interpretation
man had telescopemodifier interpretation

Humans usually choose the most natural interpretation first.

Parsing Preference Strategies

StrategyMeaning
Minimal Attachmentchoose simplest structure
Late Closureattach words to current phrase
Right Associationprefer recent phrase

These strategies help reduce ambiguity.

Encoding Uncertainty

Natural language contains uncertainty and ambiguity.

To manage this, NLP uses probabilistic models.

Example: Probabilistic Context-Free Grammar (PCFG)

Example Grammar with Probabilities

RuleProbability
S → NP VP0.9
VP → V NP0.6
VP → V0.4

Parser selects the most probable parse tree.

Advantages

AdvantageExplanation
Handles ambiguitymultiple interpretations
Statistical decisionprobability based
Better accuracyrealistic language modeling

Deterministic Parser

A Deterministic Parser processes a sentence step by step without backtracking.

It always chooses one parsing action at each step.

Example Parsing Process

Sentence: “The boy eats apples.”

Parser actions:

StepAction
1read “The”
2combine Det + N → NP
3read verb
4form VP
5NP + VP → S

Characteristics

FeatureExplanation
No backtrackingfaster parsing
Efficientsuitable for real-time systems
Uses rulesgrammar-based decisions

Summary Table

TopicKey Idea
Auxiliary verbshelping verbs in verb phrases
Verb phrasecombination of auxiliary + main verb
Movement phenomenonwords move position in questions
CFG for questionsgrammar rules for WH questions
Human parsing preferencesimplest interpretation
Encoding uncertaintyprobabilistic grammar
Deterministic parserparsing without backtracking

Short Answer 

Grammars for natural language describe how sentences are formed using grammatical rules. Auxiliary verbs and verb phrases help express tense, aspect, and modality in sentences. Movement phenomenon refers to rearranging words in a sentence, especially during question formation or passive construction. Context-Free Grammars can handle questions by introducing special rules for auxiliary verbs and WH words. Humans naturally prefer simpler sentence structures during parsing, which is explained through strategies such as minimal attachment and late closure. Because natural language contains ambiguity, uncertainty is handled using probabilistic models like PCFG. Deterministic parsers process sentences sequentially without backtracking, making them efficient for real-time natural language processing systems.