Morphology Syntax Prosody Semantics
The Urdu grammar is based on Lexical Functional Grammar, which
allows for a modular architecture. Morphology, syntax, semantics and
prosody are therefore encoded at independent levels providing the
necessary flexibility.
Each level of analysis uses a different type of representation,
depending on how the content is best represented. All of these levels interact with each other, providing a broad grammatical
coverage of each string entered. Furthermore, the grammar can be used
for both: parsing and generation.
Morphology (xfst)
|
Syntax (XLE),
c- and f-structure |
→
|
Prosody
p-structure |
Semantics (xfr)
|
The morphology is based on finite state machines, mainly the finite
state morphology tools described (and developed) by Beesley and
Karttunen (2003). Using this technology, the grammar can deal with the
full range of inflectional and derivational morphology in Urdu,
including difficult phenomena such as reduplication.
The morphological analyzer provides each string with tags, e.g.
The syntax component is at the core of the Urdu grammar. Its theoretical background is Lexical Functional grammar, which believes in a modular architecture. It runs on a platform named XLE, which has been developed by Xerox/PARC. The syntax depends on two pillars: first on c-structure, which forms the basic constituent "tree" and linear precedence; the second main pillar is the f-structure, which encodes grammatical relations and functional information.