ACL RD-TEC 1.0 Summarization of W02-1201
Paper Title:
A STUDY IN URDU CORPUS CONSTRUCTION
A STUDY IN URDU CORPUS CONSTRUCTION
Authors: Dara Becker and Kashif Riaz
Primarily assigned technology terms:
- broadcasting
- character recognition
- corpus building
- corpus construction
- encoding
- hyphenation
- interfaces
- language engineering
- language processing
- learning
- learning algorithms
- machine learning
- machine learning algorithms
- natural language processing
- nlp
- optical character recognition
- processing
- processor
- recognition
- recognition technology
- standardization
- tagging
Other assigned terms:
- case
- community
- corpora
- corpus encoding standard
- document
- document type definition
- foreign word
- implementation
- language data
- language processing research
- mapping
- mappings
- meaning
- measure
- metadata
- method
- methodology
- natural language
- paragraph
- parts of speech
- persian
- process
- semantic
- semantic value
- source text
- surface form
- tags
- technology
- term
- text
- unicode character set
- user
- web site
- word
- word order
- writing system
- xml document
- xml format