TOKENIZER: Related Papers in ACL Anthology
Back to Document Index
Back to Term Index
-
Concordance view (Keyword-In-Context) for the term tokenizer in the ACL ARC 2.0;
- Concordance view in the ACL ARC 1.0 (Sketch Engine Service).
TOKENIZER can be found in the following ACL ARC 1.0 documents (click to explore):
- (ACL ID: A00-1032) language independent morphological analysis
- (ACL ID: A00-1033) a divide-and-conquer strategy for shallow parsing of german free texts
- (ACL ID: A00-1045) improving testsuites via instrumentation
- (ACL ID: A00-2004) advances in domain independent linear text segmentation
- (ACL ID: A92-1018) a practical part-of-speech tagger
- (ACL ID: A92-1027) an efficient chart-based algorithm for partial-parsing of unrestricted texts
- (ACL ID: A94-1013) adaptive sentence boundary disambiguation
- (ACL ID: A94-1030) improving chinese tokenization with linguistic filters on statistical lexical acquisition
- (ACL ID: A97-1006) natural language dialogue service for appointment scheduling agents
- (ACL ID: A97-1031) an information extraction core system for real world german text processing
- (ACL ID: A97-2004) duke's trainable information and meaning extraction system (duke times)
- (ACL ID: C00-1072) the automated acquisition of topic signatures for text summarization
- (ACL ID: C00-2095) a formalism for universal segmentation of text
- (ACL ID: C02-1002) a cheap and fast way to build useful translation lexicons
- (ACL ID: C02-2005) scaled log likelihood ratios for the detection of abbreviations in text corpora
- (ACL ID: C04-1021) modern natural language interfaces to databases
- (ACL ID: C04-1037) optimizing disambiguation in swahili
- (ACL ID: C04-1122) named entity discovery using comparable news articles
- (ACL ID: C04-1192) fine-grained word sense disambiguation based on parallel corpora, word alignment, word clustering and aligned wordnets
- (ACL ID: C94-2108) content characterization using word shape tokens
- (ACL ID: C96-1072) learning to recognize names across languages
- (ACL ID: C96-1087) a probabilistic approach to compound noun indexing in korean texts
- (ACL ID: C96-2192) tagging spoken language using written language statistics
- (ACL ID: E06-2003) linguastream
- (ACL ID: E06-2024) a suite of shallow processing tools for portuguese
- (ACL ID: E99-1018) pos disambiguation and unknown word guessing with decision trees
- (ACL ID: H05-1048) detection of entity mentions occuring in english and chinese text
- (ACL ID: H93-1026) fastus
- (ACL ID: H93-1037) lingstat
- (ACL ID: H93-1061) a semantic concordance
- (ACL ID: H93-1087) lingstat
- (ACL ID: H94-1029) the automatic component of the lingstat machine-aided translation system
- (ACL ID: I05-5003) using machine translation evaluation techniques to determine sentence-level semantic equivalence
- (ACL ID: J03-3002) articles the web as a parallel corpus
- (ACL ID: J03-4004) disambiguating nouns, verbs, and adjectives using automatically acquired selectional preferences
- (ACL ID: J92-1002) an estimate of an upper bound for the entropy of english
- (ACL ID: J96-1001) translating collocations for bilingual lexicons
- (ACL ID: J97-2002) adaptive multilingual sentence boundary disambiguation
- (ACL ID: M91-1023) gte
- (ACL ID: M91-1031) synchronetics
- (ACL ID: M93-1018) sra
- (ACL ID: M93-1019) sri
- (ACL ID: M93-1024) description of the link system used for muc-5
- (ACL ID: M95-1008) knight-ridder information's value adding name finder
- (ACL ID: M95-1015) university of pennsylvania
- (ACL ID: M95-1018) sra
- (ACL ID: M95-1020) sterling software
- (ACL ID: M98-1008) american university in cairo
- (ACL ID: M98-1016) description of the kent ridge digital labs system used for muc-7
- (ACL ID: N04-1013) speed and accuracy in shallow and deep stochastic parsing
- (ACL ID: N04-2007) a preliminary look into the use of named entity information for bioscience text tokenization
- (ACL ID: N04-4038) automatic tagging of arabic text
- (ACL ID: N06-2043) illuminating trouble tickets with sublanguage theory
- (ACL ID: P01-1059) producing biographical summaries
- (ACL ID: P02-1056) an integrated archictecture for shallow and deep processing
- (ACL ID: P03-1051) language model based arabic word segmentation
- (ACL ID: P03-2019) integrating information extraction and automatic hyperlinking
- (ACL ID: P04-1010) data-driven strategies for an automated dialogue system
- (ACL ID: P04-1025) extracting regulatory gene expression networks from pubmed
- (ACL ID: P04-1057) error mining for wide-coverage grammar engineering
- (ACL ID: P04-3031) nltk
- (ACL ID: P05-1057) log-linear models for word alignment
- (ACL ID: P05-1064) a phonotactic language model for spoken language identification
- (ACL ID: P05-1076) automatic acquisition of adjectival subcategorization from corpora
- (ACL ID: P05-2005) exploiting named entity taggers in a second language
- (ACL ID: P05-3007) high throughput modularized nlp system for clinical text
- (ACL ID: P05-3017) supporting annotation layers for natural language processing
- (ACL ID: P06-1032) correcting esl errors using phrasal smt techniques
- (ACL ID: P06-4019) outilex, a linguistic platform for text processing
- (ACL ID: P93-1006) using bracketed parses to evaluate a grammar checking application
- (ACL ID: P95-1032) a pattern matching method for finding noun and proper noun translations from noisy parallel corpora
- (ACL ID: P96-1015) directed replacement
- (ACL ID: P96-1050) a synopsis of learning to recognize names across languages
- (ACL ID: P97-1039) a portable algorithm for mapping bitext correspondence
- (ACL ID: P98-1004) a simple hybrid aligner for generating lexical correspondences in parallel texts
- (ACL ID: P98-1030) terminology finite-state preprocessing for computational lfg
- (ACL ID: P98-1050) multext-east
- (ACL ID: P98-1066) a layered approach to nlp-based information retrieval
- (ACL ID: P98-1108) use of mutual information based character clusters in dictionary-less morphological analysis of japanese
- (ACL ID: W00-0506) pre-processing closed captions for machine translation
- (ACL ID: W01-0513) is knowledge-free induction of multiword unit dictionary headwords a solved problem?
- (ACL ID: W01-1411) towards a simple and accurate statistical approach to learning translation relationships among words
- (ACL ID: W01-1513) how to integrate linguistic information in files and generate feedback for grammar errors
- (ACL ID: W02-0101) teaching nlp/cl through games
- (ACL ID: W02-0109) nltk
- (ACL ID: W02-0301) tuning support vector machines for biomedical named entity recognition
- (ACL ID: W02-1210) efficient deep processing of japanese
- (ACL ID: W03-0504) summarization of noisy documents
- (ACL ID: W03-0801) the talent system
- (ACL ID: W03-0810) accelerating corporate research in the development, application, and deployment of human language technologies
- (ACL ID: W03-0812) sdl---a description language for building nlp systems
- (ACL ID: W03-1002) statistical machine translation using coercive two-level syntactic transduction
- (ACL ID: W03-1606) normalization and paraphrasing using symbolic methods
- (ACL ID: W04-0409) integrating morphology with multi-word expression processing in turkish
- (ACL ID: W04-0508) answering questions in the genomics domain
- (ACL ID: W04-0808) an evaluation exercise for romanian word sense disambiguation
- (ACL ID: W04-0833) simple features for statistical word sense disambiguation
- (ACL ID: W04-1613) letter-to-sound conversion for urdu text-to-speech system
- (ACL ID: W04-1615) farsisum - a persian text summarizer
- (ACL ID: W04-2213) building parallel corpora for econtent professionals
- (ACL ID: W04-2710) annotating wordnet
- (ACL ID: W04-3111) integrated annotation for biomedical information extraction
- (ACL ID: W04-3234) trained named entity recognition using distributional clusters
- (ACL ID: W05-0101) teaching applied natural language processing
- (ACL ID: W05-0405) feature-based segmentation of narrative documents
- (ACL ID: W05-0603) search engine statistics beyond the n-gram
- (ACL ID: W06-0302) toward opinion summarization
- (ACL ID: W06-0508) a hybrid approach for extracting semantic relations from texts
- (ACL ID: W06-1206) automated multiword expression prediction for grammar engineering
- (ACL ID: W06-1601) unsupervised discovery of a statistical verb lexicon
- (ACL ID: W06-1640) partially supervised coreference resolution for opinion summarization through structured rule learning
- (ACL ID: W06-1658) entity annotation based on inverse index operations
- (ACL ID: W06-2714) middleware for creating and combining multi-dimensional nlp markup
- (ACL ID: W06-2718) a standoff annotation interface between delph-in components
- (ACL ID: W06-3103) morpho-syntactic arabic preprocessing for arabic to english statistical machine translation
- (ACL ID: W06-3114) manual and automatic evaluation of machine translation between european languages
- (ACL ID: W06-3317) using dependency parsing and probabilistic inference to extract relationships between genes, proteins and malignancies implicit among multiple biomedical research abstracts
- (ACL ID: W06-3328) bootstrapping and evaluating named entity recognition in the biomedical domain
- (ACL ID: W06-3604) all-word prediction as the ultimate confusible disambiguation
- (ACL ID: W93-0104) internal and external evidence in the identification and semantic categorization of proper names
- (ACL ID: W95-0114) compiling bilingual lexicon entries from a non-parallel english-chinese corpus
- (ACL ID: W96-0109) exploiting text structure for topic identification
- (ACL ID: W97-0110) corpus based statistical generalization tree in rule optimization
- (ACL ID: W97-0113) data reliability and its effects on automatic abstracting
- (ACL ID: W97-0304) text segmentation using exponential models
- (ACL ID: W97-0809) the use of lexical semantics in information extraction
- (ACL ID: W97-1008) what makes a word
- (ACL ID: W97-1508) lexical resource reconciliation in the xerox linguistic environment
- (ACL ID: W98-0211) how to build a (quite general) linguistic diagram editor
- (ACL ID: W98-1001) discovering lexical information by tagging arabic newspaper text
- (ACL ID: W98-1002) tagarab
- (ACL ID: W98-1118) exploiting diverse knowledge sources via maximum entropy in named entity recognition
- (ACL ID: W98-1125) discourse parsing
- (ACL ID: W98-1208) implementing a sense tagger in a general architecture for text engineering
- (ACL ID: W99-0605) cross-language information retrieval for technical documents
- (ACL ID: X96-1058) met name recognition with japanese fastus
- (ACL ID: X98-1010) coreference resolution strategies from an application perspective
- (ACL ID: X98-1017) the smart/empire tipster ir system
* See also a list of some of the related terms to tokenizer.
Back to Description Index