TOKENIZER: Related Papers in ACL Anthology

Back to Document Index
Back to Term Index

Concordance view (Keyword-In-Context) for the term tokenizer in the ACL ARC 2.0;
Concordance view in the ACL ARC 1.0 (Sketch Engine Service).

TOKENIZER can be found in the following ACL ARC 1.0 documents (click to explore):

(ACL ID: A00-1032) language independent morphological analysis
(ACL ID: A00-1033) a divide-and-conquer strategy for shallow parsing of german free texts
(ACL ID: A00-1045) improving testsuites via instrumentation
(ACL ID: A00-2004) advances in domain independent linear text segmentation
(ACL ID: A92-1018) a practical part-of-speech tagger
(ACL ID: A92-1027) an efficient chart-based algorithm for partial-parsing of unrestricted texts
(ACL ID: A94-1013) adaptive sentence boundary disambiguation
(ACL ID: A94-1030) improving chinese tokenization with linguistic filters on statistical lexical acquisition
(ACL ID: A97-1006) natural language dialogue service for appointment scheduling agents
(ACL ID: A97-1031) an information extraction core system for real world german text processing
(ACL ID: A97-2004) duke's trainable information and meaning extraction system (duke times)
(ACL ID: C00-1072) the automated acquisition of topic signatures for text summarization
(ACL ID: C00-2095) a formalism for universal segmentation of text
(ACL ID: C02-1002) a cheap and fast way to build useful translation lexicons
(ACL ID: C02-2005) scaled log likelihood ratios for the detection of abbreviations in text corpora
(ACL ID: C04-1021) modern natural language interfaces to databases
(ACL ID: C04-1037) optimizing disambiguation in swahili
(ACL ID: C04-1122) named entity discovery using comparable news articles
(ACL ID: C04-1192) fine-grained word sense disambiguation based on parallel corpora, word alignment, word clustering and aligned wordnets
(ACL ID: C94-2108) content characterization using word shape tokens
(ACL ID: C96-1072) learning to recognize names across languages
(ACL ID: C96-1087) a probabilistic approach to compound noun indexing in korean texts
(ACL ID: C96-2192) tagging spoken language using written language statistics
(ACL ID: E06-2003) linguastream
(ACL ID: E06-2024) a suite of shallow processing tools for portuguese
(ACL ID: E99-1018) pos disambiguation and unknown word guessing with decision trees
(ACL ID: H05-1048) detection of entity mentions occuring in english and chinese text
(ACL ID: H93-1026) fastus
(ACL ID: H93-1037) lingstat
(ACL ID: H93-1061) a semantic concordance
(ACL ID: H93-1087) lingstat
(ACL ID: H94-1029) the automatic component of the lingstat machine-aided translation system
(ACL ID: I05-5003) using machine translation evaluation techniques to determine sentence-level semantic equivalence
(ACL ID: J03-3002) articles the web as a parallel corpus
(ACL ID: J03-4004) disambiguating nouns, verbs, and adjectives using automatically acquired selectional preferences
(ACL ID: J92-1002) an estimate of an upper bound for the entropy of english
(ACL ID: J96-1001) translating collocations for bilingual lexicons
(ACL ID: J97-2002) adaptive multilingual sentence boundary disambiguation
(ACL ID: M91-1023) gte
(ACL ID: M91-1031) synchronetics
(ACL ID: M93-1018) sra
(ACL ID: M93-1019) sri
(ACL ID: M93-1024) description of the link system used for muc-5
(ACL ID: M95-1008) knight-ridder information's value adding name finder
(ACL ID: M95-1015) university of pennsylvania
(ACL ID: M95-1018) sra
(ACL ID: M95-1020) sterling software
(ACL ID: M98-1008) american university in cairo
(ACL ID: M98-1016) description of the kent ridge digital labs system used for muc-7
(ACL ID: N04-1013) speed and accuracy in shallow and deep stochastic parsing
(ACL ID: N04-2007) a preliminary look into the use of named entity information for bioscience text tokenization
(ACL ID: N04-4038) automatic tagging of arabic text
(ACL ID: N06-2043) illuminating trouble tickets with sublanguage theory
(ACL ID: P01-1059) producing biographical summaries
(ACL ID: P02-1056) an integrated archictecture for shallow and deep processing
(ACL ID: P03-1051) language model based arabic word segmentation
(ACL ID: P03-2019) integrating information extraction and automatic hyperlinking
(ACL ID: P04-1010) data-driven strategies for an automated dialogue system
(ACL ID: P04-1025) extracting regulatory gene expression networks from pubmed
(ACL ID: P04-1057) error mining for wide-coverage grammar engineering
(ACL ID: P04-3031) nltk
(ACL ID: P05-1057) log-linear models for word alignment
(ACL ID: P05-1064) a phonotactic language model for spoken language identification
(ACL ID: P05-1076) automatic acquisition of adjectival subcategorization from corpora
(ACL ID: P05-2005) exploiting named entity taggers in a second language
(ACL ID: P05-3007) high throughput modularized nlp system for clinical text
(ACL ID: P05-3017) supporting annotation layers for natural language processing
(ACL ID: P06-1032) correcting esl errors using phrasal smt techniques
(ACL ID: P06-4019) outilex, a linguistic platform for text processing
(ACL ID: P93-1006) using bracketed parses to evaluate a grammar checking application
(ACL ID: P95-1032) a pattern matching method for finding noun and proper noun translations from noisy parallel corpora
(ACL ID: P96-1015) directed replacement
(ACL ID: P96-1050) a synopsis of learning to recognize names across languages
(ACL ID: P97-1039) a portable algorithm for mapping bitext correspondence
(ACL ID: P98-1004) a simple hybrid aligner for generating lexical correspondences in parallel texts
(ACL ID: P98-1030) terminology finite-state preprocessing for computational lfg
(ACL ID: P98-1050) multext-east
(ACL ID: P98-1066) a layered approach to nlp-based information retrieval
(ACL ID: P98-1108) use of mutual information based character clusters in dictionary-less morphological analysis of japanese
(ACL ID: W00-0506) pre-processing closed captions for machine translation
(ACL ID: W01-0513) is knowledge-free induction of multiword unit dictionary headwords a solved problem?
(ACL ID: W01-1411) towards a simple and accurate statistical approach to learning translation relationships among words
(ACL ID: W01-1513) how to integrate linguistic information in files and generate feedback for grammar errors
(ACL ID: W02-0101) teaching nlp/cl through games
(ACL ID: W02-0109) nltk
(ACL ID: W02-0301) tuning support vector machines for biomedical named entity recognition
(ACL ID: W02-1210) efficient deep processing of japanese
(ACL ID: W03-0504) summarization of noisy documents
(ACL ID: W03-0801) the talent system
(ACL ID: W03-0810) accelerating corporate research in the development, application, and deployment of human language technologies
(ACL ID: W03-0812) sdl---a description language for building nlp systems
(ACL ID: W03-1002) statistical machine translation using coercive two-level syntactic transduction
(ACL ID: W03-1606) normalization and paraphrasing using symbolic methods
(ACL ID: W04-0409) integrating morphology with multi-word expression processing in turkish
(ACL ID: W04-0508) answering questions in the genomics domain
(ACL ID: W04-0808) an evaluation exercise for romanian word sense disambiguation
(ACL ID: W04-0833) simple features for statistical word sense disambiguation
(ACL ID: W04-1613) letter-to-sound conversion for urdu text-to-speech system
(ACL ID: W04-1615) farsisum - a persian text summarizer
(ACL ID: W04-2213) building parallel corpora for econtent professionals
(ACL ID: W04-2710) annotating wordnet
(ACL ID: W04-3111) integrated annotation for biomedical information extraction
(ACL ID: W04-3234) trained named entity recognition using distributional clusters
(ACL ID: W05-0101) teaching applied natural language processing
(ACL ID: W05-0405) feature-based segmentation of narrative documents
(ACL ID: W05-0603) search engine statistics beyond the n-gram
(ACL ID: W06-0302) toward opinion summarization
(ACL ID: W06-0508) a hybrid approach for extracting semantic relations from texts
(ACL ID: W06-1206) automated multiword expression prediction for grammar engineering
(ACL ID: W06-1601) unsupervised discovery of a statistical verb lexicon
(ACL ID: W06-1640) partially supervised coreference resolution for opinion summarization through structured rule learning
(ACL ID: W06-1658) entity annotation based on inverse index operations
(ACL ID: W06-2714) middleware for creating and combining multi-dimensional nlp markup
(ACL ID: W06-2718) a standoff annotation interface between delph-in components
(ACL ID: W06-3103) morpho-syntactic arabic preprocessing for arabic to english statistical machine translation
(ACL ID: W06-3114) manual and automatic evaluation of machine translation between european languages
(ACL ID: W06-3317) using dependency parsing and probabilistic inference to extract relationships between genes, proteins and malignancies implicit among multiple biomedical research abstracts
(ACL ID: W06-3328) bootstrapping and evaluating named entity recognition in the biomedical domain
(ACL ID: W06-3604) all-word prediction as the ultimate confusible disambiguation
(ACL ID: W93-0104) internal and external evidence in the identification and semantic categorization of proper names
(ACL ID: W95-0114) compiling bilingual lexicon entries from a non-parallel english-chinese corpus
(ACL ID: W96-0109) exploiting text structure for topic identification
(ACL ID: W97-0110) corpus based statistical generalization tree in rule optimization
(ACL ID: W97-0113) data reliability and its effects on automatic abstracting
(ACL ID: W97-0304) text segmentation using exponential models
(ACL ID: W97-0809) the use of lexical semantics in information extraction
(ACL ID: W97-1008) what makes a word
(ACL ID: W97-1508) lexical resource reconciliation in the xerox linguistic environment
(ACL ID: W98-0211) how to build a (quite general) linguistic diagram editor
(ACL ID: W98-1001) discovering lexical information by tagging arabic newspaper text
(ACL ID: W98-1002) tagarab
(ACL ID: W98-1118) exploiting diverse knowledge sources via maximum entropy in named entity recognition
(ACL ID: W98-1125) discourse parsing
(ACL ID: W98-1208) implementing a sense tagger in a general architecture for text engineering
(ACL ID: W99-0605) cross-language information retrieval for technical documents
(ACL ID: X96-1058) met name recognition with japanese fastus
(ACL ID: X98-1010) coreference resolution strategies from an application perspective
(ACL ID: X98-1017) the smart/empire tipster ir system

* See also a list of some of the related terms to tokenizer.

Back to Description Index