WORD SEGMENTATION: Related Papers in ACL Anthology
Back to Document Index
Back to Term Index
-
Concordance view (Keyword-In-Context) for the term word segmentation in the ACL ARC 2.0;
- Concordance view in the ACL ARC 1.0 (Sketch Engine Service).
WORD SEGMENTATION can be found in the following ACL ARC 1.0 documents (click to explore):
- (ACL ID: A00-2032) mostly-unsupervised statistical segmentation of japanese
- (ACL ID: A92-1020) a corpus-based statistical approach to automatic book indexing
- (ACL ID: A97-1018) cseg&tagl.0
- (ACL ID: A97-1034) using sgml as a basis for data-intensive nlp
- (ACL ID: A97-1049) an intelligent multilingual information browsing and retrieval system using information extraction
- (ACL ID: C00-1024) a muitilingual news summarizer
- (ACL ID: C00-1026) automatic semantic classification for chinese unknown compound nouns
- (ACL ID: C00-1084) automatic semantic sequence extraction from unrestricted non-tagged texts
- (ACL ID: C00-2095) a formalism for universal segmentation of text
- (ACL ID: C00-2116) automatic corpus-based thai word extraction with the c4.5 learning algorithm
- (ACL ID: C00-2119) using a broad-coverage parser for word-breaking in japanese
- (ACL ID: C02-1056) paraphrasing of chinese utterances
- (ACL ID: C02-1078) a probabilistic method for analyzing japanese anaphora integrating zero pronoun detection and resolution
- (ACL ID: C02-1089) applying an nvef word-pair identifier to the chinese syllable-to-word conversion problem
- (ACL ID: C02-1101) detecting errors in corpora using support vector machines
- (ACL ID: C02-1140) bringing the dictionary to the user
- (ACL ID: C02-1143) simple features for chinese word sense disambiguation
- (ACL ID: C02-1145) building a large-scale annotated chinese corpus
- (ACL ID: C02-1148) investigating the relationship between word segmentation performance and retrieval performance in chinese ir
- (ACL ID: C02-1162) identifying concepts across languages
- (ACL ID: C02-1163) machine translation by interaction between paraphraser and transfer
- (ACL ID: C02-2019) morphological analysis of the spontaneous speech corpus
- (ACL ID: C04-1066) japanese unknown word identification by character-based chunking
- (ACL ID: C04-1067) chinese and japanese word segmentation using word-level and character-level information
- (ACL ID: C04-1081) chinese segmentation and new word detection using conditional random fields
- (ACL ID: C04-1098) a trigger language model-based ir system
- (ACL ID: C04-1132) learning a robust word sense disambiguation model using hypernyms in definition sentences
- (ACL ID: C04-1145) morpheme-based derivation of bipolar semantic orientation of chinese words
- (ACL ID: C04-1152) efficient unsupervised recursive word segmentation using minimum description length
- (ACL ID: C04-1175) combining prediction by partial matching and logistic regression for thai word segmentation
- (ACL ID: C88-2135) a computer readability formula of japanese texts for machine scoring
- (ACL ID: C90-1012) the generalized lr parser/compiler v8-4
- (ACL ID: C92-4173) tokenization as the initial phase in nlp
- (ACL ID: C94-1009) building an mt dictionary from parallel texts based on linguistic and statistical information
- (ACL ID: C94-1032) a stochastic japanese morphological analyzer using a forward-dp backward-a* n-best search algorithm
- (ACL ID: C94-1091) classifier assignment by corpus-based approach
- (ACL ID: C94-1093) restructuring tagged corpora with morpheme adjustment rules
- (ACL ID: C94-1096) an ibm-pc environment for chinese corpus analysis
- (ACL ID: C94-2153) an efficient syntactic tagging tool for corpora
- (ACL ID: C94-2198) word class discovery for postprocessing chinese handwriting recognition
- (ACL ID: C94-2209) blending segmentation with tagging in chinese language corpus processing
- (ACL ID: C96-1031) gramcheck
- (ACL ID: C96-1035) chinese word segmentation based on maximum matching and word binding force
- (ACL ID: C96-1039) identification and classification of proper nouns in chinese texts
- (ACL ID: C96-1089) learning bilingual collocations by word-level sorting
- (ACL ID: C96-2104) a portable & quick japanese parser
- (ACL ID: C96-2136) context-based spelling correction for japanese ocr
- (ACL ID: C96-2184) segmentation standard for chinese natural language processing
- (ACL ID: C96-2194) a gradual refinement model for a robust thai morphological analyzer
- (ACL ID: C96-2208) the automatic extraction of open compounds from text corpora
- (ACL ID: E89-1020) it would be much easier if went were goed
- (ACL ID: H01-1021) evaluating question-answering techniques in chinese
- (ACL ID: H01-1035) inducing multilingual text analysis tools via robust projection across aligned corpora
- (ACL ID: H01-1057) non-dictionary-based thai word segmentation using decision trees
- (ACL ID: H01-1071) towards automatic sign translation
- (ACL ID: H93-1044) session 8
- (ACL ID: H93-1045) example-based correction of word segmentation and part of speech labelling
- (ACL ID: H93-1049) hypothesizing word association from untagged text
- (ACL ID: H94-1045) session 8 &
- (ACL ID: H94-1054) japanese word segmentation by hidden markov model
- (ACL ID: I05-2003) a hybrid chinese language model based on a combination of ontology with statistical method
- (ACL ID: I05-2010) applying a mix word-pair identifier to the chinese syllable-to-word conversion problem
- (ACL ID: I05-2014) bleu in characters
- (ACL ID: I05-2015) building an annotated japanese-chinese parallel corpus �c a part of nict multilingual corpora
- (ACL ID: I05-2039) the influence of data homogeneity on nlp system performance
- (ACL ID: I05-3002) using word-pair identifier to improve chinese input system
- (ACL ID: I05-3010) turn-taking in mandarin dialogue
- (ACL ID: I05-3017) the second international chinese word segmentation bakeoff
- (ACL ID: I05-3018) combination of machine learning methods for optimum chinese word segmentation
- (ACL ID: I05-3019) unigram language model for chinese word segmentation
- (ACL ID: I05-3020) report to bmm-based chinese word segmentor with context-based unknown word identifier for the second international chinese word segmentation bakeoff
- (ACL ID: I05-3022) chinese word segmentation in ftrd beijing
- (ACL ID: I05-3023) perceptron learning for chinese word segmentation
- (ACL ID: I05-3025) a maximum entropy approach to chinese word segmentation
- (ACL ID: I05-3026) description of the hku chinese word segmentation system for sighan bakeoff 2005
- (ACL ID: I05-3028) chinese word segmentation with multiple postprocessors in hit-irlab
- (ACL ID: I05-3029) maximal match chinese segmentation augmented by resources generated from a very large dictionary for post-processing
- (ACL ID: I05-3031) two-phase lmr-rc tagging for chinese word segmentation
- (ACL ID: I05-3033) towards a hybrid model for chinese word segmentation
- (ACL ID: J00-3004) a compression-based algorithm for chinese word segmentation
- (ACL ID: J01-1001) using suffix arrays to compute term frequency and document frequency for all substrings in a corpus
- (ACL ID: J01-2001) unsupervised learning of the morphology of a natural language
- (ACL ID: J05-4005) chinese word segmentation and named entity recognition
- (ACL ID: J96-3004) a stochastic finite-state word-segmentation algorithm for chinese
- (ACL ID: J96-4004) a statistically emergent approach for language processing
- (ACL ID: M93-1010) bbn
- (ACL ID: M93-1028) report from the text analysis techniques topic session
- (ACL ID: M98-1016) description of the kent ridge digital labs system used for muc-7
- (ACL ID: N01-1024) knowledge-free induction of inflectional morphologies
- (ACL ID: N03-1018) a generative probabilistic ocr model for nlp applications
- (ACL ID: N03-1025) language and task independent text categorization with simple language models
- (ACL ID: N03-2035) a context-sensitive homograph disambiguation in thai text-to-speech synthesis
- (ACL ID: N04-2008) greek word segmentation using minimal information
- (ACL ID: N04-4010) using n-best lists for named entity recognition from chinese speech
- (ACL ID: N04-4015) morphological analysis for statistical machine translation
- (ACL ID: N06-2021) initial study on automatic identification of speaker role in broadcast news speech
- (ACL ID: P01-1053) automatic detection of syllable boundaries combining the advantages of treebank and bracketed corpora training
- (ACL ID: P02-1064) an empirical study of active learning with support vector machines forjapanese word segmentation
- (ACL ID: P03-1004) fast methods for kernel-based text analysis
- (ACL ID: P03-1051) language model based arabic word segmentation
- (ACL ID: P03-1061) morphological analysis of a large spontaneous speech corpus in japanese
- (ACL ID: P03-1066) unsupervised learning of dependency structure for language modeling
- (ACL ID: P05-1034) dependency treelet translation
- (ACL ID: P06-1027) semi-supervised conditional random fields for improved sequence segmentation and labeling
- (ACL ID: P06-1060) factorizing complex models
- (ACL ID: P06-1069) a comparison and semi-quantitative analysis of words and character-bigrams as features in chinese text categorization
- (ACL ID: P06-1077) tree-to-string alignment template for statistical machine translation
- (ACL ID: P06-1078) incorporating speech recognition confidence into discriminative named entity recognition of speech data
- (ACL ID: P06-1085) contextual dependencies in unsupervised word segmentation
- (ACL ID: P06-1090) a clustered global phrase reordering model for statistical machine translation
- (ACL ID: P06-1126) discriminative pruning of language models for chinese word segmentation
- (ACL ID: P06-2026) chinese-english term translation mining based on semantic prediction
- (ACL ID: P06-2045) a collaborative framework for collecting thai unknown words from the web
- (ACL ID: P06-2056) unsupervised segmentation of chinese text by use of branching entropy
- (ACL ID: P06-2099) compiling a lexicon of cooking actions for animation generation
- (ACL ID: P06-2108) using word support model to improve chinese input system
- (ACL ID: P06-2121) hal-based cascaded model for variable-length semantic pattern induction from psychiatry web resources
- (ACL ID: P06-2125) an hmm-based approach to automatic phrasing for mandarin text-to-speech synthesis
- (ACL ID: P06-3008) discursive usage of six chinese punctuation marks
- (ACL ID: P06-4010) chinese named entity and relation identification system
- (ACL ID: P86-1024) a sentence analysis method for a japanese book reading machine for the blind
- (ACL ID: P93-1046) integrating word boundary identification with sentence understanding
- (ACL ID: P94-1010) a stochastic finite-state word-segmentation algorithm for chinese
- (ACL ID: P96-1018) high-performance bilingual text alignment using statistical and dictionary information
- (ACL ID: P97-1041) a trainable rule-based algorithm for word segmentation
- (ACL ID: P98-1031) named entity scoring for speech input
- (ACL ID: P98-2138) combining trigram and winnow in thai ocr error correction
- (ACL ID: P98-2152) japanese ocr error correction using character shape similarity and statistical language model
- (ACL ID: P98-2206) chinese word segmentation without using lexicon and hand-crafted training data
- (ACL ID: P99-1036) a part of speech estimation method for japanese unknown words using a statistical model of morphology and context
- (ACL ID: W00-0504) mandarin-english information (mei)
- (ACL ID: W00-0703) pronunciation by analogy in normal and impaired readers
- (ACL ID: W00-0712) knowledge-free induction of morphology using latent semantic analysis
- (ACL ID: W00-0803) chinese-japanese cross language information retrieval
- (ACL ID: W00-0903) comparing corpora and lexical ambiguity
- (ACL ID: W00-1203) knowledge extraction for identification of chinese organization names
- (ACL ID: W00-1205) sinica treebank
- (ACL ID: W00-1206) enhancement of a chinese discourse marker tagger with c4.5
- (ACL ID: W00-1207) statistically-enhanced new word identification in a rule-based chinese system
- (ACL ID: W00-1212) a block-based robust dependency parser for unrestricted chinese text
- (ACL ID: W00-1214) machine learning methods for chinese web page categorization
- (ACL ID: W00-1314) word alignment of english-chinese bilingual corpus based on chucks
- (ACL ID: W01-1412) a comparative study on translation units for bilingual lexicon extraction
- (ACL ID: W01-1623) toward a large spontaneous mandarin dialogue corpus
- (ACL ID: W02-1206) lexicon-based orthographic disambiguation in cjk intelligent information retrieval
- (ACL ID: W02-1207) a state of the art of thai language resources and thai language behavior analysis and modeling
- (ACL ID: W02-1210) efficient deep processing of japanese
- (ACL ID: W02-1211) constructing of a large-scale chinese-english parallel corpus
- (ACL ID: W02-1607) building a training corpus for word sense disambiguation in english-to-vietnamese machine translation
- (ACL ID: W02-1800) coling-02
- (ACL ID: W02-1806) pcfg parsing for restricted classical chinese texts
- (ACL ID: W02-1812) a word segmentation method with dynamic adapting to text using inductive learning
- (ACL ID: W02-1813) using the segmentation corpus to define an inventory of concatenative units for cantonese speech synthesis
- (ACL ID: W02-1814) extracting pronunciation-translated names from chinese texts using bootstrapping approach
- (ACL ID: W02-1815) combining classifiers for chinese word segmentation
- (ACL ID: W02-1817) automatic recognition of chinese unknown words based on roles tagging
- (ACL ID: W02-1819) learning rules for chinese prosodic phrase prediction
- (ACL ID: W03-0314) learning sequence-to-sequence correspondences from parallel corpora via sequential pattern mining
- (ACL ID: W03-0430) early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons
- (ACL ID: W03-0433) a stacked, voted, stacked model for named entity recognition
- (ACL ID: W03-1025) a maximum entropy chinese character-based parser
- (ACL ID: W03-1026) howtogetachinesename(entity)
- (ACL ID: W03-1102) a practical text summarizer by paragraph extraction for thai
- (ACL ID: W03-1106) text classification in asian languages without word segmentation
- (ACL ID: W03-1118) text categorization using automatically acquired domain ontology
- (ACL ID: W03-1204) evaluation of features for sentence extraction on different types of corpora
- (ACL ID: W03-1309) protein name tagging for biomedical annotation in text
- (ACL ID: W03-1506) multi-language named-entity recognition system based on hmm
- (ACL ID: W03-1701) unsupervised training for overlapping ambiguity resolution in chinese word segmentation
- (ACL ID: W03-1705) a bottom-up merging algorithm for chinese unknown word extraction
- (ACL ID: W03-1708) chiners
- (ACL ID: W03-1710) modeling of long distance context dependency in chinese
- (ACL ID: W03-1711) a chinese efficient analyser integrating word segmentation, part-of-speech tagging, partial parsing and full parsing
- (ACL ID: W03-1718) single character chinese named entity recognition
- (ACL ID: W03-1719) the first international chinese word segmentation bakeoff
- (ACL ID: W03-1721) chinese word segmentation using minimal linguistic knowledge
- (ACL ID: W03-1722) chinese word segmentation at peking university
- (ACL ID: W03-1723) a two-stage statistical word segmentation system for chinese
- (ACL ID: W03-1724) integrating ngram model and case-based learning for chinese word segmentation
- (ACL ID: W03-1725) a unicode based adaptive segmentor
- (ACL ID: W03-1727) chinese word segmentation in msr-nlp
- (ACL ID: W03-1728) chinese word segmentation as lmr tagging
- (ACL ID: W03-1729) systran's chinese word segmentation
- (ACL ID: W03-1731) chunking-based chinese word tokenization
- (ACL ID: W04-0110) segment predictability as a cue in word segmentation
- (ACL ID: W04-0703) event clustering on streaming news using co-reference chains and event words
- (ACL ID: W04-0705) applying coreference to improve name recognition
- (ACL ID: W04-1018) chinese text summarization based on thematic area detection
- (ACL ID: W04-1100) proceedings of the third sighan workshop on chinese language processing
- (ACL ID: W04-1105) an enhanced model for chinese word segmentation and part-of-speech tagging
- (ACL ID: W04-1107) chinese chunking with another type of spec
- (ACL ID: W04-1110) automatic alignment and extraction of bilingual domain ontology for medical domain web search
- (ACL ID: W04-1112) chinese term extraction from web pages based on compound term productivity
- (ACL ID: W04-1114) the construction of a chinese shallow treebank
- (ACL ID: W04-1118) do we need chinese word segmentation for statistical machine translation?
- (ACL ID: W04-1119) a semi-supervised approach to build annotated corpus for chinese named entity recognition
- (ACL ID: W04-1120) a new chinese natural language understanding architecture based on multilayer search mechanism
- (ACL ID: W04-1122) an integrated method for chinese unknown word extraction
- (ACL ID: W04-1307) on the acquisition of phonological representations
- (ACL ID: W04-1313) modelling atypical syntax processing
- (ACL ID: W04-2208) multilingual aligned parallel treebank corpus reflecting contextual information and its applications
- (ACL ID: W04-2602) towards full automation of lexicon construction
- (ACL ID: W04-3227) phrase pair rescoring with term weighting for statistical machine translatio
- (ACL ID: W04-3230) applying conditional random fields to japanese morphological analysis
- (ACL ID: W04-3236) chinese part-of-speech tagging
- (ACL ID: W04-3248) a new approach for english-chinese named entity alignment
- (ACL ID: W05-0506) a second language acquisition model using example generalization and concept categories
- (ACL ID: W05-0706) choosing an optimal architecture for segmentation and pos-tagging of modern hebrew
- (ACL ID: W05-0804) bilingual word spectral clustering for statistical machine translation
- (ACL ID: W06-0102) regional variation of domain-specific lexical items
- (ACL ID: W06-0103) mining atomic chinese abbreviation pairs
- (ACL ID: W06-0109) the role of lexical resources in cjk natural language processing
- (ACL ID: W06-0110) hybrid models for chinese named entity recognition
- (ACL ID: W06-0115) the third international chinese language processing bakeoff
- (ACL ID: W06-0116) chinese named entity recognition with conditional random fields
- (ACL ID: W06-0117) france telecom r&d beijing word segmenter for sighan bakeoff 2006
- (ACL ID: W06-0118) voting between dictionary-based and subword tagging models for chinese word segmentation
- (ACL ID: W06-0120) on closed task of chinese word segmentation
- (ACL ID: W06-0121) chinese word segmentation with maximum entropy and n-gram language model
- (ACL ID: W06-0124) boosting for chinese named entity recognition
- (ACL ID: W06-0125) chinese word segmentation and named entity recognition based on a context-dependent mutual information independence model
- (ACL ID: W06-0129) character language models for chinese word segmentation and named entity recognition
- (ACL ID: W06-0130) chinese named entity recognition with conditional probabilistic models
- (ACL ID: W06-0131) poc-nlw template for chinese word segmentation
- (ACL ID: W06-0132) chinese word segmentation and named entity recognition based on conditional random fields models
- (ACL ID: W06-0133) maximum entropy word segmentation of chinese text
- (ACL ID: W06-0134) a pragmatic chinese word segmentation system
- (ACL ID: W06-0135) netease automatic chinese word segmentation
- (ACL ID: W06-0136) n-gram based two-step algorithm for word segmentation
- (ACL ID: W06-0137) chinese word segmentation based on an approach of maximum entropy modeling
- (ACL ID: W06-0138) using part-of-speech reranking to improve chinese word segmentation
- (ACL ID: W06-0140) chinese named entity recognition with a multi-phase model
- (ACL ID: W06-0502) multilingual ontology acquisition from multiple mrds
- (ACL ID: W06-0608) the hinoki sensebank — a large-scale word sense tagged corpus of japanese —
- (ACL ID: W06-0703) question pre-processing in a qa system on internet discussion groups
- (ACL ID: W06-1002) the role of lexical resources in cjk natural language processing
- (ACL ID: W06-1631) capturing out-of-vocabulary words in arabic text
- (ACL ID: W06-1655) a hybrid markov/semi-markov conditional random field for sequence segmentation
- (ACL ID: W06-1660) empirical study on the performance stability of named entity recognition model across domains
- (ACL ID: W06-1905) keyword translation accuracy and cross-lingual question answering inchinese and japanese
- (ACL ID: W06-2808) anomaly detecting within dynamic chinese chat text
- (ACL ID: W06-3103) morpho-syntactic arabic preprocessing for arabic to english statistical machine translation
- (ACL ID: W06-3208) morphology induction from limited noisy data using approximate string matching
- (ACL ID: W06-3210) a naive theory of affixation and an algorithm for extraction
- (ACL ID: W93-0305) hmm-based part-of-speech tagging for chinese corpora
- (ACL ID: W93-0311) corpus-based adaptation mechanisms for chinese homophone disambiguation
- (ACL ID: W93-0312) example-based sense tagging of running chinese text
- (ACL ID: W95-0109) automatic construction of a chinese electronic dictionary
- (ACL ID: W96-0113) a re-estimation method for stochastic language modeling from ambiguous observations
- (ACL ID: W96-0205) automatic extraction of new words from japanese texts using generalized forward-backward search
- (ACL ID: W97-0120) a self-organizing japanese word segmenter using heuristic word identification and re-estimation
- (ACL ID: W97-0126) a statistical approach to thai morphological analyzer
- (ACL ID: W97-0312) learning to tag multilingual texts through observation
- (ACL ID: W97-0316) lexicon effects on chinese information retrieval
- (ACL ID: W97-0901) reuse of a proper noun recognition system in commercial and operational nlp applications
- (ACL ID: W98-0905) an approach to the automatic acquisition of phonotactic constraints
- (ACL ID: W99-0612) language independent named entity recognition combining morphological and contextual evidence
- (ACL ID: X93-1011) trw japanese fast data finder
- (ACL ID: X96-1021) oleada
- (ACL ID: X96-1026) chinese information extraction and retrieval
- (ACL ID: X96-1055) approaches in met (multi-lingual entity task)
- (ACL ID: X98-1019) improving english and chinese ad-hoc retrieval
* See also a list of some of the related terms to word segmentation.
Back to Description Index