lr,12-2-P01-1008,ak identification of paraphrases </term> from a <term> corpus of multiple English translations </term>
lr,44-1-N03-1004,ak for <term> answers </term> in multiple <term> corpora </term> . The <term> answering agents </term>
lr,19-4-N03-1012,ak successfully classifies 73.2 % in a <term> German corpus </term> of 2.284 <term> SRHs </term> as either
lr,12-1-N03-2006,ak </term> based on a <term> small-sized bilingual corpus </term> , we use an <term> out-of-domain bilingual
lr,19-1-N03-2006,ak , we use an <term> out-of-domain bilingual corpus </term> and , in addition , the <term> language
lr,32-1-N03-2006,ak model </term> of an <term> in-domain monolingual corpus </term> . We conducted experiments with an
lr,18-3-N03-2006,ak of using an <term> out-of-domain bilingual corpus </term> and the possibility of using the <term>
lr,10-5-N03-2025,ak Markov Model </term> is trained on a <term> corpus </term> automatically tagged by the first
lr,19-2-N03-4010,ak candidates </term> from the given <term> text corpus </term> . The operation of the <term> system
other,15-1-P03-1009,ak classes </term> from undisambiguated <term> corpus data </term> . We describe a new approach
lr,15-5-P03-1031,ak information </term> obtained from <term> dialogue corpora </term> . Unlike conventional methods that
lr,17-2-P03-1050,ak a <term> small ( 10K sentences ) parallel corpus </term> as its sole <term> training resources
lr,6-2-P03-1051,ak by a <term> small manually segmented Arabic corpus </term> and uses it to bootstrap an <term>
lr,27-2-P03-1051,ak </term> from a <term> large unsegmented Arabic corpus </term> . The <term> algorithm </term> uses a
lr,8-4-P03-1051,ak estimated from a <term> small manually segmented corpus </term> of about 110,000 <term> words </term>
lr,18-5-P03-1051,ak from a <term> 155 million word unsegmented corpus </term> , and re-estimate the <term> model
lr,34-5-P03-1051,ak <term> vocabulary </term> and <term> training corpus </term> . The resulting <term> Arabic word
lr,15-6-P03-1051,ak exact match accuracy </term> on a <term> test corpus </term> containing 28,449 <term> word tokens
lr,24-7-P03-1051,ak can create a <term> small manually segmented corpus </term> of the <term> language </term> of interest
lr,15-2-P03-1058,ak </term> from <term> English-Chinese parallel corpora </term> , which are then used for disambiguating
hide detail