lr-prod,1-1-H92-1074,bq <term> CSR ( Connected Speech Recognition ) corpus </term> represents a new <term> DARPA speech
lr-prod,7-2-H92-1074,bq now old <term> Resource Management ( RM ) corpus </term> that has fueled <term> DARPA speech
lr,9-5-P06-2059,bq experiment , the method could construct a <term> corpus </term> consisting of 126,610 <term> sentences
lr,12-2-P01-1008,bq identification of paraphrases </term> from a <term> corpus of multiple English translations </term>
lr,6-3-P06-1052,bq </term> . We evaluate the algorithm on a <term> corpus </term> , and show that it reduces the degree
lr,10-5-N03-2025,bq Markov Model </term> is trained on a <term> corpus </term> automatically tagged by the first
lr,3-3-H92-1026,bq process </term> in a novel way . We use a <term> corpus of bracketed sentences </term> , called a
lr,9-1-P03-1068,bq of a large , <term> semantically annotated corpus </term> resource as reliable basis for the
lr,7-2-P03-1051,bq by a small <term> manually segmented Arabic corpus </term> and uses it to bootstrap an <term>
lr,28-2-P03-1051,bq </term> from a large <term> unsegmented Arabic corpus </term> . The <term> algorithm </term> uses a
lr,27-4-P06-2001,bq </term> to train , that is , a bigger <term> corpus </term> written by one unique <term> author
lr,20-1-N03-2006,bq , we use an out-of-domain <term> bilingual corpus </term> and , in addition , the <term> language
lr,19-3-N03-2006,bq of using an out-of-domain <term> bilingual corpus </term> and the possibility of using the <term>
lr,2-3-I05-4010,bq in detail . The resultant <term> bilingual corpus </term> , 10.4 M <term> English words </term>
lr,13-1-N03-2006,bq </term> based on a small-sized <term> bilingual corpus </term> , we use an out-of-domain <term> bilingual
lr,23-2-C04-1116,bq each author 's text as a coherent <term> corpus </term> . Our approach is based on the idea
lr-prod,2-3-H92-1074,bq for the past 5 years . The new <term> CSR corpus </term> supports research on major new problems
lr-prod,29-4-H92-1074,bq dynamic challenge of extending the <term> CSR corpus </term> to meet future needs . <term> Language
lr-prod,7-4-H92-1074,bq paper presents an overview of the <term> CSR corpus </term> , reviews the definition and development
tech,4-2-N06-4001,bq InfoMagnets </term> aims at making <term> exploratory corpus analysis </term> accessible to researchers
hide detail