tech,4-1-P03-1050,bq This paper presents an <term> unsupervised learning approach </term> to building a <term> non-English ( Arabic ) stemmer </term> .
tech,10-1-P03-1050,bq This paper presents an <term> unsupervised learning approach </term> to building a <term> non-English ( Arabic ) stemmer </term> .
model,1-2-P03-1050,bq The <term> stemming model </term> is based on <term> statistical machine translation </term> and it uses an <term> English stemmer </term> and a small ( 10K sentences ) <term> parallel corpus </term> as its sole <term> training resources </term> .
tech,6-2-P03-1050,bq The <term> stemming model </term> is based on <term> statistical machine translation </term> and it uses an <term> English stemmer </term> and a small ( 10K sentences ) <term> parallel corpus </term> as its sole <term> training resources </term> .
tech,13-2-P03-1050,bq The <term> stemming model </term> is based on <term> statistical machine translation </term> and it uses an <term> English stemmer </term> and a small ( 10K sentences ) <term> parallel corpus </term> as its sole <term> training resources </term> .
lr,22-2-P03-1050,bq The <term> stemming model </term> is based on <term> statistical machine translation </term> and it uses an <term> English stemmer </term> and a small ( 10K sentences ) <term> parallel corpus </term> as its sole <term> training resources </term> .
lr,27-2-P03-1050,bq The <term> stemming model </term> is based on <term> statistical machine translation </term> and it uses an <term> English stemmer </term> and a small ( 10K sentences ) <term> parallel corpus </term> as its sole <term> training resources </term> .
lr,1-3-P03-1050,bq No <term> parallel text </term> is needed after the <term> training phase </term> .
other,7-3-P03-1050,bq No <term> parallel text </term> is needed after the <term> training phase </term> .
lr,0-4-P03-1050,bq No <term> parallel text </term> is needed after the <term> training phase </term> . <term> Monolingual , unannotated text </term> can be used to further improve the <term> stemmer </term> by allowing it to adapt to a desired <term> domain </term> or <term> genre </term> .
tech,11-4-P03-1050,bq <term> Monolingual , unannotated text </term> can be used to further improve the <term> stemmer </term> by allowing it to adapt to a desired <term> domain </term> or <term> genre </term> .
other,20-4-P03-1050,bq <term> Monolingual , unannotated text </term> can be used to further improve the <term> stemmer </term> by allowing it to adapt to a desired <term> domain </term> or <term> genre </term> .
other,22-4-P03-1050,bq <term> Monolingual , unannotated text </term> can be used to further improve the <term> stemmer </term> by allowing it to adapt to a desired <term> domain </term> or <term> genre </term> .
other,7-5-P03-1050,bq Examples and results will be given for <term> Arabic </term> , but the approach is applicable to any <term> language </term> that needs <term> affix removal </term> .
other,16-5-P03-1050,bq Examples and results will be given for <term> Arabic </term> , but the approach is applicable to any <term> language </term> that needs <term> affix removal </term> .
tech,19-5-P03-1050,bq Examples and results will be given for <term> Arabic </term> , but the approach is applicable to any <term> language </term> that needs <term> affix removal </term> .
tech,1-6-P03-1050,bq Our <term> resource-frugal approach </term> results in 87.5 % <term> agreement </term> with a state of the art , proprietary <term> Arabic stemmer </term> built using <term> rules </term> , <term> affix lists </term> , and <term> human annotated text </term> , in addition to an <term> unsupervised component </term> .
measure(ment),7-6-P03-1050,bq Our <term> resource-frugal approach </term> results in 87.5 % <term> agreement </term> with a state of the art , proprietary <term> Arabic stemmer </term> built using <term> rules </term> , <term> affix lists </term> , and <term> human annotated text </term> , in addition to an <term> unsupervised component </term> .
tech,16-6-P03-1050,bq Our <term> resource-frugal approach </term> results in 87.5 % <term> agreement </term> with a state of the art , proprietary <term> Arabic stemmer </term> built using <term> rules </term> , <term> affix lists </term> , and <term> human annotated text </term> , in addition to an <term> unsupervised component </term> .
model,20-6-P03-1050,bq Our <term> resource-frugal approach </term> results in 87.5 % <term> agreement </term> with a state of the art , proprietary <term> Arabic stemmer </term> built using <term> rules </term> , <term> affix lists </term> , and <term> human annotated text </term> , in addition to an <term> unsupervised component </term> .
hide detail