#8395Using this approach, we extract parallel datafrom large Chinese, Arabic, and English non-parallel newspaper corpora.
lr,18-5-J05-4003,ak
fromscratch by starting with a very small
<term>
parallel corpus
</term>
( 100,000 words ) and exploiting
#8447We also show that a good-quality MT system can be built fromscratch by starting with a very small parallel corpus (100,000 words) and exploiting a largenon-parallel corpus.