tech,42-1-N03-1018,bq In this paper , we introduce a <term> generative probabilistic optical character recognition ( OCR ) model </term> that describes an end-to-end process in the <term> noisy channel framework </term> , progressing from generation of <term> true text </term> through its transformation into the <term> noisy output </term> of an <term> OCR system </term> .
tech,7-2-N03-1018,bq The <term> model </term> is designed for use in <term> error correction </term> , with a focus on <term> post-processing </term> the <term> output </term> of black-box <term> OCR systems </term> in order to make it more useful for <term> NLP tasks </term> .
hide detail