lr,26-3-H90-1060,ak usual <term> pooling </term> of all the <term> speech data </term> from many <term> speakers </term> prior
other,16-7-H90-1060,ak reference ) speaker </term> and the <term> target speaker </term> . Each <term> reference model </term>
other,10-8-H90-1060,ak is transformed to the space of the <term> target speaker </term> and combined by averaging . Using
other,6-9-H90-1060,ak 40 <term> utterances </term> from the <term> target speaker </term> for <term> adaptation </term> , the <term>
other,22-4-H90-1060,ak a standard <term> grammar </term> and <term> test set </term> from the <term> DARPA Resource Management
other,10-5-H90-1060,ak comparable to our best condition for this <term> test suite </term> , using 109 <term> training speakers
tech,33-3-H90-1060,ak many <term> speakers </term> prior to <term> training </term> . With only 12 <term> training speakers
other,9-7-H90-1060,ak is estimated independently for each <term> training ( reference ) speaker </term> and the <term> target speaker </term>
other,6-3-H90-1060,ak . In addition , combination of the <term> training speakers </term> is done by averaging the statistics
other,3-4-H90-1060,ak <term> training </term> . With only 12 <term> training speakers </term> for <term> SI recognition </term> , we
other,15-5-H90-1060,ak <term> test suite </term> , using 109 <term> training speakers </term> . Second , we show a significant
other,3-9-H90-1060,ak combined by averaging . Using only 40 <term> utterances </term> from the <term> target speaker </term>
measure(ment),14-4-H90-1060,ak recognition </term> , we achieved a 7.5 % <term> word error rate </term> on a standard <term> grammar </term>
hide detail