tech,8-6-H90-1060,ak show a significant improvement for <term> speaker adaptation ( SA ) </term> using the new <term> SI corpus </term>
lr,16-6-H90-1060,ak adaptation ( SA ) </term> using the new <term> SI corpus </term> and a small amount of <term> speech
lr,23-6-H90-1060,ak corpus </term> and a small amount of <term> speech </term> from the <term> new ( target ) speaker
other,26-6-H90-1060,ak amount of <term> speech </term> from the <term> new ( target ) speaker </term> . A <term> probabilistic spectral mapping
model,1-7-H90-1060,ak <term> new ( target ) speaker </term> . A <term> probabilistic spectral mapping </term> is estimated independently for each
other,9-7-H90-1060,ak is estimated independently for each <term> training ( reference ) speaker </term> and the <term> target speaker </term>
other,16-7-H90-1060,ak reference ) speaker </term> and the <term> target speaker </term> . Each <term> reference model </term>
model,1-8-H90-1060,ak the <term> target speaker </term> . Each <term> reference model </term> is transformed to the space of the
other,10-8-H90-1060,ak is transformed to the space of the <term> target speaker </term> and combined by averaging . Using
other,3-9-H90-1060,ak combined by averaging . Using only 40 <term> utterances </term> from the <term> target speaker </term>
other,6-9-H90-1060,ak 40 <term> utterances </term> from the <term> target speaker </term> for <term> adaptation </term> , the <term>
tech,9-9-H90-1060,ak the <term> target speaker </term> for <term> adaptation </term> , the <term> error rate </term> dropped
measure(ment),12-9-H90-1060,ak </term> for <term> adaptation </term> , the <term> error rate </term> dropped to 4.1 % --- a 45 % reduction
hide detail