#21186Second, we show a significant improvement forspeaker adaptation ( SA ) using the new SI corpus and a small amount of speech from the new (target) speaker.
lr,16-6-H90-1060,ak
adaptation ( SA )
</term>
using the new
<term>
SI corpus
</term>
and a small amount of
<term>
speech
#21194Second, we show a significant improvement for speaker adaptation (SA) using the newSI corpus and a small amount of speech from the new (target) speaker.
lr,23-6-H90-1060,ak
corpus
</term>
and a small amount of
<term>
speech
</term>
from the
<term>
new ( target ) speaker
#21201Second, we show a significant improvement for speaker adaptation (SA) using the new SI corpus and a small amount ofspeech from the new (target) speaker.
other,26-6-H90-1060,ak
amount of
<term>
speech
</term>
from the
<term>
new ( target ) speaker
</term>
. A
<term>
probabilistic spectral mapping
#21204Second, we show a significant improvement for speaker adaptation (SA) using the new SI corpus and a small amount of speech from thenew ( target ) speaker.
model,1-7-H90-1060,ak
<term>
new ( target ) speaker
</term>
. A
<term>
probabilistic spectral mapping
</term>
is estimated independently for each
#21211Aprobabilistic spectral mapping is estimated independently for each training (reference) speaker and the target speaker.
other,9-7-H90-1060,ak
is estimated independently for each
<term>
training ( reference ) speaker
</term>
and the
<term>
target speaker
</term>
#21219A probabilistic spectral mapping is estimated independently for eachtraining ( reference ) speaker and the target speaker.
other,16-7-H90-1060,ak
reference ) speaker
</term>
and the
<term>
target speaker
</term>
. Each
<term>
reference model
</term>
#21226A probabilistic spectral mapping is estimated independently for each training (reference) speaker and thetarget speaker.
model,1-8-H90-1060,ak
the
<term>
target speaker
</term>
. Each
<term>
reference model
</term>
is transformed to the space of the
#21230Eachreference model is transformed to the space of the target speaker and combined by averaging.
other,10-8-H90-1060,ak
is transformed to the space of the
<term>
target speaker
</term>
and combined by averaging . Using
#21239Each reference model is transformed to the space of thetarget speaker and combined by averaging.
other,3-9-H90-1060,ak
combined by averaging . Using only 40
<term>
utterances
</term>
from the
<term>
target speaker
</term>
#21249Using only 40utterances from the target speaker for adaptation, the error rate dropped to 4.1% --- a 45% reduction in error compared to the SI result.
other,6-9-H90-1060,ak
40
<term>
utterances
</term>
from the
<term>
target speaker
</term>
for
<term>
adaptation
</term>
, the
<term>
#21252Using only 40 utterances from thetarget speaker for adaptation, the error rate dropped to 4.1% --- a 45% reduction in error compared to the SI result.
tech,9-9-H90-1060,ak
the
<term>
target speaker
</term>
for
<term>
adaptation
</term>
, the
<term>
error rate
</term>
dropped
#21255Using only 40 utterances from the target speaker foradaptation, the error rate dropped to 4.1% --- a 45% reduction in error compared to the SI result.
measure(ment),12-9-H90-1060,ak
</term>
for
<term>
adaptation
</term>
, the
<term>
error rate
</term>
dropped to 4.1 % --- a 45 % reduction
#21258Using only 40 utterances from the target speaker for adaptation, theerror rate dropped to 4.1% --- a 45% reduction in error compared to the SI result.