Papers

Peer-reviewed International journal
Dec, 2011

Robust seed model training for speaker adaptation using pseudo-speaker features generated by inverse CMLLR transformation

Proceedings of 2011 Automatic Speech Recognition and Understanding Workshop (ASRU 2011)
  • Arata Itoh
  • ,
  • Sunao Hara
  • ,
  • Norihide Kitaoka
  • ,
  • Kazuya Takeda

First page
169
Last page
172
Language
English
Publishing type
Research paper (international conference proceedings)
DOI
10.1109/ASRU.2011.6163925
Publisher
IEEE

In this paper, we propose a novel acoustic model training method which is suitable for speaker adaptation in speech recognition. Our method is based on feature generation from a small amount of speakers' data. For decades, speaker adaptation methods have been widely used. Such adaptation methods need some amount of adaptation data and if the data is not sufficient, speech recognition performance degrade significantly. If the seed models to be adapted to a specific speaker can widely cover more speakers, speaker adaptation can perform robustly. To make such robust seed models, we adopt inverse maximum likelihood linear regression (MLLR) transformation-based feature generation, and then train our seed models using these features. First we obtain MLLR transformation matrices from a limited number of existing speakers. Then we extract the bases of the MLLR transformation matrices using PCA. The distribution of the weight parameters to express the MLLR transformation matrices for the existing speakers is estimated. Next we generate pseudo-speaker MLLR transformations by sampling the weight parameters from the distribution, and apply the inverse of the transformation to the normalized existing speaker features to generate the pseudo-speakers' features. Finally, using these features, we train the acoustic seed models. Using this seed models, we obtained better speaker adaptation results than using simply environmentally adapted models. © 2011 IEEE.

Link information
DOI
https://doi.org/10.1109/ASRU.2011.6163925
DBLP
https://dblp.uni-trier.de/rec/conf/asru/ItohHKT11
URL
http://dblp.uni-trier.de/db/conf/asru/asru2011.html#conf/asru/ItohHKT11
Scopus
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84858993998&origin=inward
Scopus Citedby
https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=84858993998&origin=inward
ID information
  • DOI : 10.1109/ASRU.2011.6163925
  • DBLP ID : conf/asru/ItohHKT11
  • SCOPUS ID : 84858993998

Export
BibTeX RIS