論文

査読有り
2022年

Experimental evaluation of the effect of phoneme time stretching on speaker embedding

Nonlinear Theory and Its Applications, IEICE
  • Taichi Fukawa
  • ,
  • Kenya Jin'no

13
2
開始ページ
277
終了ページ
281
記述言語
英語
掲載種別
研究論文(学術雑誌)
DOI
10.1587/nolta.13.277
出版者・発行元
The Institute of Electronics, Information and Communication Engineers

For an indefinite length spectrogram sequence of phonemes, we experimentally verified two methods of obtaining speaker embedding by transforming it to fixed length: adding padding and time stretching. We confirmed that both methods can maintain the extraction performance. We also confirm that the fixed frame length does not affect the results.

リンク情報
DOI
https://doi.org/10.1587/nolta.13.277
CiNii Research
https://cir.nii.ac.jp/crid/1390573242800278912?lang=ja
URL
https://kaken.nii.ac.jp/grant/KAKENHI-PROJECT-19K12163/
ID情報
  • DOI : 10.1587/nolta.13.277
  • eISSN : 2185-4106
  • CiNii Research ID : 1390573242800278912
  • ORCIDのPut Code : 110802754

エクスポート
BibTeX RIS