2022年
Experimental evaluation of the effect of phoneme time stretching on speaker embedding
Nonlinear Theory and Its Applications, IEICE
- ,
- 巻
- 13
- 号
- 2
- 開始ページ
- 277
- 終了ページ
- 281
- 記述言語
- 英語
- 掲載種別
- 研究論文(学術雑誌)
- DOI
- 10.1587/nolta.13.277
- 出版者・発行元
- The Institute of Electronics, Information and Communication Engineers
For an indefinite length spectrogram sequence of phonemes, we experimentally verified two methods of obtaining speaker embedding by transforming it to fixed length: adding padding and time stretching. We confirmed that both methods can maintain the extraction performance. We also confirm that the fixed frame length does not affect the results.
- リンク情報
- ID情報
-
- DOI : 10.1587/nolta.13.277
- eISSN : 2185-4106
- CiNii Research ID : 1390573242800278912
- ORCIDのPut Code : 110802754