F0 contour approximation model for a one-stream tonal word recognition system

AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS

Nutthacha Prukkanon
Kosin Chamnongthai
Yoshikazu Miyanaga

巻: 70
号: 5
開始ページ: 681
終了ページ: 688
記述言語: 英語
掲載種別
DOI: 10.1016/j.aeue.2016.02.006
出版者・発行元: ELSEVIER GMBH, URBAN & FISCHER VERLAG

The performance of a non-tonal speech recognition system degrades when confronted with the task of recognizing tonal words. Several speech recognition applications require tonal word recognition. Therefore, this paper considers how to create a suitable tone model for a tonal syllable recognition system serving application devices based on a one-stream scheme. The fundamental frequency contour (F0 contour) approximation model is proposed here to estimate F0 continuity contours for all of a tonal word. The processes of approximation include voice detection, F0 smoothing, F0 forecasting, and F0 normalization. To model the F0 contours of unvoiced regions belonging to F0 forecasting, a linear regression function is used to create an approximate F0 contour. Experimental results indicate that the proposed model improves the accuracy of tonal word recognition by 8.6% and 12.2%, respectively, compared with conventional random and exponential approaches. (C) 2016 Elsevier GmbH. All rights reserved.

リンク情報

DOI: https://doi.org/10.1016/j.aeue.2016.02.006
Web of Science: https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000373865600022&DestApp=WOS_CPL
URL: https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84959473571&origin=inward

ID情報

DOI : 10.1016/j.aeue.2016.02.006
ISSN : 1434-8411
eISSN : 1618-0399
SCOPUS ID : 84959473571
Web of Science ID : WOS:000373865600022

エクスポート: BibTeX RIS

宮永喜一

MISC

F0 contour approximation model for a one-stream tonal word recognition system

メニュー

共著者の一覧