論文

査読有り
2015年

MULTI-MODAL SERVICE OPERATION ESTIMATION USING DNN-BASED ACOUSTIC BAG-OF-FEATURES

2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO)
  • Satoshi Tamura
  • ,
  • Takuya Uno
  • ,
  • Masanori Takehara
  • ,
  • Satoru Hayamizu
  • ,
  • Takeshi Kurata

開始ページ
2291
終了ページ
2295
記述言語
英語
掲載種別
研究論文(国際会議プロシーディングス)
DOI
10.1109/EUSIPCO.2015.7362793
出版者・発行元
IEEE

In service engineering it is important to estimate when and what a worker did, because they include crucial evidences to improve service quality and working environments. For Service Operation Estimation (SOE), acoustic information is one of useful and key modalities; particularly environmental or background sounds include effective cues. This paper focuses on two aspects: (1) extracting powerful and robust acoustic features by using stacked-denoising-autoencoder and hag-of-feature techniques, and (2) investigating a multi-modal SOE scheme by combining the audio features and the other sensor data as well as non-sensor information. We conducted evaluation experiments using multi-modal data recorded in a restaurant. We improved SOE performance in comparison to conventional acoustic features, and effectiveness of our multi modal SOE scheme is also clarified.

リンク情報
DOI
https://doi.org/10.1109/EUSIPCO.2015.7362793
DBLP
https://dblp.uni-trier.de/rec/conf/eusipco/TamuraUTHK15
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000377943800461&DestApp=WOS_CPL
URL
http://dblp.uni-trier.de/db/conf/eusipco/eusipco2015.html#conf/eusipco/TamuraUTHK15
ID情報
  • DOI : 10.1109/EUSIPCO.2015.7362793
  • ISSN : 2076-1465
  • DBLP ID : conf/eusipco/TamuraUTHK15
  • ORCIDのPut Code : 34740453
  • Web of Science ID : WOS:000377943800461

エクスポート
BibTeX RIS