論文

査読有り
2015年

Scene Understanding Based on Sound and Text Information for a Cooking Support Robot

CURRENT APPROACHES IN APPLIED ARTIFICIAL INTELLIGENCE
  • Ryosuke Kojima
  • ,
  • Osamu Sugiyama
  • ,
  • Kazuhiro Nakadai

9101
開始ページ
665
終了ページ
674
記述言語
英語
掲載種別
研究論文(国際会議プロシーディングス)
DOI
10.1007/978-3-319-19066-2_64
出版者・発行元
SPRINGER-VERLAG BERLIN

We address noise-robust "auditory scene understanding" for a robot defined by extracting 6W (What, When, Where, Who, Why, hoW) information on the surrounding environment. Although such a robot has been studied in the field of robot audition, only the first four Ws except for "why" and "how" were in scope. Thus, this paper mainly focuses on extracting "how" information, in particular, on cooking scenes to realize a cooking support robot. In this case, "how" information is regarded as a cooking procedure, we construct sound-based cooking procedure recognition based on two models. One is a conventional statistical model, Gaussian Mixture Model (GMM), which is used for an acoustic model to recognize a cooking sound event such as stirring, cutting and so on. The other is a Hierarchical Hidden Markov Model (HHMM), which is used for a recipe model to recognize a sequence of cooking events, i.e., a cooking procedure. We constructed a prototype system for cooking recipe and procedure recognition. Preliminary results showed that the proposed GMM-HHMM based system outperformed a conventional GMM-HMM based system in terms of noise-robustness in cooking recipe recognition and our system was able to correct misrecognition of cooking sound events using recipe model in cooking procedure recognition.

リンク情報
DOI
https://doi.org/10.1007/978-3-319-19066-2_64
DBLP
https://dblp.uni-trier.de/rec/conf/ieaaie/KojimaSN15
J-GLOBAL
https://jglobal.jst.go.jp/detail?JGLOBAL_ID=201702207987252829
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000363236300064&DestApp=WOS_CPL
URL
http://dblp.uni-trier.de/db/conf/ieaaie/ieaaie2015.html#conf/ieaaie/KojimaSN15
ID情報
  • DOI : 10.1007/978-3-319-19066-2_64
  • ISSN : 0302-9743
  • DBLP ID : conf/ieaaie/KojimaSN15
  • J-Global ID : 201702207987252829
  • Web of Science ID : WOS:000363236300064

エクスポート
BibTeX RIS