論文

査読有り
2010年

AUTOMATIC AUDIO TAGGING USING COVARIATE SHIFT ADAPTATION

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING
  • Gordon Wichern
  • ,
  • Makoto Yamada
  • ,
  • Harvey Thornburg
  • ,
  • Masashi Sugiyama
  • ,
  • Andreas Spanias

開始ページ
253
終了ページ
256
記述言語
英語
掲載種別
研究論文(国際会議プロシーディングス)
DOI
10.1109/ICASSP.2010.5495973
出版者・発行元
IEEE

Automatically annotating or tagging unlabeled audio files has several applications, such as database organization and recommender systems. We are interested in the case where the system is trained using clean high-quality audio files, but most of the files that need to be automatically tagged during the test phase are heavily compressed and noisy, for instance if they were captured on a mobile device. In this situation we assume the audio files follow a covariate shift model in the acoustic feature space, i.e., the feature distributions are different in the training and test phases, but the conditional distribution of labels given features remains unchanged. Our method uses a specially designed audio similarity measure as input to a set of weighted logistic regressors, which attempt to alleviate the influence of covariate shift. Results on a freely available database of sound files contributed and labeled by non-expert users, demonstrate effective automatic tagging performance.

リンク情報
DOI
https://doi.org/10.1109/ICASSP.2010.5495973
DBLP
https://dblp.uni-trier.de/rec/conf/icassp/WichernYTSS10
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000287096000063&DestApp=WOS_CPL
URL
http://dblp.uni-trier.de/db/conf/icassp/icassp2010.html#conf/icassp/WichernYTSS10
ID情報
  • DOI : 10.1109/ICASSP.2010.5495973
  • ISSN : 1520-6149
  • DBLP ID : conf/icassp/WichernYTSS10
  • Web of Science ID : WOS:000287096000063

エクスポート
BibTeX RIS