2011年
BAYESIAN NONPARAMETRIC SPECTROGRAM MODELING BASED ON INFINITE FACTORIAL INFINITE HIDDEN MARKOV MODEL
2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA)
- ,
- ,
- ,
- ,
- ,
- 開始ページ
- 325
- 終了ページ
- 328
- 記述言語
- 英語
- 掲載種別
- 研究論文(国際会議プロシーディングス)
- DOI
- 10.1109/ASPAA.2011.6082324
- 出版者・発行元
- IEEE
This paper presents a Bayesian nonparametric latent source discovery method for music signal analysis. In audio signal analysis, an important goal is to decompose music signals into individual notes, with applications such as music transcription, source separation or note-level manipulation. Recently, the use of latent variable decompositions, especially nonnegative matrix factorization (NMF), has been a very active area of research. These methods are facing two, mutually dependent, problems: first, instrument sounds often exhibit time-varying spectra, and grasping this time-varying nature is an important factor to characterize the diversity of each instrument; moreover, in many cases we do not know in advance the number of sources and which instruments are played. Conventional decompositions generally fail to cope with these issues as they suffer from the difficulties of automatically determining the number of sources and automatically grouping spectra into single events. We address both these problems by developing a Bayesian nonparametric fusion of NMF and hidden Markov model (HMM). Our model decomposes music spectrograms in an automatically estimated number of components, each of which consisting in an HMM whose number of states is also automatically estimated from the data.
- リンク情報
-
- DOI
- https://doi.org/10.1109/ASPAA.2011.6082324
- Web of Science
- https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000298302900082&DestApp=WOS_CPL
- URL
- http://www.scopus.com/inward/record.url?eid=2-s2.0-83455246038&partnerID=MN8TOARS
- URL
- http://orcid.org/0000-0003-4385-7170
- ID情報
-
- DOI : 10.1109/ASPAA.2011.6082324
- ORCIDのPut Code : 61223446
- SCOPUS ID : 83455246038
- Web of Science ID : WOS:000298302900082