MISC

査読有り
2002年

Clustering of gene expression data by mixture of PCA models

ARTIFICIAL NEURAL NETWORKS - ICANN 2002
  • T Yoshioka
  • ,
  • R Morioka
  • ,
  • K Kobayashi
  • ,
  • S Oba
  • ,
  • N Ogawsawara
  • ,
  • S Ishii

2415
開始ページ
522
終了ページ
527
記述言語
英語
掲載種別
出版者・発行元
SPRINGER-VERLAG BERLIN

Clustering techniques, such as hierarchical clustering, k-means algorithm and self-organizing maps, are widely used to analyze gene expression data. Results of these algorithms depend on several parameters, e.g., the number of clusters. However, there is no theoretical criterion to determine such parameters'. In order to overcome this problem, we propose a method using mixture of PCA models trained by a variational Bayes (VB) estimation. In our method, good clustering results are selected based on the free energy obtained within the VB estimation. Furthermore, by taking an ensemble of estimation results, a robust clustering is achieved without any biological knowledge. Our method is applied to a clustering problem for gene expression data during a sporulation of Bacillus subtilis and it is able to capture characteristics of the sigma cascade.

リンク情報
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000181441900085&DestApp=WOS_CPL
ID情報
  • ISSN : 0302-9743
  • Web of Science ID : WOS:000181441900085

エクスポート
BibTeX RIS