MISC

2006年5月

Alignment-free classification of G-protein-coupled receptors using self-organizing maps

JOURNAL OF CHEMICAL INFORMATION AND MODELING
  • JM Otaki
  • ,
  • A Mori
  • ,
  • Y Itoh
  • ,
  • T Nakayama
  • ,
  • H Yamamoto

46
3
開始ページ
1479
終了ページ
1490
記述言語
英語
掲載種別
DOI
10.1021/ci050382y
出版者・発行元
AMER CHEMICAL SOC

Proteins are classified mainly on the basis of alignments of amino acid sequences. Drug discovery processes based on pharmacologically important proteins such as G-protein-coupled receptors (GPCRs) may be facilitated if more information is extracted directly from the primary sequences. Here, we investigate an alignment-free approach to protein classification using self-organizing maps (SOMs), a kind of artificial neural network, which needs only primary sequences of proteins and determines their relative locations in a two-dimensional lattice of neurons through an adaptive process. We first showed that a set of 1397 aligned samples of Class A GPCRs can be classified by our SOM program into 15 conventional categories with 99.2% accuracy. Similarly, a nonaligned raw sequence data set of 4116 samples was categorized into 15 conventional families with 97.8% accuracy in a cross-validation test. Orphan GPCRs were also classified appropriately using the result of the SOM learning. A supposedly diverse family of olfactory receptors formed the most distinctive cluster in the map, whereas amine and peptide families exhibited diffuse distributions. A feature of this kind in the map can be interpreted to reflect hierarchical family composition. Interestingly, some orphan receptors that were categorized as olfactory were somatosensory chemoreceptors. These results suggest the applicability and potential of the SOM program to classification prediction and knowledge discovery from protein sequences.

リンク情報
DOI
https://doi.org/10.1021/ci050382y
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000237658400058&DestApp=WOS_CPL
ID情報
  • DOI : 10.1021/ci050382y
  • ISSN : 1549-9596
  • Web of Science ID : WOS:000237658400058

エクスポート
BibTeX RIS