論文

査読有り
2006年6月

Coverage of whole proteome by structural genomics observed through protein homology modeling database

Journal of Structural and Functional Genomics
  • Kei Yura
  • ,
  • Akihiro Yamaguchi
  • ,
  • Mitiko Go

7
2
開始ページ
65
終了ページ
76
記述言語
英語
掲載種別
研究論文(学術雑誌)
DOI
10.1007/s10969-006-9010-3
出版者・発行元
2

We have been developing FAMSBASE, a protein homology-modeling database of whole ORFs predicted from genome sequences. The latest update of FAMSBASE ( http://daisy.nagahama-i-bio.ac.jp/Famsbase/ ), which is based on the protein three-dimensional (3D) structures released by November 2003, contains modeled 3D structures for 368,724 open reading frames (ORFs) derived from genomes of 276 species, namely 17 archaebacterial, 130 eubacterial, 18 eukaryotic and 111 phage genomes. Those 276 genomes are predicted to have 734,193 ORFs in total and the current FAMSBASE contains protein 3D structure of approximately 50% of the ORF products. However, cases that a modeled 3D structure covers the whole part of an ORF product are rare. When portion of an ORF with 3D structure is compared in three kingdoms of life, in archaebacteria and eubacteria, approximately 60% of the ORFs have modeled 3D structures covering almost the entire amino acid sequences, however, the percentage falls to about 30% in eukaryotes. When annual differences in the number of ORFs with modeled 3D structure are calculated, the fraction of modeled 3D structures of soluble protein for archaebacteria is increased by 5%, and that for eubacteria by 7% in the last 3 years. Assuming that this rate would be maintained and that determination of 3D structures for predicted disordered regions is unattainable, whole soluble protein model structures of prokaryotes without the putative disordered regions will be in hand within 15 years. For eukaryotic proteins, they will be in hand within 25 years. The 3D structures we will have at those times are not the 3D structure of the entire proteins encoded in single ORFs, but the 3D structures of separate structural domains. Measuring or predicting spatial arrangements of structural domains in an ORF will then be a coming issue of structural genomics. © 2006 Springer Science+Business Media B.V.

リンク情報
DOI
https://doi.org/10.1007/s10969-006-9010-3
PubMed
https://www.ncbi.nlm.nih.gov/pubmed/17146617
ID情報
  • DOI : 10.1007/s10969-006-9010-3
  • ISSN : 1345-711X
  • PubMed ID : 17146617
  • SCOPUS ID : 33846152637

エクスポート
BibTeX RIS