論文

査読有り
2008年3月

Hidden active. information in a random compound library: Extraction using a pseudo-structure-activity relationship model

JOURNAL OF CHEMICAL INFORMATION AND MODELING
  • Hiroaki Fukunishi
  • ,
  • Reiji Teramoto
  • ,
  • Jiro Shimada

48
3
開始ページ
575
終了ページ
582
記述言語
英語
掲載種別
研究論文(学術雑誌)
DOI
10.1021/ci7003384
出版者・発行元
AMER CHEMICAL SOC

We propose a hypothesis that "a model of active compound can be provided by integrating information of compounds high-ranked by docking simulation of a random compound library". In our hypothesis, the inclusion of true active compounds in the high-ranked compound is not necessary. We regard the high-ranked compounds as being pseudo-active compounds.-As a method to embody our hypothesis, we introduce a pseudo-structure-activity relationship (PSAR) model. Although the PSAR model is the same as a quantitative structure activity relationship (QSAR) model, in terms of statistical methodology, the implications of the training data are different. Known active compounds (ligands) are used as training data in the QSAR model, whereas the pseudo-active compounds are used in the PSAR model. In this study, Random Forest was used as a machine-learning algorithm. From tests for four functionally different targets, estrogen receptor antagonist (ER), thymidine kinase (TK), thrombin, and acetylcholine esterase (AChE), using five scoring functions, we obtained three conclusions: (1) the PSAR models significantly gave higher percentages of known ligands found than random sampling, and these results are sufficient to support our hypothesis; (2) the PSAR models gave higher percentages of known ligands found than normal scoring by scoring function, and these results demonstrate the practical usefulness of the PSAR model; and (3) the PSAR model can assess compounds failed in the docking simulation. Note that PSAR and QSAR models are used in different situations; the advantage of the PSAR model emerges when no ligand is available as training data or when one wants to find novel types of ligands, whereas the QSAR model is effective for finding compounds similar to known ligands when the ligands are already known.

リンク情報
DOI
https://doi.org/10.1021/ci7003384
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000254353500011&DestApp=WOS_CPL
ID情報
  • DOI : 10.1021/ci7003384
  • ISSN : 1549-9596
  • Web of Science ID : WOS:000254353500011

エクスポート
BibTeX RIS