論文

査読有り 国際誌
2014年6月

Computational chemogenomics: Is it more than inductive transfer?

Journal of Computer-Aided Molecular Design
  • J. B. Brown
  • ,
  • Yasushi Okuno
  • ,
  • Gilles Marcou
  • ,
  • Alexandre Varnek
  • ,
  • Dragos Horvath

28
6
開始ページ
597
終了ページ
618
記述言語
英語
掲載種別
研究論文(学術雑誌)
DOI
10.1007/s10822-014-9743-1
出版者・発行元
SPRINGER

High-throughput assays challenge us to extract knowledge from multi-ligand, multi-target activity data. In QSAR, weights are statically fitted to each ligand descriptor with respect to a single endpoint or target. However, computational chemogenomics (CG) has demonstrated benefits of learning from entire grids of data at once, rather than building target-specific QSARs. A possible reason for this is the emergence of inductive knowledge transfer (IT) between targets, providing statistical robustness to the model, with no assumption about the structure of the targets. Relevant protein descriptors in CG should allow one to learn how to dynamically adjust ligand attribute weights with respect to protein structure. Hence, models built through explicit learning (EL) by including protein information, while benefitting from IT enhancement, should provide additional predictive capability, notably for protein deorphanization. This interplay between IT and EL in CG modeling is not sufficiently studied. While IT is likely to occur irrespective of the injected target information, it is not clear whether and when boosting due to EL may occur. EL is only possible if protein description is appropriate to the target set under investigation. The key issue here is the search for evidence of genuine EL exceeding expectations based on pure IT. We explore the problem in the context of Support Vector Regression, using more than 9,400 pK-i p K i values of 31 GPCRs, where compound-protein interactions are represented by the concatenation of vectorial descriptions of compounds and proteins. This provides a unified framework to generate both IT-enhanced and potentially EL-enabled models, where the difference is toggled by supplied protein information. For EL-enabled models, protein information includes genuine protein descriptors such as typical sequence-based terms, but also the experimentally determined affinity cross-correlation fingerprints. These latter benchmark the expected behavior of a quasi-ideal descriptor capturing the actual functional protein-protein relatedness, and therefore thought to be the most likely to enable EL. EL- and IT-based methods were benchmarked alongside classical QSAR, with respect to cross-validation and deorphanization challenges. A rational method for projecting benchmarked methodologies into a strategy space is given, in the aims that the projection will provide directions for the types of molecule designs possible using a given methodology. While EL-enabled strategies outperform classical QSARs and favorably compare to similar published results, they are, in all respects evaluated herein, not strongly distinguished from IT-enhanced models. Moreover, EL-enabled strategies failed to prove superior in deorphanization challenges. Therefore, this paper raises caution that, contrary to common belief and intuitive expectation, the benefits of chemogenomics models over classical QSAR are quite possibly due less to the injection of protein-related information, and rather impacted more by the effect of inductive transfer, due to simultaneous learning from all of the modeled endpoints. These results show that the field of protein descriptor research needs further improvements to truly realize the expected benefit of EL. © 2014 Springer International Publishing.

リンク情報
DOI
https://doi.org/10.1007/s10822-014-9743-1
DBLP
https://dblp.uni-trier.de/rec/journals/jcamd/BrownOMVH14
PubMed
https://www.ncbi.nlm.nih.gov/pubmed/24771144
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000337162700001&DestApp=WOS_CPL
Scopus
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84902801801&origin=inward
Scopus Citedby
https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=84902801801&origin=inward
URL
https://www.wikidata.org/entity/Q30361816
URL
https://dblp.uni-trier.de/db/journals/jcamd/jcamd28.html#BrownOMVH14
ID情報
  • DOI : 10.1007/s10822-014-9743-1
  • ISSN : 0920-654X
  • eISSN : 1573-4951
  • DBLP ID : journals/jcamd/BrownOMVH14
  • PubMed ID : 24771144
  • SCOPUS ID : 84902801801
  • Web of Science ID : WOS:000337162700001

エクスポート
BibTeX RIS