Computational chemogenomics: Is it more than inductive transfer?

Journal of Computer-Aided Molecular Design

J. B. Brown
Yasushi Okuno
Gilles Marcou
Alexandre Varnek
Dragos Horvath

巻: 28
号: 6
開始ページ: 597
終了ページ: 618
記述言語: 英語
掲載種別: 研究論文（学術雑誌）
DOI: 10.1007/s10822-014-9743-1
出版者・発行元: SPRINGER

High-throughput assays challenge us to extract knowledge from multi-ligand, multi-target activity data. In QSAR, weights are statically fitted to each ligand descriptor with respect to a single endpoint or target. However, computational chemogenomics (CG) has demonstrated benefits of learning from entire grids of data at once, rather than building target-specific QSARs. A possible reason for this is the emergence of inductive knowledge transfer (IT) between targets, providing statistical robustness to the model, with no assumption about the structure of the targets. Relevant protein descriptors in CG should allow one to learn how to dynamically adjust ligand attribute weights with respect to protein structure. Hence, models built through explicit learning (EL) by including protein information, while benefitting from IT enhancement, should provide additional predictive capability, notably for protein deorphanization. This interplay between IT and EL in CG modeling is not sufficiently studied. While IT is likely to occur irrespective of the injected target information, it is not clear whether and when boosting due to EL may occur. EL is only possible if protein description is appropriate to the target set under investigation. The key issue here is the search for evidence of genuine EL exceeding expectations based on pure IT. We explore the problem in the context of Support Vector Regression, using more than 9,400 pK-i p K i values of 31 GPCRs, where compound-protein interactions are represented by the concatenation of vectorial descriptions of compounds and proteins. This provides a unified framework to generate both IT-enhanced and potentially EL-enabled models, where the difference is toggled by supplied protein information. For EL-enabled models, protein information includes genuine protein descriptors such as typical sequence-based terms, but also the experimentally determined affinity cross-correlation fingerprints. These latter benchmark the expected behavior of a quasi-ideal descriptor capturing the actual functional protein-protein relatedness, and therefore thought to be the most likely to enable EL. EL- and IT-based methods were benchmarked alongside classical QSAR, with respect to cross-validation and deorphanization challenges. A rational method for projecting benchmarked methodologies into a strategy space is given, in the aims that the projection will provide directions for the types of molecule designs possible using a given methodology. While EL-enabled strategies outperform classical QSARs and favorably compare to similar published results, they are, in all respects evaluated herein, not strongly distinguished from IT-enhanced models. Moreover, EL-enabled strategies failed to prove superior in deorphanization challenges. Therefore, this paper raises caution that, contrary to common belief and intuitive expectation, the benefits of chemogenomics models over classical QSAR are quite possibly due less to the injection of protein-related information, and rather impacted more by the effect of inductive transfer, due to simultaneous learning from all of the modeled endpoints. These results show that the field of protein descriptor research needs further improvements to truly realize the expected benefit of EL. © 2014 Springer International Publishing.

リンク情報

DOI: https://doi.org/10.1007/s10822-014-9743-1
DBLP: https://dblp.uni-trier.de/rec/journals/jcamd/BrownOMVH14
PubMed: https://www.ncbi.nlm.nih.gov/pubmed/24771144
Web of Science: https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000337162700001&DestApp=WOS_CPL
Scopus: https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84902801801&origin=inward
Scopus Citedby: https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=84902801801&origin=inward
URL: https://www.wikidata.org/entity/Q30361816
URL: https://dblp.uni-trier.de/db/journals/jcamd/jcamd28.html#BrownOMVH14

ID情報

DOI : 10.1007/s10822-014-9743-1
ISSN : 0920-654X
eISSN : 1573-4951
DBLP ID : journals/jcamd/BrownOMVH14
PubMed ID : 24771144
SCOPUS ID : 84902801801
Web of Science ID : WOS:000337162700001

エクスポート: BibTeX RIS

奥野恭史

論文

Computational chemogenomics: Is it more than inductive transfer?

メニュー

共著者の一覧