Structure-Aware Multimodal Deep Learning for Drug-Protein Interaction Prediction

No Thumbnail Available
File version
Author(s)
Wang, Penglei
Zheng, Shuangjia
Jiang, Yize
Li, Chengtao
Liu, Junhong
Wen, Chang
Patronov, Atanas
Qian, Dahong
Chen, Hongming
Yang, Yuedong
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2022
Size
File type(s)
Location
License
Abstract

Identifying drug-protein interactions (DPIs) is crucial in drug discovery, and a number of machine learning methods have been developed to predict DPIs. Existing methods usually use unrealistic data sets with hidden bias, which will limit the accuracy of virtual screening methods. Meanwhile, most DPI prediction methods pay more attention to molecular representation but lack effective research on protein representation and high-level associations between different instances. To this end, we present the novel structure-aware multimodal deep DPI prediction model, STAMP-DPI, which was trained on a curated industry-scale benchmark data set. We built a high-quality benchmark data set named GalaxyDB for DPI prediction. This industry-scale data set along with an unbiased training procedure resulted in a more robust benchmark study. For informative protein representation, we constructed a structure-aware graph neural network method from the protein sequence by combining predicted contact maps and graph neural networks. Through further integration of structure-based representation and high-level pretrained embeddings for molecules and proteins, our model effectively captures the feature representation of the interactions between them. As a result, STAMP-DPI outperformed state-of-the-art DPI prediction methods by decreasing 7.00% mean square error (MSE) in the Davis data set and improving 8.89% area under the curve (AUC) in the GalaxyDB data set. Moreover, our model is an interpretable model with the transformer-based interaction mechanism, which can accurately reveal the binding sites between molecules and proteins.

Journal Title

Journal of Chemical Information and Modeling

Conference Title
Book Title
Edition
Volume

62

Issue

5

Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
Item Access Status
Note
Access the data
Related item(s)
Subject

Medicinal and biomolecular chemistry

Theoretical and computational chemistry

Theory of computation

Science & Technology

Life Sciences & Biomedicine

Physical Sciences

Chemistry, Medicinal

Persistent link to this record
Citation

Wang, P; Zheng, S; Jiang, Y; Li, C; Liu, J; Wen, C; Patronov, A; Qian, D; Chen, H; Yang, Y, Structure-Aware Multimodal Deep Learning for Drug-Protein Interaction Prediction, Journal of Chemical Information and Modeling, 2022, 62 (5), pp. 1308-1317

Collections