Learning epistatic interactions from sequence-activity data to predict enantioselectivity
File version
Accepted Manuscript (AM)
Author(s)
Gumulya, Yosephine
Malde, Alpeshkumar K
Boden, Mikael
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
License
Abstract
Enzymes with a high selectivity are desirable for improving economics of chemical synthesis of enantiopure compounds. To improve enzyme selectivity mutations are often introduced near the catalytic active site. In this compact environment epistatic interactions between residues, where contributions to selectivity are non-additive, play a significant role in determining the degree of selectivity. Using support vector machine regression models we map mutations to the experimentally characterised enantioselectivities for a set of 136 variants of the epoxide hydrolase from the fungus Aspergillus niger (AnEH). We investigate whether the influence a mutation has on enzyme selectivity can be accurately predicted through linear models, and whether prediction accuracy can be improved using higher-order counterparts. Comparing linear and polynomial degree = 2 models, mean Pearson coefficients (r) from 50×5-fold cross-validation increase from 0.84 to 0.91 respectively. Equivalent models tested on interaction-minimised sequences achieve values of r= 0.90 and r= 0.93. As expected, testing on a simulated control data set with no interactions results in no significant improvements from higher-order models. Additional experimentally derived AnEH mutants are tested with linear and polynomial degree = 2 models, with values increasing from r= 0.51 to r= 0.87 respectively. The study demonstrates that linear models perform well, however the representation of epistatic interactions in predictive models improves identification of selectivity-enhancing mutations. The improvement is attributed to higher-order kernel functions that represent epistatic interactions between residues.
Journal Title
Journal of Computer-Aided Molecular Design
Conference Title
Book Title
Edition
Volume
31
Issue
12
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
© 2017 Springer. This is an electronic version of an article published in Journal of Computer-Aided Molecular Design, 2017, 31 (12), pp. 1085-1096. Journal of Computer-Aided Molecular Design is available online at: http://link.springer.com/ with the open URL of your article.
Item Access Status
Note
Access the data
Related item(s)
Subject
Theoretical and computational chemistry
Cheminformatics and quantitative structure-activity relationships
Science & Technology
Life Sciences & Biomedicine
Technology
Biochemistry & Molecular Biology
Biophysics
Persistent link to this record
Citation
Zaugg, J; Gumulya, Y; Malde, AK; Boden, M, Learning epistatic interactions from sequence-activity data to predict enantioselectivity, Journal of Computer-Aided Molecular Design, 2017, 31 (12), pp. 1085-1096