Learning epistatic interactions from sequence-activity data to predict enantioselectivity

Loading...
Thumbnail Image
File version

Accepted Manuscript (AM)

Author(s)
Zaugg, Julian
Gumulya, Yosephine
Malde, Alpeshkumar K
Boden, Mikael
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2017
Size
File type(s)
Location
License
Abstract

Enzymes with a high selectivity are desirable for improving economics of chemical synthesis of enantiopure compounds. To improve enzyme selectivity mutations are often introduced near the catalytic active site. In this compact environment epistatic interactions between residues, where contributions to selectivity are non-additive, play a significant role in determining the degree of selectivity. Using support vector machine regression models we map mutations to the experimentally characterised enantioselectivities for a set of 136 variants of the epoxide hydrolase from the fungus Aspergillus niger (AnEH). We investigate whether the influence a mutation has on enzyme selectivity can be accurately predicted through linear models, and whether prediction accuracy can be improved using higher-order counterparts. Comparing linear and polynomial degree = 2 models, mean Pearson coefficients (r) from 50×5-fold cross-validation increase from 0.84 to 0.91 respectively. Equivalent models tested on interaction-minimised sequences achieve values of r= 0.90 and r= 0.93. As expected, testing on a simulated control data set with no interactions results in no significant improvements from higher-order models. Additional experimentally derived AnEH mutants are tested with linear and polynomial degree = 2 models, with values increasing from r= 0.51 to r= 0.87 respectively. The study demonstrates that linear models perform well, however the representation of epistatic interactions in predictive models improves identification of selectivity-enhancing mutations. The improvement is attributed to higher-order kernel functions that represent epistatic interactions between residues.

Journal Title

Journal of Computer-Aided Molecular Design

Conference Title
Book Title
Edition
Volume

31

Issue

12

Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

© 2017 Springer. This is an electronic version of an article published in Journal of Computer-Aided Molecular Design, 2017, 31 (12), pp. 1085-1096. Journal of Computer-Aided Molecular Design is available online at: http://link.springer.com/ with the open URL of your article.

Item Access Status
Note
Access the data
Related item(s)
Subject

Theoretical and computational chemistry

Cheminformatics and quantitative structure-activity relationships

Science & Technology

Life Sciences & Biomedicine

Technology

Biochemistry & Molecular Biology

Biophysics

Persistent link to this record
Citation

Zaugg, J; Gumulya, Y; Malde, AK; Boden, M, Learning epistatic interactions from sequence-activity data to predict enantioselectivity, Journal of Computer-Aided Molecular Design, 2017, 31 (12), pp. 1085-1096

Collections