Predicting lysine-malonylation sites of proteins using sequence and predicted structural features

No Thumbnail Available
File version
Author(s)
Taherzadeh, Ghazaleh
Yang, Yuedong
Xu, Haodong
Xue, Yu
Liew, Alan Wee-Chung
Zhou, Yaoqi
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2018
Size
File type(s)
Location
License
Abstract

Malonylation is a recently discovered post‐translational modification (PTM) in which a malonyl group attaches to a lysine (K) amino acid residue of a protein. In this work, a novel machine learning model, SPRINT‐Mal, is developed to predict malonylation sites by employing sequence and predicted structural features. Evolutionary information and physicochemical properties are found to be the two most discriminative features whereas a structural feature called half‐sphere exposure provides additional improvement to the prediction performance. SPRINT‐Mal trained on mouse data yields robust performance for 10‐fold cross validation and independent test set with Area Under the Curve (AUC) values of 0.74 and 0.76 and Matthews’ Correlation Coefficient (MCC) of 0.213 and 0.20, respectively. Moreover, SPRINT‐Mal achieved comparable performance when testing on H. sapiens proteins without species‐specific training but not in bacterium S. erythraea. This suggests similar underlying physicochemical mechanisms between mouse and human but not between mouse and bacterium. SPRINT‐Mal is freely available as an online server at: http://sparks-lab.org/server/SPRINT-Mal/. © 2018 Wiley Periodicals, Inc.

Journal Title

Journal of Computational Chemistry

Conference Title
Book Title
Edition
Volume

39

Issue

22

Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
Item Access Status
Note
Access the data
Related item(s)
Subject

Physical chemistry

Theoretical and computational chemistry

Theoretical and computational chemistry not elsewhere classified

Nanotechnology

Support vector machines

Lysine‐malonylation sites prediction

Post translational modification

Persistent link to this record
Citation
Collections