SPIN2: Predicting sequence profiles from protein structures using deep neural networks
File version
Accepted Manuscript (AM)
Author(s)
Li, Zhixiu
Hanson, Jack
Heffernan, Rhys
Lyons, James
Paliwal, Kuldip
Dehzangi, Abdollah
Yang, Yuedong
Zhou, Yaoqi
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
License
Abstract
Designing protein sequences that can fold into a given structure is a well‐known inverse protein‐folding problem. One important characteristic to attain for a protein design program is the ability to recover wild‐type sequences given their native backbone structures. The highest average sequence identity accuracy achieved by current protein‐design programs in this problem is around 30%, achieved by our previous system, SPIN. SPIN is a program that predicts sequences compatible with a provided structure using a neural network with fragment‐based local and energy‐based nonlocal profiles. Our new model, SPIN2, uses a deep neural network and additional structural features to improve on SPIN. SPIN2 achieves over 34% in sequence recovery in 10‐fold cross‐validation and independent tests, a 4% improvement over the previous version. The sequence profiles generated from SPIN2 are expected to be useful for improving existing fold recognition and protein design techniques. SPIN2 is available at http://sparks-lab.org.
Journal Title
Proteins: Structure, Function and Bioinformatics
Conference Title
Book Title
Edition
Volume
86
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
© 2018 Wiley Periodicals, Inc. This is the peer reviewed version of the following article: SPIN2: Predicting sequence profiles from protein structures using deep neural networks, Proteins: Structure, Function, and Bioinformatics, Volume86, Issue6, June 2018, Pages 629-633, which has been published in final form at 10.1002/prot.25489. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Self-Archiving (http://olabout.wiley.com/WileyCDA/Section/id-828039.html)
Item Access Status
Note
Access the data
Related item(s)
Subject
Mathematical sciences
Biological sciences
Other biological sciences not elsewhere classified