SPIN2: Predicting sequence profiles from protein structures using deep neural networks

View/ Open
File version
Accepted Manuscript (AM)
Author(s)
O'Connell, James
Li, Zhixiu
Hanson, Jack
Heffernan, Rhys
Lyons, James
Paliwal, Kuldip
Dehzangi, Abdollah
Yang, Yuedong
Zhou, Yaoqi
Griffith University Author(s)
Year published
2018
Metadata
Show full item recordAbstract
Designing protein sequences that can fold into a given structure is a well‐known inverse protein‐folding problem. One important characteristic to attain for a protein design program is the ability to recover wild‐type sequences given their native backbone structures. The highest average sequence identity accuracy achieved by current protein‐design programs in this problem is around 30%, achieved by our previous system, SPIN. SPIN is a program that predicts sequences compatible with a provided structure using a neural network with fragment‐based local and energy‐based nonlocal profiles. Our new model, SPIN2, uses a deep neural ...
View more >Designing protein sequences that can fold into a given structure is a well‐known inverse protein‐folding problem. One important characteristic to attain for a protein design program is the ability to recover wild‐type sequences given their native backbone structures. The highest average sequence identity accuracy achieved by current protein‐design programs in this problem is around 30%, achieved by our previous system, SPIN. SPIN is a program that predicts sequences compatible with a provided structure using a neural network with fragment‐based local and energy‐based nonlocal profiles. Our new model, SPIN2, uses a deep neural network and additional structural features to improve on SPIN. SPIN2 achieves over 34% in sequence recovery in 10‐fold cross‐validation and independent tests, a 4% improvement over the previous version. The sequence profiles generated from SPIN2 are expected to be useful for improving existing fold recognition and protein design techniques. SPIN2 is available at http://sparks-lab.org.
View less >
View more >Designing protein sequences that can fold into a given structure is a well‐known inverse protein‐folding problem. One important characteristic to attain for a protein design program is the ability to recover wild‐type sequences given their native backbone structures. The highest average sequence identity accuracy achieved by current protein‐design programs in this problem is around 30%, achieved by our previous system, SPIN. SPIN is a program that predicts sequences compatible with a provided structure using a neural network with fragment‐based local and energy‐based nonlocal profiles. Our new model, SPIN2, uses a deep neural network and additional structural features to improve on SPIN. SPIN2 achieves over 34% in sequence recovery in 10‐fold cross‐validation and independent tests, a 4% improvement over the previous version. The sequence profiles generated from SPIN2 are expected to be useful for improving existing fold recognition and protein design techniques. SPIN2 is available at http://sparks-lab.org.
View less >
Journal Title
Proteins: Structure, Function and Bioinformatics
Volume
86
Copyright Statement
© 2018 Wiley Periodicals, Inc. This is the peer reviewed version of the following article: SPIN2: Predicting sequence profiles from protein structures using deep neural networks, Proteins: Structure, Function, and Bioinformatics, Volume86, Issue6, June 2018, Pages 629-633, which has been published in final form at 10.1002/prot.25489. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Self-Archiving (http://olabout.wiley.com/WileyCDA/Section/id-828039.html)
Subject
Mathematical sciences
Biological sciences
Other biological sciences not elsewhere classified