Prediction and validation of the unexplored RNA-binding protein atlas of the human proteome

Loading...
Thumbnail Image
File version
Author(s)
Zhao, Huiying
Yang, Yuedong
Janga, Sarath Chandra
Kao, C Cheng
Zhou, Yaoqi
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2014
Size

417709 bytes

File type(s)

application/pdf

Location
License
Abstract

Detecting protein-RNA interactions is challenging both experimentally and computationally because RNAs are large in number, diverse in cellular location and function, and flexible in structure. As a result, many RNA-binding proteins (RBPs) remain to be identified. Here, a template-based, function-prediction technique SPOT-Seq for RBPs is applied to human proteome and its result is validated by a recent proteomic experimental discovery of 860 mRNA-binding proteins (mRBPs). The coverage (or sensitivity) is 42.6% for 1217 known RBPs annotated in the Gene Ontology and 43.6% for 860 newly discovered human mRBPs. Consistent sensitivity indicates the robust performance of SPOT-Seq for predicting RBPs. More importantly, SPOT-Seq detects 2418 novel RBPs in human proteome, 291 of which were validated by the newly discovered mRBP set. Among 291 validated novel RBPs, 61 are not homologous to any known RBPs. Successful validation of predicted novel RBPs permits us to further analysis of their phenotypic roles in disease pathways. The dataset of 2418 predicted novel RBPs along with confidence levels and complex structures is available at http://sparks-lab.org (in publications) for experimental confirmations and hypothesis generation.

Journal Title

Proteins: Structure, Function, Bioinformatics

Conference Title
Book Title
Edition
Volume

82

Issue

4

Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

© 2014 Wiley Periodicals, Inc. This is the accepted version of the following article: Prediction and validation of the unexplored RNA-binding protein atlas of the human proteome, Proteins: Structure, Function, and Bioinformatics, Vol. 82(4), 2014, pp. 640-647, which has been published in final form at dx.doi.org/10.1002/prot.24441.

Item Access Status
Note
Access the data
Related item(s)
Subject

Mathematical sciences

Biological sciences

Information and computing sciences

Persistent link to this record
Citation
Collections