Genome-scale characterization of RNA tertiary structures and their functional impact by RNA solvent accessibility prediction

Thumbnail Image
File version
Version of Record (VoR)
Yang, Yuedong
Li, Xiaomei
Zhao, Huiying
Zhan, Jian
Wang, Jihua
Zhou, Yaoqi
Griffith University Author(s)
Primary Supervisor
Other Supervisors
File type(s)

As most RNA structures are elusive to structure determination, obtaining solvent accessible surface areas (ASAs) of nucleotides in an RNA structure is an important first step to characterize potential functional sites and core structural regions. Here, we developed RNAsnap, the first machine-learning method trained on protein-bound RNA structures for solvent accessibility prediction. Built on sequence profiles from multiple sequence alignment (RNAsnap-prof), the method provided robust prediction in fivefold cross-validation and an independent test (Pearson correlation coefficients, r, between predicted and actual ASA values are 0.66 and 0.63, respectively). Application of the method to 6178 mRNAs revealed its positive correlation to mRNA accessibility by dimethyl sulphate (DMS) experimentally measured in vivo (r = 0.37) but not in vitro (r = 0.07), despite the lack of training on mRNAs and the fact that DMS accessibility is only an approximation to solvent accessibility. We further found strong association across coding and noncoding regions between predicted solvent accessibility of the mutation site of a single nucleotide variant (SNV) and the frequency of that variant in the population for 2.2 million SNVs obtained in the 1000 Genomes Project. Moreover, mapping solvent accessibility of RNAs to the human genome indicated that introns, 5′ cap of 5′ and 3′ cap of 3′ untranslated regions, are more solvent accessible, consistent with their respective functional roles. These results support conformational selections as the mechanism for the formation of RNA–protein complexes and highlight the utility of genome-scale characterization of RNA tertiary structures by RNAsnap. The server and its stand-alone downloadable version are available at

Journal Title
Conference Title
Book Title
Thesis Type
Degree Program
Publisher link
Patent number
Grant identifier(s)
Rights Statement
Rights Statement
© 2016 Yang et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society. This article, published in RNA, is available under a Creative Commons License (Attribution 4.0 International), as described at
Item Access Status
Access the data
Related item(s)
Biochemistry and cell biology
Genetics not elsewhere classified
Persistent link to this record