3D RNA from evolutionary couplings

Caleb Weinreb; Torsten Gross; Chris Sander; Debora S. Marks

doi:10.1101/028456

Abstract

Non-protein-coding RNAs are ubiquitous in cell physiology, with a diverse repertoire of known functions. In fact, the majority of the eukaryotic genome does not code for proteins, and thousands of conserved long non-protein-coding RNAs of currently unkown function have been identified. When available, knowledge of their 3D structure is very helpful in elucidating the function of these RNAs. However, despite some outstanding structure elucidation of RNAs using X-ray crystallography, NMR and cryoEM, learning RNA 3D structures remains low-throughput. RNA structure prediction in silico is a promising alternative approach and works well for double-helical stems, but full 3D structure determination requires tertiary contacts outside of secondary structures that are difficult to infer from sequence information. Here, based only on information from RNA multiple sequence alignments, we use a global statistical sequence probability model of co-variation in a pairs of nucleotide positions to detect 3D contacts, in analogy to recently developed breakthrough methods for computational protein folding. In blinded tests on 22 known RNA structures ranging in size from 65 to 1800 nucleotides, the predicted contacts matched physical nucleotide interactions with 65-95% true positive prediction accuracy. Importantly, we infer many long-range tertiary contacts, including non-Watson-Crick interactions, where secondary structure elements assemble in 3D. When used as restraints in molecular dynamics simulations, the inferred contacts improve RNA 3D structure prediction to a coordinate error as low as 6 – 10 Å rmsd deviation in atom positions, with potential for further refinement by molecular dynamics. These contacts include functionally important interactions, such as those that distinguish the active and inactive conformations of four riboswitches. In blind prediction mode, we present evolutionary couplings suitable for folding simulations for 180 RNAs of unknown structure, available at https://marks.hms.harvard.edu/ev_rna/. We anticipate that this approach can help shed light on the structure and function of non-protein-coding RNAs as well as 3D-structured mRNAs.

Introduction

Structured RNAs play diverse cellular roles, including the regulation of mRNA splicing, degradation and localization^1-5, delivery of viral genome⁶, X-inactivation⁷, as well as its more well known roles in translation and ribosomal processing⁸ This probably represents only a fraction of RNA’s functional repertoire. High-throughput approaches have revealed pervasive transcription throughout the non-coding genome. Although much of this transcription may be artifactual or biological noise⁹, it has nevertheless revealed some functional RNAs, including long noncoding RNAs¹⁰ that may have three-dimensional structures that can act can act – for example – as protein scaffolds¹¹. Other large-scale evidence of structure within the coding region of mRNAs has emerged from new high-throughput approaches that detect base-pairing in vivo^12-15.

Many of these newly discovered RNAs act by adopting specific 3D structures^16,17 but high-resolution structure determination remains labor intensive. Thus, there is a renewed interest in computational approaches to structure prediction. Though 3D structure prediction for smaller RNAs has become moderately successful ^18-22, predicting the structure of large (> 70 nt) RNAs without experimental information remains challenging²³ as the RNAs with multiple helical segments can assume diverse folds, producing a conformational search space that is impossibly large. Information about contacts between bases that are distant in the secondary structure but close in the 3D structure (tertiary contacts) substantially narrows the conformational search space²⁴. Biochemical probing studies can provide evidence for low-resolution tertiary contacts^25,26, and 3D structures computed using extensive experimental chemical mapping information have achieved accuracy of ~ 8Å positional root mean square deviation (rmsd) from the known crystal structures. However, predicting RNA 3D structure completely in silico has been challenging²⁷.

The recent explosion of available homologous sequences for RNAs provides an exciting opportunity to detect tertiary contacts from sequence co-variation alone. However, existing methods – designed to detect local (pairwise) patterns of sequence co-variation – have had limited success^28,29,30. Since RNA tertiary contacts often form complex networks³¹, it is possible that overlapping patterns of sequence constraint interfere with each other, obscuring true correlations and producing spurious transitive correlations when multiple contacts are chained together. A similar problem stymied protein structure prediction until the emergence of global maximum entropy models that could de-convolve the underlying network of residue-residue interactions^32-34. Although local, non-global co-variation models have successfully predicted RNA secondary structure, we hypothesized that global maximum entropy models such as those used in predicting protein 3D structure^35-38 would enhance prediction of 3D contacts in RNA, including tertiary contacts such as non-WC base pairs. Here we adapted the maxent model to RNA sequences alignments and tested the ability of the approach to predict tertiary structure contacts on 22 known RNA structures. In genuine prediction mode, we infer 3D contacts for 180 known functional RNAs with no known 3D structure from mulitple sequence alignments alone.

Results

Evolutionary couplings accurately predict 3D contacts

We adapted the evolutionary couplings model used for proteins to predict ECs for RNA and evaluated the ability of ECs to recover RNA 3D contacts for a specific RNA from each of the 22 RFAM families that had a known 3D structure (Supplementary Table 1). To assess the accuracy of predicted contacts, we defined contacts as true positives when their minimumatom-distance was < 8 Å. All 22 RNAs had a true positive rate above 70% for L/2 ECs (where L is the number of nucleotides) and 8 had over 80%. ECs predicted contacts with greater accuracy than mutual information (MI), which has been widely used for RNA secondary structure prediction^39,40 (Fig. 1A). This was true even when using enhanced MI (MIe) that implements two features of the EC statistical model: (1) Down-weighting of similar sequences to avoid spurious correlations from phylogeny; (2) An average product correction (APC)⁴¹. We refer to MI without these modifications as raw MI (MI_R).

Figure 1: Comparison of EC to MI: Summary of 22 alignments

(A) EC predicts structural contacts with an overall higher accuracy than MI_E or MI_R. To see if EC and MI are sensitive to different types of contacts, we classified contacts based on their distance in the secondary structure and defined three categories: secondary structure contacts (d_ss = 1); short-range contacts (d_ss < 5); and long-range contacts (d_ss ≥ 5). (B) Whereas EC and MI detect a similar number of secondary structure contacts, ECs are significantly enriched with long-range contacts. (C) The long-range contacts detected by ECs represent a variety of biochemical interactions, as annotated from the crystal structure. Notably, 51% of these contacts have no annotation. Understanding the biochemical basis for these interactions is a promising area for future research.

ECs detect long-range interactions

Though overall accuracy is important – not all contacts are created equal. Often, complex RNA folds are stabilized by a small number of critical long-range contacts that bridge distant parts of the secondary structure. Detecting these contacts computationally would be a significant advance since they provide crucial structural information that cannot be inferred from the secondary structure alone. To test whether ECs could detect long-range contacts, we define the secondary-structure-distance (d_ss) between two bases as the length of the shortest path between them in a graph where nodes are bases and edges are either WC base pairs of instances of adjacency on the RNA chain. True positive predicted contacts can then be divided into three categories: WC base pairs (d_ss = 1); short-range contacts (1 < d_ss < 5); and long-range contacts (d_ss ≥ 5). Compared to MI_E, EC-predicted contacts are highly enriched with long-range contacts (0.66 – 8 fold; mean fold-change = 2.4; p ≤ 10⁻³; Fig. 1B). Indeed, we were able to robustly predict long-range contacts across the 22 alignments, obtaining on average 0.05 * L contacts for an RNA of length L, with some notable outliers. For example, for Ribonuclease P, a ribozyme of length L = 334, we correctly predicted 53 long-range contacts using ECs, compared to 9 for MI_E and 10 for MI_R. The long-range contacts predicted by ECs represent a variety of biochemical interactions, including base pairing, base stacking and base-backbone interactions (Fig. 1C). Remarkably, 51% of long-range contacts had no annotation in the crystal structure. These contacts may arise from un-annotated interactions such cooperative binding of an ion, ligand, or protein, or through patterns of hydrogen bonding that are not currently characterized. Understanding the biochemical basis for these interactions is a interesting area for future research.

ECs detect non-WC base pairs

Pairs of RNA bases often form contacts through hydrogen bonding, assuming geometrical configurations that can broadly be divided into Watson-Crick (WC) base pairs and non-WC base pairs. Previous work has suggested that non-WC base pairs cannot be predicted from sequence co-variation alone⁴², since they do not sufficiently coevolve. Others have hypothesized that non-WC base pairs cannot be predicted computationally since they often participate in interaction networks, with overlapping patterns of sequence constraint obscuring the underlying pairwise co-variation³¹. Evolutionary couplings derived from global maximum entropy models have successfully deconvolved similar networks of constraints within proteins, and may be useful in this context too. We found that ECs detect non-WC base pairs with unprecedented sensitivity, detecting substantially more pairs than MI_E (0.5 – 8.5 fold; mean fold-change = 1.92; p ≤ 10⁻³). Overall, ECs captured 16% of all annotated non-WC base pairs across the 22 structures, which may represent a high level of sensitivity, since not all non-WC base pairs coevolve. We expect that ECs will complement existing approaches for detecting non-WC base pairs that rely on the concept of isostericity³⁰, since they can focus attention on interactions with the strongest co-evolutionary signal.

ECs reveal contacts in the eukaryotic ribosome

ECs may be sensitive to interacting nucleotides in large RNAs that form topologically complex folds with abundant long-range contacts. ECs computed on a full alignment of eukaryotic ribosomal sequences (RF01960) are over 90% accurate for the top 900 (L/2) contacts and predict substantially more WC and non-WC base pairs than MI_E (Fig. 2A-C). ECs also predict many long-range contacts between nucleotides (Fig. 2D). However, some contacting nucleotides are not predicted, including a large pseudo-knot between the 5-prime and 3-prime regions. ECs may not capture these contacts due to a lack of sensitivity. Alternatively, these regions may not actually be co-evolving and their proximities could be a simple consequence of the geometric constraints from the remainder of the molecule. Distinguishing contacts that genuinely coevolve from those that form due to geometrical constraints is important because coevolving contacts are conserved across the alignment and likely to be functionally important.

Figure 2: ECs detect ribosomal contacts

ECs may be sensitive for large RNAs that form topologically complex folds with abundant long-range contacts. We tested this notion on the 40S ribosome (RF01960), and found a dramatic difference between ECs and MI, with ECs detecting more overall true positives (A), more Watson-Crick base pairs (B) and more non-WC base pairs (C). ECs also detect ~50 long-range contacts that bridge distance parts of the secondary structure (D).

Evolutionary couplings reveal functionally important contacts in riboswitches

To probe whether evolutionarily coupled tertiary contacts are functionally important, we investigated the top scoring ECs in riboswitches, which are cis-acting regulatory RNA genes that undergo ligand or temperature⁴⁵ dependent conformational changes between at least two mutually exclusive functional states^43-45. In four riboswitches from our dataset – S-adenosylmethionine (SAM), active vitamin B₁ (TPP), active folate vitamin (THF), and Adenine-sensing ( Purine) – we found a cluster of tertiary contacts that stabilize the ligand bound conformation, but are broken in the unbound conformation (Fig.3). For instance, in the TPP riboswitch, ECs between the L5 loop and J3-2 helix (69A-37G, 69A-23C and 70A-22C, in numbering from 2gdi.pdb⁴⁶) have stacking interactions when the TPP ligand binds⁴⁶, Fig. 3B. The EC predicted contacts that are part of the ligand bound conformations are not present in the secondary structure or detected by MI_E. Similarly, very high ranking ECs between nucleotides between P2-P4 and P1-P4 in the SAM riboswitch form conditionally on binding of S-adenosylmethionine, (C25-G89, U26-A88, G27-C87, G28-C86, A9-A84, A10-A84 in numbering from 4kqy.pdb⁴⁷). Some of these EC predicted contacts are also detected by the MI_E but there is a cluster between P1-P4 that are not, including base stacking between A9-A84. In addition to tertiary contacts from the known ligand-bound conformation, ECs could be used to confirm the existence of hypothesized contacts in the unbound conformation, which has no known structure. Unfortunately, many of these contacts involve nucleotides that are not currently represented in the RFAM family. We look forward to analyzing these more complete alignments.

Figure 3: ECs identify functional interactions in riboswitches

3D contacts detected through using ECs are conserved across the RNA family, and may therefore be functionally important. In four riboswitches, the most significant tertiary interactions revealed by ECs are critical for stabilizing the ligand-bound conformation. In each example, a contact map (left) shows the top L/2 contacts. The circled contacts – which are highlighted red on the 3D structures (middle) – are formed in the ligand-bound state, but violated in the unbound state. This is illustrated by the schematics (right), which were reproduced from prior studies.

Proof of principle-folding

To understand the function of non-coding RNAs, it is critical to know their structure. Currently, a major obstacle in RNA 3D structure prediction is the limited knowledge of long-range contacts that bridge distant parts of the secondary structure. We hypothesized that ECs could provide the critical tertiary contacts that are necessary to fold RNAs. Using coarse-grained molecular dynamics implemented in NAST³¹ followed by simulated annealing with XPLOR⁴⁸, we predicted the all-atom structure for four RNA families (Fig. 4). Our predicted structures had an all atom RMSD of 6 – 12 Å, which is comparable to the state of the art for RNA structure predictions that have tertiary contacts derived from biochemical probing²³.

Figure 4: Blinded structure prediction

EC contacts provide enough structural information to predict the 3D structure of medium sized (70 – 120 nt) RNAs. The results from four predictions are shown here, with the true structure shown above (gray) and the predicted structure shown below (red).

Contact Predictions of RNAs of unknown 3D structure

We predicted the 3D contacts for 160 RNA genes represented in RFAM that have no 3D structure known for any member of the family. This includes members of the Group-II catalytic introns whose 3-dimensional arrangement has been elusive since their discovery over 25 years ago⁴⁹ Not surprisingly, the high ECs (red dots, Fig. 5A) capture the predicted secondary structure (green dots, Fig. 5A) but they also capture clusters of contacts that connect together the predicted helices into a more compact structure (black circles, Fig5A). Specifically, the EC pairs connect the end of stem loop 1 with the start of stem loop 3, end of stem loop 2 with end of stem loop 4 etc. Similar patterns in all the Group II intron predictions suggest compact globular structures that are independently known to exist. ECs for the small nucleolar RNA 25 connect the two prominent stem loops, predicting a compact folded RNA (Fig. 5B right panel). The prediction of the archeal RNAseP (left panel, Fig.5B) shows a similar 3D arrangement of the stem loops to those known for the other RNaseP families, though they are not sequences related enough to be put in same RFAM family, they are grouped into the same clan and their similar function suggests that the prediction is correct. Similarly, ECs of the SAM-IV riboswitch are similar to known functionally related SAM riboswitches, indirectly supporting the predicted 3D contacts (Fig. 5B middle panel).

Figure 5: Predicted contacts for six RNA families

We use ECs to predict contacts for 160 RNA families with no known structure. Six families are presented here, selected for the presence of interesting tertiary contacts, which are highlighted with black circles. Many of these contacts appear to bridge the loops at the end of hairpins, and may be informative for 3D structure prediction.

Many more riboswitches are thought to exist than are currently known, but detecting them computationally has been challenging as they are often distant in primary sequence though machine learning based approaches represent a promising advance^50,51. Here, we have discovered a conserved co-evolutionary motif among riboswitches in which tertiary contacts between stem-loops stabilize the ligand bound conformation and are broken in the unbound conformation. Searching for this pattern of evolutionary couplings could complement existing approaches for riboswitch detection.

Discussion

We show that the global statistical models used for protein contact prediction give more accurate contacts than standard approaches and more importantly reveal nucleotide contacts that are important for the RNA function. In particular, the inferred evolutionary couplings (ECs) reveal clusters of contacts between loops and helices that are strategically located for the formation of 3D structures. In many cases the co-evolutionary signal from these interactions is drowned out by transitivity when using a purely local correlation model such as mutual information (MI), which is independently calculated for each pair of positions.

This work uses RNA alignments available in the RFAM database and we expect that the information content of the alignments could be improved with algorithmic refinements, especially by careful extension beyond the boundaries of current RFAM domains. Fortunately, ECs for RNA can be calculated with far fewer sequences than for proteins, since the number of parameters in the model scales with the square of the number of residue states, which is just four for RNA as opposed to twenty.

RNA secondary structure, which is formed by strings of hydrogen-bonded base pair interactions, can be well predicted by a variety of methods, including local co-variation methods such as MI. However, RNA gene products can form other kinds of inter-nucleotide contacts that are important for its 3D structure and function. With such non-secondary structure contacts derived from maximum entropy co-variation analysis, we perform proof-of-principle folding that uses only ECs and secondary structure prediction from RFAM. The resulting structures in some cases are better than those computed using detailed experimental constraints, even without the use of fragments or canonical helical constructs.

For practical reasons, we have focused attention on small and medium size RNA species in the range of tens to a few hundred nucleotides in length. For the largest molecule analyzed here, the small ribosomal subunit with about 1800 nucleotides, we derived predicted contacts, outside of secondary structure helices, at a 2:1 correct:false ratio, with broad distribution over the known tertiary structure contacts. However, complete folding of this and the large subunit of the ribosome probably requires assistance of ribosomal proteins. Evolutionary coupling analysis between RNA and proteins is of interest in this context and beyond the scope of this report.

The inferred evolutionary couplings, for each pair of residue positions, have a numerical value reflecting the strength of the interaction that the particular pair contributes to the mutational correlations observed for all pairs. Plausibly, the extent of translational propagation of the basic direct interactions in the entire system is stronger in proteins than in RNA molecules. We were therefore surprised that the maximum entropy method for extracting causative interactions works well in about one third of the currently analyzed sequence-rich RNA families and leads to reasonably accurate predictions of 3D tertiary contacts and 3D atomic structures. With the rapid acquisition of genetic sequences, we expect many more functional RNA molecules to reveal interesting 3D structures via maximum entropy contact analysis.

Methods

Selection of RFAM families

To test whether ECs could predict tertiary structure contacts, we used RNA multiple sequence alignments from the RFAM 11.0 database⁵², removing columns with > 50% gaps. We restricted to families where the effective number of sequences (Meff, see below) was greater than 0.5L, where L is the number of columns in the alignment, yielding 244 families (see Supplementary Table 1). Of these, 21 aligned to a known structure in the PDB ⁵³. Data on these 21 structures are presented in Figure 1. For blind structure prediction, we removed structures with a length > 200 nt, as well as structures where the RNA of interest is part of a larger complex. Data on the remaining 13 structures are presented in Figure 3.

Computing ECs

We applied a maximum entropy model to identify evolutionarily coupled pairs of columns in the alignments as described previously³⁶. We inferred the parameters of our model using penalized Maximum Likelihood with a pseudo-likelihood approximation (pseudo-likelihood maximization; PLM)^36,54-57 rather than with a previously applied mean-field approximation^32,33,35,37. This method assumes that sequences are independent draws from an underlying distribution over sequence space. However, this assumption does not hold in reality, since many sequences are related by phylogeny. To account for this, we reweighted sequences in inverse proportion to their number over 80% similar neighbors. The sum of the resulting weights represents the effective number sequences (M_eff).

For regularization, we used an L2 penalty with λ_h = 0.01 for single column fields and λ_e = 20.0 for the pair couplings. We also applied an average product correction (APC)⁴¹ to account for differences in the entropy of each column.

Computing MI

To investigate how ECs compare to previous measures of co-evolution, we computed two versions of mutual information (MI). First we computed the raw MI (MI_R) as shown below, where f_i(A) = P(S_i = A) and f_ij(A, B) = P(S_i = A, S_j = B) for a sequence S in the alignment. EC scores differ from MI_R in three ways: (1) They rely on a global maximum entropy model; (2) They down-weight sequences with a greater phylogenetic representation in the alignment; (3) They include an APC correction. Since feature (1) is the focus of this study, we also computed an enhanced MI score (MI_E), which incorporates features (2) and (3), as has been done in previous work on RNA co-evolution⁴¹.

Annotating interactions

For each alignment, we investigated the top L/2 contacts with a chain-distance > 4. We first classified contacts as true-positives if the minimum-atom-distance from the crystal structure was < 8 Å These were classified according secondary structure distance (d_ss) and biochemical interaction type. The d_ss for a pair of bases is the length of the shortest path between them in a graph where nodes are bases and edges are either secondary-structure contacts or instances of adjacency on the chain. To compute d_ss, we used the consensus secondary-structure provided by RFAM, which is inferred using a profile stochastic context-free grammar⁵⁸. To classify contacts create random unfolded by their biochemical interaction type, we used crystal structure annotations from FR3D³¹ which were downloaded from RNA3DHub (http://rna.bgsu.edu/rna3dhub/).

3D structure prediction

We performed blind structure prediction using the Nucleic Acid Simulation Tool (NAST)⁵⁹. NAST is a coarse-grained modeling tool that uses a combination secondary structure and tertiary contacts as inputs. For each RNA family, we generated 200 random unfolded structures that satisfied the secondary structure constraints (Fig. 6A). Next, we performed molecular dynamics using tertiary structure restraints to generate candidate models (Fig. 6B). To obtain these tertiary structure restraints, we used the ¾*L contacts with the top EC scores, where L represents the length of the RNA. Since the resulting lists contained many intra-helical contacts that are not useful for folding, we removed all contacts with d_ss < 5. We next sought to eliminate tertiary structure restraints derived from false-positive contacts. To that end, we performed molecular dynamics using weak constraints to iteratively remove restraints that were consistently violated by the resulting structures (Fig. 6C), removing at most 15% of the using simulated annealing contacts in any one round. Contacts were defined to be violated when the average distance between the corresponding bases was > pipeline 15 Å. Once all contacts were satisfied, we clustered the 20% of structures with the lowest NAST energy (Fig. 6D) using Biopython, producing n = 4 clusters. From each cluster, we chose a representative with the lowest NAST energy and then created an all-atom structure by assembling fragments from the ribosome (Fig. 6E). Finally, we refined the all-atom models by simulated annealing with XPLOR (Fig. 6F).

Figure 6: Overview of folding pipeline

References

1.↵
Warf, M. B. & Berglund, J. A. Role of RNA structure in regulating pre-mRNA splicing. Trends in Biochemical Sciences 35, 169–178, doi:http://dx.doi.org/10.1016/j.tibs.2009.10.004 (2010).
OpenUrl CrossRef PubMed Web of Science
2.
McManus, C. J. & Graveley, B. R. RNA structure and the mechanisms of alternative splicing. Curr Opin Genet Dev 21, 373–379, doi:10.1016/j.gde.2011.04.001 (2011).
OpenUrl CrossRef PubMed
3.
Martin, K. C. & Ephrussi, A. mRNA localization: gene expression in the spatial dimension. Cell 136, 719–730, doi:10.1016/j.cell.2009.01.044 (2009).
OpenUrl CrossRef PubMed Web of Science
4.
Garneau, N. L., Wilusz, J. & Wilusz, C. J. The highways and byways of mRNA decay. Nat Rev Mol Cell Biol 8, 113–126, doi:10.1038/nrm2104 (2007).
OpenUrl CrossRef PubMed
5.↵
He, L. & Hannon, G. J. MicroRNAs: small RNAs with a big role in gene regulation. Nat Rev Genet 5, 522–531, doi:10.1038/nrg1379 (2004).
OpenUrl CrossRef PubMed
6.↵
Olsen, H. S., Nelbock, P., Cochrane, A. W. & Rosen, C. A. Secondary structure is the major determinant for interaction of HIV rev protein with RNA. Science 247, 845–848 (1990).
OpenUrl Abstract/FREE Full Text
7.↵
Plath, K., Mlynarczyk-Evans, S., Nusinow, D. A. & Panning, B. Xist RNA and the mechanism of X chromosome inactivation. Annu Rev Genet 36, 233–278, doi:10.1146/annurev.genet.36.042902.092433 (2002).
OpenUrl CrossRef PubMed Web of Science
8.↵
Maxwell, E. S. & Fournier, M. J. The small nucleolar RNAs. Annu Rev Biochem 64, 897–934, doi:10.1146/annurev.bi.64.070195.004341 (1995).
OpenUrl CrossRef PubMed Web of Science
9.↵
Eddy, S. R. Computational analysis of conserved RNA secondary structure in transcriptomes and genomes. Annu Rev Biophys 43, 433–456, doi:10.1146/annurev-biophys-051013-022950 (2014).
OpenUrl CrossRef PubMed
10.↵
Rinn, J. L. & Chang, H. Y. Genome regulation by long noncoding RNAs. Annu Rev Biochem 81, 145–166, doi:10.1146/annurev-biochem-051410-092902 (2012).
OpenUrl CrossRef PubMed Web of Science
11.↵
Quinodoz, S. & Guttman, M. Long noncoding RNAs: an emerging link between gene regulation and nuclear organization. Trends Cell Biol 24, 651–663, doi:10.1016/j.tcb.2014.08.009 (2014).
OpenUrl CrossRef
12.↵
Wan, Y. et al. Landscape and variation of RNA secondary structure across the human transcriptome. Nature 505, 706–709, doi:10.1038/nature12946 (2014).
OpenUrl CrossRef PubMed Web of Science
13.
Spitale, R. C. et al. Structural imprints in vivo decode RNA regulatory mechanisms. Nature 519, 486–490, doi:10.1038/nature14263 (2015).
OpenUrl CrossRef PubMed
14.
Ding, Y. et al. In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features. Nature 505, 696–700, doi:10.1038/nature12756 (2014).
OpenUrl CrossRef PubMed Web of Science
15.↵
Rouskin, S., Zubradt, M., Washietl, S., Kellis, M. & Weissman, J. S. Genome-wide probing of RNA structure reveals active unfolding of mRNA structures in vivo. Nature 505, 701–705, doi:10.1038/nature12894 (2014).
OpenUrl CrossRef PubMed Web of Science
16.↵
Mortimer, S. A., Kidwell, M. A. & Doudna, J. A. Insights into RNA structure and function from genome-wide studies. Nat Rev Genet 15, 469–479, doi:10.1038/nrg3681 (2014).
OpenUrl CrossRef PubMed
17.↵
Novikova, I. V., Hennelly, S. P. & Sanbonmatsu, K. Y. Sizing up long non-coding RNAs: do lncRNAs have secondary and tertiary structure? Bioarchitecture 2, 189–199, doi:10.4161/bioa.22592 (2012).
OpenUrl CrossRef PubMed
18.↵
Parisien, M. & Major, F. The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data. Nature 452, 51–55, doi:10.1038/nature06684 (2008).
OpenUrl CrossRef PubMed Web of Science
19.
Frellsen, J. et al. A probabilistic model of RNA conformational space. PLoS Comput Biol 5, e1000406, doi:10.1371/journal.pcbi.1000406 (2009).
OpenUrl CrossRef PubMed
20.
Das, R., Karanicolas, J. & Baker, D. Atomic accuracy in predicting and designing noncanonical RNA structure. Nat Methods 7, 291–294, doi:10.1038/nmeth.1433 (2010).
OpenUrl CrossRef PubMed Web of Science
21.
Das, R. & Baker, D. Automated de novo prediction of native-like RNA tertiary structures. Proc Natl Acad Sci U S A 104, 14664–14669, doi:10.1073/pnas.0703836104 (2007).
OpenUrl Abstract/FREE Full Text
22.↵
Cao, S. & Chen, S.-J. Physics-based de novo prediction of RNA 3D structures. The journal of physical chemistry. B 115, 4216–4226, doi:10.1021/jp112059y (2011).
OpenUrl CrossRef PubMed
23.↵
Miao, Z. et al. RNA-Puzzles Round II: assessment of RNA structure prediction programs applied to three large RNA structures. RNA 21, 1066–1084, doi:10.1261/rna.049502.114 (2015).
OpenUrl Abstract/FREE Full Text
24.↵
Magnus, M. et al. Computational modeling of RNA 3D structures, with the aid of experimental restraints. RNA Biol 11, 522–536, doi:10.4161/rna.28826 (2014).
OpenUrl CrossRef PubMed
25.↵
Ramani, V., Qiu, R. & Shendure, J. High-throughput determination of RNA structure by proximity ligation. Nat Biotechnol, doi:10.1038/nbt.3289 (2015).
OpenUrl CrossRef PubMed
26.↵
Cheng, C. Y. et al. Consistent global structures of complex RNA states through multidimensional chemical mapping. Elife 4, e07600, doi:10.7554/eLife.07600 (2015).
OpenUrl CrossRef PubMed
27.↵
Laing, C. & Schlick, T. Computational approaches to 3D modeling of RNA. J Phys Condens Matter 22, 283101, doi:10.1088/0953-8984/22/28/283101 (2010).
OpenUrl CrossRef PubMed
29.↵
Shang, L., Xu, W., Ozer, S. & Gutell, R. R. Structural constraints identified with covariation analysis in ribosomal RNA. PLoS One 7, e39383, doi:10.1371/journal.pone.0039383 (2012).
OpenUrl CrossRef PubMed
29.↵
Pang, P. S., Jankowsky, E., Wadley, L. M. & Pyle, A. M. Prediction of functional tertiary interactions and intermolecular interfaces from primary sequence data. J Exp Zool B Mol Dev Evol 304, 50–63, doi:10.1002/jez.b.21024 (2005).
OpenUrl CrossRef
30.↵
Mokdad, A. & Frankel, A. D. ISFOLD: structure prediction of base pairs in non-helical RNA motifs from isostericity signatures in their sequence alignments. J Biomol Struct Dyn 25, 467–472, doi:10.1080/07391102.2008.10531239 (2008).
OpenUrl CrossRef PubMed
31.↵
Butcher, S. E. & Pyle, A. M. The molecular interactions that stabilize RNA tertiary structure: RNA motifs, patterns, and networks. Acc Chem Res 44, 1302–1311, doi:10.1021/ar200098t (2011).
OpenUrl CrossRef PubMed Web of Science
32.↵
Marks, D. S., Hopf, T. A. & Sander, C. Protein structure prediction from sequence variation. Nat Biotechnol 30, 1072–1080, doi:10.1038/nbt.2419 (2012).
OpenUrl CrossRef PubMed
33.↵
Morcos, F. et al. Direct-coupling analysis of residue coevolution captures native contacts across many protein families. Proc Natl Acad Sci U S A 108, E1293–1301, doi:10.1073/pnas.1111471108 (2011).
OpenUrl Abstract/FREE Full Text
34.↵
Weigt, M., White, R. A., Szurmant, H., Hoch, J. A. & Hwa, T. Identification of direct residue contacts in protein-protein interaction by message passing. Proc Natl Acad Sci US A 106, 67–72, doi:10.1073/pnas.0805923106 (2009).
OpenUrl Abstract/FREE Full Text
35.↵
Hopf, T. A. et al. Three-dimensional structures of membrane proteins from genomic sequencing. Cell 149, 1607–1621, doi:10.1016/j.cell.2012.04.012 (2012).
OpenUrl CrossRef PubMed Web of Science
36.↵
Hopf, T. A. et al. Sequence co-evolution gives 3D contacts and structures of protein complexes. Elife 3, doi:10.7554/eLife.03430 (2014).
OpenUrl CrossRef PubMed
37.↵
Marks, D. S. et al. Protein 3D structure computed from evolutionary sequence variation. PLoS One 6, e28766, doi:10.1371/journal.pone.0028766 (2011).
OpenUrl CrossRef PubMed
38.↵
Ovchinnikov, S., Kamisetty, H. & Baker, D. Robust and accurate prediction of residue-residue interactions across protein interfaces using evolutionary information. Elife 3, e02030, doi:10.7554/eLife.02030 (2014).
OpenUrl CrossRef PubMed
39.↵
Freyhult, E., Moulton, V. & Gardner, P. Predicting RNA structure using mutual information. Appl Bioinformatics 4, 53–59 (2005).
OpenUrl CrossRef PubMed
40.↵
Gutell, R. R., Power, A., Hertz, G. Z., Putz, E. J. & Stormo, G. D. Identifying constraints on the higher-order structure of RNA: continued development and application of comparative sequence analysis methods. Nucleic Acids Res 20, 5785–5795 (1992).
OpenUrl CrossRef PubMed Web of Science
41.↵
Dunn, S. D., Wahl, L. M. & Gloor, G. B. Mutual information without the influence of phylogeny or entropy dramatically improves residue contact prediction. Bioinformatics 24, 333–340, doi:10.1093/bioinformatics/btm604 (2008).
OpenUrl CrossRef PubMed Web of Science
42.↵
Dutheil, J. Y., Jossinet, F. & Westhof, E. Base pairing constraints drive structural epistasis in ribosomal RNA sequences. Mol Biol Evol 27, 1868–1876, doi:10.1093/molbev/msq069 (2010).
OpenUrl CrossRef PubMed Web of Science
43.↵
Garst, A. D., Edwards, A. L. & Batey, R. T. Riboswitches: structures and mechanisms. Cold Spring Harb Perspect Biol 3, doi:10.1101/cshperspect.a003533 (2011).
OpenUrl Abstract/FREE Full Text
44.
Reining, A. et al. Three-state mechanism couples ligand and temperature sensing in riboswitches. Nature 499, 355–359, doi:10.1038/nature12378 (2013).
OpenUrl CrossRef PubMed Web of Science
45.↵
Serganov, A. & Patel, D. J. Molecular recognition and function of riboswitches. Curr Opin Struct Biol 22, 279–286, doi:10.1016/j.sbi.2012.04.005 (2012).
OpenUrl CrossRef PubMed
46.↵
Serganov, A., Polonskaia, A., Phan, A. T., Breaker, R. R. & Patel, D. J. Structural basis for gene regulation by a thiamine pyrophosphate-sensing riboswitch. Nature 441, 1167–1171, doi:10.1038/nature04740 (2006).
OpenUrl CrossRef PubMed Web of Science
47.↵
Lu, C. et al. SAM recognition and conformational switching mechanism in the Bacillus subtilis yitJ S box/SAM-I riboswitch. J Mol Biol 404, 803–818, doi:10.1016/j.jmb.2010.09.059 (2010).
OpenUrl CrossRef PubMed Web of Science
48.↵
Schwieters, C. D., Kuszewski, J. J., Tjandra, N. & Clore, G. M. The Xplor-NIH NMR molecular structure determination package. JMagn Reson 160, 65–73 (2003).
OpenUrl CrossRef PubMed Web of Science
49.↵
Michel, F., Umesono, K. & Ozeki, H. Comparative and functional anatomy of group II catalytic introns--a review. Gene 82, 5–30 (1989).
OpenUrl CrossRef PubMed Web of Science
50.↵
Havill, J. T., Bhatiya, C., Johnson, S. M., Sheets, J. D. & Thompson, J. S. A new approach for detecting riboswitches in DNA sequences. Bioinformatics 30, 3012–3019, doi:10.1093/bioinformatics/btu479 (2014).
OpenUrl CrossRef PubMed
51.↵
Chang, T. H. et al. Computational identification of riboswitches based on RNA conserved functional sequences and conformations. RNA 15, 1426–1430, doi:10.1261/rna.1623809 (2009).
OpenUrl Abstract/FREE Full Text
52.↵
Burge, S. W. et al. Rfam 11.0: 10 years of RNA families. Nucleic Acids Res 41, D226–232, doi:10.1093/nar/gks1005 (2013).
OpenUrl CrossRef PubMed Web of Science
53.↵
Berman, H. M. et al. The Protein Data Bank. Nucleic Acids Res 28, 235–242 (2000).
OpenUrl CrossRef PubMed Web of Science
54.↵
Balakrishnan, S., Kamisetty, H., Carbonell, J. G., Lee, S. I. & Langmead, C. J. Learning generative models for protein fold families. Proteins 79, 1061–1078, doi:10.1002/prot.22934 (2011).
OpenUrl CrossRef PubMed Web of Science
55.
Ekeberg, M., Lovkvist, C., Lan, Y., Weigt, M. & Aurell, E. Improved contact prediction in proteins: using pseudolikelihoods to infer Potts models. Phys Rev E Stat Nonlin Soft Matter Phys 87, 012707 (2013).
OpenUrl CrossRef PubMed
56.
Aurell, E. & Ekeberg, M. Inverse Ising inference using all the data. Phys Rev Lett 108, 090201 (2012).
OpenUrl CrossRef PubMed
57.↵
Kamisetty, H., Ovchinnikov, S. & Baker, D. Assessing the utility of coevolution-based residue-residue contact predictions in a sequence- and structure-rich era. Proc Natl Acad Sci U S A 110, 15674–15679, doi:10.1073/pnas.1314045110 (2013).
OpenUrl Abstract/FREE Full Text
58.↵
Nawrocki, E. P. & Eddy, S. R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 29, 2933–2935, doi:10.1093/bioinformatics/btt509 (2013).
OpenUrl CrossRef PubMed Web of Science
59.↵
Jonikas, M. A. et al. Coarse-grained modeling of large RNA molecules with knowledge-based potentials and structural filters. RNA 15, 189–199, doi:10.1261/rna.1270809 (2009).
OpenUrl Abstract/FREE Full Text

View the discussion thread.

Posted October 06, 2015.

Download PDF

Citation Tools

Subject Area

Bioinformatics

Subject Areas

All Articles

Animal Behavior and Cognition (5204)
Biochemistry (11718)
Bioengineering (8724)
Bioinformatics (29132)
Biophysics (14937)
Cancer Biology (12052)
Cell Biology (17362)
Clinical Trials (138)
Developmental Biology (9407)
Ecology (14146)
Epidemiology (2067)
Evolutionary Biology (18270)
Genetics (12223)
Genomics (16768)
Immunology (11844)
Microbiology (28016)
Molecular Biology (11560)
Neuroscience (60841)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3231)
Physiology (4940)
Plant Biology (10405)
Scientific Communication and Education (1681)
Synthetic Biology (2878)
Systems Biology (7333)
Zoology (1642)

[1] 1.↵
Warf, M. B. & Berglund, J. A. Role of RNA structure in regulating pre-mRNA splicing. Trends in Biochemical Sciences 35, 169–178, doi:http://dx.doi.org/10.1016/j.tibs.2009.10.004 (2010).
OpenUrl CrossRef PubMed Web of Science

[2] 2.
McManus, C. J. & Graveley, B. R. RNA structure and the mechanisms of alternative splicing. Curr Opin Genet Dev 21, 373–379, doi:10.1016/j.gde.2011.04.001 (2011).
OpenUrl CrossRef PubMed

[3] 3.
Martin, K. C. & Ephrussi, A. mRNA localization: gene expression in the spatial dimension. Cell 136, 719–730, doi:10.1016/j.cell.2009.01.044 (2009).
OpenUrl CrossRef PubMed Web of Science

[4] 4.
Garneau, N. L., Wilusz, J. & Wilusz, C. J. The highways and byways of mRNA decay. Nat Rev Mol Cell Biol 8, 113–126, doi:10.1038/nrm2104 (2007).
OpenUrl CrossRef PubMed

[5] 5.↵
He, L. & Hannon, G. J. MicroRNAs: small RNAs with a big role in gene regulation. Nat Rev Genet 5, 522–531, doi:10.1038/nrg1379 (2004).
OpenUrl CrossRef PubMed

[6] 6.↵
Olsen, H. S., Nelbock, P., Cochrane, A. W. & Rosen, C. A. Secondary structure is the major determinant for interaction of HIV rev protein with RNA. Science 247, 845–848 (1990).
OpenUrl Abstract/FREE Full Text

[7] 7.↵
Plath, K., Mlynarczyk-Evans, S., Nusinow, D. A. & Panning, B. Xist RNA and the mechanism of X chromosome inactivation. Annu Rev Genet 36, 233–278, doi:10.1146/annurev.genet.36.042902.092433 (2002).
OpenUrl CrossRef PubMed Web of Science

[8] 8.↵
Maxwell, E. S. & Fournier, M. J. The small nucleolar RNAs. Annu Rev Biochem 64, 897–934, doi:10.1146/annurev.bi.64.070195.004341 (1995).
OpenUrl CrossRef PubMed Web of Science

[9] 9.↵
Eddy, S. R. Computational analysis of conserved RNA secondary structure in transcriptomes and genomes. Annu Rev Biophys 43, 433–456, doi:10.1146/annurev-biophys-051013-022950 (2014).
OpenUrl CrossRef PubMed

[10] 10.↵
Rinn, J. L. & Chang, H. Y. Genome regulation by long noncoding RNAs. Annu Rev Biochem 81, 145–166, doi:10.1146/annurev-biochem-051410-092902 (2012).
OpenUrl CrossRef PubMed Web of Science

[11] 11.↵
Quinodoz, S. & Guttman, M. Long noncoding RNAs: an emerging link between gene regulation and nuclear organization. Trends Cell Biol 24, 651–663, doi:10.1016/j.tcb.2014.08.009 (2014).
OpenUrl CrossRef

[12] 12.↵
Wan, Y. et al. Landscape and variation of RNA secondary structure across the human transcriptome. Nature 505, 706–709, doi:10.1038/nature12946 (2014).
OpenUrl CrossRef PubMed Web of Science

[13] 13.
Spitale, R. C. et al. Structural imprints in vivo decode RNA regulatory mechanisms. Nature 519, 486–490, doi:10.1038/nature14263 (2015).
OpenUrl CrossRef PubMed

[14] 14.
Ding, Y. et al. In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features. Nature 505, 696–700, doi:10.1038/nature12756 (2014).
OpenUrl CrossRef PubMed Web of Science

[15] 15.↵
Rouskin, S., Zubradt, M., Washietl, S., Kellis, M. & Weissman, J. S. Genome-wide probing of RNA structure reveals active unfolding of mRNA structures in vivo. Nature 505, 701–705, doi:10.1038/nature12894 (2014).
OpenUrl CrossRef PubMed Web of Science

[16] 16.↵
Mortimer, S. A., Kidwell, M. A. & Doudna, J. A. Insights into RNA structure and function from genome-wide studies. Nat Rev Genet 15, 469–479, doi:10.1038/nrg3681 (2014).
OpenUrl CrossRef PubMed

[17] 17.↵
Novikova, I. V., Hennelly, S. P. & Sanbonmatsu, K. Y. Sizing up long non-coding RNAs: do lncRNAs have secondary and tertiary structure? Bioarchitecture 2, 189–199, doi:10.4161/bioa.22592 (2012).
OpenUrl CrossRef PubMed

[18] 18.↵
Parisien, M. & Major, F. The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data. Nature 452, 51–55, doi:10.1038/nature06684 (2008).
OpenUrl CrossRef PubMed Web of Science

[19] 19.
Frellsen, J. et al. A probabilistic model of RNA conformational space. PLoS Comput Biol 5, e1000406, doi:10.1371/journal.pcbi.1000406 (2009).
OpenUrl CrossRef PubMed

[20] 20.
Das, R., Karanicolas, J. & Baker, D. Atomic accuracy in predicting and designing noncanonical RNA structure. Nat Methods 7, 291–294, doi:10.1038/nmeth.1433 (2010).
OpenUrl CrossRef PubMed Web of Science

[21] 21.
Das, R. & Baker, D. Automated de novo prediction of native-like RNA tertiary structures. Proc Natl Acad Sci U S A 104, 14664–14669, doi:10.1073/pnas.0703836104 (2007).
OpenUrl Abstract/FREE Full Text

[22] 22.↵
Cao, S. & Chen, S.-J. Physics-based de novo prediction of RNA 3D structures. The journal of physical chemistry. B 115, 4216–4226, doi:10.1021/jp112059y (2011).
OpenUrl CrossRef PubMed

[23] 23.↵
Miao, Z. et al. RNA-Puzzles Round II: assessment of RNA structure prediction programs applied to three large RNA structures. RNA 21, 1066–1084, doi:10.1261/rna.049502.114 (2015).
OpenUrl Abstract/FREE Full Text

[24] 24.↵
Magnus, M. et al. Computational modeling of RNA 3D structures, with the aid of experimental restraints. RNA Biol 11, 522–536, doi:10.4161/rna.28826 (2014).
OpenUrl CrossRef PubMed

[25] 25.↵
Ramani, V., Qiu, R. & Shendure, J. High-throughput determination of RNA structure by proximity ligation. Nat Biotechnol, doi:10.1038/nbt.3289 (2015).
OpenUrl CrossRef PubMed

[26] 26.↵
Cheng, C. Y. et al. Consistent global structures of complex RNA states through multidimensional chemical mapping. Elife 4, e07600, doi:10.7554/eLife.07600 (2015).
OpenUrl CrossRef PubMed

[27] 27.↵
Laing, C. & Schlick, T. Computational approaches to 3D modeling of RNA. J Phys Condens Matter 22, 283101, doi:10.1088/0953-8984/22/28/283101 (2010).
OpenUrl CrossRef PubMed

[28] 29.↵
Shang, L., Xu, W., Ozer, S. & Gutell, R. R. Structural constraints identified with covariation analysis in ribosomal RNA. PLoS One 7, e39383, doi:10.1371/journal.pone.0039383 (2012).
OpenUrl CrossRef PubMed

[29] 29.↵
Pang, P. S., Jankowsky, E., Wadley, L. M. & Pyle, A. M. Prediction of functional tertiary interactions and intermolecular interfaces from primary sequence data. J Exp Zool B Mol Dev Evol 304, 50–63, doi:10.1002/jez.b.21024 (2005).
OpenUrl CrossRef

[30] 30.↵
Mokdad, A. & Frankel, A. D. ISFOLD: structure prediction of base pairs in non-helical RNA motifs from isostericity signatures in their sequence alignments. J Biomol Struct Dyn 25, 467–472, doi:10.1080/07391102.2008.10531239 (2008).
OpenUrl CrossRef PubMed

[31] 31.↵
Butcher, S. E. & Pyle, A. M. The molecular interactions that stabilize RNA tertiary structure: RNA motifs, patterns, and networks. Acc Chem Res 44, 1302–1311, doi:10.1021/ar200098t (2011).
OpenUrl CrossRef PubMed Web of Science

[32] 32.↵
Marks, D. S., Hopf, T. A. & Sander, C. Protein structure prediction from sequence variation. Nat Biotechnol 30, 1072–1080, doi:10.1038/nbt.2419 (2012).
OpenUrl CrossRef PubMed

[33] 33.↵
Morcos, F. et al. Direct-coupling analysis of residue coevolution captures native contacts across many protein families. Proc Natl Acad Sci U S A 108, E1293–1301, doi:10.1073/pnas.1111471108 (2011).
OpenUrl Abstract/FREE Full Text

[34] 34.↵
Weigt, M., White, R. A., Szurmant, H., Hoch, J. A. & Hwa, T. Identification of direct residue contacts in protein-protein interaction by message passing. Proc Natl Acad Sci US A 106, 67–72, doi:10.1073/pnas.0805923106 (2009).
OpenUrl Abstract/FREE Full Text

[35] 35.↵
Hopf, T. A. et al. Three-dimensional structures of membrane proteins from genomic sequencing. Cell 149, 1607–1621, doi:10.1016/j.cell.2012.04.012 (2012).
OpenUrl CrossRef PubMed Web of Science

[36] 36.↵
Hopf, T. A. et al. Sequence co-evolution gives 3D contacts and structures of protein complexes. Elife 3, doi:10.7554/eLife.03430 (2014).
OpenUrl CrossRef PubMed

[37] 37.↵
Marks, D. S. et al. Protein 3D structure computed from evolutionary sequence variation. PLoS One 6, e28766, doi:10.1371/journal.pone.0028766 (2011).
OpenUrl CrossRef PubMed

[38] 38.↵
Ovchinnikov, S., Kamisetty, H. & Baker, D. Robust and accurate prediction of residue-residue interactions across protein interfaces using evolutionary information. Elife 3, e02030, doi:10.7554/eLife.02030 (2014).
OpenUrl CrossRef PubMed

[39] 39.↵
Freyhult, E., Moulton, V. & Gardner, P. Predicting RNA structure using mutual information. Appl Bioinformatics 4, 53–59 (2005).
OpenUrl CrossRef PubMed

[40] 40.↵
Gutell, R. R., Power, A., Hertz, G. Z., Putz, E. J. & Stormo, G. D. Identifying constraints on the higher-order structure of RNA: continued development and application of comparative sequence analysis methods. Nucleic Acids Res 20, 5785–5795 (1992).
OpenUrl CrossRef PubMed Web of Science

[41] 41.↵
Dunn, S. D., Wahl, L. M. & Gloor, G. B. Mutual information without the influence of phylogeny or entropy dramatically improves residue contact prediction. Bioinformatics 24, 333–340, doi:10.1093/bioinformatics/btm604 (2008).
OpenUrl CrossRef PubMed Web of Science

[42] 42.↵
Dutheil, J. Y., Jossinet, F. & Westhof, E. Base pairing constraints drive structural epistasis in ribosomal RNA sequences. Mol Biol Evol 27, 1868–1876, doi:10.1093/molbev/msq069 (2010).
OpenUrl CrossRef PubMed Web of Science

[43] 43.↵
Garst, A. D., Edwards, A. L. & Batey, R. T. Riboswitches: structures and mechanisms. Cold Spring Harb Perspect Biol 3, doi:10.1101/cshperspect.a003533 (2011).
OpenUrl Abstract/FREE Full Text

[44] 44.
Reining, A. et al. Three-state mechanism couples ligand and temperature sensing in riboswitches. Nature 499, 355–359, doi:10.1038/nature12378 (2013).
OpenUrl CrossRef PubMed Web of Science

[45] 45.↵
Serganov, A. & Patel, D. J. Molecular recognition and function of riboswitches. Curr Opin Struct Biol 22, 279–286, doi:10.1016/j.sbi.2012.04.005 (2012).
OpenUrl CrossRef PubMed

[46] 46.↵
Serganov, A., Polonskaia, A., Phan, A. T., Breaker, R. R. & Patel, D. J. Structural basis for gene regulation by a thiamine pyrophosphate-sensing riboswitch. Nature 441, 1167–1171, doi:10.1038/nature04740 (2006).
OpenUrl CrossRef PubMed Web of Science

[47] 47.↵
Lu, C. et al. SAM recognition and conformational switching mechanism in the Bacillus subtilis yitJ S box/SAM-I riboswitch. J Mol Biol 404, 803–818, doi:10.1016/j.jmb.2010.09.059 (2010).
OpenUrl CrossRef PubMed Web of Science

[48] 48.↵
Schwieters, C. D., Kuszewski, J. J., Tjandra, N. & Clore, G. M. The Xplor-NIH NMR molecular structure determination package. JMagn Reson 160, 65–73 (2003).
OpenUrl CrossRef PubMed Web of Science

[49] 49.↵
Michel, F., Umesono, K. & Ozeki, H. Comparative and functional anatomy of group II catalytic introns--a review. Gene 82, 5–30 (1989).
OpenUrl CrossRef PubMed Web of Science

[50] 50.↵
Havill, J. T., Bhatiya, C., Johnson, S. M., Sheets, J. D. & Thompson, J. S. A new approach for detecting riboswitches in DNA sequences. Bioinformatics 30, 3012–3019, doi:10.1093/bioinformatics/btu479 (2014).
OpenUrl CrossRef PubMed

[51] 51.↵
Chang, T. H. et al. Computational identification of riboswitches based on RNA conserved functional sequences and conformations. RNA 15, 1426–1430, doi:10.1261/rna.1623809 (2009).
OpenUrl Abstract/FREE Full Text

[52] 52.↵
Burge, S. W. et al. Rfam 11.0: 10 years of RNA families. Nucleic Acids Res 41, D226–232, doi:10.1093/nar/gks1005 (2013).
OpenUrl CrossRef PubMed Web of Science

[53] 53.↵
Berman, H. M. et al. The Protein Data Bank. Nucleic Acids Res 28, 235–242 (2000).
OpenUrl CrossRef PubMed Web of Science

[54] 54.↵
Balakrishnan, S., Kamisetty, H., Carbonell, J. G., Lee, S. I. & Langmead, C. J. Learning generative models for protein fold families. Proteins 79, 1061–1078, doi:10.1002/prot.22934 (2011).
OpenUrl CrossRef PubMed Web of Science

[55] 55.
Ekeberg, M., Lovkvist, C., Lan, Y., Weigt, M. & Aurell, E. Improved contact prediction in proteins: using pseudolikelihoods to infer Potts models. Phys Rev E Stat Nonlin Soft Matter Phys 87, 012707 (2013).
OpenUrl CrossRef PubMed

[56] 56.
Aurell, E. & Ekeberg, M. Inverse Ising inference using all the data. Phys Rev Lett 108, 090201 (2012).
OpenUrl CrossRef PubMed

[57] 57.↵
Kamisetty, H., Ovchinnikov, S. & Baker, D. Assessing the utility of coevolution-based residue-residue contact predictions in a sequence- and structure-rich era. Proc Natl Acad Sci U S A 110, 15674–15679, doi:10.1073/pnas.1314045110 (2013).
OpenUrl Abstract/FREE Full Text

[58] 58.↵
Nawrocki, E. P. & Eddy, S. R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 29, 2933–2935, doi:10.1093/bioinformatics/btt509 (2013).
OpenUrl CrossRef PubMed Web of Science

[59] 59.↵
Jonikas, M. A. et al. Coarse-grained modeling of large RNA molecules with knowledge-based potentials and structural filters. RNA 15, 189–199, doi:10.1261/rna.1270809 (2009).
OpenUrl Abstract/FREE Full Text