1. A Polymer Physics Framework for the Entropy of Arbitrary Pseudoknots
- Author
-
Michael Brenner, Ofer Kimchi, Lucy J. Colwell, and Tristan Cragnolini
- Subjects
0303 health sciences ,Polymers ,Computer science ,Entropy ,Biophysics ,Energy landscape ,Articles ,Nucleic acid secondary structure ,03 medical and health sciences ,0302 clinical medicine ,Loop entropy ,Enumeration ,Nucleic Acid Conformation ,RNA ,Thermodynamics ,Polymer physics ,Entropy (information theory) ,Heuristics ,Algorithm ,Algorithms ,030217 neurology & neurosurgery ,030304 developmental biology ,Parametric statistics - Abstract
The accurate prediction of RNA secondary structure from primary sequence has had enormous impact on research from the past 40 years. Although many algorithms are available to make these predictions, the inclusion of non-nested loops, termed pseudoknots, still poses challenges arising from two main factors: 1) no physical model exists to estimate the loop entropies of complex intramolecular pseudoknots, and 2) their NP-complete enumeration has impeded their study. Here, we address both challenges. First, we develop a polymer physics model that can address arbitrarily complex pseudoknots using only two parameters corresponding to concrete physical quantities—over an order of magnitude fewer than the sparsest state-of-the-art phenomenological methods. Second, by coupling this model to exhaustive enumeration of the set of possible structures, we compute the entire free energy landscape of secondary structures resulting from a primary RNA sequence. We demonstrate that for RNA structures of ∼80 nucleotides, with minimal heuristics, the complete enumeration of possible secondary structures can be accomplished quickly despite the NP-complete nature of the problem. We further show that despite our loop entropy model’s parametric sparsity, it performs better than or on par with previously published methods in predicting both pseudoknotted and non-pseudoknotted structures on a benchmark data set of RNA structures of ≤80 nucleotides. We suggest ways in which the accuracy of the model can be further improved.
- Published
- 2019