41 results on '"Huang, Fenix W."'
Search Results
2. The energy-spectrum of bicompatible sequences
- Author
-
Huang, Fenix W., Barrett, Christopher L., and Reidys, Christian M.
- Subjects
Quantitative Biology - Biomolecules ,Mathematics - Combinatorics - Abstract
Background: Genotype-phenotype maps provide a meaningful filtration of sequence space and RNA secondary structures are particular such phenotypes. Compatible sequences i.e.~sequences that satisfy the base pairing constraints of a given RNA structure play an important role in the context of neutral networks and inverse folding. Sequences satisfying the constraints of two structures simultaneously are called bicompatible and phenotypic change, induced by erroneously replicating populations of RNA sequences, is closely connected to bicompatibility. Furthermore, bicompatible sequences are relevant for riboswitch sequences, beacons of evolution, realizing two distinct phenotypes. Results: We present a full loop energy model Boltzmann sampler of bicompatible sequences for pairs of structures. The novel dynamic programming algorithm is based on a topological framework encapsulating the relations between loops. We utilize our sequence sampler to study the energy spectra and density of bicompatible sequences, the rankings of the structures and key properties for evolutionary transitions. Conclusion: Our analysis of riboswitch sequences shows that key properties of bicompatible sequences depend on the particular pair of structures. While there always exist bicompatible sequences for random structure pairs, they are less suited to facilitate transitions. We show that native riboswitch sequences exhibit a distinct signature with regards to the ranking of their two phenotypes relative to the minimum free energy, suggesting a new criterion for identifying native sequences and sequences subjected to evolutionary pressure., Comment: 20 pages, 10 Figures
- Published
- 2019
3. Genetic robustness of let-7 miRNA sequence-structure pairs
- Author
-
He, Qijun, Huang, Fenix W., Barrett, Christopher, and Reidys, Christian M.
- Subjects
Quantitative Biology - Populations and Evolution ,Mathematics - Combinatorics ,05A99 - Abstract
Genetic robustness, the preservation of evolved phenotypes against genotypic mutations, is one of the central concepts in evolution. In recent years a large body of work has focused on the origins, mechanisms, and consequences of robustness in a wide range of biological systems. In particular, research on ncRNAs studied the ability of sequences to maintain folded structures against single-point mutations. In these studies, the structure is merely a reference. However, recent work revealed evidence that structure itself contributes to the genetic robustness of ncRNAs. We follow this line of thought and consider sequence-structure pairs as the unit of evolution and introduce the spectrum of inverse folding rates (IFR-spectrum) as a measurement of genetic robustness. Our analysis of the miRNA let-7 family captures key features of structure-modulated evolution and facilitates the study of robustness against multiple-point mutations., Comment: 9 pages 7 figures
- Published
- 2018
4. An efficient dual sampling algorithm with Hamming distance filtration
- Author
-
Huang, Fenix W., He, Qijun, Barrett, Christopher, and Reidys, Christian M.
- Subjects
Quantitative Biology - Biomolecules ,Mathematics - Combinatorics - Abstract
Recently, a framework considering RNA sequences and their RNA secondary structures as pairs, led to some information-theoretic perspectives on how the semantics encoded in RNA sequences can be inferred. In this context, the pairing arises naturally from the energy model of RNA secondary structures. Fixing the sequence in the pairing produces the RNA energy landscape, whose partition function was discovered by McCaskill. Dually, fixing the structure induces the energy landscape of sequences. The latter has been considered for designing more efficient inverse folding algorithms. We present here the Hamming distance filtered, dual partition function, together with a Boltzmann sampler using novel dynamic programming routines for the loop-based energy model. The time complexity of the algorithm is $O(h^2n)$, where $h,n$ are Hamming distance and sequence length, respectively, reducing the time complexity of samplers, reported in the literature by $O(n^2)$. We then present two applications, the first being in the context of the evolution of natural sequence-structure pairs of microRNAs and the second constructing neutral paths. The former studies the inverse fold rate (IFR) of sequence-structure pairs, filtered by Hamming distance, observing that such pairs evolve towards higher levels of robustness, i.e.,~increasing IFR. The latter is an algorithm that constructs neutral paths: given two sequences in a neutral network, we employ the sampler in order to construct short paths connecting them, consisting of sequences all contained in the neutral network., Comment: 8 pages 6 figures
- Published
- 2017
5. Topological language for RNA
- Author
-
Huang, Fenix W. D. and Reidys, Christian M.
- Subjects
Computer Science - Formal Languages and Automata Theory ,Quantitative Biology - Biomolecules - Abstract
In this paper we introduce a novel, context-free grammar, {\it RNAFeatures$^*$}, capable of generating any RNA structure including pseudoknot structures (pk-structure). We represent pk-structures as orientable fatgraphs, which naturally leads to a filtration by their topological genus. Within this framework, RNA secondary structures correspond to pk-structures of genus zero. {\it RNAFeatures$^*$} acts on formal, arc-labeled RNA secondary structures, called $\lambda$-structures. $\lambda$-structures correspond one-to-one to pk-structures together with some additional information. This information consists of the specific rearrangement of the backbone, by which a pk-structure can be made cross-free. {\it RNAFeatures$^*$} is an extension of the grammar for secondary structures and employs an enhancement by labelings of the symbols as well as the production rules. We discuss how to use {\it RNAFeatures$^*$} to obtain a stochastic context-free grammar for pk-structures, using data of RNA sequences and structures. The induced grammar facilitates fast Boltzmann sampling and statistical analysis. As a first application, we present an $O(n log(n))$ runtime algorithm which samples pk-structures based on ninety tRNA sequences and structures from the Nucleic Acid Database (NDB)., Comment: 29 pages, 13 figures, 1 table
- Published
- 2016
6. Sequence-structure relations of biopolymers
- Author
-
Barrett, Christopher, Huang, Fenix W., and Reidys, Christian M.
- Subjects
Mathematics - Combinatorics ,Physics - Biological Physics ,Quantitative Biology - Biomolecules - Abstract
Motivation: DNA data is transcribed into single-stranded RNA, which folds into specific molecular structures. In this paper we pose the question to what extent sequence- and structure-information correlate. We view this correlation as structural semantics of sequence data that allows for a different interpretation than conventional sequence alignment. Structural semantics could enable us to identify more general embedded "patterns" in DNA and RNA sequences. Results: We compute the partition function of sequences with respect to a fixed structure and connect this computation to the mutual information of a sequence-structure pair for RNA secondary structures. We present a Boltzmann sampler and obtain the a priori probability of specific sequence patterns. We present a detailed analysis for the three PDB-structures, 2JXV (hairpin), 2N3R (3-branch multi-loop) and 1EHZ (tRNA). We localize specific sequence patterns, contrast the energy spectrum of the Boltzmann sampled sequences versus those sequences that refold into the same structure and derive a criterion to identify native structures. We illustrate that there are multiple sequences in the partition function of a fixed structure, each having nearly the same mutual information, that are nevertheless poorly aligned. This indicates the possibility of the existence of relevant patterns embedded in the sequences that are not discoverable using alignments., Comment: 8 pages, 13 figures
- Published
- 2015
7. A topological framework for signed permutations
- Author
-
Huang, Fenix W. D. and Reidys, Christian M.
- Subjects
Mathematics - Combinatorics ,57M15 - Abstract
In this paper we present a topological framework for studying signed permutations and their reversal distance. As a result we can give an alternative approach and interpretation of the Hannenhalli-Pevzner formula for the reversal distance of signed permutations. Our approach utlizes the Poincar\'e dual, upon which reversals act in a particular way and obsoletes the notion of "padding" of the signed permutations. To this end we construct a bijection between signed permutations and an equivalence class of particular fatgraphs, called $\pi$-maps, and analyze the action of reversals on the latter. We show that reversals act via either slicing, gluing or half-flipping of external vertices, which implies that any reversal changes the topological genus by at most one. Finally we revisit the Hannenhalli-Pevzner formula employing orientable and non-orientable, irreducible, $\pi$-maps., Comment: 37 pages, 16 figures
- Published
- 2014
8. Shapes of topological RNA structures
- Author
-
Huang, Fenix W. D. and Reidys, Christian M.
- Subjects
Mathematics - Combinatorics ,Quantitative Biology - Biomolecules ,05C85 - Abstract
A topological RNA structure is derived from a diagram and its shape is obtained by collapsing the stacks of the structure into single arcs and by removing any arcs of length one. Shapes contain key topological, information and for fixed topological genus there exist only finitely many such shapes. We shall express topological RNA structures as unicellular maps, i.e. graphs together with a cyclic ordering of their half-edges. In this paper we prove a bijection of shapes of topological RNA structures. We furthermore derive a linear time algorithm generating shapes of fixed topological genus. We derive explicit expressions for the coefficients of the generating polynomial of these shapes and the generating function of RNA structures of genus $g$. Furthermore we outline how shapes can be used in order to extract essential information of RNA structure databases., Comment: 27 pages, 11 figures, 2 tables. arXiv admin note: text overlap with arXiv:1304.7397
- Published
- 2014
9. Uniform generation of RNA pseudoknot structures with genus filtration
- Author
-
Huang, Fenix W. D., Nebel, Markus E., and Reidys, Christian M.
- Subjects
Computer Science - Computational Engineering, Finance, and Science ,Mathematics - Combinatorics ,Quantitative Biology - Biomolecules ,05C85 - Abstract
In this paper we present a sampling framework for RNA structures of fixed topological genus. We introduce a novel, linear time, uniform sampling algorithm for RNA structures of fixed topological genus $g$, for arbitrary $g>0$. Furthermore we develop a linear time sampling algorithm for RNA structures of fixed topological genus $g$ that are weighted by a simplified, loop-based energy functional. For this process the partition function of the energy functional has to be computed once, which has $O(n^2)$ time complexity., Comment: 11 figures, 25 pages
- Published
- 2013
10. The energy-spectrum of bicompatible sequences
- Author
-
Huang, Fenix W., Barrett, Christopher L., and Reidys, Christian M.
- Published
- 2021
- Full Text
- View/download PDF
11. On the combinatorics of sparsification
- Author
-
Huang, Fenix W. D. and Reidys, Christian M.
- Subjects
Mathematics - Combinatorics ,32Q55 - Abstract
Background: We study the sparsification of dynamic programming folding algorithms of RNA structures. Sparsification applies to the mfe-folding of RNA structures and can lead to a significant reduction of time complexity. Results: We analyze the sparsification of a particular decomposition rule, $\Lambda^*$, that splits an interval for RNA secondary and pseudoknot structures of fixed topological genus. Essential for quantifying the sparsification is the size of its so called candidate set. We present a combinatorial framework which allows by means of probabilities of irreducible substructures to obtain the expected size of the set of $\Lambda^*$-candidates. We compute these expectations for arc-based energy models via energy-filtered generating functions (GF) for RNA secondary structures as well as RNA pseudoknot structures. For RNA secondary structures we also consider a simplified loop-energy model. This combinatorial analysis is then compared to the expected number of $\Lambda^*$-candidates obtained from folding mfe-structures. In case of the mfe-folding of RNA secondary structures with a simplified loop energy model our results imply that sparsification provides a reduction of time complexity by a constant factor of 91% (theory) versus a 96% reduction (experiment). For the "full" loop-energy model there is a reduction of 98% (experiment)., Comment: 27 pages, 12 figures
- Published
- 2011
12. Topology of RNA-RNA interaction structures
- Author
-
Andersen, Jørgen E., Huang, Fenix W. D., Penner, Robert C., and Reidys, Christian M.
- Subjects
Mathematics - Combinatorics ,32Q55 - Abstract
The topological filtration of interacting RNA complexes is studied and the role is analyzed of certain diagrams called irreducible shadows, which form suitable building blocks for more general structures. We prove that for two interacting RNAs, called interaction structures, there exist for fixed genus only finitely many irreducible shadows. This implies that for fixed genus there are only finitely many classes of interaction structures. In particular the simplest case of genus zero already provides the formalism for certain types of structures that occur in nature and are not covered by other filtrations. This case of genus zero interaction structures is already of practical interest, is studied here in detail and found to be expressed by a multiple context-free grammar extending the usual one for RNA secondary structures. We show that in $O(n^6)$ time and $O(n^4)$ space complexity, this grammar for genus zero interaction structures provides not only minimum free energy solutions but also the complete partition function and base pairing probabilities., Comment: 40 pages 15 figures
- Published
- 2011
13. On the uniform generation of modular diagrams
- Author
-
Huang, Fenix W. D. and Reidys, Christian M.
- Subjects
Mathematics - Combinatorics ,05B30 - Abstract
In this paper we present an algorithm that generates $k$-noncrossing, $\sigma$-modular diagrams with uniform probability. A diagram is a labeled graph of degree $\le 1$ over $n$ vertices drawn in a horizontal line with arcs $(i,j)$ in the upper half-plane. A $k$-crossing in a diagram is a set of $k$ distinct arcs $(i_1, j_1), (i_2, j_2),\ldots,(i_k, j_k)$ with the property $i_1 < i_2 < \ldots < i_k < j_1 < j_2 < \ldots< j_k$. A diagram without any $k$-crossings is called a $k$-noncrossing diagram and a stack of length $\sigma$ is a maximal sequence $((i,j),(i+1,j-1),\dots,(i+(\sigma-1),j-(\sigma-1)))$. A diagram is $\sigma$-modular if any arc is contained in a stack of length at least $\sigma$. Our algorithm generates after $O(n^k)$ preprocessing time, $k$-noncrossing, $\sigma$-modular diagrams in $O(n)$ time and space complexity., Comment: 21 pages, 7 figures
- Published
- 2010
14. RNA-RNA interaction prediction: partition function and base pair pairing probabilities
- Author
-
Huang, Fenix W. D., Qin, Jing, Reidys, Christian M., and Stadler, Peter F.
- Subjects
Mathematics - Combinatorics ,05A16 - Abstract
In this paper, we study the interaction of an antisense RNA and its target mRNA, based on the model introduced by Alkan {\it et al.} (Alkan {\it et al.}, J. Comput. Biol., Vol:267--282, 2006). Our main results are the derivation of the partition function \cite{Backhofen} (Chitsaz {\it et al.}, Bioinformatics, to appear, 2009), based on the concept of tight-structure and the computation of the base pairing probabilities. This paper contains the folding algorithm {\sf rip} which computes the partition function as well as the base pairing probabilities in $O(N^4M^2)+O(N^2M^4)$ time and $O(N^2M^2)$ space, where $N,M$ denote the lengths of the interacting sequences., Comment: 32 pages, 28 figures
- Published
- 2009
15. Folding 3-noncrossing RNA pseudoknot structures
- Author
-
Huang, Fenix W. D., Peng, Wade W. J., and Reidys, Christian M.
- Subjects
Mathematics - Combinatorics ,05B30 - Abstract
In this paper we present a selfcontained analysis and description of the novel {\it ab initio} folding algorithm {\sf cross}, which generates the minimum free energy (mfe), 3-noncrossing, $\sigma$-canonical RNA structure. Here an RNA structure is 3-noncrossing if it does not contain more than three mutually crossing arcs and $\sigma$-canonical, if each of its stacks has size greater or equal than $\sigma$. Our notion of mfe-structure is based on a specific concept of pseudoknots and respective loop-based energy parameters. The algorithm decomposes into three parts: the first is the inductive construction of motifs and shadows, the second is the generation of the skeleta-trees rooted in irreducible shadows and the third is the saturation of skeleta via context dependent dynamic programming routines., Comment: 32pages, 27figures
- Published
- 2008
16. Motifs in SARS-CoV-2 evolution
- Author
-
Barrett, Christopher L., primary, Bura, Andrei C., additional, He, Qijun, additional, Huang, Fenix W., additional, Li, Thomas J.X., additional, and Reidys, Christian M., additional
- Published
- 2023
- Full Text
- View/download PDF
17. MOTIFS IN SARS-COV-2 EVOLUTION
- Author
-
Barrett, Christopher, primary, Bura, Andrei C., additional, He, Qijun, additional, Huang, Fenix W., additional, Li, Thomas J. X., additional, and Reidys, Christian M., additional
- Published
- 2023
- Full Text
- View/download PDF
18. Motifs in SARS-CoV-2 evolution
- Author
-
Barrett, Christopher, Bura, Andrei C., He, Qijun, Huang, Fenix W., Li, Thomas J.X., and Reidys, Christian M.
- Abstract
We present a novel framework enhancing the prediction of whether novel lineage poses the threat of eventually dominating the viral population. The framework is based purely on genomic sequence data, without requiring prior established biological analysis. Its building blocks are sets of coevolving sites in the alignment (motifs), identified via coevolutionary signals. The collection of such motifs forms a relational structure over the polymorphic sites. Motifs are constructed using distances quantifying the coevolutionary coupling of pairs and manifest as coevolving clusters of sites. We present an approach to genomic surveillance based on this notion of relational structure. Our system will issue an alert regarding a lineage, based on its contribution to drastic changes in the relational structure. We then conduct a comprehensive retrospective analysis of the COVID-19 pandemic based on SARS-CoV-2 genomic sequence data in GISAID from October 2020 to September 2022, across 21 lineages and 27 countries with weekly resolution. We investigate the performance of this surveillance system in terms of its accuracy, timeliness, and robustness. Lastly, we study how well each lineage is classified by such a system.
- Published
- 2024
- Full Text
- View/download PDF
19. Buying time: detecting Vocs in SARS-CoV-2 via co-evolutionary signals
- Author
-
Barrett, Christopher, primary, Bura, Andrei C., additional, He, Qijun, additional, Huang, Fenix W., additional, Li, Thomas J. X., additional, and Reidys, Christian M., additional
- Published
- 2022
- Full Text
- View/download PDF
20. Snord116 Post-transcriptionally Increases Nhlh2 mRNA Stability: Implications for Human Prader-Willi Syndrome
- Author
-
Kocher, Matthew A., Huang, Fenix W., Le, Erin, Good, Deborah J., Kocher, Matthew A., Huang, Fenix W., Le, Erin, and Good, Deborah J.
- Abstract
The smallest genomic region causing Prader-Willi Syndrome (PWS) deletes the non-coding RNA SNORD116 cluster; however, the function of SNORD116 remains a mystery. Previous work in the field revealed the tantalizing possibility that expression of NHLH2, a gene previously implicated in both obesity and hypogonadism, was downregulated in PWS patients and differentiated stem cells. In silico RNA: RNA modeling identified several potential interaction domains between SNORD116 and NHLH2 mRNA. One of these interaction domains was highly conserved in most vertebrate NHLH2 mRNAs examined. A construct containing the Nhlh2 mRNA, including its 3'-UTR, linked to a c-myc tag was transfected into a hypothalamic neuron cell line in the presence and absence of exogenously-expressed Snord116. Nhlh2 mRNA expression was upregulated in the presence of Snord116 dependent on the length and type of 3'UTR used on the construct. Furthermore, use of actinomycin D to stop new transcription in N29/2 cells demonstrated that the upregulation occurred through increased stability of the Nhlh2 mRNA in the 45 minutes immediately following transcription. In silico modeling also revealed that a single nucleotide variant (SNV) in the NHLH2 mRNA could reduce the predicted interaction strength of the NHLH2:SNORD116 diad. Indeed, use of an Nhlh2 mRNA construct containing this SNV significantly reduces the ability of Snord116 to increase Nhlh2 mRNA levels. For the first time, these data identify a motif and mechanism for SNORD116-mediated regulation of NHLH2, clarifying the mechanism by which deletion of the SNORD116 snoRNAs locus leads to PWS phenotypes.
- Published
- 2021
- Full Text
- View/download PDF
21. Snord116 Post-transcriptionally Increases Nhlh2 mRNA Stability: Implications for Human Prader-Willi Syndrome
- Author
-
Kocher, Matthew A, primary, Huang, Fenix W, additional, Le, Erin, additional, and Good, Deborah J, additional
- Published
- 2021
- Full Text
- View/download PDF
22. Multiscale Feedback Loops in SARS-CoV-2 Viral Evolution
- Author
-
Barrett, Christopher, primary, Bura, Andrei C., additional, He, Qijun, additional, Huang, Fenix W., additional, Li, Thomas J.X., additional, Waterman, Michael S., additional, and Reidys, Christian M., additional
- Published
- 2021
- Full Text
- View/download PDF
23. Addendum: topology and prediction of RNA pseudoknots
- Author
-
Reidys, Christian M., Huang, Fenix W. D., Andersen, Jørgen E., Penner, Robert C., Stadler, Peter F., and Nebel, Markus E.
- Published
- 2012
- Full Text
- View/download PDF
24. Topology and prediction of RNA pseudoknots
- Author
-
Reidys, Christian M., Huang, Fenix W. D., Andersen, Jørgen E., Penner, Robert C., Stadler, Peter F., and Nebel, Markus E.
- Published
- 2011
- Full Text
- View/download PDF
25. Target prediction and a statistical sampling algorithm for RNA–RNA interaction
- Author
-
Huang, Fenix W., Qin, Jing, Reidys, Christian M., and Stadler, Peter F.
- Published
- 2010
26. Partition function and base pairing probabilities for RNA–RNA interaction prediction
- Author
-
Huang, Fenix W. D., Qin, Jing, Reidys, Christian M., and Stadler, Peter F.
- Published
- 2009
27. Genetic robustness of let-7 miRNA sequence–structure pairs
- Author
-
He, Qijun, primary, Huang, Fenix W., additional, Barrett, Christopher, additional, and Reidys, Christian M., additional
- Published
- 2019
- Full Text
- View/download PDF
28. A Boltzmann Sampler for 1-Pairs with Double Filtration
- Author
-
Barrett, Christopher, primary, He, Qijun, additional, Huang, Fenix W., additional, and Reidys, Christian M., additional
- Published
- 2019
- Full Text
- View/download PDF
29. An Efficient Dual Sampling Algorithm with Hamming Distance Filtration
- Author
-
Barrett, Christopher, primary, He, Qijun, additional, Huang, Fenix W., additional, and Reidys, Christian M., additional
- Published
- 2018
- Full Text
- View/download PDF
30. Sequence–structure relations of biopolymers
- Author
-
Barrett, Christopher, primary, Huang, Fenix W, additional, and Reidys, Christian M, additional
- Published
- 2016
- Full Text
- View/download PDF
31. Sequence-structure relations of biopolymers.
- Author
-
Barrett, Christopher, Huang, Fenix W., and Reidys, Christian M.
- Subjects
- *
NON-coding RNA , *NUCLEOTIDE sequence , *BOLTZMANN'S equation , *PARTITION functions , *NUCLEOTIDE sequencing - Abstract
Motivation: DNA data is transcribed into single-stranded RNA, which folds into specific molecular structures. In this paper we pose the question to what extent sequence- and structure-information correlate. We view this correlation as structural semantics of sequence data that allows for a different interpretation than conventional sequence alignment. Structural semantics could enable us to identify more general embedded 'patterns' in DNA and RNA sequences. Results: We compute the partition function of sequences with respect to a fixed structure and connect this computation to the mutual information of a sequence-structure pair for RNA secondary structures. We present a Boltzmann sampler and obtain the a priori probability of specific sequence patterns. We present a detailed analysis for the three PDB-structures, 2JXV (hairpin), 2N3R (3-branch multi-loop) and 1EHZ (tRNA). We localize specific sequence patterns, contrast the energy spectrum of the Boltzmann sampled sequences versus those sequences that refold into the same structure and derive a criterion to identify native structures. We illustrate that there are multiple sequences in the partition function of a fixed structure, each having nearly the same mutual information, that are nevertheless poorly aligned. This indicates the possibility of the existence of relevant patterns embedded in the sequences that are not discoverable using alignments. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
32. Addendum: topology and prediction of RNA pseudoknots
- Author
-
Reidys, Christian M., primary, Huang, Fenix W. D., additional, Andersen, Jørgen E., additional, Penner, Robert C., additional, Stadler, Peter F., additional, and Nebel, Markus E., additional
- Published
- 2011
- Full Text
- View/download PDF
33. Target prediction and a statistical sampling algorithm for RNA–RNA interaction
- Author
-
Huang, Fenix W. D., primary, Qin, Jing, additional, Reidys, Christian M., additional, and Stadler, Peter F., additional
- Published
- 2009
- Full Text
- View/download PDF
34. On the combinatorics of sparsification.
- Author
-
Huang, Fenix W. D. and Reidys, Christian M.
- Subjects
- *
NUCLEIC acids , *MOLECULAR biology , *RNA , *ALLOYS , *GENETICS - Abstract
Background: We study the sparsification of dynamic programming based on folding algorithms of RNA structures. Sparsification is a method that improves significantly the computation of minimum free energy (mfe) RNA structures. Results: We provide a quantitative analysis of the sparsification of a particular decomposition rule, Λ*. This rule splits an interval of RNA secondary and pseudoknot structures of fixed topological genus. Key for quantifying sparsifications is the size of the so called candidate sets. Here we assume mfe-structures to be specifically distributed (see Assumption 1) within arbitrary and irreducible RNA secondary and pseudoknot structures of fixed topological genus. We then present a combinatorial framework which allows by means of probabilities of irreducible sub-structures to obtain the expectation of the Λ*-candidate set w.r.t. a uniformly random input sequence. We compute these expectations for arc-based energy models via energy-filtered generating functions (GF) in case of RNA secondary structures as well as RNA pseudoknot structures. Furthermore, for RNA secondary structures we also analyze a simplified loop-based energy model. Our combinatorial analysis is then compared to the expected number of Λ*-candidates obtained from the folding mfe-structures. In case of the mfe-folding of RNA secondary structures with a simplified loop-based energy model our results imply that sparsification provides a significant, constant improvement of 91% (theory) to be compared to an 96% (experimental, simplified arc-based model) reduction. However, we do not observe a linear factor improvement. Finally, in case of the "full" loop-energy model we can report a reduction of 98% (experiment). Conclusions: Sparsification was initially attributed a linear factor improvement. This conclusion was based on the so called polymer-zeta property, which stems from interpreting polymer chains as self-avoiding walks. Subsequent findings however reveal that the O(n) improvement is not correct. The combinatorial analysis presented here shows that, assuming a specific distribution (see Assumption 1), of mfe-structures within irreducible and arbitrary structures, the expected number of Λ*-candidates is Θ(n2). However, the constant reduction is quite significant, being in the range of 96%. We furthermore show an analogous result for the sparsification of the Λ*-decomposition rule for RNA pseudoknotted structures of genus one. Finally we observe that the effect of sparsification is sensitive to the employed energy model. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
35. Sequence-structure relations of pseudoknot RNA.
- Author
-
Huang, Fenix W. D., Li, Linda Y. M., and Reidys, Christian M.
- Subjects
- *
RNA , *ALGORITHMS , *NUCLEOTIDE sequence , *DNA , *CATALYTIC RNA , *PROTEINS - Abstract
Background: The analysis of sequence-structure relations of RNA is based on a specific notion and folding of RNA structure. The notion of coarse grained structure employed here is that of canonical RNA pseudoknot contact-structures with at most two mutually crossing bonds (3-noncrossing). These structures are folded by a novel, ab initio prediction algorithm, cross, capable of searching all 3-noncrossing RNA structures. The algorithm outputs the minimum free energy structure. Results: After giving some background on RNA pseudoknot structures and providing an outline of the folding algorithm being employed, we present in this paper various, statistical results on the mapping from RNA sequences into 3-noncrossing RNA pseudoknot structures. We study properties, like the fraction of pseudoknot structures, the dominant pseudoknot-shapes, neutral walks, neutral neighbors and local connectivity. We then put our results into context of molecular evolution of RNA. Conclusion: Our results imply that, in analogy to RNA secondary structures, 3-noncrossing pseudoknot RNA represents a molecular phenotype that is well suited for molecular and in particular neutral evolution. We can conclude that extended, percolating neutral networks of pseudoknot RNA exist. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
36. Topological language for RNA.
- Author
-
Huang FW and Reidys CM
- Subjects
- Models, Theoretical, RNA chemistry
- Abstract
In this paper we introduce a novel, context-free grammar, RNAFeatures
* , capable of generating any RNA structure including pseudoknot structures (pk-structure). We represent pk-structures as orientable fatgraphs, which naturally leads to a filtration by their topological genus. Within this framework, RNA secondary structures correspond to pk-structures of genus zero. RNAFeatures* acts on formal, arc-labeled RNA secondary structures, called λ-structures. λ-structures correspond one-to-one to pk-structures together with some additional information. This information consists of the specific rearrangement of the backbone, by which a pk-structure can be made cross-free. RNAFeatures* is an extension of the grammar for secondary structures and employs an enhancement by labelings of the symbols as well as the production rules. We discuss how to use RNAFeatures* to obtain a stochastic context-free grammar for pk-structures, using data of RNA sequences and structures. The induced grammar facilitates fast Boltzmann sampling and statistical analysis. As a first application, we present an O(nlog (n)) runtime algorithm which samples pk-structures based on ninety tRNA sequences and structures from the Nucleic Acid Database (NDB)., Availability: the source code for simulation results is available at http://staff.vbi.vt.edu/fenixh/TPstructure.zip. The code is written in C and compiled by Xcode., (Copyright © 2016 Elsevier Inc. All rights reserved.)- Published
- 2016
- Full Text
- View/download PDF
37. Shapes of topological RNA structures.
- Author
-
Huang FW and Reidys CM
- Subjects
- Algorithms, Databases, Nucleic Acid, Mathematical Concepts, Models, Molecular, Nucleic Acid Conformation, RNA chemistry
- Abstract
A topological RNA structure is derived by fattening the edges of a contact structure into ribbons. The shape of a topological RNA structure is obtained by collapsing the stacks of the structure into single arcs and by removing any arcs of length one, as well as isolated vertices. A shape contains the key topological information of the molecular conformation and for fixed topological genus there exist only finitely many such shapes. In this paper we compute the generating polynomial of shapes of fixed topological genus g. We furthermore derive an algorithm having O(glog g) time complexity uniformly generating shapes of genus g and discuss some applications in the context of databases of RNA pseudoknot structures., (Published by Elsevier Inc.)
- Published
- 2015
- Full Text
- View/download PDF
38. Generation of RNA pseudoknot structures with topological genus filtration.
- Author
-
Huang FW, Nebel ME, and Reidys CM
- Subjects
- Algorithms, Computational Biology, Mathematical Concepts, Models, Molecular, Nucleic Acid Conformation, RNA chemistry
- Abstract
In this paper we present a sampling framework for RNA structures of fixed topological genus. We introduce a novel, linear time, uniform sampling algorithm for RNA structures of fixed topological genus g, for arbitrary g>0. Furthermore we develop a linear time sampling algorithm for RNA structures of fixed topological genus g that are weighted by a simplified, loop-based energy functional. For this process the partition function of the energy functional has to be computed once, which has O(n(2)) time complexity., (Copyright © 2013 Elsevier Inc. All rights reserved.)
- Published
- 2013
- Full Text
- View/download PDF
39. Topology of RNA-RNA interaction structures.
- Author
-
Andersen JE, Huang FW, Penner RC, and Reidys CM
- Subjects
- Models, Theoretical, Thermodynamics, Algorithms, Nucleic Acid Conformation, RNA chemistry
- Abstract
The topological filtration of interacting RNA complexes is studied, and the role is analyzed of certain diagrams called irreducible shadows, which form suitable building blocks for more general structures. We prove that, for two interacting RNAs, called interaction structures, there exist for fixed genus only finitely many irreducible shadows. This implies that, for fixed genus, there are only finitely many classes of interaction structures. In particular, the simplest case of genus zero already provides the formalism for certain types of structures that occur in nature and are not covered by other filtrations. This case of genus zero interaction structures is already of practical interest, is studied here in detail, and is found to be expressed by a multiple context-free grammar that extends the usual one for RNA secondary structures. We show that, in O(n(6)) time and O(n(4)) space complexity, this grammar for genus zero interaction structures provides not only minimum free energy solutions but also the complete partition function and base pairing probabilities.
- Published
- 2012
- Full Text
- View/download PDF
40. Folding 3-noncrossing RNA pseudoknot structures.
- Author
-
Huang FW, Peng WW, and Reidys CM
- Subjects
- Algorithms, RNA, Transfer chemistry, Regulatory Sequences, Ribonucleic Acid genetics, Nucleic Acid Conformation, RNA chemistry
- Abstract
In this article, we present the novel ab initio folding algorithm cross, which generates minimum free energy (mfe), 3-noncrossing, canonical RNA structures. Here an RNA structure is 3-noncrossing if it does not contain three or more mutually crossing arcs and canonical, if each of its stacks has size greater or equal than two. Our notion of mfe-structure is based on a specific concept of pseudoknots and respective loop-based energy parameters. The algorithm decomposes into three subroutines: first the inductive construction of motifs and their associated shadows, second the generation of the (rooted) skeleta-trees and third the saturation of the skeleta via context dependent dynamic programming routines.
- Published
- 2009
- Full Text
- View/download PDF
41. Statistics of canonical RNA pseudoknot structures.
- Author
-
Huang FW and Reidys CM
- Subjects
- Algorithms, Animals, Models, Molecular, Models, Genetic, Nucleic Acid Conformation, RNA genetics
- Abstract
In this paper we study canonical RNA pseudoknot structures. We prove central limit theorems for the distributions of the arc-numbers of k-noncrossing RNA structures with given minimum stack-size tau over n nucleotides. Furthermore we compare the space of all canonical structures with canonical minimum free energy pseudoknot structures. Our results generalize the analysis of Schuster et al. obtained for RNA secondary structures [Hofacker, I.L., Schuster, P., Stadler, P.F., 1998. Combinatorics of RNA secondary structures. Discrete Appl. Math. 88, 207-237; Jin, E.Y., Reidys, C.M., 2007b. Central and local limit theorems for RNA structures. J. Theor. Biol. 250 (2008), 547-559; 2007a. Asymptotic enumeration of RNA structures with pseudoknots. Bull. Math. Biol., 70 (4), 951-970] to k-noncrossing RNA structures. Here k2 and tau are arbitrary natural numbers. We compare canonical pseudoknot structures to arbitrary structures and show that canonical pseudoknot structures exhibit significantly smaller exponential growth rates. We then compute the asymptotic distribution of their arc-numbers. Finally, we analyze how the minimum stack-size and crossing number factor into the distributions.
- Published
- 2008
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.