Back to Search
Start Over
Remote homology search with hidden Potts models.
- Source :
- PLoS Computational Biology, Vol 16, Iss 11, p e1008085 (2020)
- Publication Year :
- 2020
- Publisher :
- Public Library of Science (PLoS), 2020.
-
Abstract
- Most methods for biological sequence homology search and alignment work with primary sequence alone, neglecting higher-order correlations. Recently, statistical physics models called Potts models have been used to infer all-by-all pairwise correlations between sites in deep multiple sequence alignments, and these pairwise couplings have improved 3D structure predictions. Here we extend the use of Potts models from structure prediction to sequence alignment and homology search by developing what we call a hidden Potts model (HPM) that merges a Potts emission process to a generative probability model of insertion and deletion. Because an HPM is incompatible with efficient dynamic programming alignment algorithms, we develop an approximate algorithm based on importance sampling, using simpler probabilistic models as proposal distributions. We test an HPM implementation on RNA structure homology search benchmarks, where we can compare directly to exact alignment methods that capture nested RNA base-pairing correlations (stochastic context-free grammars). HPMs perform promisingly in these proof of principle experiments.
- Subjects :
- Biology (General)
QH301-705.5
Subjects
Details
- Language :
- English
- ISSN :
- 1553734X and 15537358
- Volume :
- 16
- Issue :
- 11
- Database :
- Directory of Open Access Journals
- Journal :
- PLoS Computational Biology
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.32d74d632ea1404cba458eb2dcd1408e
- Document Type :
- article
- Full Text :
- https://doi.org/10.1371/journal.pcbi.1008085