Back to Search Start Over

Remote homology search with hidden Potts models.

Authors :
Grey W Wilburn
Sean R Eddy
Source :
PLoS Computational Biology, Vol 16, Iss 11, p e1008085 (2020)
Publication Year :
2020
Publisher :
Public Library of Science (PLoS), 2020.

Abstract

Most methods for biological sequence homology search and alignment work with primary sequence alone, neglecting higher-order correlations. Recently, statistical physics models called Potts models have been used to infer all-by-all pairwise correlations between sites in deep multiple sequence alignments, and these pairwise couplings have improved 3D structure predictions. Here we extend the use of Potts models from structure prediction to sequence alignment and homology search by developing what we call a hidden Potts model (HPM) that merges a Potts emission process to a generative probability model of insertion and deletion. Because an HPM is incompatible with efficient dynamic programming alignment algorithms, we develop an approximate algorithm based on importance sampling, using simpler probabilistic models as proposal distributions. We test an HPM implementation on RNA structure homology search benchmarks, where we can compare directly to exact alignment methods that capture nested RNA base-pairing correlations (stochastic context-free grammars). HPMs perform promisingly in these proof of principle experiments.

Subjects

Subjects :
Biology (General)
QH301-705.5

Details

Language :
English
ISSN :
1553734X and 15537358
Volume :
16
Issue :
11
Database :
Directory of Open Access Journals
Journal :
PLoS Computational Biology
Publication Type :
Academic Journal
Accession number :
edsdoj.32d74d632ea1404cba458eb2dcd1408e
Document Type :
article
Full Text :
https://doi.org/10.1371/journal.pcbi.1008085