Back to Search
Start Over
Orthology and paralogy constraints: satisfiability and consistency
- Source :
- BMC Genomics
- Publisher :
- Springer Nature
-
Abstract
- A variety of methods based on sequence similarity, reconciliation, synteny or functional characteristics, can be used to infer orthology and paralogy relations between genes of a given gene family . But is a given set of orthology/paralogy constraints possible, i.e., can they simultaneously co-exist in an evolutionary history for ? While previous studies have focused on full sets of constraints, here we consider the general case where does not necessarily involve a constraint for each pair of genes. The problem is subdivided in two parts: (1) Is satisfiable, i.e. can we find an event-labeled gene tree G inducing ? (2) Is there such a G which is consistent, i.e., such that all displayed triplet phylogenies are included in a species tree? Previous results on the Graph sandwich problem can be used to answer to (1), and we provide polynomial-time algorithms for satisfiability and consistency with a given species tree. We also describe a new polynomial-time algorithm for the case of consistency with an unknown species tree and full knowledge of pairwise orthology/paralogy relationships, as well as a branch-and-bound algorithm in the case when unknown relations are present. We show that our algorithms can be used in combination with ProteinOrtho, a sequence similarity-based orthology detection tool, to extract a set of robust orthology/paralogy relationships.
- Subjects :
- Theoretical computer science
0206 medical engineering
02 engineering and technology
Biology
computer.software_genre
species tree
Evolution, Molecular
Set (abstract data type)
03 medical and health sciences
Consistency (database systems)
Similarity (network science)
Genetics
paralogy
Phylogeny
030304 developmental biology
Synteny
0303 health sciences
Sequence
consistency
Models, Genetic
Research
Tree (data structure)
gene tree
satisfiability
Graph sandwich problem
Multigene Family
Pairwise comparison
Data mining
orthology
computer
Algorithms
020602 bioinformatics
Biotechnology
Subjects
Details
- Language :
- English
- ISSN :
- 14712164
- Volume :
- 15
- Issue :
- Suppl 6
- Database :
- OpenAIRE
- Journal :
- BMC Genomics
- Accession number :
- edsair.doi.dedup.....84c507dc7ed281aba7e2348befb290b4
- Full Text :
- https://doi.org/10.1186/1471-2164-15-s6-s12