Back to Search Start Over

Simultaneous Reconstruction of Duplication Episodes and Gene-Species Mappings

Authors :
Paweł Górecki and Natalia Rutecka and Agnieszka Mykowiecka and Jarosław Paszek
Górecki, Paweł
Rutecka, Natalia
Mykowiecka, Agnieszka
Paszek, Jarosław
Paweł Górecki and Natalia Rutecka and Agnieszka Mykowiecka and Jarosław Paszek
Górecki, Paweł
Rutecka, Natalia
Mykowiecka, Agnieszka
Paszek, Jarosław
Publication Year :
2023

Abstract

We present a novel problem, called MetaEC, which aims to infer gene-species assignments in a collection of gene trees with missing labels by minimizing the size of duplication episode clustering (EC). This problem is particularly relevant in metagenomics, where incomplete data often poses a challenge in the accurate reconstruction of gene histories. To solve MetaEC, we propose a polynomial time dynamic programming (DP) formulation that verifies the existence of a set of duplication episodes from a predefined set of episode candidates. We then demonstrate how to use DP to design an algorithm that solves MetaEC. Although the algorithm is exponential in the worst case, we introduce a heuristic modification of the algorithm that provides a solution with the knowledge that it is exact. To evaluate our method, we perform two computational experiments on simulated and empirical data containing whole genome duplication events, showing that our algorithm is able to accurately infer the corresponding events.

Details

Database :
OAIster
Notes :
application/pdf, English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1402193964
Document Type :
Electronic Resource
Full Text :
https://doi.org/10.4230.LIPIcs.WABI.2023.6