Back to Search Start Over

Sorting Permutations by Intergenic Operations

Authors :
Guillaume Fertin
Zanoni Dias
Andre Rodrigues Oliveira
Klairton Lima Brito
Ulisses Dias
Géraldine Jean
Universidade Estadual de Campinas (UNICAMP)
Laboratoire des Sciences du Numérique de Nantes (LS2N)
IMT Atlantique Bretagne-Pays de la Loire (IMT Atlantique)
Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST)
Université de Nantes (UN)-Université de Nantes (UN)-École Centrale de Nantes (ECN)-Centre National de la Recherche Scientifique (CNRS)
Combinatoire et Bioinformatique (COMBI)
Université de Nantes (UN)-Université de Nantes (UN)-École Centrale de Nantes (ECN)-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique Bretagne-Pays de la Loire (IMT Atlantique)
Source :
IEEE/ACM Transactions on Computational Biology and Bioinformatics, IEEE/ACM Transactions on Computational Biology and Bioinformatics, Institute of Electrical and Electronics Engineers, 2021, pp.1-1. ⟨10.1109/TCBB.2021.3077418⟩
Publication Year :
2021
Publisher :
Institute of Electrical and Electronics Engineers (IEEE), 2021.

Abstract

Genome Rearrangements are events that affect large stretches of genomes during evolution. Many mathematical models have been used to estimate the evolutionary distance between two genomes based on genome rearrangements. However, most of them focused on the (order of the) genes of a genome, disregarding other important elements in it. Recently, researchers have shown that considering regions between each pair of genes, called intergenic regions, can enhance distance estimation in realistic data. Two of the most studied genome rearrangements are the reversal, which inverts a sequence of genes, and the transposition, which occurs when two adjacent gene sequences swap their positions inside the genome. In this work, we study the transposition distance between two genomes, but we also consider intergenic regions, a problem we name Sorting by Intergenic Transpositions. We show that this problem is NP-hard and propose two approximation algorithms, with factors 3.5 and 2.5, considering two distinct definitions for the problem. We also investigate the signed reversal and transposition distance between two genomes considering their intergenic regions. This second problem is called Sorting by Signed Intergenic Reversals and Intergenic Transpositions. We show that this problem is NP-hard and develop two approximation algorithms, with factors 3 and 2.5. We check how these algorithms behave when assigning weights for genome rearrangements. Finally, we implemented all these algorithms and tested them on real and simulated data.

Details

ISSN :
23740043, 15455963, and 15579964
Volume :
18
Database :
OpenAIRE
Journal :
IEEE/ACM Transactions on Computational Biology and Bioinformatics
Accession number :
edsair.doi.dedup.....1d0044c5e95c1c373e5ce25b112bd689
Full Text :
https://doi.org/10.1109/tcbb.2021.3077418