Back to Search Start Over

A 2-approximation algorithm for the contig-based genomic scaffold filling problem.

Authors :
Jiang H
Qingge L
Zhu D
Zhu B
Source :
Journal of bioinformatics and computational biology [J Bioinform Comput Biol] 2018 Dec; Vol. 16 (6), pp. 1850022.
Publication Year :
2018

Abstract

The genomic scaffold filling problem has attracted a lot of attention recently. The problem is on filling an incomplete sequence (scaffold) I into I ' , with respect to a complete reference genome G , such that the number of common/shared adjacencies between G and I ' is maximized. The problem is NP-complete, and admits a constant-factor approximation. However, the sequence input I is not quite practical and does not fit most of the real datasets (where a scaffold is more often given as a list of contigs). In this paper, we revisit the genomic scaffold filling problem by considering this important case when a scaffold S is given, the missing genes can only be inserted in between the contigs, and the objective is to maximize the number of common adjacencies between G and the filled genome S ' . For this problem, we present a simple NP-completeness proof, we then present a factor-2 approximation algorithm.

Details

Language :
English
ISSN :
1757-6334
Volume :
16
Issue :
6
Database :
MEDLINE
Journal :
Journal of bioinformatics and computational biology
Publication Type :
Academic Journal
Accession number :
30616473
Full Text :
https://doi.org/10.1142/S0219720018500221