Back to Search Start Over

PileLine: a toolbox to handle genome position information in next-generation sequencing studies

Authors :
Fdez-Riverola Florentino
Reboiro-Jato Miguel
Gómez-López Gonzalo
Glez-Peña Daniel
Pisano David G
Source :
BMC Bioinformatics, Vol 12, Iss 1, p 31 (2011)
Publication Year :
2011
Publisher :
BMC, 2011.

Abstract

Abstract Background Genomic position (GP) files currently used in next-generation sequencing (NGS) studies are always difficult to manipulate due to their huge size and the lack of appropriate tools to properly manage them. The structure of these flat files is based on representing one line per position that has been covered by at least one aligned read, imposing significant restrictions from a computational performance perspective. Results PileLine implements a flexible command-line toolkit providing specific support to the management, filtering, comparison and annotation of GP files produced by NGS experiments. PileLine tools are coded in Java and run on both UNIX (Linux, Mac OS) and Windows platforms. The set of tools comprising PileLine are designed to be memory efficient by performing fast seek on-disk operations over sorted GP files. Conclusions Our novel toolbox has been extensively tested taking into consideration performance issues. It is publicly available at http://sourceforge.net/projects/pilelinetools under the GNU LGPL license. Full documentation including common use cases and guided analysis workflows is available at http://sing.ei.uvigo.es/pileline.

Details

Language :
English
ISSN :
14712105
Volume :
12
Issue :
1
Database :
Directory of Open Access Journals
Journal :
BMC Bioinformatics
Publication Type :
Academic Journal
Accession number :
edsdoj.4bb4ff37bdb5475bbbf1b959a625ebbd
Document Type :
article
Full Text :
https://doi.org/10.1186/1471-2105-12-31