51. Pgltools: a genomic arithmetic tool suite for manipulation of Hi-C peak and other chromatin interaction data
- Author
-
Erin N. Smith, Kelly A. Frazer, Paola Benaglio, He Li, Naoki Nariai, and William W. Greenwald
- Subjects
0301 basic medicine ,Source code ,Paired-genomic-loci ,Computer science ,computer.software_genre ,Genome ,Biochemistry ,Mathematical Sciences ,0302 clinical medicine ,Structural Biology ,Cross-platform ,lcsh:QH301-705.5 ,media_common ,computer.programming_language ,Programming language ,Applied Mathematics ,Suite ,High-Throughput Nucleotide Sequencing ,Genomics ,Biological Sciences ,File format ,Chromatin ,Computer Science Applications ,Networking and Information Technology R&D ,Hi-CChIA-PET ,lcsh:R858-859.7 ,Data mining ,Biotechnology ,Chromatin Immunoprecipitation ,Bioinformatics ,media_common.quotation_subject ,lcsh:Computer applications to medicine. Medical informatics ,Data type ,03 medical and health sciences ,Information and Computing Sciences ,Genetics ,Molecular Biology ,Unix ,Chromatin conformation capture ,Human Genome ,Python (programming language) ,Bedtools ,030104 developmental biology ,ComputingMethodologies_PATTERNRECOGNITION ,lcsh:Biology (General) ,Tool suite ,Genetic Loci ,Genomic arithmetic ,computer ,030217 neurology & neurosurgery ,Software ,Peak - Abstract
Genomic interaction studies use next-generation sequencing (NGS) to examine the interactions between two loci on the genome, with subsequent bioinformatics analyses typically including annotation, intersection, and merging of data from multiple experiments. While many file types and analysis tools exist for storing and manipulating single locus NGS data, there is currently no file standard or analysis tool suite for manipulating and storing paired-genomic-loci: the data type resulting from “genomic interaction” studies. As genomic interaction sequencing data are becoming prevalent, a standard file format and tools for working with these data conveniently and efficiently are needed. This article details a file standard and novel software tool suite for working with paired-genomic-loci data. We present the paired-genomic-loci (PGL) file standard for genomic-interactions data, and the accompanying analysis tool suite “pgltools”: a cross platform, pypy compatible python package available both as an easy-to-use UNIX package, and as a python module, for integration into pipelines of paired-genomic-loci analyses. Pgltools is a freely available, open source tool suite for manipulating paired-genomic-loci data. Source code, an in-depth manual, and a tutorial are available publicly at www.github.com/billgreenwald/pgltools , and a python module of the operations can be installed from PyPI via the PyGLtools module.
- Published
- 2017
- Full Text
- View/download PDF