1. Discovering hotspots in functional genomic data superposed on 3D chromatin configuration reconstructions.
- Author
-
Capurso D, Bengtsson H, and Segal MR
- Subjects
- Chromatin chemistry, Chromatin metabolism, Chromatin Immunoprecipitation, Chromosome Mapping, Cytochrome P-450 Enzyme System genetics, Cytochrome P-450 Enzyme System metabolism, High-Throughput Nucleotide Sequencing, Intracellular Signaling Peptides and Proteins genetics, Intracellular Signaling Peptides and Proteins metabolism, Models, Molecular, Molecular Conformation, Saccharomyces cerevisiae metabolism, Saccharomyces cerevisiae ultrastructure, Saccharomyces cerevisiae Proteins genetics, Saccharomyces cerevisiae Proteins metabolism, Transcription Factors genetics, Transcription Factors metabolism, Algorithms, Chromatin ultrastructure, Gene Expression Regulation, Fungal, Genome, Fungal, Saccharomyces cerevisiae genetics
- Abstract
The spatial organization of the genome influences cellular function, notably gene regulation. Recent studies have assessed the three-dimensional (3D) co-localization of functional annotations (e.g. centromeres, long terminal repeats) using 3D genome reconstructions from Hi-C (genome-wide chromosome conformation capture) data; however, corresponding assessments for continuous functional genomic data (e.g. chromatin immunoprecipitation-sequencing (ChIP-seq) peak height) are lacking. Here, we demonstrate that applying bump hunting via the patient rule induction method (PRIM) to ChIP-seq data superposed on a Saccharomyces cerevisiae 3D genome reconstruction can discover 'functional 3D hotspots', regions in 3-space for which the mean ChIP-seq peak height is significantly elevated. For the transcription factor Swi6, the top hotspot by P-value contains MSB2 and ERG11 - known Swi6 target genes on different chromosomes. We verify this finding in a number of ways. First, this top hotspot is relatively stable under PRIM across parameter settings. Second, this hotspot is among the top hotspots by mean outcome identified by an alternative algorithm, k-Nearest Neighbor (k-NN) regression. Third, the distance between MSB2 and ERG11 is smaller than expected (by resampling) in two other 3D reconstructions generated via different normalization and reconstruction algorithms. This analytic approach can discover functional 3D hotspots and potentially reveal novel regulatory interactions., (© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2016
- Full Text
- View/download PDF