Back to Search
Start Over
The Role of Genome Accessibility in Transcription Factor Binding in Bacteria
- Source :
- PLoS Computational Biology, PLoS Computational Biology, Vol 12, Iss 4, p e1004891 (2016)
- Publication Year :
- 2016
- Publisher :
- Public Library of Science (PLoS), 2016.
-
Abstract
- ChIP-seq enables genome-scale identification of regulatory regions that govern gene expression. However, the biological insights generated from ChIP-seq analysis have been limited to predictions of binding sites and cooperative interactions. Furthermore, ChIP-seq data often poorly correlate with in vitro measurements or predicted motifs, highlighting that binding affinity alone is insufficient to explain transcription factor (TF)-binding in vivo. One possibility is that binding sites are not equally accessible across the genome. A more comprehensive biophysical representation of TF-binding is required to improve our ability to understand, predict, and alter gene expression. Here, we show that genome accessibility is a key parameter that impacts TF-binding in bacteria. We developed a thermodynamic model that parameterizes ChIP-seq coverage in terms of genome accessibility and binding affinity. The role of genome accessibility is validated using a large-scale ChIP-seq dataset of the M. tuberculosis regulatory network. We find that accounting for genome accessibility led to a model that explains 63% of the ChIP-seq profile variance, while a model based in motif score alone explains only 35% of the variance. Moreover, our framework enables de novo ChIP-seq peak prediction and is useful for inferring TF-binding peaks in new experimental conditions by reducing the need for additional experiments. We observe that the genome is more accessible in intergenic regions, and that increased accessibility is positively correlated with gene expression and anti-correlated with distance to the origin of replication. Our biophysically motivated model provides a more comprehensive description of TF-binding in vivo from first principles towards a better representation of gene regulation in silico, with promising applications in systems biology.<br />Author Summary A quantitative description of transcription factor (TF) binding in vivo is critical for our understanding of gene regulation. Chromatin Immunoprecipitation followed by sequencing (ChIP-seq) provides a genome-scale map of TF-binding. However, a quantitative characterization of the impact of genome accessibility on TF-binding in bacteria remains elusive. In order to help recruit or block gene expression, TFs must have physical access to regulatory regions. This paper presents a thermodynamics model that describes TF-binding in terms of genome accessibility and binding site affinity. We apply this model in a ChIP-seq dataset for Mycobacterium tuberculosis and observed that genome accessibility is critical to our understanding of TF-binding in vivo. This new model provides practical applications, such as de novo prediction of TF-binding peaks and a framework to measure DNA accessibility from ChIP-seq data. Our model enables us to quantify the relationship of genome accessibility with genomic features and suggest mechanisms that influence genome accessibility in vivo (e.g. distance to oriC). The model proposed in this study gives new perspective for ChIP-seq analysis in bacteria towards an improved description of gene regulation in silico.
- Subjects :
- 0301 basic medicine
Gene regulatory network
Gene Expression
Biochemistry
Genome
0302 clinical medicine
Bacterial genetics
Invertebrate Genomics
Pathology
Gene Regulatory Networks
lcsh:QH301-705.5
Regulation of gene expression
Genetics
Bacterial Genomics
Ecology
Systems Biology
Physics
Microbial Genetics
Genomics
Synthetic genomics
Computational Theory and Mathematics
Regulatory sequence
Modeling and Simulation
Physical Sciences
Thermodynamics
Engineering and Technology
Synthetic Biology
Sequence Analysis
Protein Binding
Research Article
Chromatin Immunoprecipitation
In silico
Systems biology
Microbial Genomics
Computational biology
Biology
Research and Analysis Methods
Models, Biological
Microbiology
Biophysical Phenomena
03 medical and health sciences
Cellular and Molecular Neuroscience
Sequence Motif Analysis
DNA-binding proteins
Transcription factors
Gene Regulation
natural sciences
Molecular Biology Techniques
Sequencing Techniques
Molecular Biology
Transcription factor
Ecology, Evolution, Behavior and Systematics
Genetic regulation
Bacteria
Computational Biology
Biology and Life Sciences
Proteins
Bacteriology
Mycobacterium tuberculosis
Sequence Analysis, DNA
Synthetic Genomics
Regulatory Proteins
030104 developmental biology
lcsh:Biology (General)
Animal Genomics
FOS: Biological sciences
Linear Models
Genome, Bacterial
030217 neurology & neurosurgery
Subjects
Details
- ISSN :
- 15537358
- Volume :
- 12
- Database :
- OpenAIRE
- Journal :
- PLOS Computational Biology
- Accession number :
- edsair.doi.dedup.....bef4da4e93074507979727b514e427a1
- Full Text :
- https://doi.org/10.1371/journal.pcbi.1004891