Back to Search Start Over

Inferred regulons are consistent with regulator binding sequences in E. coli.

Authors :
Qiu S
Wan X
Liang Y
Lamoureux CR
Akbari A
Palsson BO
Zielinski DC
Source :
PLoS computational biology [PLoS Comput Biol] 2024 Jan 22; Vol. 20 (1), pp. e1011824. Date of Electronic Publication: 2024 Jan 22 (Print Publication: 2024).
Publication Year :
2024

Abstract

The transcriptional regulatory network (TRN) of E. coli consists of thousands of interactions between regulators and DNA sequences. Regulons are typically determined either from resource-intensive experimental measurement of functional binding sites, or inferred from analysis of high-throughput gene expression datasets. Recently, independent component analysis (ICA) of RNA-seq compendia has shown to be a powerful method for inferring bacterial regulons. However, it remains unclear to what extent regulons predicted by ICA structure have a biochemical basis in promoter sequences. Here, we address this question by developing machine learning models that predict inferred regulon structures in E. coli based on promoter sequence features. Models were constructed successfully (cross-validation AUROC > = 0.8) for 85% (40/47) of ICA-inferred E. coli regulons. We found that: 1) The presence of a high scoring regulator motif in the promoter region was sufficient to specify regulatory activity in 40% (19/47) of the regulons, 2) Additional features, such as DNA shape and extended motifs that can account for regulator multimeric binding, helped to specify regulon structure for the remaining 60% of regulons (28/47); 3) investigating regulons where initial machine learning models failed revealed new regulator-specific sequence features that improved model accuracy. Finally, we found that strong regulatory binding sequences underlie both the genes shared between ICA-inferred and experimental regulons as well as genes in the E. coli core pan-regulon of Fur. This work demonstrates that the structure of ICA-inferred regulons largely can be understood through the strength of regulator binding sites in promoter regions, reinforcing the utility of top-down inference for regulon discovery.<br />Competing Interests: The authors declare that there are no competing interests.<br /> (Copyright: © 2024 Qiu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)

Details

Language :
English
ISSN :
1553-7358
Volume :
20
Issue :
1
Database :
MEDLINE
Journal :
PLoS computational biology
Publication Type :
Academic Journal
Accession number :
38252668
Full Text :
https://doi.org/10.1371/journal.pcbi.1011824