Back to Search Start Over

Identification of putative essential protein domains from high-density transposon insertion sequencing

Authors :
A. S. M. Zisanur Rahman
Lukas Timmerman
Flyn Gallardo
Silvia T. Cardona
Source :
Scientific Reports, Scientific Reports, Vol 12, Iss 1, Pp 1-10 (2022)
Publication Year :
2022
Publisher :
Nature Publishing Group UK, 2022.

Abstract

A first clue to gene function can be obtained by examining whether a gene is required for life in certain standard conditions, that is, whether a gene is essential. In bacteria, essential genes are usually identified by high-density transposon mutagenesis followed by sequencing of insertion sites (Tn-seq). These studies assign the term “essential” to whole genes rather than the protein domain sequences that encode the essential functions. However, genes can code for multiple protein domains that evolve their functions independently. Therefore, when essential genes code for more than one protein domain, only one of them could be essential. In this study, we defined this subset of genes as “essential domain-containing” (EDC) genes. Using a Tn-seq data set built-in Burkholderia cenocepacia K56-2, we developed an in silico pipeline to identify EDC genes and the essential protein domains they encode. We found forty candidate EDC genes and demonstrated growth defect phenotypes using CRISPR interference (CRISPRi). This analysis included two knockdowns of genes encoding the protein domains of unknown function DUF2213 and DUF4148. These putative essential domains are conserved in more than two hundred bacterial species, including human and plant pathogens. Together, our study suggests that essentiality should be assigned to individual protein domains rather than genes, contributing to a first functional characterization of protein domains of unknown function.

Details

Language :
English
ISSN :
20452322
Volume :
12
Database :
OpenAIRE
Journal :
Scientific Reports
Accession number :
edsair.doi.dedup.....2f190a7e48b6df97151793333de5e350