Descriptor: "Pan-genome" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Pan-genome"' showing total 2,533 results

Start Over Descriptor "Pan-genome"

2,533 results on '"Pan-genome"'

1. Harnessing the predicted maize pan-interactome for putative gene function prediction and prioritization of candidate genes for important traits

Author: Poretsky, Elly, Cagirici, H Busra, Andorf, Carson M, and Sen, Taner Z
Subjects: Biological Sciences, Bioinformatics and Computational Biology, Genetics, Biotechnology, Human Genome, Networking and Information Technology R&D (NITRD), Generic health relevance, Plant Genetics and Genomics, pan-genome, predicted protein-protein interactions, interactome, protein function, gene candidate prioritization, PPI, Pan-genome, Biochemistry and cell biology, Statistics
Abstract: The recent assembly and annotation of the 26 maize nested association mapping (NAM) population founder inbreds have enabled large-scale pan-genomic comparative studies. These studies have expanded our understanding of agronomically important traits by integrating pan-transcriptomic data with trait-specific gene candidates from previous association mapping results. In contrast to the availability of pan-transcriptomic data, obtaining reliable protein-protein interaction (PPI) data has remained a challenge due to its high cost and complexity. We generated predicted PPI networks for each of the 26 genomes using the established STRING database. The individual genome-interactomes were then integrated to generate core- and pan-interactomes. We deployed the PPI clustering algorithm ClusterONE to identify numerous PPI clusters that were functionally annotated using gene ontology (GO) functional enrichment, demonstrating a diverse range of enriched GO terms across different clusters. Additional cluster annotations were generated by integrating gene co-expression data and gene description annotations, providing additional useful information. We show that the functionally annotated PPI clusters establish a useful framework for protein function prediction and prioritization of candidate genes of interest. Our study not only provides a comprehensive resource of predicted PPI networks for 26 maize genomes, but also offers annotated interactome clusters for predicting protein functions and prioritizing gene candidates. The source code for the Python implementation of the analysis workflow and a standalone web application for accessing the analysis results are available at https://github.com/eporetsky/PanPPI.
Published: 2024

2. Probiotic and anti-inflammatory properties of Lactiplantibacillus plantarum MKTJ24 isolated from an artisanal fermented fish of North-east India.

Author: Joishy, Tulsi K., Bhattacharya, Anupam, Singh, Chingtham Thanil, Mukherjee, Ashis K., and Khan, Mojibur R.
Subjects: *FERMENTED fish, *WHOLE genome sequencing, *REACTIVE oxygen species, *PAN-genome, *SMALL-scale fisheries, *LACTOBACILLUS plantarum, *OPERONS, *LACTIC acid bacteria
Abstract: The study aimed to isolate and characterize lactic acid bacteria from various traditional fermented fish products from North East India, including Xindol , Hentak, and Ngari, which hold significant dietary importance for the indigenous tribes. Additionally, the study sought to examine their untargeted metabolomic profiles. A total of 43 strains of Bacillus , Priestia, Staphylococcus, Pediococcus, and Lactiplantibacillus were isolated, characterized by 16 S rRNA gene and tested for probiotic properties. Five strains passed pH and bile salt tests with strain dependent antimicrobial activity, which exhibited moderate autoaggregation and hydrophobicity properties. Lactiplantibacillus plantarum MKTJ24 exhibited the highest hydrophobicity (42 %), which was further confirmed by adhesion assay in HT-29 cell lines (100 %). Lactiplantibacillus plantarum MKTJ24 treatment in LPS-stimulated HT-29 cells up-regulated expression of mucin genes compared to LPS-treated cells. Treatment of RAW 264.7 cells with Lactiplantibacillus plantarum MKTJ24 decreased LPS-induced reactive oxygen species (ROS) and nitric oxide (NO) productions. Further, genome analysis of Lactiplantibacillus plantarum MKTJ24 revealed the presence of several probiotic markers and immunomodulatory genes. The genome was found to harbor plantaricin operon involved in bacteriocin production. A pangenome analysis using all the publicly available L. plantarum genomes specifically isolated from fermented fish products identified 120 unique genes in Lactiplantibacillus plantarum MKTJ24. Metabolomic analysis indicated dominance of ascorbic acids, pentafluropropionate, cyclopropaneacetic acid, florobenzylamine, and furanone in Xindol. This study suggests that Lactiplantibacillus plantarum MKTJ24 has potential probiotic and immunomodulatory properties that could be used in processing traditional fermented fish products on an industrial scale to improve their quality and enhance functional properties. [Display omitted] • Immunomodulatory properties of Lactiplantibacillus plantarum MKTJ24 showed excellent probiotic properties was evaluated. • Lactiplantibacillus plantarum MKTJ23 and MKTJ24 were found in Xindol along with different strains of Pediococcus pentosaceus. • L. plantarum MKTJ24 reduced the production of reactive oxygen species and nitric oxide in LPS treated RAW264.7 cell lines. • Whole genome analysis confirmed the presence of probiotic and immune associated genes in the genome of L. plantarum MKTJ24. • Pangenome analysis has depicted the presence of diverse genomic content among the L. plantarum strains. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

3. Wheat genomics: genomes, pangenomes, and beyond.

Author: Tiwari, Vijay K., Saripalli, Gautam, Sharma, Parva K., and Poland, Jesse
Subjects: *PLANT breeding, *FUNCTIONAL genomics, *PAN-genome, *CROP improvement, *GERMPLASM
Abstract: As a major staple food crop for the global human population, climate-resilient wheat is the key to global food security. In this context, there is an urgent need to combine advances in genomic technologies and cutting-edge tools in the crop breeding pipelines. The genetic improvement of wheat starts with a clear understanding of genes and genomic components. The past 4–5 years have seen unprecedented growth in wheat genomes, pangenomes, and functional genomics studies. More than 50 reference-level genome assemblies are available for wheat and its wild and related species. Wheat's wild and related species provide a rich source of new genes and alleles to improve wheat. Reference genome assemblies of wheat's wild and related progenitor species have provided a new, helpful resource for targeted gene discovery for wheat improvement. This review explores the journey of wheat genomics starting in 2003. There is an urgent need to improve wheat for upcoming challenges, including biotic and abiotic stresses. Sustainable wheat improvement requires the introduction of new genes and alleles in high-yielding wheat cultivars. Using new approaches, tools, and technologies to identify and introduce new genes in wheat cultivars is critical. High-quality genomes, transcriptomes, and pangenomes provide essential resources and tools to examine wheat closely to identify and manipulate new and targeted genes and alleles. Wheat genomics has improved excellently in the past 5 years, generating multiple genomes, pangenomes, and transcriptomes. Leveraging these resources allows us to accelerate our crop improvement pipelines. This review summarizes the progress made in wheat genomics and trait discovery in the past 5 years. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

4. Pan‐genome analysis of 13 Spinacia accessions reveals structural variations associated with sex chromosome evolution and domestication traits in spinach.

Author: She, Hongbing, Liu, Zhiyuan, Xu, Zhaosheng, Zhang, Helong, Wu, Jian, Cheng, Feng, Wang, Xiaowu, and Qian, Wei
Subjects: *Y chromosome, *SEX chromosomes, *GENETIC variation, *PLANT breeding, *GENE expression
Abstract: Summary: Structural variations (SVs) are major genetic variants that can be involved in the origin, adaptation and domestication of species. However, the identification and characterization of SVs in Spinacia species are rare due to the lack of a pan‐genome. Here, we report eight chromosome‐scale assemblies of cultivated spinach and its two wild species. After integration with five existing assemblies, we constructed a comprehensive Spinacia pan‐genome and identified 193 661 pan‐SVs, which were genotyped in 452 Spinacia accessions. Our pan‐SVs enabled genome‐wide association study identified signals associated with sex and clarified the evolutionary direction of spinach. Most sex‐linked SVs (86%) were biased to occur on the Y chromosome during the evolution of the sex‐linked region, resulting in reduced Y‐linked gene expression. The frequency of pan‐SVs among Spinacia accessions further illustrated the contribution of these SVs to domestication, such as bolting time and seed dormancy. Furthermore, compared with SNPs, pan‐SVs act as efficient variants in genomic selection (GS) because of their ability to capture missing heritability information and higher prediction accuracy. Overall, this study provides a valuable resource for spinach genomics and highlights the potential utility of pan‐SV in crop improvement and breeding programmes. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

5. A de novo germline pathogenic BRCA1 variant identified following an osteosarcoma pangenomic molecular analysis.

Author: Mouren, Adrien, Chansavang, Albain, Hamzaoui, Nadim, Srikaran, Arunya, Laurent-Puig, Pierre, Marisa, Laetitia, De Percin, Sixtine, Lupo, Audrey, Larousserie, Frédérique, Blons, Hélène, L'Haridon, Anais, Burnichon, Nelly, Pasmant, Eric, and Tlemsani, Camille
Subjects: HOMOLOGOUS recombination, WHOLE genome sequencing, PAN-genome, BRCA genes, MOSAICISM
Abstract: De novo germline pathogenic variants (gPV) of the BReast CAncer 1 (BRCA1) gene are very rare. Only a few have been described up to date, usually in patients with a history of ovarian or breast cancer. Here, we report the first case of an incidental de novo BRCA1 germline pathogenic variant which was identified within the framework of the Plan France Médecine Génomique (PFMG) 2025 French national tumor sequencing program. The proband was a 29-year-old man diagnosed with metastatic osteosarcoma. Tumor whole exome sequencing identified a BRCA1 c.3756_3759del p.(Ser1253Argfs*10) pathogenic variant without loss-of-heterozygosity. A low genomic instability score and the absence of single base substitution signatures of homologous recombination deficiency suggested that the BRCA1 variant was not driver in the osteosarcoma tumorigenesis. Germline whole genome sequencing asserted the germline nature of this variant, with a 36% allele frequency, suggesting a mosaicism caused by a post-zygotic mutational event. The proband's family (parents and siblings) were not carriers of this variant confirming the de novo occurrence. Tumor sequencing programs like the French PFMG 2025 have been implemented worldwide and may help identify new gPV, including de novo variants. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

6. Large-scale genomic analysis of Elizabethkingia anophelis.

Author: Andriyanov, Pavel, Zhurilov, Pavel, Menshikova, Alena, Tutrina, Anastasia, Yashin, Ivan, and Kashina, Daria
Subjects: *HORIZONTAL gene transfer, *GENOMICS, *DRUG resistance in microorganisms, *PAN-genome, *INFECTIOUS disease transmission, *MOBILE genetic elements
Abstract: The recent emergence of Elizabethkingia anophelis as a human pathogen is a major concern for global public health. This organism has the potential to cause severe infections and has inherent antimicrobial resistance. The potential for widespread outbreaks and rapid global spread highlights the critical importance of understanding the biology and transmission dynamics of this infectious agent. We performed a large-scale analysis of available 540 E. anophelis, including one novel strain isolated from raw milk and sequenced in this study. Pan-genome analysis revealed an open and diverse pan-genome in this species, characterized by the presence of many accessory genes. This suggests that the species has a high level of adaptability and can thrive in a variety of environments. Phylogenetic analysis has also revealed a complex population structure, with limited source-lineage correlation. We identified diverse antimicrobial resistance factors, including core-genome and accessory ones often associated with mobile genetic elements within specific lineages. Mobilome analysis revealed a dynamic landscape primarily composed of genetic islands, integrative and conjugative elements, prophage elements, and small portion of plasmids emphasizing a complex mechanism of horizontal gene transfer. Our study underscores the adaptability of E. anophelis, characterized by a diverse range of antimicrobial resistance genes, putative virulence factors, and genes enhancing fitness. This adaptability is also supported by the organism's ability to acquire genetic material through horizontal gene transfer, primarily facilitated by mobile genetic elements such as integrative and conjugative elements (ICEs). The potential for rapid evolution of this emerging pathogen poses a significant challenge to public health efforts. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

7. Pangenome analysis of five representative Tropheryma whipplei strains following multiepitope-based vaccine design via immunoinformatic approaches.

Author: Hasan, Ahmad, Ibrahim, Muhammad, Alonazi, Wadi B., Yu, Rongrong, and Li, Bin
Subjects: *MOLECULAR dynamics, *VACCINE approval, *CENTRAL nervous system, *CARDIOVASCULAR system, *PAN-genome
Abstract: Whipple disease caused by Tropheryma whipplei a gram-positive bacterium is a systemic disorder that impacts not only the gastrointestinal tract but also the vascular system, joints, central nervous system, and cardiovascular system. Due to the lack of an approved vaccine, this study aimed to utilize immunoinformatic approaches to design multiepitope -based vaccine by utilizing the proteomes of five representative T. whipplei strains. The genomes initially comprised a total of 4,844 proteins ranging from 956 to 1012 proteins per strain. We collected 829 nonredundant lists of core proteins, that were shared among all the strains. Following subtractive proteomics, one extracellular protein, WP_033800108.1, a WhiB family transcriptional regulator, was selected for the chimeric-based multiepitope vaccine. Five immunodominant epitopes were retrieved from the WhiB family transcriptional regulator protein, indicating MHC-I and MHC-II with a global population coverage of 70.61%. The strong binding affinity, high solubility, nontoxicity, nonallergenic properties and high antigenicity scores make the selected epitopes more appropriate. Integration of the epitopes into a chimeric vaccine was carried out by applying appropriate adjuvant molecules and linkers, leading to the vaccine construct having enhanced immunogenicity and successfully eliciting both innate and adaptive immune responses. Moreover, the abilityof the vaccine to bind TLR4, a core innate immune receptor, was confirmed. Molecular dynamics simulations have also revealed the promising potential stability of the designed vaccine at 400 ns. In summary, we have designed a potential vaccine construct that has the ability not only to induce targeted immunogenicity for one strain but also for global T. whipplei strains. This study proposes a potential universal vaccine, reducing Whipple's disease risk and laying the groundwork for future research on multi-strain pathogens. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

8. Pangenome graph analysis reveals extensive effector copy-number variation in spinach downy mildew.

Author: Skiadas, Petros, Vidal, Sofía Riera, Dommisse, Joris, Mendel, Melanie N., Elberse, Joyce, Van den Ackerveken, Guido, de Jonge, Ronnie, and Seidl, Michael F.
Subjects: *BIOLOGICAL evolution, *PHYTOPATHOGENIC microorganisms, *CHROMOSOME structure, *PAN-genome, *PLANT evolution
Abstract: Plant pathogens adapt at speeds that challenge contemporary disease management strategies like the deployment of disease resistance genes. The strong evolutionary pressure to adapt, shapes pathogens' genomes, and comparative genomics has been instrumental in characterizing this process. With the aim to capture genomic variation at high resolution and study the processes contributing to adaptation, we here leverage an innovative, multi-genome method to construct and annotate the first pangenome graph of an oomycete plant pathogen. We expand on this approach by analysing the graph and creating synteny based single-copy orthogroups for all genes. We generated telomere-to-telomere genome assemblies of six genetically diverse isolates of the oomycete pathogen Peronospora effusa, the economically most important disease in cultivated spinach worldwide. The pangenome graph demonstrates that P. effusa genomes are highly conserved, both in chromosomal structure and gene content, and revealed the continued activity of transposable elements which are directly responsible for 80% of the observed variation between the isolates. While most genes are generally conserved, virulence related genes are highly variable between the isolates. Most of the variation is found in large gene clusters resulting from extensive copy-number expansion. Pangenome graph-based discovery can thus be effectively used to capture genomic variation at exceptional resolution, thereby providing a framework to study the biology and evolution of plant pathogens. Author summary: Plant pathogens are known to evolve rapidly and overcome disease resistance of newly introduced crop varieties. This swift adaptation is visible in the genomes of these pathogens, which can be highly variable. Such genomic variation cannot be captured with contemporary comparative genomic methods that rely on a single reference genome or focus solely on protein coding genes. To overcome these limitations and compare multiple genomes in a robust and scalable method, we constructed the first pangenome graph for an oomycete filamentous plant pathogen with six telomere-to-telomere genome assemblies of Peronospora effusa. This high-resolution pangenomic framework enabled detailed comparisons of the genomes at any level, from the nucleotide to the chromosome, and for any subset of protein-coding genes or transposable elements, to discover novel biology and potential mechanisms for the rapid evolution of this devastating pathogen. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

9. Analyzes of pan-genome and resequencing atlas unveil the genetic basis of jujube domestication.

Author: Guo, Mingxin, Lian, Qun, Mei, Ye, Yang, Wangwang, Zhao, Suna, Zhang, Siyuan, Xing, Xinfeng, Zhang, Haixiang, Gao, Keying, He, Wentong, Wang, Zhitong, Wang, Huan, Zhou, Jun, Cheng, Lin, Bao, Zhigui, Huang, Sanwen, Yan, Jianbin, and Zhao, Xusheng
Subjects: JUJUBE (Plant), FUNCTIONAL genomics, PAN-genome, FRUIT ripening, GENETIC variation
Abstract: Jujube (Ziziphus jujuba Mill.), belonging to the Rhamnaceae family, is gaining increasing prominence as a perennial fruit crop with significant economic and medicinal values. Here, we conduct de novo assembly of four reference-grade genomes, encompassing one wild and three cultivated jujube accessions. We present insights into the population structure, genetic diversity, and genomic variations within a diverse collection of 1059 jujube accessions. Analyzes of the jujube pan-genome, based on our four assemblies and four previously released genomes, reveal extensive genomic variations within domestication-associated regions, potentially leading to the discovery of a candidate gene that regulates flowering and fruit ripening. By leveraging the pan-genome and a large-scale resequencing population, we identify two candidate genes involved in domestication traits, including the seed-setting rate, the bearing-shoot length and the leaf size in jujube. These genomic resources will accelerate evolutionary and functional genomics studies of jujube. Jujube is a perennial fruit crop with significant economic and medicinal values. Here, the author report genome assemblies of four jujube accessions, construct pan-genome together with four previously published genomes, and generate resequencing data to reveal genetic basis of domestication traits. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

10. A pan-genomic analysis based multi-epitope vaccine development by targeting Stenotrophomonas maltophilia using reverse vaccinology method: an in-silico approach.

Author: Shovon, Md. Hasan Jafre, Imtiaz, Md., Biswas, Partha, Tareq, Md. Mohaimenul Islam, Zilani, Md Nazmul Hasan, and Hasan, Md.Nazmul
Subjects: *STENOTROPHOMONAS maltophilia, *NOSOCOMIAL infections, *GENOMICS, *VACCINE development, *RESPIRATORY organs
Abstract: Antibiotic resistance in bacteria leads to high mortality rates and healthcare costs, a significant concern for public health. A colonizer of the human respiratory system, Stenotrophomonas maltophilia is frequently associated with hospital-acquired infections in individuals with cystic fibrosis, cancer, and other chronic illnesses. The importance of this study is underscored by its capacity to meet the critical demand for effective preventive strategies against this pathogen, particularly among susceptible groups of cystic fibrosis and those undergoing cancer treatment. In this study, we engineered a multi-epitope vaccine targeting S. maltophilia through genomic analysis, reverse vaccination strategies, and immunoinformatic techniques by examining a total of 81 complete genomes of S. maltophilia strains. Our investigation revealed 1945 core protein-coding genes alongside their corresponding proteomic sequences, with 191 of these genes predicted to exhibit virulence characteristics. Out of the filtered proteins, three best antigenic proteins were selected for epitope prediction while seven epitopes each from CTL, HTL, and B cell were chosen for vaccine development. The vaccine was refined and validated, showing highly antigenic and desirable physicochemical features. Molecular docking assessments revealed stable binding with TLR-4. Molecular dynamic simulation demonstrated stable dynamics with minor alterations. The originality of this investigation is rooted in the thorough techniques aimed at designing a vaccine that directly targets S. maltophilia, a microorganism of considerable clinical relevance that currently lacks an available vaccine. This study not only responds to a pressing public health crisis but also lays the groundwork for subsequent research endeavors focused on the prevention of S. maltophilia outbreaks. Further evidence from studies in mice models is needed to confirm immune protection against S. maltophilia. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

11. A unified framework to analyze transposable element insertion polymorphisms using graph genomes.

Author: Groza, Cristian, Chen, Xun, Wheeler, Travis J., Bourque, Guillaume, and Goubert, Clément
Subjects: MOBILE genetic elements, DROSOPHILA melanogaster, PAN-genome, CANNABIS (Genus), GRAPH algorithms
Abstract: Transposable elements are ubiquitous mobile DNA sequences generating insertion polymorphisms, contributing to genomic diversity. We present GraffiTE, a flexible pipeline to analyze polymorphic mobile elements insertions. By integrating state-of-the-art structural variant detection algorithms and graph genomes, GraffiTE identifies polymorphic mobile elements from genomic assemblies or long-read sequencing data, and genotypes these variants using short or long read sets. Benchmarking on simulated and real datasets reports high precision and recall rates. GraffiTE is designed to allow non-expert users to perform comprehensive analyses, including in models with limited transposable element knowledge and is compatible with various sequencing technologies. Here, we demonstrate the versatility of GraffiTE by analyzing human, Drosophila melanogaster, maize, and Cannabis sativa pangenome data. These analyses reveal the landscapes of polymorphic mobile elements and their frequency variations across individuals, strains, and cultivars. Transposable element (TE) activity affects genome structure. Here, authors present GraffiTE, a framework for analysing polymorphic TEs in long reads or assemblies. It combines state-of-the-art variant search, TE annotation, and graph-genotyping, and has proven versatile across eukaryotic models. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

12. Genomic Diversity of Streptomyces clavuligerus : Implications for Clavulanic Acid Biosynthesis and Industrial Hyperproduction.

Author: Ríos-Fernández, Paula, Caicedo-Montoya, Carlos, and Ríos-Estepa, Rigoberto
Subjects: *GENOMICS, *CLAVULANIC acid, *PAN-genome, *METABOLISM, *BIOSYNTHESIS
Abstract: Streptomyces clavuligerus is a species used worldwide to industrially produce clavulanic acid (CA), a molecule that enhances antibiotic effectiveness against β-lactamase-producing bacterial strains. Despite its low inherent CA production, hyper-producing strains have been developed. However, genomic analyses specific to S. clavuligerus and CA biosynthesis are limited. Genomic variations that may influence CA yield were explored using S. clavuligerus strain genomes from diverse sources. Despite the slight differences obtained by similarity index calculation, pan-genome estimation revealed that only half of the genes identified were present in all strains. As expected, core genes were associated with primary metabolism, while the remaining genes were linked to secondary metabolism. Differences at the sequence level were more likely to be found in regions close to the tips of the linear chromosome. Wild-type strains preserved larger chromosomal and plasmid regions compared to industrial and/or hyper-producing strains; such a grouping pattern was also found through refined phylogenetic analyses. These results provide essential insights for the development of hyper-producing S. clavuligerus strains, attending to the critical demand for this antibiotic enhancer and contributing to future strategies for CA production optimization. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

13. Genome characterization of Shewanella algae in Hainan Province, China.

Author: Licheng Wang, Shaojin Chen, Mei Xing, Lingzhi Dong, Huaxiong Zhu, Yujin Lin, Jinyi Li, Tuo Sun, Xiong Zhu, and Xiaoxia Wang
Subjects: MICROBIAL sensitivity tests, SHEWANELLA, PAN-genome, MULTIDRUG resistance, DRUG resistance in bacteria
Abstract: Shewanella algae is an emerging marine zoonotic pathogen. In this study, we first reported the Shewanella algae infections in patients and animals in Hainan Province, China. Currently, there is still relatively little known about the whole-genome characteristics of Shewanella algae in most tropical regions, including in southern China. Here, we sequenced the 62 Shewanella algae strains isolated from Hainan Province and combined with the whole genomes sequences of 144 Shewanella algae genomes from public databases to analyze genomic features. Phylogenetic analysis revealed that Shewanella algae is widely distributed in the marine environments of both temperate and tropical countries, exhibiting close phylogenetic relationships with genomes isolated from patients, animals, and plants. Thereby confirming that exposure to marine environments is a risk factor for Shewanella algae infections. Average nucleotide identity analysis indicated that the clonally identical genomes could be isolated from patients with different sample types at different times. Pan-genome analysis identified a total of 21,909 genes, including 1,563 core genes, 8,292 strainspecific genes, and 12,054 accessory genes. Multiple putative virulence-associated genes were identified, encompassing 14 categories and 16 subcategories, with 171 distinct virulence factors. Three different plasmid replicon types were detected in 33 genomes. Eleven classes of antibiotic resistance genes and 352 integrons were identified. Antimicrobial susceptibility testing revealed a high resistance rate to imipenem and colistin among the strains studied, with 5 strains exhibiting multidrug resistance. However, they were all sensitive to amikacin, minocycline, and tigecycline. Our findings clarify the genomic characteristics and population structure of Shewanella algae in Hainan Province. The results offer insights into the genetic basis of pathogenicity in Shewanella algae and enhance our understanding of its global phylogeography. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

14. Campylobacter coli of porcine origin exhibits an open pan-genome within a single clonal complex: insights from comparative genomic analysis.

Author: Ghatak, Sandeep, Prince Milton, Arockiasamy Arun, Das, Samir, Momin, Kasanchi M., Srinivas, Kandhan, Pyngrope, Daniel Aibor, and Priya, G. Bhuvana
Subjects: MOBILE genetic elements, CAMPYLOBACTER coli, GENOMICS, PAN-genome, FOOD pathogens
Abstract: Introduction: Although Campylobacter spp., including Campylobacter coli, have emerged as important zoonotic foodborne pathogens globally, the understanding of the genomic epidemiology of C. coli of porcine origin is limited. Methods: As pigs are an important reservoir of C. coli, we analyzed C. coli genomes that were isolated (n = 3) from pigs and sequenced (this study) them along with all other C. coli genomes for which pig intestines, pig feces, and pigs were mentioned as sources in the NCBI database up to January 6, 2023. In this paper, we report the pan-genomic features, the multi-locus sequence types, the resistome, virulome, and mobilome, and the phylogenomic analysis of these organisms that were obtained from pigs. Results and discussion: Our analysis revealed that, in addition to having an open pan-genome, majority (63%) of the typeable isolates of C. coli of pig origin belonged to a single clonal complex, ST-828. The resistome of these C. coli isolates was predominated by the genes tetO (53%), blaOXA-193 (49%), and APH (3')-IIIa (21%); however, the virulome analysis revealed a core set of 37 virulence genes. Analysis of the mobile genetic elements in the genomes revealed wide diversity of the plasmids and bacteriophages, while 30 transposons were common to all genomes of C. coli of porcine origin. Phylogenomic analysis showed two discernible clusters comprising isolates originating from Japan and another set of isolates comprising mostly copies of a type strain stored in three different culture collections. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

15. Pan-genome analysis of invasive Streptococcus mutans strains.

Author: Sujitha, Srinivasan, Gunasekaran, Paramasamy, and Rajendhran, Jeyaprakash
Subjects: *EXTRACELLULAR matrix proteins, *PAN-genome, *STREPTOCOCCUS mutans, *PROTEIN-protein interactions, *ENDOTHELIAL cells
Abstract: Streptococcus mutans is responsible for dental problems and is associated with cardiovascular co-morbidities. Only a few selected strains can adhere to and invade endothelial cells. To ascertain which strains have the capability to invade cardiovascular cells, in silico PCR was performed on all the 193 available strains. The genome sequences were screened for collagen-binding genes cnm and cbm. Among the 193 strains tested, only 4 showed the presence of collagen-binding gene. BPGA tool was used for pan-genome analysis of invasive strains. Results indicated an almost closed pan-genome for S. mutans comprising 45,654 core genes, 29,452 accessory genes and 232 unique genes. Most of the unique genes belonged to only 5 genomes amongst the 42 invasive genomes analysed. These five genomes were screened for the presence of virulence genes using the MP3 software. Protein--protein interactions between the pathogenic proteins and extracellular matrix components were analysed using HPIDB. Surface-localized proteins were predicted to interact with the human tumour suppressor gene. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

16. Complete genome sequences of two Pantoea stewartii strains ATCC 8199 from maize and PSCN1 from sugarcane.

Author: Chu, Na, Liu, Tian-Tian, Zhang, Hui-Li, Cui, Dong, Huang, Mei-Ting, Fu, Hua-Ying, Su, Jun-Bo, and Gao, San-Ji
Subjects: *BIOLOGICAL classification, *BACTERIAL genomes, *WHOLE genome sequencing, *PAN-genome, *CORN diseases
Abstract: Objectives: The pathogen of Pantoea stewartii (Ps) is the causal agent of bacterial disease in corn and various graminaceous plants. Ps has two subspecies, Pantoea stewartii subsp. stewartia (Pss) and Pantoea stewartii subsp. indologenes (Psi). This study presents two complete genomes of Ps strains including ATCC 8199 isolated from maize and PSCN1 causing bacterial wilt in sugarcane. The two bacterial genomes information will be helpful for taxonomy analysis in this genus Pantoea at whole-genome levels and accurately discriminated the two subspecies of Pss and Psi. Data description: The reference strain ATCC 8199 isolated from maize was purchased from Beijing Biobw Biotechnology Co., Ltd. (China) and the strain of PSCN1 was isolated from sugarcane cultivar YZ08-1095 in Zhanjiang, Guangdong province of China. Two complete genomes were sequenced using Illumina Hiseq (second-generation) and Oxford Nanopore (third-generation) platforms. The genome of the strain ATCC 8199 comprised of 4.78 Mb with an average GC content of 54.03%, along with five plasmids, encoding a total of 4,846 gene with an average gene length of 827 bp. The genome of PSCN1 comprised of 5.03 Mb with an average GC content of 53.78%, along with two plasmids, encoding a total of 4,725 gene with an average gene length of 913 bp. The bacterial pan-genome analysis highlighted the strain ATCC 8199 was clustered into a subgroup with a Pss strain CCUG 26,359 from USA, while the strain PSCN1 was clustered into another subgroup with a Ps strain NRRLB-133 from USA. These findings will serve as a useful resource for further analyses of the evolution of Ps strains and corresponding disease epidemiology worldwide. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

17. Chromosomal gene order defines several structural classes of Staphylococcus epidermidis genomes.

Author: Nagy, Naya and Hodor, Paul
Subjects: *PAN-genome, *STAPHYLOCOCCUS epidermidis, *GENOMES, *GENES, *SPECIES
Abstract: The original methodology for describing the pangenome of a prokaryotic species is based on modeling genomes as unordered sets of genes. More recent findings have underlined the importance of considering the ordering of genes along the genetic material as well, when making comparisons among genomes. To further investigate the benefits of gene order when describing genomes of a given species, we applied two distance metrics on a dataset of 84 genomes of Staphylococcus epidermidis. The first metric, GeLev, depends on the order of genes and is a derivative of the Levenshtein distance. The second, the Jaccard distance, depends on gene sets only. The application of these distances reveals information about the global structure of the genomes, and allows clustering of the genomes into classes. The main biological result is that, while genomes within the same class are structurally similar, genomes of different classes have an additional characteristic. Between genomes in different classes we can discover instances where a large segment of the first genome appears in reverse order in the second. This feature suggests that genome rearrangements in S. epidermidis happen on a large scale, while micro-rearrangements of single or a small number of genes are rare. Thus, this paper describes a straight-forward method to classify genomes into structural classes with the same order of genes and makes it possible to visualize reversed segments in pairs of genomes. The method can be readily applied to other species. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

18. Comparative genomics of zoonotic pathogen Clostridioides difficile of animal origin to understand its diversity.

Author: Karthik, Kumaragurubaran, Anbazhagan, Subbaiyan, Priyadharshini, Murugaiyan Latha Mala, Sharma, Rajeev Kumar, and Manoharan, Seeralan
Subjects: *WHOLE genome sequencing, *CLOSTRIDIOIDES difficile, *COMPARATIVE genomics, *PAN-genome, *CD26 antigen
Abstract: Clostridioides difficile, a zoonotic pathogen causing enteric diseases in different animals and humans. A comprehensive study on the presence of toxin genes and antimicrobial resistance genes based on genome data of C. difficile in animals is scanty. In the present study, a total of 15 C. difficile isolates were recovered from dogs and isolates with toxin genes (D1, CD15 and CD26) along with two other non-toxigenic strains (CD28, CD32) were used for whole genome sequencing and comparative genomics. Sequence type-based clustering was noted in the whole genome phylogeny with 4 known multi-locus sequence typing (MLST) clades namely I, II, IV, and V and a cryptic clade. ST11 and ST54 were reported for the 2nd time worldwide in dogs. Out of 109 genomes used in the study, 29 genomes were predicted with all four toxin genes (toxA, toxB, cdtA, cdtB) while 22 did not have any of the toxin genes. ST11 of MLST clade V had the maximum number of 46 genomes predicted with at least one toxin gene. Among the genomes sequenced in this study, CD26 had a maximum of 5 AMR genes (aac(6′)-aph(2″), ant(6)-Ia, catP, erm(B)_18, and tet(M)_11) and CD15 was predicted with 2 AMR genes (aac(6′)-aph(2″), erm(B)_18). Tetracycline resistance genes were predicted most in the ST11 genome. Of the 22 non-toxigenic strains, 9 genomes (ST48 = 5, ST3 = 2, ST109 = 1, ST15 = 1) were predicted with a minimum of one AMR gene. Pangenome analysis indicated that the Bpan value is 0.12 showing that C. difficile has an open pangenome structure. This indicates that the organism can evolve by the addition of new genes. This study reports the circulation of clinically important ST11 and multidrug-resistant non-toxigenic strains among animals. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

19. The goat pan-genome reveals patterns of gene loss during domestication.

Author: Liu, Jiaxin, Shi, Yilong, Mo, Dongxin, Luo, Lingyun, Xu, Songsong, and Lv, Fenghua
Subjects: *GOAT breeds, *GERMPLASM conservation, *GENETIC variation, *PAN-genome, *GOATS
Abstract: Background: Unveiling genetic diversity features and understanding the genetic mechanisms of diverse goat phenotypes are pivotal in facilitating the preservation and utilization of these genetic resources. However, the total genetic diversity within a species can't be captured by the reference genome of a single individual. The pan-genome is a collection of all the DNA sequences that occur in a species, and it is expected to capture the total genomic diversity of the specific species. Results: We constructed a goat pan-genome using map-to-pan assemble based on 813 individuals, including 723 domestic goats and 90 samples from their wild relatives, which presented a broad regional and global representation. In total, 146 Mb sequences and 974 genes were identified as absent from the reference genome (ARS1.2; GCF_001704415.2). We identified 3,190 novel single nucleotide polymorphisms (SNPs) using the pan-genome analysis. These novel SNPs could properly reveal the population structure of domestic goats and their wild relatives. Presence/absence variation (PAV) analysis revealed gene loss and intense negative selection during domestication and improvement. Conclusions: Our research highlights the importance of the goat pan-genome in capturing the missing genetic variations. It reveals the changes in genomic architecture during goat domestication and improvement, such as gene loss. This improves our understanding of the evolutionary and breeding history of goats. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

20. Pangenome Data Analysis Reveals Characteristics of Resistance Gene Analogs Associated with Sclerotinia sclerotiorum Resistance in Sunflower.

Author: Lu, Yan, Huang, Jiaying, Liu, Dongqi, Kong, Xiangjiu, Song, Yang, and Jing, Lan
Subjects: *LOCUS (Genetics), *PAN-genome, *SCLEROTINIA sclerotiorum, *PRODUCTION losses, *NATURAL immunity
Abstract: The sunflower, an important oilseed crop and food source across the world, is susceptible to several pathogens, which cause severe losses in sunflower production. The utilization of genetic resistance is the most economical, effective measure to prevent infectious diseases. Based on the sunflower pangenome, in this study, we explored the variability of resistance gene analogs (RGAs) within the species. According to a comparative analysis of RGA candidates in the sunflower pangenome using the RGAugury pipeline, a total of 1344 RGAs were identified, comprising 1107 conserved, 199 varied, and 38 rare RGAs. We also identified RGAs associated with resistance against Sclerotinia sclerotiorum (S. sclerotiorum) in sunflower at the quantitative trait locus (QTL). A total of 61 RGAs were found to be located at four quantitative trait loci (QTLs). Through a detailed expression analysis of RGAs in one susceptible and two tolerant sunflower inbred lines (ILs) across various time points post inoculation, we discovered that 348 RGAs exhibited differential expression in response to Sclerotinia head rot (SHR), with 17 of these differentially expressed RGAs being situated within the QTL regions. In addition, 15 RGA candidates had gene introgression. Our data provide a better understanding of RGAs, which facilitate genomics-based improvements in disease resistance in sunflower. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

21. Elastic-Degenerate String Matching with 1 Error or Mismatch.

Author: Bernardini, Giulia, Gabory, Esteban, Pissis, Solon P., Stougie, Leen, Sweering, Michelle, and Zuba, Wiktor
Subjects: *HAMMING distance, *COMPUTATIONAL geometry, *MATRIX multiplications, *STATISTICAL decision making, *PAN-genome
Abstract: An elastic-degenerate (ED) string is a sequence of n finite sets of strings of total length N, introduced to represent a set of related DNA sequences, also known as a pangenome. The ED string matching (EDSM) problem consists in reporting all occurrences of a pattern of length m in an ED text. The EDSM problem has recently received some attention by the combinatorial pattern matching community, culminating in an O ~ (n m ω - 1) + O (N) -time algorithm [Bernardini et al., SIAM J. Comput. 2022], where ω denotes the matrix multiplication exponent and the O ~ (·) notation suppresses polylog factors. In the k-EDSM problem, the approximate version of EDSM, we are asked to report all pattern occurrences with at most k errors. k-EDSM can be solved in O (k 2 m G + k N) time, under edit distance, or O (k m G + k N) time, under Hamming distance, where G denotes the total number of strings in the ED text [Bernardini et al., Theor. Comput. Sci. 2020]. Unfortunately, G is only bounded by N, and so even for k = 1 , the existing algorithms run in Ω (m N) time in the worst case. In this paper we make progress in this direction. We show that 1-EDSM can be solved in O ((n m 2 + N) log m) or O (n m 3 + N) time under edit distance. For the decision version of the problem, we present a faster O (n m 2 log m + N log log m) -time algorithm. We also show that 1-EDSM can be solved in O (n m 2 + N log m) time under Hamming distance. Our algorithms for edit distance rely on non-trivial reductions from 1-EDSM to special instances of classic computational geometry problems (2d rectangle stabbing or 2d range emptiness), which we show how to solve efficiently. In order to obtain an even faster algorithm for Hamming distance, we rely on employing and adapting the k-errata trees for indexing with errors [Cole et al., STOC 2004]. This is an extended version of a paper presented at LATIN 2022. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

22. Contribution of the Mobilome to the Configuration of the Resistome of Corynebacterium striatum.

Author: Urrutia, Catherine, Leyton-Carcaman, Benjamin, and Abanto Marin, Michel
Subjects: *NASAL mucosa, *PAN-genome, *NOSOCOMIAL infections, *INTEGRONS, *DRUG resistance in microorganisms, *MOBILE genetic elements
Abstract: Corynebacterium striatum, present in the microbiota of human skin and nasal mucosa, has recently emerged as a causative agent of hospital-acquired infections, notable for its resistance to multiple antimicrobials. Its mobilome comprises several mobile genetic elements, such as plasmids, transposons, insertion sequences and integrons, which contribute to the acquisition of antimicrobial resistance genes. This study analyzes the contribution of the C. striatum mobilome in the transfer and dissemination of resistance genes. In addition, integrative and conjugative elements (ICEs), essential in the dissemination of resistance genes between bacterial populations, whose role in C. striatum has not yet been studied, are examined. This study examined 365 C. striatum genomes obtained from the NCBI Pathogen Detection database. Phylogenetic and pangenome analyses were performed, the resistance profile of the bacterium was recognized, and mobile elements, including putative ICE, were detected. Bioinformatic analyses identified 20 antimicrobial resistance genes in this species, with the Ermx gene being the most predominant. Resistance genes were mainly associated with plasmid sequence regions and class 1 integrons. Although an ICE was detected, no resistance genes linked to this element were found. This study provided valuable information on the geographic spread and prevalence of outbreaks observed through phylogenetic and pangenome analyses, along with identifying antimicrobial resistance genes and mobile genetic elements that carry many of the resistance genes and may be the subject of future research and therapeutic approaches. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

23. Identification of key drivers of antimicrobial resistance in Enterococcus using machine learning.

Author: Kim, Jee In, Manuele, Alexander, Maguire, Finlay, Zaheer, Rahat, McAllister, Tim A., and Beiko, Robert G.
Subjects: *MACHINE learning, *ENTEROCOCCUS faecium, *ENTEROCOCCUS faecalis, *PAN-genome, *DRUG resistance in microorganisms
Abstract: With antimicrobial resistance (AMR) rapidly evolving in pathogens, quick and accurate identification of genetic determinants of phenotypic resistance is essential for improving surveillance, stewardship, and clinical mitigation. Machine learning (ML) models show promise for AMR prediction in diagnostics but require a deep understanding of internal processes to use effectively. Our study utilised AMR gene, pangenomic, and predicted plasmid features from 647 Enterococcus faecium and Enterococcus faecalis genomes across the One Health continuum, along with corresponding resistance phenotypes, to develop interpretive ML classifiers. Vancomycin resistance could be predicted with 99% accuracy with AMR gene features, 98% with pangenome features, and 96% with plasmid clusters. Top pangenome features overlapped with the resistance genes of the vanA operon, which are often laterally transmitted via plasmids. Doxycycline resistance prediction achieved approximately 92% accuracy with pangenome features, with the top feature being elements of Tn916 conjugative transposon, a tet(M) carrier. Erythromycin resistance prediction models achieved about 90% accuracy, but top features were negatively correlated with resistance due to the confounding effect of population structure. This work demonstrates the importance of reviewing ML models' features to discern biological relevance even when achieving high-performance metrics. Our workflow offers the potential to propose hypotheses for experimental testing, enhancing the understanding of AMR mechanisms, which are crucial for combating the AMR crisis. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

24. Toward building a comprehensive human pan-genome: The SEN-GENOME project.

Author: Gaye, Amadou, Sene, Andrea Regina G., Gadji, Macoura, Deme, Alioune, Cisse, Aynina, and Ndiaye, Rokhaya
Subjects: *HUMAN genome, *GENETIC variation, *PAN-genome, *ETHNOLINGUISTIC groups, *DATA harmonization
Abstract: The human reference genome (GRCh38), primarily sourced from individuals of European descent, falls short in capturing the vast genetic diversity across global populations. Efforts to diversify the reference genome face challenges in accessibility and representation, exacerbating the scarcity of African genomic data crucial for studying diseases prevalent in these populations. Sherman et al. proposed constructing reference genomes tailored to distinct human sub-populations. Their African Pan-Genome initiative highlighted substantial genetic variation missing from the GRCh38 human reference genome, emphasizing the necessity for population-specific genomes. In response, local initiatives like the Senegalese Genome project (SEN-GENOME) have emerged to document the genomes of historically overlooked populations. SEN-GENOME embodies community-driven decentralized research. With meticulous recruitment criteria and ethical practices, it aims to sequence 1,000 genomes from 31 ethnolinguistic groups, in the fourteen administrative regions of Senegal, fostering local genomic research tailored to the region. The key to SEN-GENOME's success is its commitment to local governance of data, capacity building, and integration with broader pan-genome projects in Africa. Despite the complexities of data harmonization and sharing, our collaborative efforts are aligned with common goals, ensuring steady progress toward a comprehensive human pan-genome. We invite and welcome collaboration with other research entities to achieve this shared vision. In summary, local initiatives such as SEN-GENOME are pivotal in bridging genomic disparities, offering pathways to equitable and inclusive genomic research. Collaborative endeavors guided by a collective vision for human health will propel us toward a more encompassing understanding of the human genome and better health through genomic medicine. The Senegalese Genome project (SEN-GENOME) aims to map the genomes of underrepresented Senegalese populations, enhancing local genomic research and precision medicine. By focusing on local governance and community-driven research, SEN-GENOME contributes to a comprehensive and inclusive human pan-genome, addressing genetic diversity and promoting health equity. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

25. Investigating Anthrax-Associated Virulence Genes among Archival and Contemporary Bacillus cereus Group Genomes.

Author: Sabin, Susanna J., Beesley, Cari A., Marston, Chung K., Paisie, Taylor K., Gulvik, Christopher A., Sprenger, Gregory A., Gee, Jay E., Traxler, Rita M., Bell, Melissa E., McQuiston, John R., and Weiner, Zachary P.
Subjects: WHOLE genome sequencing, BACILLUS anthracis, BACILLUS cereus, PSEUDOGENES, PAN-genome
Abstract: Bacillus anthracis causes anthrax through virulence factors encoded on two plasmids. However, non-B. anthracis organisms within the closely related, environmentally ubiquitous Bacillus cereus group (BCG) may cause an anthrax-like disease in humans through the partial adoption of anthrax-associated virulence genes, challenging the definition of anthrax disease. To elucidate these phenomena and their evolutionary past, we performed whole-genome sequencing on non-anthracis BCG isolates, including 93 archival (1967–2003) and 5 contemporary isolates (2019–2023). We produced annotated genomic assemblies and performed a pan-genome analysis to identify evidence of virulence gene homology and virulence gene acquisition by linear inheritance or horizontal gene transfer. At least one anthrax-associated virulence gene was annotated in ten isolates. Most homologous sequences in archival isolates showed evidence of pseudogenization and subsequent gene loss. The presence or absence of accessory genes, including anthrax-associated virulence genes, aligned with the phylogenetic structure of the BCG core genome. These findings support the hypothesis that anthrax-associated virulence genes were inherited from a common ancestor in the BCG and were retained or lost across different lineages, and contribute to a growing body of work informing public health strategies related to anthrax surveillance and identification. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

26. Retrospect and prospect of Nicotiana tabacum genome sequencing.

Author: Zhijun Tong, Yujie Huang, Qian-Hao Zhu, Longjiang Fan, Bingguang Xiao, and Enhui Shen
Subjects: PLANT genomes, PAN-genome, FUNCTIONAL genomics, TOBACCO, GENOMES
Abstract: Investigating plant genomes offers crucial foundational resources for exploring various aspects of plant biology and applications, such as functional genomics and breeding practices. With the development in sequencing and assembly technology, several Nicotiana tabacum genomes have been published. In this paper, we reviewed the progress on N. tabacum genome assembly and quality, from the initial draft genomes to the recent high-quality chromosome-level assemblies. The application of long-read sequencing, optical mapping, and Hi-C technologies has significantly improved the contiguity and completeness of N. tabacum genome assemblies, with the latest assemblies having a contig N50 size over 50 Mb. Despite these advancements, further improvements are still required and possible, particularly on the development of pan-genome and telomere-totelomere (T2T) genomes. These new genomes will capture the genomic diversity and variations among different N. tabacum cultivars and species, and provide a comprehensive view of the N. tabacum genome structure and gene content, so to deepen our understanding of the N. tabacum genome and facilitate precise breeding and functional genomics. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

27. Realizing visionary goals for the International Year of Millet (IYoM): accelerating interventions through advances in molecular breeding and multiomics resources.

Author: Chandra, Tilak, Jaiswal, Sarika, Tomar, Rukam Singh, Iquebal, Mir Asif, and Kumar, Dinesh
Abstract: Main conclusion: Leveraging advanced breeding and multi-omics resources is vital to position millet as an essential “nutricereal resource,” aligning with IYoM goals, alleviating strain on global cereal production, boosting resilience to climate change, and advancing sustainable crop improvement and biodiversity. The global challenges of food security, nutrition, climate change, and agrarian sustainability demand the adoption of climate-resilient, nutrient-rich crops to support a growing population amidst shifting environmental conditions. Millets, also referred to as “Shree Anna,” emerge as a promising solution to address these issues by bolstering food production, improving nutrient security, and fostering biodiversity conservation. Their resilience to harsh environments, nutritional density, cultural significance, and potential to enhance dietary quality index made them valuable assets in global agriculture. Recognizing their pivotal role, the United Nations designated 2023 as the “International Year of Millets (IYoM 2023),” emphasizing their contribution to climate-resilient agriculture and nutritional enhancement. Scientific progress has invigorated efforts to enhance millet production through genetic and genomic interventions, yielding a wealth of advanced molecular breeding technologies and multi-omics resources. These advancements offer opportunities to tackle prevailing challenges in millet, such as anti-nutritional factors, sensory acceptability issues, toxin contamination, and ancillary crop improvements. This review provides a comprehensive overview of molecular breeding and multi-omics resources for nine major millet species, focusing on their potential impact within the framework of IYoM. These resources include whole and pan-genome, elucidating adaptive responses to abiotic stressors, organelle-based studies revealing evolutionary resilience, markers linked to desirable traits for efficient breeding, QTL analysis facilitating trait selection, functional gene discovery for biotechnological interventions, regulatory ncRNAs for trait modulation, web-based platforms for stakeholder communication, tissue culture techniques for genetic modification, and integrated omics approaches enabled by precise application of CRISPR/Cas9 technology. Aligning these resources with the seven thematic areas outlined by IYoM catalyzes transformative changes in millet production and utilization, thereby contributing to global food security, sustainable agriculture, and enhanced nutritional consequences. The portrayal of millets in the context of the International Year of Millets (IYoM) framework includes a roadmap for their enhancement through future interventions, utilizing multi-omics resources. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

28. Pangenomic and biochemical analyses of Helcococcus ovis reveal widespread tetracycline resistance and a novel bacterial species, Helcococcus bovis.

Author: Cunha, Federico, Yuting Zhai, Casaro, Segundo, Jones, Kristi L., Hernandez, Modesto, Bisinotto, Rafael S., Kariyawasam, Subhashinie, Brown, Mary B., Phillips, Ashley, Jeong, Kwangcheol C., and Galvão, Klibs N.
Subjects: BIOLOGICAL extinction, WHOLE genome sequencing, DRUG resistance in bacteria, PAN-genome, PHENOTYPES
Abstract: Helcococcus ovis (H. ovis) is an opportunistic bacterial pathogen of a wide range of animal hosts including domestic ruminants, swine, avians, and humans. In this study, we sequenced the genomes of 35 Helcococcus sp. clinical isolates from the uterus of dairy cows and explored their antimicrobial resistance and biochemical phenotypes in vitro. Phylogenetic and average nucleotide identity analyses classified four Helcococcus isolates within a cryptic clade representing an undescribed species, for which we propose the name Helcococcus bovis sp. nov. By establishing this new species clade, we also resolve the longstanding question of the classification of the Tongji strain responsible for a confirmed human conjunctival infection. This strain did not neatly fit into H. ovis and is instead a member of H. bovis. We applied whole genome comparative analyses to explore the pangenome, resistome, virulome, and taxonomic diversity of the remaining 31 H. ovis isolates. An overwhelming 97% of H. ovis strains (30 out of 31) harbor mobile tetracycline resistance genes and displayed significantly increased minimum inhibitory concentrations of tetracyclines in vitro. The high prevalence of mobile tetracycline resistance genes makes H. ovis a significant antimicrobial resistance gene reservoir in our food chain. Finally, the phylogenetic distribution of co-occurring high-virulence determinant genes of H. ovis across unlinked and distant loci highlights an instance of convergent gene loss in the species. In summary, this study showed that mobile genetic element-mediated tetracycline resistance is widespread in H. ovis, and that there is evidence of cooccurring virulence factors across clades suggesting convergent gene loss in the species. Finally, we introduced a novel Helcococcus species closely related to H. ovis, called H. bovis sp. nov., which has been reported to cause infection in humans. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

29. Unravelling the main genomic features of Mycoplasma equirhinis.

Author: Martineau, Matthieu, Ambroset, Chloé, Lefebvre, Stéphanie, Kokabi, Éléna, Léon, Albertine, and Tardy, Florence
Subjects: *BACTERIAL genomes, *PAN-genome, *MYCOPLASMATALES, *GENOMES, *SPECIES
Abstract: Background: Mycoplasma spp. are wall-less bacteria with small genomes (usually 0.5–1.5 Mb). Many Mycoplasma (M.) species are known to colonize the respiratory tract of both humans and livestock animals, where they act as primary pathogens or opportunists. M. equirhinis was described for the first time in 1975 in horses but has been poorly studied since, despite regular reports of around 14% prevalence in equine respiratory disorders. We recently showed that M. equirhinis is not a primary pathogen but could play a role in co-infections of the respiratory tract. This study was a set up to propose the first genomic characterization to better our understanding of the M. equirhinis species. Results: Four circularized genomes, two of which were generated here, were compared in terms of synteny, gene content, and specific features associated with virulence or genome plasticity. An additional 20 scaffold-level genomes were used to analyse intra-species diversity through a pangenome phylogenetic approach. The M. equirhinis species showed consistent genomic homogeneity, pointing to potential clonality of isolates despite their varied geographical origins (UK, Japan and various places in France). Three different classes of mobile genetic elements have been detected: insertion sequences related to the IS1634 family, a putative prophage related to M. arthritidis and integrative conjugative elements related to M. arginini. The core genome harbours the typical putative virulence-associated genes of mycoplasmas mainly involved in cytoadherence and immune escape. Conclusion: M. equirhinis is a highly syntenic, homogeneous species with a limited repertoire of mobile genetic elements and putative virulence genes. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

30. Unveiling the Brazilian kefir microbiome: discovery of a novel Lactobacillus kefiranofaciens (LkefirU) genome and in silico prospection of bioactive peptides with potential anti-Alzheimer properties.

Author: Silva, Matheus H., Batista, Letícia L., Malta, Serena M., Santos, Ana C. C., Mendes-Silva, Ana P., Bonetti, Ana M., Ueira-Vieira, Carlos, and dos Santos, Anderson R.
Subjects: *DIETARY bioactive peptides, *ALZHEIMER'S disease, *PAN-genome, *MOLECULAR docking, *KEFIR
Abstract: Background: Kefir is a complex microbial community that plays a critical role in the fermentation and production of bioactive peptides, and has health-improving properties. The composition of kefir can vary by geographic localization and weather, and this paper focuses on a Brazilian sample and continues previous work that has successful anti-Alzheimer properties. In this study, we employed shotgun metagenomics and peptidomics approaches to characterize Brazilian kefir further. Results: We successfully assembled the novel genome of Lactobacillus kefiranofaciens (LkefirU) and conducted a comprehensive pangenome analysis to compare it with other strains. Furthermore, we performed a peptidome analysis, revealing the presence of bioactive peptides encrypted by L. kefiranofaciens in the Brazilian kefir sample, and utilized in silico prospecting and molecular docking techniques to identify potential anti-Alzheimer peptides, targeting β-amyloid (fibril and plaque), BACE, and acetylcholinesterase. Through this analysis, we identified two peptides that show promise as compounds with anti-Alzheimer properties. Conclusions: These findings not only provide insights into the genome of L. kefiranofaciens but also serve as a promising prototype for the development of novel anti-Alzheimer compounds derived from Brazilian kefir. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

31. Insights into group-specific pattern of secondary metabolite gene cluster in Burkholderia genus.

Author: Byeollee Kim, So-Ra Han, Hyun Lee, and Tae-Jin Oh
Subjects: GENE clusters, PRINCIPAL components analysis, PAN-genome, BURKHOLDERIA, SIDEROPHORES
Abstract: Burkholderia is a versatile strain that has expanded into several genera. It has been steadily reported that the genome features of Burkholderia exhibit activities ranging from plant growth promotion to pathogenicity across various isolation areas. The objective of this study was to investigate the secondary metabolite patterns of 366 Burkholderia species through comparative genomics. Samples were selected based on assembly quality assessment and similarity below 80% in average nucleotide identity. Duplicate samples were excluded. Samples were divided into two groups using FastANI analysis. Group A included B. pseudomallei complex. Group B included B. cepacia complex. The limitations of MLST were proposed. The detection of genes was performed, including environmental and virulence-related genes. In the pan-genome analysis, each complex possessed a similar pattern of cluster for orthologous groups. Group A (n = 185) had 14,066 cloud genes, 2,465 shell genes, 682 soft-core genes, and 2,553 strict-core genes. Group B (n = 181) had 39,867 cloud genes, 4,986 shell genes, 324 soft-core genes, 222 core genes, and 2,949 strict-core genes. AntiSMASH was employed to analyze the biosynthetic gene cluster (BGC). The results were then utilized for network analysis using BiG-SCAPE and CORASON. Principal component analysis was conducted and a table was constructed using the results obtained from antiSMASH. The results were divided into Group A and Group B. We expected the various species to show similar patterns of secondary metabolite gene clusters. For in-depth analysis, a network analysis of secondary metabolite gene clusters was conducted, exemplified by BiG-SCAPE analysis. Depending on the species and complex, Burkholderia possessed several kinds of siderophore. Among them, ornibactin was possessed in most Burkholderia and was clustered into 4,062 clans. There was a similar pattern of gene clusters depending on the species. NRPS_04014 belonged to siderophore BGCs including ornibactin and indigoidine. However, it was observed that each family included a similar species. This suggests that, besides siderophores being species-specific, the ornibactin gene cluster itself might also be species-specific. The results suggest that siderophores are associated with environmental adaptation, possessing a similar pattern of siderophore gene clusters among species, which could provide another perspective on speciesspecific environmental adaptation mechanisms. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

32. Haplotype-based pangenomes reveal genetic variations and climate adaptations in moso bamboo populations.

Author: Hou, Yinguang, Gan, Junwei, Fan, Zeyu, Sun, Lei, Garg, Vanika, Wang, Yu, Li, Shanying, Bao, Pengfei, Cao, Bingchen, Varshney, Rajeev K., and Zhao, Hansheng
Subjects: CLIMATE change adaptation, CLIMATE change mitigation, GENETIC variation, HAPLOTYPES, PAN-genome
Abstract: Moso bamboo (Phyllostachys edulis), an ecologically and economically important forest species in East Asia, plays vital roles in carbon sequestration and climate change mitigation. However, intensifying climate change threatens moso bamboo survival. Here we generate high-quality haplotype-based pangenome assemblies for 16 representative moso bamboo accessions and integrated these assemblies with 427 previously resequenced accessions. Characterization of the haplotype-based pangenome reveals extensive genetic variation, predominantly between haplotypes rather than within accessions. Many genes with allele-specific expression patterns are implicated in climate responses. Integrating spatiotemporal climate data reveals more than 1050 variations associated with pivotal climate factors, including temperature and precipitation. Climate-associated variations enable the prediction of increased genetic risk across the northern and western regions of China under future emissions scenarios, underscoring the threats posed by rising temperatures. Our integrated haplotype-based pangenome elucidates moso bamboo's local climate adaptation mechanisms and provides critical genomic resources for addressing intensifying climate pressures on this essential bamboo. More broadly, this study demonstrates the power of long-read sequencing in dissecting adaptive traits in climate-sensitive species, advancing evolutionary knowledge to support conservation. Moso bamboo is a critical species for carbon sequestration and mitigating climate change. This study presents a haplotype-based pangenome that uncovers substantial genetic diversity associated with climate adaptation and enables predictions of genetic vulnerability under future emission scenarios. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

33. Genomes of diverse Actinidia species provide insights into cis-regulatory motifs and genes associated with critical traits.

Author: Li, Xiaolong, Huo, Liuqing, Li, Xinyi, Zhang, Chaofan, Gu, Miaofeng, Fan, Jialu, Xu, Changbin, Gong, Jinli, Hu, Xiaoli, Zheng, Yi, and Sun, Xuepeng
Subjects: *GENE families, *COMPARATIVE genomics, *PAN-genome, *VITAMIN C, *FRUIT development, *KIWIFRUIT
Abstract: Background: Kiwifruit, belonging to the genus Actinidia, represents a unique fruit crop characterized by its modern cultivars being genetically diverse and exhibiting remarkable variations in morphological traits and adaptability to harsh environments. However, the genetic mechanisms underlying such morphological diversity remain largely elusive. Results: We report the high-quality genomes of five Actinidia species, including Actinidia longicarpa, A. macrosperma, A. polygama, A. reticulata, and A. rufa. Through comparative genomics analyses, we identified three whole genome duplication events shared by the Actinidia genus and uncovered rapidly evolving gene families implicated in the development of characteristic kiwifruit traits, including vitamin C (VC) content and fruit hairiness. A range of structural variations were identified, potentially contributing to the phenotypic diversity in kiwifruit. Notably, phylogenomic analyses revealed 76 cis-regulatory elements within the Actinidia genus, predominantly associated with stress responses, metabolic processes, and development. Among these, five motifs did not exhibit similarity to known plant motifs, suggesting the presence of possible novel cis-regulatory elements in kiwifruit. Construction of a pan-genome encompassing the nine Actinidia species facilitated the identification of gene DTZ79_23g14810 specific to species exhibiting extraordinarily high VC content. Expression of DTZ79_23g14810 is significantly correlated with the dynamics of VC concentration, and its overexpression in the transgenic roots of kiwifruit plants resulted in increased VC content. Conclusions: Collectively, the genomes and pan-genome of diverse Actinidia species not only enhance our understanding of fruit development but also provide a valuable genomic resource for facilitating the genome-based breeding of kiwifruit. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

34. Evolutionary dynamics of the successful expansion of pandemic Vibrio parahaemolyticus ST3 in Latin America.

Author: Campbell, Amy Marie, Gavilan, Ronnie G., Abanto Marin, Michel, Yang, Chao, Hauton, Chris, van Aerle, Ronny, and Martinez-Urtaza, Jaime
Subjects: VIBRIO parahaemolyticus, BIOLOGICAL fitness, MARINE west coast climate, PAN-genome, GENOMES
Abstract: The underlying evolutionary mechanisms driving global expansions of pathogen strains are poorly understood. Vibrio parahaemolyticus is one of only two marine pathogens where variants have emerged in distinct climates globally. The success of a Vibrio parahaemolyticus clone (VpST3) in Latin America- the first spread identified outside its endemic region of tropical Asia- provided an invaluable opportunity to investigate mechanisms of VpST3 expansion into a distinct marine climate. A global collection of VpST3 isolates and novel Latin American isolates were used for evolutionary population genomics, pangenome analysis and combined with oceanic climate data. We found a VpST3 population (LatAm-VpST3) introduced in Latin America well before the emergence of this clone in India, previously considered the onset of the VpST3 epidemic. LatAm-VpST3 underwent successful adaptation to local conditions over its evolutionary divergence from Asian VpST3 isolates, to become dominant in Latin America. Selection signatures were found in genes providing resilience to the distinct marine climate. Core genome mutations and accessory gene presences that promoted survival over long dispersals or increased environmental fitness were associated with environmental conditions. These results provide novel insights into the global expansion of this successful V. parahaemolyticus clone into regions with different climate scenarios. Vibrio parahaemolyticus Sequence Type 3 was first reported in India and expanded globally to areas with distinct climates including Latin America. In this study, the authors investigate the evolutionary mechanisms driving emergence of the strain in Latin America using 280 publicly available genomes and 32 samples newly sequenced in this study. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

35. Pan-genome analysis reveals novel chromosomal markers for multiplex PCR-based specific detection of Bacillus anthracis.

Author: Zorigt, Tuvshinzaya, Furuta, Yoshikazu, Paudel, Atmika, Kamboyi, Harvey Kakoma, Shawa, Misheck, Chuluun, Mungunsar, Sugawara, Misa, Enkhtsetseg, Nyamdorj, Enkhtuya, Jargalsaikhan, Battsetseg, Badgar, Munyeme, Musso, Hang'ombe, Bernard M., and Higashi, Hideaki
Subjects: *WHOLE genome sequencing, *ANTHRAX vaccines, *BACILLUS anthracis, *POLYMERASE chain reaction, *GENETIC markers
Abstract: Background: Bacillus anthracis is a highly pathogenic bacterium that can cause lethal infection in animals and humans, making it a significant concern as a pathogen and biological agent. Consequently, accurate diagnosis of B. anthracis is critically important for public health. However, the identification of specific marker genes encoded in the B. anthracis chromosome is challenging due to the genetic similarity it shares with B. cereus and B. thuringiensis. Methods: The complete genomes of B. anthracis, B. cereus, B. thuringiensis, and B. weihenstephanensis were de novo annotated with Prokka, and these annotations were used by Roary to produce the pan-genome. B. anthracis exclusive genes were identified by Perl script, and their specificity was examined by nucleotide BLAST search. A local BLAST alignment was performed to confirm the presence of the identified genes across various B. anthracis strains. Multiplex polymerase chain reactions (PCR) were established based on the identified genes. Result: The distribution of genes among 151 whole-genome sequences exhibited three distinct major patterns, depending on the bacterial species and strains. Further comparative analysis between the three groups uncovered thirty chromosome-encoded genes exclusively present in B. anthracis strains. Of these, twenty were found in known lambda prophage regions, and ten were in previously undefined region of the chromosome. We established three distinct multiplex PCRs for the specific detection of B. anthracis by utilizing three of the identified genes, BA1698, BA5354, and BA5361. Conclusion: The study identified thirty chromosome-encoded genes specific to B.anthracis, encompassing previously described genes in known lambda prophage regions and nine newly discovered genes from an undefined gene region to the best of our knowledge. Three multiplex PCR assays offer an accurate and reliable alternative method for detecting B. anthracis. Furthermore, these genetic markers have value in anthrax vaccine development, and understanding the pathogenicity of B. anthracis. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

36. Fusarium Protein Toolkit: a web-based resource for structural and variant analysis of Fusarium species.

Author: Kim, Hye-Seon, Haley, Olivia C., Portwood II, John L., Harding, Stephen, Proctor, Robert H., Woodhouse, Margaret R., Sen, Taner Z., and Andorf, Carson M.
Abstract: Background: The genus Fusarium poses significant threats to food security and safety worldwide because numerous species of the fungus cause destructive diseases and/or mycotoxin contamination in crops. The adverse effects of climate change are exacerbating some existing threats and causing new problems. These challenges highlight the need for innovative solutions, including the development of advanced tools to identify targets for control strategies. Description: In response to these challenges, we developed the Fusarium Protein Toolkit (FPT), a web-based tool that allows users to interrogate the structural and variant landscape within the Fusarium pan-genome. The tool displays both AlphaFold and ESMFold-generated protein structure models from six Fusarium species. The structures are accessible through a user-friendly web portal and facilitate comparative analysis, functional annotation inference, and identification of related protein structures. Using a protein language model, FPT predicts the impact of over 270 million coding variants in two of the most agriculturally important species, Fusarium graminearum and F. verticillioides. To facilitate the assessment of naturally occurring genetic variation, FPT provides variant effect scores for proteins in a Fusarium pan-genome based on 22 diverse species. The scores indicate potential functional consequences of amino acid substitutions and are displayed as intuitive heatmaps using the PanEffect framework. Conclusion: FPT fills a knowledge gap by providing previously unavailable tools to assess structural and missense variation in proteins produced by Fusarium. FPT has the potential to deepen our understanding of pathogenic mechanisms in Fusarium, and aid the identification of genetic targets for control strategies that reduce crop diseases and mycotoxin contamination. Such targets are vital to solving the agricultural problems incited by Fusarium, particularly evolving threats resulting from climate change. Thus, FPT has the potential to contribute to improving food security and safety worldwide. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

37. Genome-wide survey of KT/HAK/KUP genes in the genus Citrullus and analysis of their involvement in K+-deficiency and drought stress responses in between C. lanatus and C. amarus.

Author: Cheng, Rui, Zhao, Zhengxiang, Tang, Yan, Gu, Yan, Chen, Guodong, Sun, Yudong, and Wang, Xuezheng
Subjects: *HORTICULTURAL crops, *GENE expression, *GENE families, *PAN-genome, *PLANT growth, *WATERMELONS
Abstract: Background: The KT/HAK/KUP is the largest K+ transporter family in plants, playing crucial roles in K+ absorption, transport, and defense against environmental stress. Sweet watermelon is an economically significant horticultural crop belonging to the genus Citrullus, with a high demand for K+ during its growth process. However, a comprehensive analysis of the KT/HAK/KUP gene family in watermelon has not been reported. Results: 14 KT/HAK/KUP genes were identified in the genomes of each of seven Citrullus species. These KT/HAK/KUPs in watermelon were unevenly distributed across seven chromosomes. Segmental duplication is the primary driving force behind the expansion of the KT/HAK/KUP family, subjected to purifying selection during domestication (Ka/Ks < 1), and all KT/HAK/KUPs exhibit conserved motifs and could be phylogenetically classified into four groups. The promoters of KT/HAK/KUPs contain numerous cis-regulatory elements related to plant growth and development, phytohormone response, and stress response. Under K+ deficiency, the growth of watermelon seedlings was significantly inhibited, with cultivated watermelon experiencing greater impacts (canopy width, redox enzyme activity) compared to the wild type. All KT/HAK/KUPs in C. lanatus and C. amarus exhibit specific expression responses to K+-deficiency and drought stress by qRT-PCR. Notably, ClG42_07g0120700/CaPI482276_07g014010 were predominantly expressed in roots and were further induced by K+-deficiency and drought stress. Additionally, the K+ transport capacity of ClG42_07g0120700 under low K+ stress was confirmed by yeast functional complementation assay. Conclusions: KT/HAK/KUP genes in watermelon were systematically identified and analyzed at the pangenome level and provide a foundation for understanding the classification and functions of the KT/HAK/KUPs in watermelon plants. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

38. Genomic analyses of agronomic traits in tea plants and related Camellia species.

Author: Shengchang Duan, Liang Yan, Zongfang Shen, Xuzhen Li, Baozheng Chen, Dawei Li, Hantao Qin, Meegahakumbura, Muditha K., Wambulwa, Moses C., Lianming Gao, Wei Chen, Yang Dong, and Jun Sheng
Subjects: CLONORCHIS sinensis, GENOMICS, PAN-genome, TEA, CAMELLIAS
Abstract: The genus Camellia contains three types of domesticates that meet various needs of ancient humans: the ornamental C. japonica, the edible oil-producing C. oleifera, and the beverage-purposed tea plant C. sinensis. The genomic drivers of the functional diversification of Camellia domesticates remain unknown. Here, we present the genomic variations of 625 Camellia accessions based on a new genome assembly of C. sinensis var. assamica ('YK10'), which consists of 15 pseudo-chromosomes with a total length of 3.35 Gb and a contig N50 of 816,948 bp. These accessions were mainly distributed in East Asia, South Asia, Southeast Asia, and Africa. We profiled the population and subpopulation structure in tea tree Camellia to find new evidence for the parallel domestication of C. sinensis var. assamica (CSA) and C. sinensis var. sinensis (CSS). We also identified candidate genes associated with traits differentiating CSA, CSS, oilseed Camellia, and ornamental Camellia cultivars. Our results provide a unique global view of the genetic diversification of Camellia domesticates and provide valuable resources for ongoing functional and molecular breeding research. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

39. Seven quick tips for gene-focused computational pangenomic analysis.

Author: Bonnici, Vincenzo and Chicco, Davide
Subjects: *BACTERIAL genomes, *PAN-genome, *COMPUTATIONAL biology, *GENOMICS, *GENOMES
Abstract: Pangenomics is a relatively new scientific field which investigates the union of all the genomes of a clade. The word pan means everything in ancient Greek; the term pangenomics originally regarded genomes of bacteria and was later intended to refer to human genomes as well. Modern bioinformatics offers several tools to analyze pangenomics data, paving the way to an emerging field that we can call computational pangenomics. Current computational power available for the bioinformatics community has made computational pangenomic analyses easy to perform, but this higher accessibility to pangenomics analysis also increases the chances to make mistakes and to produce misleading or inflated results, especially by beginners. To handle this problem, we present here a few quick tips for efficient and correct computational pangenomic analyses with a focus on bacterial pangenomics, by describing common mistakes to avoid and experienced best practices to follow in this field. We believe our recommendations can help the readers perform more robust and sound pangenomic analyses and to generate more reliable results. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

40. Strain heterogeneity in a non-pathogenic Aspergillus fungus highlights factors associated with virulence.

Author: Rinker, David C., Sauters, Thomas J. C., Steffen, Karin, Gumilang, Adiyantara, Raja, Huzefa A., Rangel-Grimaldo, Manuel, Pinzan, Camila Figueiredo, de Castro, Patrícia Alves, dos Reis, Thaila Fernanda, Delbaje, Endrews, Houbraken, Jos, Goldman, Gustavo H., Oberlies, Nicholas H., and Rokas, Antonis
Subjects: *ASPERGILLUS fumigatus, *PULMONARY aspergillosis, *ASPERGILLUS, *HETEROGENEITY, *METABOLITES, *PAN-genome
Abstract: Fungal pathogens exhibit extensive strain heterogeneity, including variation in virulence. Whether closely related non-pathogenic species also exhibit strain heterogeneity remains unknown. Here, we comprehensively characterized the pathogenic potentials (i.e., the ability to cause morbidity and mortality) of 16 diverse strains of Aspergillus fischeri, a non-pathogenic close relative of the major pathogen Aspergillus fumigatus. In vitro immune response assays and in vivo virulence assays using a mouse model of pulmonary aspergillosis showed that A. fischeri strains varied widely in their pathogenic potential. Furthermore, pangenome analyses suggest that A. fischeri genomic and phenotypic diversity is even greater. Genomic, transcriptomic, and metabolic profiling identified several pathways and secondary metabolites associated with variation in virulence. Notably, strain virulence was associated with the simultaneous presence of the secondary metabolites hexadehydroastechrome and gliotoxin. We submit that examining the pathogenic potentials of non-pathogenic close relatives is key for understanding the origins of fungal pathogenicity. Examination of Aspergillus fischeri, a non-pathogenic close relative of the major fungal pathogen Aspergillus fumigatus, revealed extensive strain heterogeneity in virulence and identified an association with specific metabolic differences. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

41. Digital Microbe: a genome-informed data integration framework for team science on emerging model organisms.

Author: Veseli, Iva, DeMers, Michelle A., Cooper, Zachary S., Schechter, Matthew S., Miller, Samuel, Weber, Laura, Smith, Christa B., Rodriguez, Lidimarie T., Schroer, William F., McIlvin, Matthew R., Lopez, Paloma Z., Saito, Makoto, Dyhrman, Sonya, Eren, A. Murat, Moran, Mary Ann, and Braakman, Rogier
Subjects: DATA integration, SCIENTIFIC models, PAN-genome, MARINE bacteria, MICROORGANISMS
Abstract: The remarkable pace of genomic data generation is rapidly transforming our understanding of life at the micron scale. Yet this data stream also creates challenges for team science. A single microbe can have multiple versions of genome architecture, functional gene annotations, and gene identifiers; additionally, the lack of mechanisms for collating and preserving advances in this knowledge raises barriers to community coalescence around shared datasets. "Digital Microbes" are frameworks for interoperable and reproducible collaborative science through open source, community-curated data packages built on a (pan)genomic foundation. Housed within an integrative software environment, Digital Microbes ensure real-time alignment of research efforts for collaborative teams and facilitate novel scientific insights as new layers of data are added. Here we describe two Digital Microbes: 1) the heterotrophic marine bacterium Ruegeria pomeroyi DSS-3 with > 100 transcriptomic datasets from lab and field studies, and 2) the pangenome of the cosmopolitan marine heterotroph Alteromonas containing 339 genomes. Examples demonstrate how an integrated framework collating public (pan)genome-informed data can generate novel and reproducible findings. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

42. Comparative genomics of dusky kob (Argyrosomus japonicus, Sciaenidae) conspecifics: Evidence for speciation and the genetic mechanisms underlying traits.

Author: Jackson, Tassin Kim and Rhode, Clint
Subjects: *GENETIC speciation, *PAN-genome, *COMPARATIVE genomics, *LIFE history theory, *SCIAENIDAE
Abstract: Dusky kob (Argyrosomus japonicus) is a commercially important finfish, indigenous to South Africa, Australia, and China. Previous studies highlighted differences in genetic composition, life history, and morphology of the species across geographic regions. A draft genome sequence of 0.742 Gb (N50 = 5.49 Mb; BUSCO completeness = 97.8%) and 22,438 predicted protein‐coding genes was generated for the South African (SA) conspecific. A comparison with the Chinese (CN) conspecific revealed a core set of 32,068 orthologous protein clusters across both genomes. The SA genome exhibited 440 unique clusters compared to 1928 unique clusters in the CN genome. Transportation and immune response processes were overrepresented among the SA accessory genome, whereas the CN accessory genome was enriched for immune response, DNA transposition, and sensory detection (FDR‐adjusted p < 0.01). These unique clusters may represent an adaptive component of the species' pangenome that could explain population divergence due to differential environmental specialisation. Furthermore, 700 single‐copy orthologues (SCOs) displayed evidence of positive selection between the SA and CN genomes, and globally these genomes shared only 92% similarity, suggesting they might be distinct species. These genes primarily play roles in metabolism and digestion, illustrating the evolutionary pathways that differentiate the species. Understanding these genomic mechanisms underlying adaptation and evolution within and between species provides valuable insights into growth and maturation of kob, traits that are particularly relevant to commercial aquaculture. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

43. CELEBRIMBOR: core and accessory genes from metagenomes.

Author: Hellewell, Joel, Horsfield, Samuel T, Wachsmann, Johanna von, Gurbich, Tatiana A, Finn, Robert D, Iqbal, Zamin, Roberts, Leah W, and Lees, John A
Subjects: *PAN-genome, *GENE frequency, *MICROORGANISM populations, *GENOMES, *GENES
Abstract: Motivation Metagenome-Assembled Genomes (MAGs) or Single-cell Amplified Genomes (SAGs) are often incomplete, with sequences missing due to errors in assembly or low coverage. This presents a particular challenge for the identification of true gene frequencies within a microbial population, as core genes missing in only a few assemblies will be mischaracterized by current pangenome approaches. Results Here, we present CELEBRIMBOR, a Snakemake pangenome analysis pipeline which uses a measure of genome completeness to automatically adjust the frequency threshold at which core genes are identified, enabling accurate core gene identification in MAGs and SAGs. Availability and implementation CELEBRIMBOR is published under open source Apache 2.0 licence at https://github.com/bacpop/CELEBRIMBOR and is available as a Docker container from this repository. Supplementary material is available in the online version of the article. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

44. Nematode-resistance loci in upland cotton genomes are associated with structural differences.

Author: Cohen, Zachary P, Perkin, Lindsey C, Wagner, Tanya A, Liu, Jinggao, Bell, Alois A, Arick, Mark A, Grover, Corrinne E, Yu, John Z, Udall, Joshua A, and Suh, Charles P C
Subjects: *SEA Island cotton, *COTTON, *COTTON fibers, *ARID regions, *PAN-genome
Abstract: Reniform and root-knot nematode are two of the most destructive pests of conventional upland cotton, Gossypium hirsutum L. and continue to be a major threat to cotton fiber production in semiarid regions of the Southern United States and Central America. Fortunately, naturally occurring tolerance to these nematodes has been identified in the Pima cotton species (Gossypium barbadense) and several upland cotton varieties (G. hirsutum), which has led to a robust breeding program that has successfully introgressed and stacked these independent resistant traits into several upland cotton lineages with superior agronomic traits, e.g. BAR 32-30 and BARBREN-713. This work identifies the genomic variations of these nematode-tolerant accessions by comparing their respective genomes to the susceptible, high-quality fiber-producing parental line of this lineage: Phytogen 355 (PSC355). We discover several large genomic differences within marker regions that harbor putative resistance genes as well as expression mechanisms shared by the two resistant lines, with respect to the susceptible PSC355 parental line. This work emphasizes the utility of whole-genome comparisons as a means of elucidating large and small nuclear differences by lineage and phenotype. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

45. Pangenome Identification and Analysis of Terpene Synthase Gene Family Members in Gossypium.

Author: Song, Yueqin, Han, Shengjie, Wang, Mengting, Ni, Xueqi, Huang, Xinzheng, and Zhang, Yongjun
Subjects: *GENETIC load, *GENE families, *HELICOVERPA armigera, *PAN-genome, *GENE regulatory networks
Abstract: Terpene synthases (TPSs), key gatekeepers in the biosynthesis of herbivore-induced terpenes, are pivotal in the diversity of terpene chemotypes across and within plant species. Here, we constructed a gene-based pangenome of the Gossypium genus by integrating the genomes of 17 diploid and 10 tetraploid species. Within this pangenome, 208 TPS syntelog groups (SGs) were identified, comprising 2 core SGs (TPS5 and TPS42) present in all 27 analyzed genomes, 6 softcore SGs (TPS11, TPS12, TPS13, TPS35, TPS37, and TPS47) found in 25 to 26 genomes, 131 dispensable SGs identified in 2 to 24 genomes, and 69 private SGs exclusive to a single genome. The mutational load analysis of these identified TPS genes across 216 cotton accessions revealed a great number of splicing variants and complex splicing patterns. The nonsynonymous/synonymous Ka/Ks value for all 52 analyzed TPS SGs was less than one, indicating that these genes were subject to purifying selection. Of 208 TPS SGs encompassing 1795 genes, 362 genes derived from 102 SGs were identified as atypical and truncated. The structural analysis of TPS genes revealed that gene truncation is a major mechanism contributing to the formation of atypical genes. An integrated analysis of three RNA-seq datasets from cotton plants subjected to herbivore infestation highlighted nine upregulated TPSs, which included six previously characterized TPSs in G. hirsutum (AD1_TPS10, AD1_TPS12, AD1_TPS40, AD1_TPS42, AD1_TPS89, and AD1_TPS104), two private TPSs (AD1_TPS100 and AD2_TPS125), and one atypical TPS (AD2_TPS41). Also, a TPS-associated coexpression module of eight genes involved in the terpenoid biosynthesis pathway was identified in the transcriptomic data of herbivore-infested G. hirsutum. These findings will help us understand the contributions of TPS family members to interspecific terpene chemotypes within Gossypium and offer valuable resources for breeding insect-resistant cotton cultivars. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

46. Pan-Genome Analysis of TRM Gene Family and Their Expression Pattern under Abiotic and Biotic Stresses in Cucumber.

Author: Zhao, Lili, Wang, Ke, Wang, Zimo, Chu, Shunpeng, Chen, Chunhua, Wang, Lina, and Ren, Zhonghai
Subjects: GENE expression, GENE families, DOWNY mildew diseases, POWDERY mildew diseases, PLANT genes, CUCUMBERS
Abstract: Cucumber (Cucumis sativus L.) is a vital economic vegetable crop, and the TONNEAU1 Recruiting Motif (TRM) gene plays a key role in cucumber organ growth. However, the pan-genomic characteristics of the TRM gene family and their expression patterns under different stresses have not been reported in cucumber. In this study, we identified 29 CsTRMs from the pan-genomes of 13 cucumber accessions, with CsTRM29 existing only in PI183967. Most CsTRM proteins exhibited differences in sequence length, except five CsTRMs having consistent protein sequence lengths among the 13 accessions. All CsTRM proteins showed amino acid variations. An analysis of CsTRM gene expression patterns revealed that six CsTRM genes strongly changed in short-fruited lines compared with long-fruited lines. And four CsTRM genes strongly responded to salt and heat stress, while CsTRM14 showed responses to salt stress, powdery mildew, gray mold, and downy mildew. Some CsTRM genes were induced or suppressed at different treatment timepoints, suggesting that cucumber TRM genes may play different roles in responses to different stresses, with expression patterns varying with stress changes. Remarkably, the expression of CsTRM21 showed considerable change between long and short fruits and in responses to abiotic stresses (salt stress and heat stress), as well as biotic stresses (powdery mildew and gray mold), suggesting a dual role of CsTRM21 in both fruit shape determination and stress resistance. Collectively, this study provided a base for the further functional identification of CsTRM genes in cucumber plant growth and stress resistance. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

47. Genomic Analysis of Antimicrobial Resistance in Pseudomonas aeruginosa from a "One Health" Perspective.

Author: García-Rivera, Celia, Molina-Pardines, Carmen, Haro-Moreno, José M., Parra Grande, Mónica, Rodríguez, Juan Carlos, and López-Pérez, Mario
Subjects: GENOMICS, DRUG resistance in microorganisms, DRUG resistance in bacteria, PSEUDOMONAS aeruginosa, PAN-genome
Abstract: The "One Health" approach provides a comprehensive framework for understanding antimicrobial resistance. This perspective is of particular importance in the study of Pseudomonas aeruginosa, as it is not only a pathogen that affects humans but also persists in environmental reservoirs. To assess evolutionary selection for niche-specific traits, a genomic comparison of 749 P. aeruginosa strains from three environments (clinical, aquatic, and soil) was performed. The results showed that the environment does indeed exert selective pressure on specific traits. The high percentage of persistent genome, the lack of correlation between phylogeny and origin of the isolate, and the high intrinsic resistance indicate that the species has a high potential for pathogenicity and resistance, regardless of the reservoir. The flexible genome showed an enrichment of metal resistance genes, which could act as a co-selection of antibiotic resistance genes. In the plasmids, resistance genes were found in multigenic clusters, with the presence of a mobile integron being prominent. This integron was identified in several pathogenic strains belonging to distantly related taxa with a worldwide distribution, showing the risk of rapid evolution of resistance. These results provide a more complete understanding of the evolution of P. aeruginosa, which could help develop new prevention strategies. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

48. Comparative Analysis of Transposable Elements in the Genomes of Citrus and Citrus -Related Genera.

Author: Wu, Yilei, Wang, Fusheng, Lyu, Keliang, and Liu, Renyi
Subjects: PLANT genomes, GENOME size, PAN-genome, PLANT diversity, GENE ontology
Abstract: Transposable elements (TEs) significantly contribute to the evolution and diversity of plant genomes. In this study, we explored the roles of TEs in the genomes of Citrus and Citrus-related genera by constructing a pan-genome TE library from 20 published genomes of Citrus and Citrus-related accessions. Our results revealed an increase in TE content and the number of TE types compared to the original annotations, as well as a decrease in the content of unclassified TEs. The average length of TEs per assembly was approximately 194.23 Mb, representing 41.76% (Murraya paniculata) to 64.76% (Citrus gilletiana) of the genomes, with a mean value of 56.95%. A significant positive correlation was found between genome size and both the number of TE types and TE content. Consistent with the difference in mean whole-genome size (39.83 Mb) between Citrus and Citrus-related genera, Citrus genomes contained an average of 34.36 Mb more TE sequences than Citrus-related genomes. Analysis of the estimated insertion time and half-life of long terminal repeat retrotransposons (LTR-RTs) suggested that TE removal was not the primary factor contributing to the differences among genomes. These findings collectively indicate that TEs are the primary determinants of genome size and play a major role in shaping genome structures. Principal coordinate analysis (PCoA) of Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) identifiers revealed that the fragmented TEs were predominantly derived from ancestral genomes, while intact TEs were crucial in the recent evolutionary diversification of Citrus. Moreover, the presence or absence of intact TEs near the AdhE superfamily was closely associated with the bitterness trait in the Citrus species. Overall, this study enhances TE annotation in Citrus and Citrus-related genomes and provides valuable data for future genetic breeding and agronomic trait research in Citrus. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

49. 泛基因组——理解生物遗传多样性的新视域.

Author: 林依萍 and 黄浩
Abstract: Copyright of Biology Teaching is the property of East China Normal University and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024

50. Genomic Analyses of Methicillin-Susceptible and Methicillin-Resistant Staphylococcus pseudintermedius Strains Involved in Canine Infections: A Comprehensive Genotypic Characterization.

Author: da Silva, Maria Eduarda Rocha Jacques, Breyer, Gabriela Merker, da Costa, Mateus Matiuzzi, Brenig, Bertram, Azevedo, Vasco Ariston de Carvalho, Cardoso, Marisa Ribeiro de Itapema, and Siqueira, Franciele Maboni
Subjects: GENOMICS, WHOLE genome sequencing, PAN-genome, HUMAN origins, BACTERIAL diseases
Abstract: Staphylococcus pseudintermedius is frequently associated with several bacterial infections in dogs, highlighting a One Health concern due to the zoonotic potential. Given the clinical significance of this pathogen, we performed comprehensive genomic analyses of 28 S. pseudintermedius strains isolated from canine infections throughout whole-genome sequencing using Illumina HiSeq, and compared the genetic features between S. pseudintermedius methicillin-resistant (MRSP) and methicillin-susceptible (MSSP) strains. Our analyses determined that MRSP genomes are larger than MSSP strains, with significant changes in antimicrobial resistance genes and virulent markers, suggesting differences in the pathogenicity of MRSP and MSSP strains. In addition, the pangenome analysis of S. pseudintermedius from canine and human origins identified core and accessory genomes with 1847 and 3037 genes, respectively, which indicates that most of the S. pseudintermedius genome is highly variable. Furthermore, phylogenomic analysis clearly separated MRSP from MSSP strains, despite their infection sites, showing phylogenetic differences according to methicillin susceptibility. Altogether our findings underscore the importance of studying the evolutionary dynamics of S. pseudintermedius, which is crucial for the development of effective prevention and control strategies of resistant S. pseudintermedius infections. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

2,533 results on '"Pan-genome"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources