53 results on '"Dohm JC"'
Search Results
2. Quantitative proteomics reveals cellular responses to individual mAb expression and tunicamycin in CHO cells.
- Author
-
Sulaj E, Schwaigerlehner L, Sandell FL, Dohm JC, Marzban G, and Kunert R
- Subjects
- CHO Cells, Animals, Proteome, Cricetinae, Cricetulus, Tunicamycin pharmacology, Antibodies, Monoclonal biosynthesis, Proteomics methods, Endoplasmic Reticulum Stress drug effects, Unfolded Protein Response drug effects
- Abstract
Chinese hamster ovary (CHO) cells are popular in the pharmaceutical industry for their ability to produce high concentrations of antibodies and their resemblance to human cells in terms of protein glycosylation patterns. Current data indicate the relevance of CHO cells in the biopharmaceutical industry, with a high number of product commendations and a significant market share for monoclonal antibodies. To enhance the production capabilities of CHO cells, a deep understanding of their cellular and molecular composition is crucial. Genome sequencing and proteomic analysis have provided valuable insights into the impact of the bioprocessing conditions, productivity, and product quality. In our investigation, we conducted a comparative analysis of proteomic profiles in high and low monoclonal antibody-producing cell lines and studied the impact of tunicamycin (TM)-induced endoplasmic reticulum (ER) stress. We examined the expression levels of different proteins including unfolded protein response (UPR) target genes by using label-free quantification techniques for protein abundance. Our results show the upregulation of proteins associated with protein folding mechanisms in low producer vs. high producer cell line suggesting a form of ER stress related to specific protein production. Further, Hspa9 and Dnaja3 are notable candidates activated by the mitochondria UPR and play important roles in protein folding processes in mitochondria. We identified significant upregulation of Nedd8 and Lgmn proteins in similar levels which may contribute to UPR stress. Interestingly, the downregulation of Hspa5/Bip and Pdia4 in response to tunicamycin treatment suggests a low-level UPR activation. KEY POINTS: • Proteome profiling of recombinant CHO cells under mild TM treatment. • Identified protein clusters are associated with the unfolded protein response (UPR). • The compared cell lines revealed noticeable disparities in protein expression levels., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
3. Genomic basis of seed colour in quinoa inferred from variant patterns using extreme gradient boosting.
- Author
-
Sandell FL, Holzweber T, Street NR, Dohm JC, and Himmelbauer H
- Subjects
- Color, Genome-Wide Association Study, Betalains metabolism, Genomics, Seeds genetics, Chenopodium quinoa genetics, Chenopodium quinoa metabolism
- Abstract
Quinoa is an agriculturally important crop species originally domesticated in the Andes of central South America. One of its most important phenotypic traits is seed colour. Seed colour variation is determined by contrasting abundance of betalains, a class of strong antioxidant and free radicals scavenging colour pigments only found in plants of the order Caryophyllales. However, the genetic basis for these pigments in seeds remains to be identified. Here we demonstrate the application of machine learning (extreme gradient boosting) to identify genetic variants predictive of seed colour. We show that extreme gradient boosting outperforms the classical genome-wide association approach. We provide re-sequencing and phenotypic data for 156 South American quinoa accessions and identify candidate genes potentially controlling betalain content in quinoa seeds. Genes identified include novel cytochrome P450 genes and known members of the betalain synthesis pathway, as well as genes annotated as being involved in seed development. Our work showcases the power of modern machine learning methods to extract biologically meaningful information from large sequencing data sets., (© 2023 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.)
- Published
- 2024
- Full Text
- View/download PDF
4. Spinach genomes reveal migration history and candidate genes for important crop traits.
- Author
-
Nguyen-Hoang A, Sandell FL, Himmelbauer H, and Dohm JC
- Abstract
Spinach ( Spinacia oleracea ) is an important leafy crop possessing notable economic value and health benefits. Current genomic resources include reference genomes and genome-wide association studies. However, the worldwide genetic relationships and the migration history of the crop remained uncertain, and genome-wide association studies have produced extensive gene lists related to agronomic traits. Here, we re-analysed the sequenced genomes of 305 cultivated and wild spinach accessions to unveil the phylogeny and history of cultivated spinach and to explore genetic variation in relation to phenotypes. In contrast to previous studies, we employed machine learning methods (based on Extreme Gradient Boosting, XGBoost) to detect variants that are collectively associated with agronomic traits. Variant-based cluster analyses revealed three primary spinach groups in the Middle East, Asia and Europe/US. Combining admixture analysis and allele-sharing statistics, migration routes of spinach from the Middle East to Europe and Asia are presented. Using XGBoost machine learning models we predict genomic variants influencing bolting time, flowering time, petiole color, and leaf surface texture and propose candidate genes for each trait. This study enhances our understanding of the history and phylogeny of domesticated spinach and provides valuable information on candidate genes for future genetic improvement of the crop., (© The Author(s) 2024. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.)
- Published
- 2024
- Full Text
- View/download PDF
5. Author Correction: Genomic distances reveal relationships of wild and cultivated beets.
- Author
-
Sandell FL, Stralis-Pavese N, McGrath JM, Schulz B, Himmelbauer H, and Dohm JC
- Published
- 2024
- Full Text
- View/download PDF
6. Genomic variation in the genus Beta based on 656 sequenced beet genomes.
- Author
-
Felkel S, Dohm JC, and Himmelbauer H
- Subjects
- Crops, Agricultural genetics, Base Sequence, Genomics, Sugars, Beta vulgaris genetics
- Abstract
Cultivated beets (Beta vulgaris ssp. vulgaris) constitute important crop plants, in particular sugar beet as an indispensable source of sucrose. Several species of wild beets of the genus Beta with distribution along the European Atlantic coast, Macaronesia, and throughout the Mediterranean area exist. Thorough characterization of beet genomes is required for straightforward access to genes promoting genetic resistance against biotic and abiotic stress. Analysing short-read data of 656 sequenced beet genomes, we identified 10 million variant positions in comparison to the sugar beet reference genome RefBeet-1.2. The main groups of species and subspecies were distinguishable based on shared variation, and the separation of sea beets (Beta vulgaris ssp. maritima) into a Mediterranean and an Atlantic subgroup as suggested by previous studies could be confirmed. Complementary approaches of variant-based clustering were employed based on PCA, genotype likelihoods, tree calculations, and admixture analysis. Outliers suggested the occurrence of inter(sub)specific hybridisation, independently confirmed by different analyses. Screens for regions under artificial selection in the sugar beet genome identified 15 Mbp of the genome as variation-poor, enriched for genes involved in shoot system development, stress response, and carbohydrate metabolism. The resources presented herein will be valuable for crop improvement and wild species monitoring and conservation efforts, and for studies on beet genealogy, population structure and population dynamics. Our study provides a wealth of data for in-depth analyses of further aspects of the beet genome towards a thorough understanding of the biology of this important complex of a crop species and its wild relatives., (© 2023. The Author(s).)
- Published
- 2023
- Full Text
- View/download PDF
7. Putting hornets on the genomic map.
- Author
-
Favreau E, Cini A, Taylor D, Câmara Ferreira F, Bentley MA, Cappa F, Cervo R, Privman E, Schneider J, Thiéry D, Mashoodh R, Wyatt CDR, Brown RL, Bodrug-Schepers A, Stralis-Pavese N, Dohm JC, Mead D, Himmelbauer H, Guigo R, and Sumner S
- Subjects
- Animals, Introduced Species, Reproduction, Wasps genetics
- Abstract
Hornets are the largest of the social wasps, and are important regulators of insect populations in their native ranges. Hornets are also very successful as invasive species, with often devastating economic, ecological and societal effects. Understanding why these wasps are such successful invaders is critical to managing future introductions and minimising impact on native biodiversity. Critical to the management toolkit is a comprehensive genomic resource for these insects. Here we provide the annotated genomes for two hornets, Vespa crabro and Vespa velutina. We compare their genomes with those of other social Hymenoptera, including the northern giant hornet Vespa mandarinia. The three hornet genomes show evidence of selection pressure on genes associated with reproduction, which might facilitate the transition into invasive ranges. Vespa crabro has experienced positive selection on the highest number of genes, including those putatively associated with molecular binding and olfactory systems. Caste-specific brain transcriptomic analysis also revealed 133 differentially expressed genes, some of which are associated with olfactory functions. This report provides a spring-board for advancing our understanding of the evolution and ecology of hornets, and opens up opportunities for using molecular methods in the future management of both native and invasive populations of these over-looked insects., (© 2023. The Author(s).)
- Published
- 2023
- Full Text
- View/download PDF
8. Genome-environment associations along elevation gradients in two snowbed species of the North-Eastern Calcareous Alps.
- Author
-
Felkel S, Tremetsberger K, Moser D, Dohm JC, Himmelbauer H, and Winkler M
- Subjects
- Temperature, Genomics, Adaptation, Physiological, Climate Change, Ecosystem, Gene Flow
- Abstract
Background: Anthropogenic climate change leads to increasing temperatures and altered precipitation and snowmelt patterns, especially in alpine ecosystems. To understand species' responses to climate change, assessment of genetic structure and diversity is crucial as the basis for the evaluation of migration patterns, genetic adaptation potential as well as the identification of adaptive alleles., Results: We studied genetic structure, diversity and genome-environment associations of two snowbed species endemic to the Eastern Alps with a large elevational range, Achillea clusiana Tausch and Campanula pulla L. Genotyping-by-sequencing was employed to assemble loci de novo, call variants and perform population genetic analyses. Populations of either species were distinguishable by mountain, and to some extent by elevation. We found evidence for gene flow between elevations. Results of genome-environment associations suggested similar selective pressures acting on both species, emanating mainly from precipitation and exposition rather than temperature., Conclusions: Given their genetic structure and amount of gene flow among populations the two study species are suitable to serve as a model for genetic monitoring of climate change adaptation along an elevation gradient. Consequences of climate change will predominantly manifest via changes in precipitation and, thus, duration of snow cover in the snowbeds and indirectly via shrub encroachment accompanied by increasing shading of snowbeds at lower range margins. Assembling genomes of the study species and studying larger sample sizes and time series will be necessary to functionally characterize and validate the herein identified genomic loci putatively involved in adaptive processes., (© 2023. The Author(s).)
- Published
- 2023
- Full Text
- View/download PDF
9. Storage media and RNA extraction approaches substantially influence the recovery and integrity of livestock fecal microbial RNA.
- Author
-
Koorakula R, Ghanbari M, Schiavinato M, Wegl G, Dohm JC, and Domig KJ
- Subjects
- Swine, Animals, Livestock genetics, RNA genetics, Feces microbiology, Microbiota genetics, Gastrointestinal Microbiome genetics
- Abstract
Background: There is growing interest in understanding gut microbiome dynamics, to increase the sustainability of livestock production systems and to better understand the dynamics that regulate antibiotic resistance genes ( i.e. , the resistome). High-throughput sequencing of RNA transcripts (RNA-seq) from microbial communities (metatranscriptome) allows an unprecedented opportunity to analyze the functional and taxonomical dynamics of the expressed microbiome and emerges as a highly informative approach. However, the isolation and preservation of high-quality RNA from livestock fecal samples remains highly challenging. This study aimed to determine the impact of the various sample storage and RNA extraction strategies on the recovery and integrity of microbial RNA extracted from selected livestock (chicken and pig) fecal samples., Methods: Fecal samples from pigs and chicken were collected from conventional slaughterhouses. Two different storage buffers were used at two different storage temperatures. The extraction of total RNA was done using four different commercially available kits and RNA integrity/quality and concentration were measured using a Bioanalyzer 2100 system with RNA 6000 Nano kit (Agilent, Santa Clara, CA, USA). In addition, RT-qPCR was used to assess bacterial RNA quality and the level of host RNA contamination., Results: The quantity and quality of RNA differed by sample type ( i.e. , either pig or chicken) and most significantly by the extraction kit, with differences in the extraction method resulting in the least variability in pig feces samples and the most variability in chicken feces. Considering a tradeoff between the RNA yield and the RNA integrity and at the same time minimizing the amount of host RNA in the sample, a combination of storing the fecal samples in RNALater at either 4 °C (for 24 h) or -80 °C (up to 2 weeks) with extraction with PM kit (RNEasy Power Microbiome Kit) had the best performance for both chicken and pig samples., Conclusion: Our findings provided a further emphasis on using a consistent methodology for sample storage, duration as well as a compatible RNA extraction approach. This is crucial as the impact of these technical steps can be potentially large compared with the real biological variability to be explained in microbiome and resistome studies., Competing Interests: Mahdi Ghanbari and Gertrude Wegl are employed by Biomin Holding GmbH. The authors declare that they have no competing interests., (© 2022 Koorakula et al.)
- Published
- 2022
- Full Text
- View/download PDF
10. Prediction of NB-LRR resistance genes based on full-length sequence homology.
- Author
-
Andolfo G, Dohm JC, and Himmelbauer H
- Subjects
- Disease Resistance genetics, Plant Breeding, Plant Diseases genetics, Plant Proteins genetics, Plant Proteins metabolism, Sequence Homology, Genes, Plant genetics, Solanum lycopersicum genetics, Solanum lycopersicum metabolism
- Abstract
The activation of plant immunity is mediated by resistance (R)-gene receptors, also known as nucleotide-binding leucine-rich repeat (NB-LRR) genes, which in turn trigger the authentic defense response. R-gene identification is a crucial goal for both classic and modern plant breeding strategies for disease resistance. The conventional method identifies NB-LRR genes using a protein motif/domain-based search (PDS) within an automatically predicted gene set of the respective genome assembly. PDS proved to be imprecise since repeat masking prior to automatic genome annotation unwittingly prevented comprehensive NB-LRR gene detection. Furthermore, R-genes have diversified in a species-specific manner, so that NB-LRR gene identification cannot be universally standardized. Here, we present the full-length Homology-based R-gene Prediction (HRP) method for the comprehensive identification and annotation of a genome's R-gene repertoire. Our method has substantially addressed the complex genomic organization of tomato (Solanum lycopersicum) NB-LRR gene loci, proving to be more performant than the well-established RenSeq approach. HRP efficiency was also tested on three differently assembled and annotated Beta sp. genomes. Indeed, HRP identified up to 45% more full-length NB-LRR genes compared to previous approaches. HRP also turned out to be a more refined strategy for R-gene allele mining, testified by the identification of hitherto undiscovered Fom-2 homologs in five Cucurbita sp. genomes. In summary, our high-performance method for full-length NB-LRR gene discovery will propel the identification of novel R-genes towards development of improved cultivars., (© 2022 The Authors. The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.)
- Published
- 2022
- Full Text
- View/download PDF
11. Genotypic and phenotypic diversity among Komagataella species reveals a hidden pathway for xylose utilization.
- Author
-
Heistinger L, Dohm JC, Paes BG, Koizar D, Troyer C, Ata Ö, Steininger-Mairinger T, and Mattanovich D
- Subjects
- Carbon metabolism, Phenotype, Pichia metabolism, Yeasts, Saccharomycetales genetics, Xylose metabolism
- Abstract
Background: The yeast genus Komagataella currently consists of seven methylotrophic species isolated from tree environments. Well-characterized strains of K. phaffii and K. pastoris are important hosts for biotechnological applications, but the potential of other species from the genus remains largely unexplored. In this study, we characterized 25 natural isolates from all seven described Komagataella species to identify interesting traits and provide a comprehensive overview of the genotypic and phenotypic diversity available within this genus., Results: Growth tests on different carbon sources and in the presence of stressors at two different temperatures allowed us to identify strains with differences in tolerance to high pH, high temperature, and growth on xylose. As Komagataella species are generally not considered xylose-utilizing yeasts, xylose assimilation was characterized in detail. Growth assays, enzyme activity measurements and
13 C labeling confirmed the ability of K. phaffii to utilize D-xylose via the oxidoreductase pathway. In addition, we performed long-read whole-genome sequencing to generate genome assemblies of all Komagataella species type strains and additional K. phaffii and K. pastoris isolates for comparative analysis. All sequenced genomes have a similar size and share 83-99% average sequence identity. Genome structure analysis showed that K. pastoris and K. ulmi share the same rearrangements in difference to K. phaffii, while the genome structure of K. kurtzmanii is similar to K. phaffii. The genomes of the other, more distant species showed a larger number of structural differences. Moreover, we used the newly assembled genomes to identify putative orthologs of important xylose-related genes in the different Komagataella species., Conclusions: By characterizing the phenotypes of 25 natural Komagataella isolates, we could identify strains with improved growth on different relevant carbon sources and stress conditions. Our data on the phenotypic and genotypic diversity will provide the basis for the use of so-far neglected Komagataella strains with interesting characteristics and the elucidation of the genetic determinants of improved growth and stress tolerance for targeted strain improvement., (© 2022. The Author(s).)- Published
- 2022
- Full Text
- View/download PDF
12. Genomic distances reveal relationships of wild and cultivated beets.
- Author
-
Sandell FL, Stralis-Pavese N, McGrath JM, Schulz B, Himmelbauer H, and Dohm JC
- Subjects
- Crops, Agricultural genetics, Genome, Plant genetics, Genomics, Sugars, Beta vulgaris genetics
- Abstract
Cultivated beets (Beta vulgaris ssp. vulgaris), including sugar beet, rank among the most important crops. The wild ancestor of beet crops is the sea beet Beta vulgaris ssp. maritima. Species and subspecies of wild beets are readily crossable with cultivated beets and are thus available for crop improvement. To study genomic relationships in the genus Beta, we sequence and analyse 606 beet genomes, encompassing sugar beet, sea beet, B. v. adanensis, B. macrocarpa, and B. patula. We observe two genetically distinct groups of sea beets, one from the Atlantic coast and the other from the Mediterranean area. Genomic comparisons based on k-mers identify sea beets from Greece as the closest wild relatives of sugar beet, suggesting that domestication of the ancestors of sugar beet may be traced to this area. Our work provides comprehensive insight into the phylogeny of wild and cultivated beets and establishes a framework for classification of further accessions of unknown (sub-)species assignment., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
13. Metatranscriptomic Analysis of the Chicken Gut Resistome Response to In-Feed Antibiotics and Natural Feed Additives.
- Author
-
Koorakula R, Schiavinato M, Ghanbari M, Wegl G, Grabner N, Koestelbauer A, Klose V, Dohm JC, and Domig KJ
- Abstract
The emergence of resistance against common antibiotics in the gut microbiota is a major issue for both human and livestock health. This highlights the need for understanding the impact of such application on the reservoir of antibiotic resistance genes in poultry gut and devising means to circumvent the potential resistome expansion. Phytogenic feed additives (PFAs) are potential natural alternative to antibiotic to improve animal health and performance, supposedly via positively affecting the gut microbial ecosystem, but there is little systematic information available. In this time-course study, we applied a shotgun meta-transcriptomics approach to investigate the impact of a PFA product as well as the commonly used antibiotic, zinc bacitracin either at AGP concentration or therapeutic concentration on the gut microbiome and resistome of broiler chickens raised for 35 days. Over the course of the trial, PFA treatments increased the abundance of Firmicutes such as Lactobacillus and resulted in a lower abundance of Escherichia , while the latter group increased significantly in the feces of chickens that received either AGP or AB doses of bacitracin. Tetracycline resistance and aminoglycoside resistance were the predominant antibiotic resistance gene (ARG) classes found, regardless of the treatment. PFA application resulted in a decrease in abundance of ARGs compared to those in the control group and other antibiotic treatment groups. In summary, the findings from this study demonstrate the potential of phytogenic feed additives could be an alternative to antibiotics in poultry farming, with the added benefit of counteracting antimicrobial resistance development., Competing Interests: MG, GW, NG, AK, and VK are employed by DSM Animal Nutrition & Health, which provided support in the form of salaries for the authors but did not have the main role in the experimental design, data collection and analysis, decision to publish, or preparation of the manuscript. DSM Animal Nutrition & Health is involved in natural feed additive development and research in natural alternatives to in-feed medication in livestock production. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2022 Koorakula, Schiavinato, Ghanbari, Wegl, Grabner, Koestelbauer, Klose, Dohm and Domig.)
- Published
- 2022
- Full Text
- View/download PDF
14. Quinoa genome assembly employing genomic variation for guided scaffolding.
- Author
-
Bodrug-Schepers A, Stralis-Pavese N, Buerstmayr H, Dohm JC, and Himmelbauer H
- Subjects
- Arabidopsis genetics, Bolivia, Chile, Contig Mapping, Genetic Markers, Genetics, Population, Haplotypes, Peru, Chenopodium quinoa genetics, Genetic Variation, Genome, Plant
- Abstract
Key Message: We propose to use the natural variation between individuals of a population for genome assembly scaffolding. In today's genome projects, multiple accessions get sequenced, leading to variant catalogs. Using such information to improve genome assemblies is attractive both cost-wise as well as scientifically, because the value of an assembly increases with its contiguity. We conclude that haplotype information is a valuable resource to group and order contigs toward the generation of pseudomolecules. Quinoa (Chenopodium quinoa) has been under cultivation in Latin America for more than 7500 years. Recently, quinoa has gained increasing attention due to its stress resistance and its nutritional value. We generated a novel quinoa genome assembly for the Bolivian accession CHEN125 using PacBio long-read sequencing data (assembly size 1.32 Gbp, initial N50 size 608 kbp). Next, we re-sequenced 50 quinoa accessions from Peru and Bolivia. This set of accessions differed at 4.4 million single-nucleotide variant (SNV) positions compared to CHEN125 (1.4 million SNV positions on average per accession). We show how to exploit variation in accessions that are distantly related to establish a genome-wide ordered set of contigs for guided scaffolding of a reference assembly. The method is based on detecting shared haplotypes and their expected continuity throughout the genome (i.e., the effect of linkage disequilibrium), as an extension of what is expected in mapping populations where only a few haplotypes are present. We test the approach using Arabidopsis thaliana data from different populations. After applying the method on our CHEN125 quinoa assembly we validated the results with mate-pairs, genetic markers, and another quinoa assembly originating from a Chilean cultivar. We show consistency between these information sources and the haplotype-based relations as determined by us and obtain an improved assembly with an N50 size of 1079 kbp and ordered contig groups of up to 39.7 Mbp. We conclude that haplotype information in distantly related individuals of the same species is a valuable resource to group and order contigs according to their adjacency in the genome toward the generation of pseudomolecules., (© 2021. The Author(s).)
- Published
- 2021
- Full Text
- View/download PDF
15. Assembly and characterization of the genome of chard (Beta vulgaris ssp. vulgaris var. cicla).
- Author
-
Lehner R, Blazek L, Minoche AE, Dohm JC, and Himmelbauer H
- Subjects
- Crops, Agricultural genetics, Genomics, Retroelements, Beta vulgaris genetics, Genome, Plant
- Abstract
Chard (Beta vulgaris ssp. vulgaris var. cicla) is a member of one of four different cultigroups of beets. While the genome of sugar beet, the most prominent beet crop, has been studied extensively, molecular data on other beet cultivars is scant. Here, we present a genome assembly of chard, a vegetable crop grown for its fleshy leaves. We report a de novo genome assembly of 604 Mbp, slightly larger than sugar beet assemblies presented so far. About 57 % of the assembly was annotated as repetitive sequence, of which LTR retrotransposons were the most abundant. Based on the presence of conserved genes, the chard assembly was estimated to be at least 96 % complete regarding its gene space. We predicted 34,521 genes of which 27,582 genes were supported by evidence from transcriptomic sequencing reads, and 5503 of the evidence-supported genes had multiple isoforms. We compared the chard gene set with gene sets from sugar beet and two wild beets (i.e. Beta vulgaris ssp. maritima and Beta patula) to find orthology relationships and identified genome-wide syntenic regions between chard and sugar beet. Lastly, we determined genomic variants that distinguish sugar beet and chard. Assessing the variation distribution along the chard chromosomes, we found extensive haplotype sharing between the two cultivars. In summary, our work provides a foundation for the molecular analysis of Beta vulgaris cultigroups as a basis for chard genomics and to unravel the domestication history of beet crops., (Copyright © 2021 The Author(s). Published by Elsevier B.V. All rights reserved.)
- Published
- 2021
- Full Text
- View/download PDF
16. Subgenome evolution in allotetraploid plants.
- Author
-
Schiavinato M, Bodrug-Schepers A, Dohm JC, and Himmelbauer H
- Subjects
- Brassica genetics, Chenopodium quinoa genetics, Domestication, Hybridization, Genetic genetics, Plants genetics, Nicotiana genetics, Evolution, Molecular, Genome, Plant genetics, Tetraploidy
- Abstract
Polyploidization is a well-known speciation and adaptation mechanism. Traces of former polyploidization events were discovered within many genomes, and especially in plants. Allopolyploidization by interspecific hybridization between two species is common. Among hybrid plants, many are domesticated species of agricultural interest and many of their genomes and of their presumptive parents have been sequenced. Hybrid genomes remain challenging to analyse because of the presence of multiple subgenomes. The genomes of hybrids often undergo rearrangement and degradation over time. Based on 10 hybrid plant genomes from six different genera, with hybridization dating from 10,000 to 5 million years ago, we assessed subgenome degradation, subgenomic intermixing and biased subgenome fractionation. The restructuring of hybrid genomes does not proceed proportionally with the age of the hybrid. The oldest hybrids in our data set display completely different fates: whereas the subgenomes of the tobacco plant Nicotiana benthamiana are in an advanced stage of degradation, the subgenomes of quinoa (Chenopodium quinoa) are exceptionally well conserved by structure and sequence. We observed statistically significant biased subgenome fractionation in seven out of 10 hybrids, which had different ages and subgenomic intermixing levels. Hence, we conclude that no correlation exists between biased fractionation and subgenome intermixing. Lastly, domestication may encourage or hinder subgenome intermixing, depending on the evolutionary context. In summary, comparative analysis of hybrid genomes and their presumptive parents allowed us to determine commonalities and differences between their evolutionary fates. In order to facilitate the future analysis of further hybrid genomes, we automated the analysis steps within manticore, which is publicly available at https://github.com/MatteoSchiavinato/manticore.git., (© 2021 The Authors. The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.)
- Published
- 2021
- Full Text
- View/download PDF
17. Adaptive Evolution in Producing Microtiter Cultivations Generates Genetically Stable Escherichia coli Production Hosts for Continuous Bioprocessing.
- Author
-
Schuller A, Cserjan-Puschmann M, Köppl C, Grabherr R, Wagenknecht M, Schiavinato M, Dohm JC, Himmelbauer H, and Striedner G
- Subjects
- Green Fluorescent Proteins genetics, Promoter Regions, Genetic genetics, Recombinant Proteins genetics, Escherichia coli genetics
- Abstract
The production of recombinant proteins usually reduces cell fitness and the growth rate of producing cells. The growth disadvantage favors faster-growing non-producer mutants. Therefore, continuous bioprocessing is hardly feasible in Escherichia coli due to the high escape rate. The stability of E. coli expression systems under long-term production conditions and how metabolic load triggered by recombinant gene expression influences the characteristics of mutations are investigated. Iterated fed-batch-like microbioreactor cultivations are conducted under production conditions. The easy-to-produce green fluorescent protein (GFP) and a challenging antigen-binding fragment (Fab) are used as model proteins, and BL21(DE3) and BL21
Q strains as expression hosts. In comparative whole-genome sequencing analyses, mutations that allowed cells to grow unhindered despite recombinant protein production are identified. A T7 RNA polymerase expression system is only conditionally suitable for long-term cultivation under production conditions. Mutations leading to non-producers occur in either the T7 RNA polymerase gene or the T7 promoter. The host RNA polymerase-based BL21Q expression system remains stable in the production of GFP in long-term cultivations. For the production of Fab, mutations in lacI of the BL21Q derivatives have positive effects on long-term stability. The results indicate that adaptive evolution carried out with genome-integrated E. coli expression systems in microtiter cultivations under industrial-relevant production conditions is an efficient strain development tool for production hosts., (© 2020 The Authors. Biotechnology Journal published by Wiley-VCH GmbH.)- Published
- 2021
- Full Text
- View/download PDF
18. Benchmarking of long-read correction methods.
- Author
-
Dohm JC, Peters P, Stralis-Pavese N, and Himmelbauer H
- Abstract
Third-generation sequencing technologies provided by Pacific Biosciences and Oxford Nanopore Technologies generate read lengths in the scale of kilobasepairs. However, these reads display high error rates, and correction steps are necessary to realize their great potential in genomics and transcriptomics. Here, we compare properties of PacBio and Nanopore data and assess correction methods by Canu, MARVEL and proovread in various combinations. We found total error rates of around 13% in the raw datasets. PacBio reads showed a high rate of insertions (around 8%) whereas Nanopore reads showed similar rates for substitutions, insertions and deletions of around 4% each. In data from both technologies the errors were uniformly distributed along reads apart from noisy 5' ends, and homopolymers appeared among the most over-represented kmers relative to a reference. Consensus correction using read overlaps reduced error rates to about 1% when using Canu or MARVEL after patching. The lowest error rate in Nanopore data (0.45%) was achieved by applying proovread on MARVEL-patched data including Illumina short-reads, and the lowest error rate in PacBio data (0.42%) was the result of Canu correction with minimap2 alignment after patching. Our study provides valuable insights and benchmarks regarding long-read data and correction methods., (© The Author(s) 2019. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.)
- Published
- 2020
- Full Text
- View/download PDF
19. Parental origin of the allotetraploid tobacco Nicotiana benthamiana.
- Author
-
Schiavinato M, Marcet-Houben M, Dohm JC, Gabaldón T, and Himmelbauer H
- Subjects
- Chromosomes, Plant genetics, Evolution, Molecular, Genomics, Hybridization, Genetic, Phylogeny, Nicotiana genetics, Nicotiana metabolism
- Abstract
Nicotiana section Suaveolentes is an almost all-Australian clade of allopolyploid tobacco species including the important plant model Nicotiana benthamiana. The homology relationships of this clade and its formation are not completely understood. To address this gap, we assessed phylogenies of all individual genes of N. benthamiana and the well studied N. tabacum (section Nicotiana) and their homologues in six diploid Nicotiana species. We generated sets of 44 424 and 65 457 phylogenetic trees of N. benthamiana and N. tabacum genes, respectively, each collectively called a phylome. Members of Nicotiana sections Noctiflorae and Sylvestres were represented as the species closest to N. benthamiana in most of the gene trees. Analyzing the gene trees of the phylome we: (i) dated the hybridization event giving rise to N. benthamiana to 4-5 MyA, and (ii) separated the subgenomes. We assigned 1.42 Gbp of the genome sequence to section Noctiflorae and 0.97 Gbp to section Sylvestres based on phylome analysis. In contrast, read mapping of the donor species did not succeed in separating the subgenomes of N. benthamiana. We show that the maternal progenitor of N. benthamiana was a member of section Noctiflorae, and confirm a member of section Sylvestres as paternal subgenome donor. We also demonstrate that the advanced stage of long-term genome diploidization in N. benthamiana is reflected in its subgenome organization. Taken together, our results underscore the usefulness of phylome analysis for subgenome characterization in hybrid species., (© 2019 The Authors The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.)
- Published
- 2020
- Full Text
- View/download PDF
20. Comparative genome characterization of the periodontal pathogen Tannerella forsythia.
- Author
-
Zwickl NF, Stralis-Pavese N, Schäffer C, Dohm JC, and Himmelbauer H
- Subjects
- Codon Usage, Genomic Islands, Glycosylation, Phylogeny, Tannerella forsythia classification, Tannerella forsythia pathogenicity, Virulence Factors genetics, Genome, Bacterial, Tannerella forsythia genetics
- Abstract
Background: Tannerella forsythia is a bacterial pathogen implicated in periodontal disease. Numerous virulence-associated T. forsythia genes have been described, however, it is necessary to expand the knowledge on T. forsythia's genome structure and genetic repertoire to further elucidate its role within pathogenesis. Tannerella sp. BU063, a putative periodontal health-associated sister taxon and closest known relative to T. forsythia is available for comparative analyses. In the past, strain confusion involving the T. forsythia reference type strain ATCC 43037 led to discrepancies between results obtained from in silico analyses and wet-lab experimentation., Results: We generated a substantially improved genome assembly of T. forsythia ATCC 43037 covering 99% of the genome in three sequences. Using annotated genomes of ten Tannerella strains we established a soft core genome encompassing 2108 genes, based on orthologs present in > = 80% of the strains analysed. We used a set of known and hypothetical virulence factors for comparisons in pathogenic strains and the putative periodontal health-associated isolate Tannerella sp. BU063 to identify candidate genes promoting T. forsythia's pathogenesis. Searching for pathogenicity islands we detected 38 candidate regions in the T. forsythia genome. Only four of these regions corresponded to previously described pathogenicity islands. While the general protein O-glycosylation gene cluster of T. forsythia ATCC 43037 has been described previously, genes required for the initiation of glycan synthesis are yet to be discovered. We found six putative glycosylation loci which were only partially conserved in other bacteria. Lastly, we performed a comparative analysis of translational bias in T. forsythia and Tannerella sp. BU063 and detected highly biased genes., Conclusions: We provide resources and important information on the genomes of Tannerella strains. Comparative analyses enabled us to assess the suitability of T. forsythia virulence factors as therapeutic targets and to suggest novel putative virulence factors. Further, we report on gene loci that should be addressed in the context of elucidating T. forsythia's protein O-glycosylation pathway. In summary, our work paves the way for further molecular dissection of T. forsythia biology in general and virulence of this species in particular.
- Published
- 2020
- Full Text
- View/download PDF
21. Genomes of the wild beets Beta patula and Beta vulgaris ssp. maritima.
- Author
-
Rodríguez Del Río Á, Minoche AE, Zwickl NF, Friedrich A, Liedtke S, Schmidt T, Himmelbauer H, and Dohm JC
- Subjects
- Beta vulgaris virology, Chromosomes genetics, Crops, Agricultural genetics, Genetic Variation, Genomics, High-Throughput Nucleotide Sequencing, In Situ Hybridization, Fluorescence, Karyotype, Phylogeny, Plant Diseases virology, Synteny genetics, Beta vulgaris genetics, Genome, Plant, Plant Diseases genetics
- Abstract
We present draft genome assemblies of Beta patula, a critically endangered wild beet endemic to the Madeira archipelago, and of the closely related Beta vulgaris ssp. maritima (sea beet). Evidence-based reference gene sets for B. patula and sea beet were generated, consisting of 25 127 and 27 662 genes, respectively. The genomes and gene sets of the two wild beets were compared with their cultivated sister taxon B. vulgaris ssp. vulgaris (sugar beet). Large syntenic regions were identified, and a display tool for automatic genome-wide synteny image generation was developed. Phylogenetic analysis based on 9861 genes showing 1:1:1 orthology supported the close relationship of B. patula to sea beet and sugar beet. A comparative analysis of the Rz2 locus, responsible for rhizomania resistance, suggested that the sequenced B. patula accession was rhizomania susceptible. Reference karyotypes for the two wild beets were established, and genomic rearrangements were detected. We consider our data as highly valuable and comprehensive resources for wild beet studies, B. patula conservation management, and sugar beet breeding research., (© 2019 The Authors. The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.)
- Published
- 2019
- Full Text
- View/download PDF
22. Genome and transcriptome characterization of the glycoengineered Nicotiana benthamiana line ΔXT/FT.
- Author
-
Schiavinato M, Strasser R, Mach L, Dohm JC, and Himmelbauer H
- Subjects
- Genetic Variation, Molecular Sequence Annotation, Transgenes genetics, Gene Expression Profiling, Genomics, Glycoproteins genetics, Protein Engineering, Nicotiana genetics
- Abstract
Background: The allotetraploid tobacco species Nicotiana benthamiana native to Australia has become a popular host for recombinant protein production. Although its usage grows every year, little is known on this plant's genomic and transcriptomic features. Most N. benthamiana accessions currently used in research lack proper documentation of their breeding history and provenance. One of these, the glycoengineered N. benthamiana line ΔXT/FT is increasingly used for the production of biopharmaceutical proteins., Results: Based on an existing draft assembly of the N. benthamiana genome we predict 50,516 protein -encoding genes (62,216 transcripts) supported by expression data derived from 2.35 billion mRNA-seq reads. Using single-copy core genes we show high completeness of the predicted gene set. We functionally annotate more than two thirds of the gene set through sequence homology to genes from other Nicotiana species. We demonstrate that the expression profiles from leaf tissue of ΔXT/FT and its wild type progenitor only show minimal differences. We identify the transgene insertion sites in ΔXT/FT and show that one of the transgenes was inserted inside another predicted gene that most likely lost its function upon insertion. Based on publicly available mRNA-seq data, we confirm that the N. benthamiana accessions used by different research institutions most likely derive from a single source., Conclusions: This work provides gene annotation of the N. benthamiana genome, a genomic and transcriptomic characterization of a transgenic N. benthamiana line in comparison to its wild-type progenitor, and sheds light onto the relatedness of N. benthamiana accessions that are used in laboratories around the world.
- Published
- 2019
- Full Text
- View/download PDF
23. A General Protein O- Glycosylation Gene Cluster Encodes the Species-Specific Glycan of the Oral Pathogen Tannerella forsythia : O -Glycan Biosynthesis and Immunological Implications.
- Author
-
Tomek MB, Maresch D, Windwarder M, Friedrich V, Janesch B, Fuchs K, Neumann L, Nimeth I, Zwickl NF, Dohm JC, Everest-Dass A, Kolarich D, Himmelbauer H, Altmann F, and Schäffer C
- Abstract
The cell surface of the oral pathogen Tannerella forsythia is heavily glycosylated with a unique, complex decasaccharide that is O -glycosidically linked to the bacterium's abundant surface (S-) layer, as well as other proteins. The S-layer glycoproteins are virulence factors of T. forsythia and there is evidence that protein O- glycosylation underpins the bacterium's pathogenicity. To elucidate the protein O -glycosylation pathway, genes suspected of encoding pathway components were first identified in the genome sequence of the ATCC 43037 type strain, revealing a 27-kb gene cluster that was shown to be polycistronic. Using a gene deletion approach targeted at predicted glycosyltransferases (Gtfs) and methyltransferases encoded in this gene cluster, in combination with mass spectrometry of the protein-released O- glycans, we show that the gene cluster encodes the species-specific part of the T. forsythia ATCC 43037 decasaccharide and that this is assembled step-wise on a pentasaccharide core. The core was previously proposed to be conserved within the Bacteroidetes phylum, to which T. forsythia is affiliated, and its biosynthesis is encoded elsewhere on the bacterial genome. Next, to assess the prevalence of protein O- glycosylation among Tannerella sp., the publicly available genome sequences of six T. forsythia strains were compared, revealing gene clusters of similar size and organization as found in the ATCC 43037 type strain. The corresponding region in the genome of a periodontal health-associated Tannerella isolate showed a different gene composition lacking most of the genes commonly found in the pathogenic strains. Finally, we investigated whether differential cell surface glycosylation impacts T. forsythia 's overall immunogenicity. Release of proinflammatory cytokines by dendritic cells (DCs) upon stimulation with defined Gtf-deficient mutants of the type strain was measured and their T cell-priming potential post-stimulation was explored. This revealed that the O- glycan is pivotal to modulating DC effector functions, with the T. forsythia -specific glycan portion suppressing and the pentasaccharide core activating a Th17 response. We conclude that complex protein O- glycosylation is a hallmark of pathogenic T. forsythia strains and propose it as a valuable target for the design of novel antimicrobials against periodontitis.
- Published
- 2018
- Full Text
- View/download PDF
24. Crop wild relative populations of Beta vulgaris allow direct mapping of agronomically important genes.
- Author
-
Capistrano-Gossmann GG, Ries D, Holtgräwe D, Minoche A, Kraft T, Frerichmann SLM, Rosleff Soerensen T, Dohm JC, González I, Schilhabel M, Varrelmann M, Tschoep H, Uphoff H, Schütze K, Borchardt D, Toerjek O, Mechelke W, Lein JC, Schechert AW, Frese L, Himmelbauer H, Weisshaar B, and Kopisch-Obuch FJ
- Subjects
- Alleles, Crops, Agricultural genetics, Disease Resistance genetics, Ecosystem, Genetic Association Studies, Genetic Variation, Genome, Plant, Geography, Hybridization, Genetic, Open Reading Frames, Phenotype, Plant Breeding, Plant Diseases genetics, Polymorphism, Single Nucleotide, RNA Interference, Beta vulgaris genetics, Contig Mapping, Gene Editing, Genes, Plant
- Abstract
Rapid identification of agronomically important genes is of pivotal interest for crop breeding. One source of such genes are crop wild relative (CWR) populations. Here we used a CWR population of <200 wild beets (B. vulgaris ssp. maritima), sampled in their natural habitat, to identify the sugar beet (Beta vulgaris ssp. vulgaris) resistance gene Rz2 with a modified version of mapping-by-sequencing (MBS). For that, we generated a draft genome sequence of the wild beet. Our results show the importance of preserving CWR in situ and demonstrate the great potential of CWR for rapid discovery of causal genes relevant for crop improvement. The candidate gene for Rz2 was identified by MBS and subsequently corroborated via RNA interference (RNAi). Rz2 encodes a CC-NB-LRR protein. Access to the DNA sequence of Rz2 opens the path to improvement of resistance towards rhizomania not only by marker-assisted breeding but also by genome editing.
- Published
- 2017
- Full Text
- View/download PDF
25. Dissecting the Effect of Genetic Variation on the Hepatic Expression of Drug Disposition Genes across the Collaborative Cross Mouse Strains.
- Author
-
Nachshon A, Abu-Toamih Atamni HJ, Steuerman Y, Sheikh-Hamed R, Dorman A, Mott R, Dohm JC, Lehrach H, Sultan M, Shamir R, Sauer S, Himmelbauer H, Iraqi FA, and Gat-Viks I
- Abstract
A central challenge in pharmaceutical research is to investigate genetic variation in response to drugs. The Collaborative Cross (CC) mouse reference population is a promising model for pharmacogenomic studies because of its large amount of genetic variation, genetic reproducibility, and dense recombination sites. While the CC lines are phenotypically diverse, their genetic diversity in drug disposition processes, such as detoxification reactions, is still largely uncharacterized. Here we systematically measured RNA-sequencing expression profiles from livers of 29 CC lines under baseline conditions. We then leveraged a reference collection of metabolic biotransformation pathways to map potential relations between drugs and their underlying expression quantitative trait loci (eQTLs). By applying this approach on proximal eQTLs, including eQTLs acting on the overall expression of genes and on the expression of particular transcript isoforms, we were able to construct the organization of hepatic eQTL-drug connectivity across the CC population. The analysis revealed a substantial impact of genetic variation acting on drug biotransformation, allowed mapping of potential joint genetic effects in the context of individual drugs, and demonstrated crosstalk between drug metabolism and lipid metabolism. Our findings provide a resource for investigating drug disposition in the CC strains, and offer a new paradigm for integrating biotransformation reactions to corresponding variations in DNA sequences.
- Published
- 2016
- Full Text
- View/download PDF
26. Oligoadenylation of 3' decay intermediates promotes cytoplasmic mRNA degradation in Drosophila cells.
- Author
-
Harnisch C, Cuzic-Feltens S, Dohm JC, Götze M, Himmelbauer H, and Wahle E
- Subjects
- Animals, Cells, Cultured, Drosophila melanogaster, Hydrolysis, Polynucleotide Adenylyltransferase metabolism, Adenine Nucleotides metabolism, Cytoplasm metabolism, Oligoribonucleotides metabolism, RNA, Messenger metabolism
- Abstract
Post-transcriptional 3' end addition of nucleotides is important in a variety of RNA decay pathways. We have examined the 3' end addition of nucleotides during the decay of the Hsp70 mRNA and a corresponding reporter RNA in Drosophila S2 cells by conventional sequencing of cDNAs obtained after mRNA circularization and by deep sequencing of dedicated libraries enriched for 3' decay intermediates along the length of the mRNA. Approximately 5%-10% of 3' decay intermediates carried nonencoded oligo(A) tails with a mean length of 2-3 nucleotides. RNAi experiments showed that the oligoadenylated RNA fragments were intermediates of exosomal decay and the noncanonical poly(A) polymerase Trf4-1 was mainly responsible for A addition. A hot spot of A addition corresponded to an intermediate of 3' decay that accumulated upon inhibition of decapping, and knockdown of Trf4-1 increased the abundance of this intermediate, suggesting that oligoadenylation facilitates 3' decay. Oligoadenylated 3' decay intermediates were found in the cytoplasmic fraction in association with ribosomes, and fluorescence microscopy revealed a cytoplasmic localization of Trf4-1. Thus, oligoadenylation enhances exosomal mRNA degradation in the cytoplasm., (© 2016 Harnisch et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.)
- Published
- 2016
- Full Text
- View/download PDF
27. Genome and transcriptome analysis of the Mesoamerican common bean and the role of gene duplications in establishing tissue and temporal specialization of genes.
- Author
-
Vlasova A, Capella-Gutiérrez S, Rendón-Anaya M, Hernández-Oñate M, Minoche AE, Erb I, Câmara F, Prieto-Barja P, Corvelo A, Sanseverino W, Westergaard G, Dohm JC, Pappas GJ Jr, Saburido-Alvarez S, Kedra D, Gonzalez I, Cozzuto L, Gómez-Garrido J, Aguilar-Morón MA, Andreu N, Aguilar OM, Garcia-Mas J, Zehnsdorf M, Vázquez MP, Delgado-Salinas A, Delaye L, Lowy E, Mentaberry A, Vianello-Brondani RP, García JL, Alioto T, Sánchez F, Himmelbauer H, Santalla M, Notredame C, Gabaldón T, Herrera-Estrella A, and Guigó R
- Subjects
- DNA, Plant genetics, Gene Duplication, Gene Expression Profiling, Genotype, Humans, Phylogeny, Seeds genetics, Sequence Analysis, DNA, Genome, Plant, Microsatellite Repeats genetics, Phaseolus genetics, Transcriptome genetics
- Abstract
Background: Legumes are the third largest family of angiosperms and the second most important crop class. Legume genomes have been shaped by extensive large-scale gene duplications, including an approximately 58 million year old whole genome duplication shared by most crop legumes., Results: We report the genome and the transcription atlas of coding and non-coding genes of a Mesoamerican genotype of common bean (Phaseolus vulgaris L., BAT93). Using a comprehensive phylogenomics analysis, we assessed the past and recent evolution of common bean, and traced the diversification of patterns of gene expression following duplication. We find that successive rounds of gene duplications in legumes have shaped tissue and developmental expression, leading to increased levels of specialization in larger gene families. We also find that many long non-coding RNAs are preferentially expressed in germ-line-related tissues (pods and seeds), suggesting that they play a significant role in fruit development. Our results also suggest that most bean-specific gene family expansions, including resistance gene clusters, predate the split of the Mesoamerican and Andean gene pools., Conclusions: The genome and transcriptome data herein generated for a Mesoamerican genotype represent a counterpart to the genomic resources already available for the Andean gene pool. Altogether, this information will allow the genetic dissection of the characters involved in the domestication and adaptation of the crop, and their further implementation in breeding strategies for this important crop.
- Published
- 2016
- Full Text
- View/download PDF
28. Diversification, evolution and methylation of short interspersed nuclear element families in sugar beet and related Amaranthaceae species.
- Author
-
Schwichtenberg K, Wenke T, Zakrzewski F, Seibt KM, Minoche A, Dohm JC, Weisshaar B, Himmelbauer H, and Schmidt T
- Subjects
- DNA Methylation genetics, Amaranthaceae genetics, Beta vulgaris genetics, Evolution, Molecular, Genetic Variation, Genome, Plant genetics, Short Interspersed Nucleotide Elements genetics
- Abstract
Short interspersed nuclear elements (SINEs) are non-autonomous non-long terminal repeat retrotransposons which are widely distributed in eukaryotic organisms. While SINEs have been intensively studied in animals, only limited information is available about plant SINEs. We analysed 22 SINE families from seven genomes of the Amaranthaceae family and identified 34 806 SINEs, including 19 549 full-length copies. With the focus on sugar beet (Beta vulgaris), we performed a comparative analysis of the diversity, genomic and chromosomal organization and the methylation of SINEs to provide a detailed insight into the evolution and age of Amaranthaceae SINEs. The lengths of consensus sequences of SINEs range from 113 nucleotides (nt) up to 224 nt. The SINEs show dispersed distribution on all chromosomes but were found with higher incidence in subterminal euchromatic chromosome regions. The methylation of SINEs is increased compared with their flanking regions, and the strongest effect is visible for cytosines in the CHH context, indicating an involvement of asymmetric methylation in the silencing of SINEs., (© 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.)
- Published
- 2016
- Full Text
- View/download PDF
29. Exploiting single-molecule transcript sequencing for eukaryotic gene prediction.
- Author
-
Minoche AE, Dohm JC, Schneider J, Holtgräwe D, Viehöver P, Montfort M, Sörensen TR, Weisshaar B, and Himmelbauer H
- Subjects
- Beta vulgaris genetics, DNA, Complementary chemistry, Genes, Plant, Molecular Sequence Data, Spinacia oleracea genetics, Gene Expression Profiling methods, Sequence Analysis, RNA methods
- Abstract
We develop a method to predict and validate gene models using PacBio single-molecule, real-time (SMRT) cDNA reads. Ninety-eight percent of full-insert SMRT reads span complete open reading frames. Gene model validation using SMRT reads is developed as automated process. Optimized training and prediction settings and mRNA-seq noise reduction of assisting Illumina reads results in increased gene prediction sensitivity and precision. Additionally, we present an improved gene set for sugar beet (Beta vulgaris) and the first genome-wide gene set for spinach (Spinacia oleracea). The workflow and guidelines are a valuable resource to obtain comprehensive gene sets for newly sequenced genomes of non-model eukaryotes.
- Published
- 2015
- Full Text
- View/download PDF
30. Profiling of extensively diversified plant LINEs reveals distinct plant-specific subclades.
- Author
-
Heitkam T, Holtgräwe D, Dohm JC, Minoche AE, Himmelbauer H, Weisshaar B, and Schmidt T
- Subjects
- DNA, Plant genetics, Evolution, Molecular, Genetic Variation, Genome, Plant genetics, Retroelements genetics
- Abstract
A large fraction of eukaryotic genomes is made up of long interspersed nuclear elements (LINEs). Due to their capability to create novel copies via error-prone reverse transcription, they generate multiple families and reach high copy numbers. Although mammalian LINEs have been well described, plant LINEs have been only poorly investigated. Here, we present a systematic cross-species survey of LINEs in higher plant genomes shedding light on plant LINE evolution as well as diversity, and facilitating their annotation in genome projects. Applying a Hidden Markov Model (HMM)-based analysis, 59 390 intact LINE reverse transcriptases (RTs) were extracted from 23 plant genomes. These fall in only two out of 28 LINE clades (L1 and RTE) known in eukaryotes. While plant RTE LINEs are highly homogenous and mostly constitute only a single family per genome, plant L1 LINEs are extremely diverse and form numerous families. Despite their heterogeneity, all members across the 23 species fall into only seven L1 subclades, some of them defined here. Exemplarily focusing on the L1 LINEs of a basal reference plant genome (Beta vulgaris), we show that the subclade classification level does not only reflect RT sequence similarity, but also mirrors structural aspects of complete LINE retrotransposons, like element size, position and type of encoded enzymatic domains. Our comprehensive catalogue of plant LINE RTs serves the classification of highly diverse plant LINEs, while the provided subclade-specific HMMs facilitate their annotation., (© 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.)
- Published
- 2014
- Full Text
- View/download PDF
31. The CHH motif in sugar beet satellite DNA: a modulator for cytosine methylation.
- Author
-
Zakrzewski F, Schubert V, Viehoever P, Minoche AE, Dohm JC, Himmelbauer H, Weisshaar B, and Schmidt T
- Subjects
- Chromosomes, Plant, Epigenesis, Genetic, Genome, Plant, Nucleotide Motifs, Repetitive Sequences, Nucleic Acid, Sequence Analysis, DNA, Beta vulgaris genetics, Cytosine metabolism, DNA Methylation, DNA, Plant chemistry
- Abstract
Methylation of DNA is important for the epigenetic silencing of repetitive DNA in plant genomes. Knowledge about the cytosine methylation status of satellite DNAs, a major class of repetitive DNA, is scarce. One reason for this is that arrays of tandemly arranged sequences are usually collapsed in next-generation sequencing assemblies. We applied strategies to overcome this limitation and quantified the level of cytosine methylation and its pattern in three satellite families of sugar beet (Beta vulgaris) which differ in their abundance, chromosomal localization and monomer size. We visualized methylation levels along pachytene chromosomes with respect to small satellite loci at maximum resolution using chromosome-wide fluorescent in situ hybridization complemented with immunostaining and super-resolution microscopy. Only reduced methylation of many satellite arrays was obtained. To investigate methylation at the nucleotide level we performed bisulfite sequencing of 1569 satellite sequences. We found that the level of methylation of cytosine strongly depends on the sequence context: cytosines in the CHH motif show lower methylation (44-52%), while CG and CHG motifs are more strongly methylated. This affects the overall methylation of satellite sequences because CHH occurs frequently while CG and CHG are rare or even absent in the satellite arrays investigated. Evidently, CHH is the major target for modulation of the cytosine methylation level of adjacent monomers within individual arrays and contributes to their epigenetic function. This strongly indicates that asymmetric cytosine methylation plays a role in the epigenetic modification of satellite repeats in plant genomes., (© 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.)
- Published
- 2014
- Full Text
- View/download PDF
32. Differential expression patterns of non-symbiotic hemoglobins in sugar beet (Beta vulgaris ssp. vulgaris).
- Author
-
Leiva-Eriksson N, Pin PA, Kraft T, Dohm JC, Minoche AE, Himmelbauer H, and Bülow L
- Subjects
- Amino Acid Sequence, Beta vulgaris physiology, Flowers genetics, Flowers physiology, Gene Expression Regulation, Developmental, Genes, Plant, Hemoglobins chemistry, Hemoglobins metabolism, Molecular Sequence Data, Photoperiod, Plant Proteins chemistry, Plant Proteins metabolism, Protein Transport, Sequence Alignment, Subcellular Fractions metabolism, Beta vulgaris genetics, Gene Expression Profiling, Gene Expression Regulation, Plant, Hemoglobins genetics, Plant Proteins genetics, Symbiosis genetics
- Abstract
Biennial sugar beet (Beta vulgaris spp. vulgaris) is a Caryophyllidae that has adapted its growth cycle to the seasonal temperature and daylength variation of temperate regions. This is the first time a holistic study of the expression pattern of non-symbiotic hemoglobins (nsHbs) is being carried out in a member of this group and under two essential environmental conditions for flowering, namely vernalization and length of photoperiod. BvHb genes were identified by sequence homology searches against the latest draft of the sugar beet genome. Three nsHb genes (BvHb1.1, BvHb1.2 and BvHb2) and one truncated Hb gene (BvHb3) were found in the genome of sugar beet. Gene expression profiling of the nsHb genes was carried out by quantitative PCR in different organs and developmental stages, as well as during vernalization and under different photoperiods. BvHb1.1 and BvHb2 showed differential expression during vernalization as well as during long and short days. The high expression of BvHb2 indicates that it has an active role in the cell, maybe even taking over some BvHb1.2 functions, except during germination where BvHb1.2 together with BvHb1.1-both Class 1 nsHbs-are highly expressed. The unprecedented finding of a leader peptide at the N-terminus of BvHb1.1, for the first time in an nsHb from higher plants, together with its observed expression indicate that it may have a very specific role due to its suggested location in chloroplasts. Our findings open up new possibilities for research, breeding and engineering since Hbs could be more involved in plant development than previously was anticipated.
- Published
- 2014
- Full Text
- View/download PDF
33. The genome of the recently domesticated crop plant sugar beet (Beta vulgaris).
- Author
-
Dohm JC, Minoche AE, Holtgräwe D, Capella-Gutiérrez S, Zakrzewski F, Tafer H, Rupp O, Sörensen TR, Stracke R, Reinhardt R, Goesmann A, Kraft T, Schulz B, Stadler PF, Schmidt T, Gabaldón T, Lehrach H, Weisshaar B, and Himmelbauer H
- Subjects
- Biofuels supply & distribution, Carbohydrate Metabolism, Chromosomes, Plant genetics, Ethanol metabolism, Genomics, In Situ Hybridization, Fluorescence, Molecular Sequence Data, Phylogeny, Sequence Analysis, DNA, Spinacia oleracea genetics, Beta vulgaris genetics, Crops, Agricultural genetics, Genome, Plant genetics
- Abstract
Sugar beet (Beta vulgaris ssp. vulgaris) is an important crop of temperate climates which provides nearly 30% of the world's annual sugar production and is a source for bioethanol and animal feed. The species belongs to the order of Caryophylalles, is diploid with 2n = 18 chromosomes, has an estimated genome size of 714-758 megabases and shares an ancient genome triplication with other eudicot plants. Leafy beets have been cultivated since Roman times, but sugar beet is one of the most recently domesticated crops. It arose in the late eighteenth century when lines accumulating sugar in the storage root were selected from crosses made with chard and fodder beet. Here we present a reference genome sequence for sugar beet as the first non-rosid, non-asterid eudicot genome, advancing comparative genomics and phylogenetic reconstructions. The genome sequence comprises 567 megabases, of which 85% could be assigned to chromosomes. The assembly covers a large proportion of the repetitive sequence content that was estimated to be 63%. We predicted 27,421 protein-coding genes supported by transcript data and annotated them on the basis of sequence homology. Phylogenetic analyses provided evidence for the separation of Caryophyllales before the split of asterids and rosids, and revealed lineage-specific gene family expansions and losses. We sequenced spinach (Spinacia oleracea), another Caryophyllales species, and validated features that separate this clade from rosids and asterids. Intraspecific genomic variation was analysed based on the genome sequences of sea beet (Beta vulgaris ssp. maritima; progenitor of all beet crops) and four additional sugar beet accessions. We identified seven million variant positions in the reference genome, and also large regions of low variability, indicating artificial selection. The sugar beet genome sequence enables the identification of genes affecting agronomically relevant traits, supports molecular breeding and maximizes the plant's potential in energy biotechnology.
- Published
- 2014
- Full Text
- View/download PDF
34. Cytosine methylation of an ancient satellite family in the wild beet Beta procumbens.
- Author
-
Schmidt M, Hense S, Minoche AE, Dohm JC, Himmelbauer H, Schmidt T, and Zakrzewski F
- Subjects
- Base Sequence, Centromere genetics, Chromosomes, Plant genetics, CpG Islands genetics, Molecular Sequence Data, Beta vulgaris genetics, Beta vulgaris metabolism, Cytosine metabolism, DNA Methylation genetics, DNA, Satellite genetics
- Abstract
DNA methylation is an essential epigenetic feature for the regulation and maintenance of heterochromatin. Satellite DNA is a repetitive sequence component that often occurs in large arrays in heterochromatin of subtelomeric, intercalary and centromeric regions. Knowledge about the methylation status of satellite DNA is important for understanding the role of repetitive DNA in heterochromatization. In this study, we investigated the cytosine methylation of the ancient satellite family pEV in the wild beet Beta procumbens. The pEV satellite is widespread in species-specific pEV subfamilies in the genus Beta and most likely originated before the radiation of the Betoideae and Chenopodioideae. In B. procumbens, the pEV subfamily occurs abundantly and spans intercalary and centromeric regions. To uncover its cytosine methylation, we performed chromosome-wide immunostaining and bisulfite sequencing of pEV satellite repeats. We found that CG and CHG sites are highly methylated while CHH sites show only low levels of methylation. As a consequence of the low frequency of CG and CHG sites and the preferential occurrence of most cytosines in the CHH motif in pEV monomers, this satellite family displays only low levels of total cytosine methylation., (© 2014 S. Karger AG, Basel.)
- Published
- 2014
- Full Text
- View/download PDF
35. Microarray and deep sequencing cross-platform analysis of the mirRNome and isomiR variation in response to epidermal growth factor.
- Author
-
Llorens F, Hummel M, Pantano L, Pastor X, Vivancos A, Castillo E, Mattlin H, Ferrer A, Ingham M, Noguera M, Kofler R, Dohm JC, Pluvinet R, Bayés M, Himmelbauer H, del Rio JA, Martí E, and Sumoy L
- Subjects
- Gene Silencing, Gene Targeting, HeLa Cells, Humans, Sequence Analysis, RNA, Signal Transduction drug effects, Signal Transduction genetics, Transcription, Genetic drug effects, Transcription, Genetic genetics, Epidermal Growth Factor pharmacology, Genomics methods, High-Throughput Nucleotide Sequencing methods, MicroRNAs genetics, Oligonucleotide Array Sequence Analysis methods
- Abstract
Background: Epidermal Growth Factor (EGF) plays an important function in the regulation of cell growth, proliferation, and differentiation by binding to its receptor (EGFR) and providing cancer cells with increased survival responsiveness. Signal transduction carried out by EGF has been extensively studied at both transcriptional and post-transcriptional levels. Little is known about the involvement of microRNAs (miRNAs) in the EGF signaling pathway. miRNAs have emerged as major players in the complex networks of gene regulation, and cancer miRNA expression studies have evidenced a direct involvement of miRNAs in cancer progression., Results: In this study, we have used an integrative high content analysis approach to identify the specific miRNAs implicated in EGF signaling in HeLa cells as potential mediators of cancer mediated functions. We have used microarray and deep-sequencing technologies in order to obtain a global view of the EGF miRNA transcriptome with a robust experimental cross-validation. By applying a procedure based on Rankprod tests, we have delimited a solid set of EGF-regulated miRNAs. After validating regulated miRNAs by reverse transcription quantitative PCR, we have derived protein networks and biological functions from the predicted targets of the regulated miRNAs to gain insight into the potential role of miRNAs in EGF-treated cells. In addition, we have analyzed sequence heterogeneity due to editing relative to the reference sequence (isomiRs) among regulated miRNAs., Conclusions: We propose that the use of global genomic miRNA cross-validation derived from high throughput technologies can be used to generate more reliable datasets inferring more robust networks of co-regulated predicted miRNA target genes.
- Published
- 2013
- Full Text
- View/download PDF
36. Highly diverse chromoviruses of Beta vulgaris are classified by chromodomains and chromosomal integration.
- Author
-
Weber B, Heitkam T, Holtgräwe D, Weisshaar B, Minoche AE, Dohm JC, Himmelbauer H, and Schmidt T
- Abstract
Background: Chromoviruses are one of the three genera of Ty3-gypsy long terminal repeat (LTR) retrotransposons, and are present in high copy numbers in plant genomes. They are widely distributed within the plant kingdom, with representatives even in lower plants such as green and red algae. Their hallmark is the presence of a chromodomain at the C-terminus of the integrase. The chromodomain exhibits structural characteristics similar to proteins of the heterochromatin protein 1 (HP1) family, which mediate the binding of each chromovirus type to specific histone variants. A specific integration via the chromodomain has been shown for only a few chromoviruses. However, a detailed study of different chromoviral clades populating a single plant genome has not yet been carried out., Results: We conducted a comprehensive survey of chromoviruses within the Beta vulgaris (sugar beet) genome, and found a highly diverse chromovirus population, with significant differences in element size, primarily caused by their flanking LTRs. In total, we identified and annotated full-length members of 16 families belonging to the four plant chromoviral clades: CRM, Tekay, Reina, and Galadriel. The families within each clade are structurally highly conserved; in particular, the position of the chromodomain coding region relative to the polypurine tract is clade-specific. Two distinct groups of chromodomains were identified. The group II chromodomain was present in three chromoviral clades, whereas families of the CRM clade contained a more divergent motif. Physical mapping using representatives of all four clades identified a clade-specific integration pattern. For some chromoviral families, we detected the presence of expressed sequence tags, indicating transcriptional activity., Conclusions: We present a detailed study of chromoviruses, belonging to the four major clades, which populate a single plant genome. Our results illustrate the diversity and family structure of B. vulgaris chromoviruses, and emphasize the role of chromodomains in the targeted integration of these viruses. We suggest that the diverse sets of plant chromoviruses with their different localization patterns might help to facilitate plant-genome organization in a structural and functional manner.
- Published
- 2013
- Full Text
- View/download PDF
37. Evolutionary reshuffling in the Errantivirus lineage Elbe within the Beta vulgaris genome.
- Author
-
Wollrab C, Heitkam T, Holtgräwe D, Weisshaar B, Minoche AE, Dohm JC, Himmelbauer H, and Schmidt T
- Subjects
- Amino Acid Sequence, Beta vulgaris virology, Chromosomes, Plant genetics, Computational Biology methods, Conserved Sequence, Genetic Variation, Molecular Sequence Data, Nucleotide Motifs, Open Reading Frames, Physical Chromosome Mapping, Plant Viruses classification, Sequence Alignment, Species Specificity, Beta vulgaris genetics, Evolution, Molecular, Genome, Plant, Plant Viruses genetics, Recombination, Genetic, Retroelements
- Abstract
LTR retrotransposons and retroviruses are closely related. Although a viral envelope gene is found in some LTR retrotransposons and all retroviruses, only the latter show infectivity. The identification of Ty3-gypsy-like retrotransposons possessing putative envelope-like open reading frames blurred the taxonomical borders and led to the establishment of the Errantivirus, Metavirus and Chromovirus genera within the Metaviridae. Only a few plant Errantiviruses have been described, and their evolutionary history is not well understood. In this study, we investigated 27 retroelements of four abundant Elbe retrotransposon families belonging to the Errantiviruses in Beta vulgaris (sugar beet). Retroelements of the Elbe lineage integrated between 0.02 and 5.59 million years ago, and show family-specific variations in autonomy and degree of rearrangements: while Elbe3 members are highly fragmented, often truncated and present in a high number of solo LTRs, Elbe2 members are mainly autonomous. We observed extensive reshuffling of structural motifs across families, leading to the formation of new retrotransposon families. Elbe retrotransposons harbor a typical envelope-like gene, often encoding transmembrane domains. During the course of Elbe evolution, the additional open reading frames have been strongly modified or independently acquired. Taken together, the Elbe lineage serves as retrotransposon model reflecting the various stages in Errantivirus evolution, and allows a detailed analysis of retrotransposon family formation., (© 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.)
- Published
- 2012
- Full Text
- View/download PDF
38. The role of a pseudo-response regulator gene in life cycle adaptation and domestication of beet.
- Author
-
Pin PA, Zhang W, Vogt SH, Dally N, Büttner B, Schulze-Buxloh G, Jelly NS, Chia TY, Mutasa-Göttgens ES, Dohm JC, Himmelbauer H, Weisshaar B, Kraus J, Gielen JJ, Lommel M, Weyens G, Wahl B, Schechert A, Nilsson O, Jung C, Kraft T, and Müller AE
- Subjects
- Adaptation, Biological genetics, Amino Acid Sequence, Amplified Fragment Length Polymorphism Analysis, Base Sequence, Beta vulgaris genetics, Chromosome Mapping, Chromosomes, Artificial, Bacterial genetics, Cloning, Molecular, DNA Primers genetics, Flowers genetics, Genetic Markers genetics, Haplotypes genetics, Immunoblotting, Models, Biological, Molecular Sequence Data, Phenotype, Photoperiod, Phylogeny, Seasons, Selection, Genetic, Sequence Alignment, Sequence Analysis, DNA, Adaptation, Biological physiology, Agriculture methods, Beta vulgaris physiology, Biological Evolution, Flowers physiology, Genes, Regulator genetics, Plant Proteins genetics
- Abstract
Life cycle adaptation to latitudinal and seasonal variation in photoperiod and temperature is a major determinant of evolutionary success in flowering plants. Whereas the life cycle of the dicotyledonous model species Arabidopsis thaliana is controlled by two epistatic genes, FLOWERING LOCUS C and FRIGIDA, three unrelated loci (VERNALIZATION) determine the spring and winter habits of monocotyledonous plants such as temperate cereals. In the core eudicot species Beta vulgaris, whose lineage diverged from that leading to Arabidopsis shortly after the monocot-dicot split 140 million years ago, the bolting locus B is a master switch distinguishing annuals from biennials. Here, we isolated B and show that the pseudo-response regulator gene BOLTING TIME CONTROL 1 (BvBTC1), through regulation of the FLOWERING LOCUS T genes, is absolutely necessary for flowering and mediates the response to both long days and vernalization. Our results suggest that domestication of beets involved the selection of a rare partial loss-of-function BvBTC1 allele that imparts reduced sensitivity to photoperiod that is restored by vernalization, thus conferring bienniality, and illustrate how evolutionary plasticity at a key regulatory point can enable new life cycle strategies., (Copyright © 2012 Elsevier Ltd. All rights reserved.)
- Published
- 2012
- Full Text
- View/download PDF
39. Maturation of mammalian H/ACA box snoRNAs: PAPD5-dependent adenylation and PARN-dependent trimming.
- Author
-
Berndt H, Harnisch C, Rammelt C, Stöhr N, Zirkel A, Dohm JC, Himmelbauer H, Tavanez JP, Hüttelmaier S, and Wahle E
- Subjects
- Base Sequence, Catalysis, Cell Nucleolus metabolism, Coiled Bodies metabolism, Exoribonucleases genetics, Exosome Multienzyme Ribonuclease Complex, Humans, Nuclear Proteins metabolism, Nucleotide Motifs, Polyadenylation, Protein Transport, RNA Editing, Exoribonucleases metabolism, RNA Nucleotidyltransferases metabolism, RNA, Small Nucleolar chemistry, RNA, Small Nucleolar metabolism
- Abstract
Small nucleolar and small Cajal body RNAs (snoRNAs and scaRNAs) of the H/ACA box and C/D box type are generated by exonucleolytic shortening of longer precursors. Removal of the last few nucleotides at the 3' end is known to be a distinct step. We report that, in human cells, knock-down of the poly(A) specific ribonuclease (PARN), previously implicated only in mRNA metabolism, causes the accumulation of oligoadenylated processing intermediates of H/ACA box but not C/D box RNAs. In agreement with a role of PARN in snoRNA and scaRNA processing, the enzyme is concentrated in nucleoli and Cajal bodies. Oligo(A) tails are attached to a short stub of intron sequence remaining beyond the mature 3' end of the snoRNAs. The noncanonical poly(A) polymerase PAPD5 is responsible for addition of the oligo(A) tails. We suggest that deadenylation is coupled to clean 3' end trimming, which might serve to enhance snoRNA stability.
- Published
- 2012
- Full Text
- View/download PDF
40. Palaeohexaploid ancestry for Caryophyllales inferred from extensive gene-based physical and genetic mapping of the sugar beet genome (Beta vulgaris).
- Author
-
Dohm JC, Lange C, Holtgräwe D, Sörensen TR, Borchardt D, Schulz B, Lehrach H, Weisshaar B, and Himmelbauer H
- Subjects
- Base Sequence, Chromosomes, Artificial, Bacterial, DNA, Plant genetics, Expressed Sequence Tags, Genes, Plant genetics, Genetic Linkage, Genetic Markers genetics, Magnoliopsida genetics, Physical Chromosome Mapping, Polymorphism, Single Nucleotide genetics, Polyploidy, Sequence Analysis, DNA, Synteny genetics, Beta vulgaris genetics, Chromosome Mapping, Evolution, Molecular, Genome, Plant genetics, Genomics
- Abstract
Sugar beet (Beta vulgaris) is an important crop plant that accounts for 30% of the world's sugar production annually. The genus Beta is a distant relative of currently sequenced taxa within the core eudicotyledons; the genomic characterization of sugar beet is essential to make its genome accessible to molecular dissection. Here, we present comprehensive genomic information in genetic and physical maps that cover all nine chromosomes. Based on this information we identified the proposed ancestral linkage groups of rosids and asterids within the sugar beet genome. We generated an extended genetic map that comprises 1127 single nucleotide polymorphism markers prepared from expressed sequence tags and bacterial artificial chromosome (BAC) end sequences. To construct a genome-wide physical map, we hybridized gene-derived oligomer probes against two BAC libraries with 9.5-fold cumulative coverage of the 758 Mbp genome. More than 2500 probes and clones were integrated both in genetic maps and the physical data. The final physical map encompasses 535 chromosomally anchored contigs that contains 8361 probes and 22 815 BAC clones. By using the gene order established with the physical map, we detected regions of synteny between sugar beet (order Caryophyllales) and rosid species that involves 1400-2700 genes in the sequenced genomes of Arabidopsis, poplar, grapevine, and cacao. The data suggest that Caryophyllales share the palaeohexaploid ancestor proposed for rosids and asterids. Taken together, we here provide extensive molecular resources for sugar beet and enable future high-resolution trait mapping, gene identification, and cross-referencing to regions sequenced in other plant species., (© 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.)
- Published
- 2012
- Full Text
- View/download PDF
41. Survey of sugar beet (Beta vulgaris L.) hAT transposons and MITE-like hATpin derivatives.
- Author
-
Menzel G, Krebs C, Diez M, Holtgräwe D, Weisshaar B, Minoche AE, Dohm JC, Himmelbauer H, and Schmidt T
- Subjects
- Amino Acid Sequence, Base Sequence, Chromosomes, Artificial, Bacterial, Chromosomes, Plant, Gene Library, Genome, Plant, In Situ Hybridization, Fluorescence, Molecular Sequence Data, Phylogeny, Physical Chromosome Mapping, Transposases genetics, Beta vulgaris genetics, DNA Transposable Elements
- Abstract
Genome-wide analyses of repetitive DNA suggest a significant impact particularly of transposable elements on genome size and evolution of virtually all eukaryotic organisms. In this study, we analyzed the abundance and diversity of the hAT transposon superfamily of the sugar beet (B. vulgaris) genome, using molecular, bioinformatic and cytogenetic approaches. We identified 81 transposase-coding sequences, three of which are part of structurally intact but nonfunctional hAT transposons (BvhAT), in a B. vulgaris BAC library as well as in whole genome sequencing-derived data sets. Additionally, 116 complete and 497 truncated non-autonomous BvhAT derivatives lacking the transposase gene were in silico-detected. The 116 complete derivatives were subdivided into four BvhATpin groups each characterized by a distinct terminal inverted repeat motif. Both BvhAT and BvhATpin transposons are specific for species of the genus Beta and closely related species, showing a localization on B. vulgaris chromosomes predominantely in euchromatic regions. The lack of any BvhAT transposase function together with the high degree of degeneration observed for the BvhAT and the BvhATpin genomic fraction contrasts with the abundance and activity of autonomous and non-autonomous hAT transposons revealed in other plant species. This indicates a possible genus-specific structural and functional repression of the hAT transposon superfamily during Beta diversification and evolution.
- Published
- 2012
- Full Text
- View/download PDF
42. Identification of mediator complex 26 (Crsp7) gametologs on platypus X1 and Y5 sex chromosomes: a candidate testis-determining gene in monotremes?
- Author
-
Tsend-Ayush E, Kortschak RD, Bernard P, Lim SL, Ryan J, Rosenkranz R, Borodina T, Dohm JC, Himmelbauer H, Harley VR, and Grützner F
- Subjects
- Animals, Chromosomes, Artificial, Bacterial genetics, Chromosomes, Mammalian genetics, Evolution, Molecular, Gene Expression Regulation, Genes, Reporter, Genes, sry, HEK293 Cells, Humans, Kidney cytology, Kidney metabolism, Male, Mice, Phylogeny, Physical Chromosome Mapping, SOX9 Transcription Factor genetics, Sex Determination Processes, Testis cytology, Transfection, Mediator Complex genetics, Platypus genetics, Testis physiology, X Chromosome genetics, Y Chromosome genetics
- Abstract
The basal lineage of monotremes features an extraordinarily complex sex chromosome system which has provided novel insights into the evolution of mammalian sex chromosomes. Recently, sequence information from autosomes, X chromosomes, and XY-shared pseudoautosomal regions has become available. However, no gene has so far been described on any of the Y chromosome-specific regions. We analyzed sequences derived from Y-specific BAC clones to identify genes with potentially male-specific function. Here, we report the identification and characterization of the mediator complex protein gametologs on platypus Y5 (Crspy). We also identified the X-chromosomal copy which unexpectedly maps to X1 (Crspx). Sequence comparison shows extensive divergence between the X and Y copy, but we found no significant positive selection on either gametolog. Expression analysis shows widespread expression of Crspx. Crspy is expressed exclusively in males with particularly strong expression in testis and kidney. Reporter gene assays to investigate whether Crspx/y can act on the recently discovered mouse Sox9 testis-specific enhancer element did reveal a modest effect together with mouse Sox9 + Sf1, but showed overall no significant upregulation of the reporter gene. This is the first report of a differentiated functional male-specific gene on platypus Y chromosomes, providing new insights into sex chromosome evolution and a candidate gene for male-specific function in monotremes.
- Published
- 2012
- Full Text
- View/download PDF
43. Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and genome analyzer systems.
- Author
-
Minoche AE, Dohm JC, and Himmelbauer H
- Subjects
- Artifacts, Automation, Laboratory, Base Composition genetics, Base Sequence, Genomics, High-Throughput Nucleotide Sequencing instrumentation, High-Throughput Nucleotide Sequencing statistics & numerical data, Molecular Sequence Data, Mutagenesis, Insertional, Polymorphism, Genetic, Reproducibility of Results, Sensitivity and Specificity, Sequence Analysis, DNA instrumentation, Sequence Analysis, DNA statistics & numerical data, Sequence Deletion, Arabidopsis genetics, Bacteriophage phi X 174 genetics, Beta vulgaris genetics, High-Throughput Nucleotide Sequencing methods, Sequence Analysis, DNA methods
- Abstract
Background: The generation and analysis of high-throughput sequencing data are becoming a major component of many studies in molecular biology and medical research. Illumina's Genome Analyzer (GA) and HiSeq instruments are currently the most widely used sequencing devices. Here, we comprehensively evaluate properties of genomic HiSeq and GAIIx data derived from two plant genomes and one virus, with read lengths of 95 to 150 bases., Results: We provide quantifications and evidence for GC bias, error rates, error sequence context, effects of quality filtering, and the reliability of quality values. By combining different filtering criteria we reduced error rates 7-fold at the expense of discarding 12.5% of alignable bases. While overall error rates are low in HiSeq data we observed regions of accumulated wrong base calls. Only 3% of all error positions accounted for 24.7% of all substitution errors. Analyzing the forward and reverse strands separately revealed error rates of up to 18.7%. Insertions and deletions occurred at very low rates on average but increased to up to 2% in homopolymers. A positive correlation between read coverage and GC content was found depending on the GC content range., Conclusions: The errors and biases we report have implications for the use and the interpretation of Illumina sequencing data. GAIIx and HiSeq data sets show slightly different error profiles. Quality filtering is essential to minimize downstream analysis artifacts. Supporting previous recommendations, the strand-specificity provides a criterion to distinguish sequencing errors from low abundance polymorphisms.
- Published
- 2011
- Full Text
- View/download PDF
44. Epigenetic profiling of heterochromatic satellite DNA.
- Author
-
Zakrzewski F, Weisshaar B, Fuchs J, Bannack E, Minoche AE, Dohm JC, Himmelbauer H, and Schmidt T
- Subjects
- Blotting, Southern, Centromere genetics, Centromere metabolism, Chromosomes, Plant genetics, Chromosomes, Plant metabolism, Cluster Analysis, DNA Methylation, DNA, Satellite genetics, DNA, Satellite metabolism, Euchromatin genetics, Euchromatin metabolism, Heterochromatin genetics, Heterochromatin metabolism, Histones genetics, Histones metabolism, In Situ Hybridization, Fluorescence, RNA, Small Interfering genetics, Sequence Analysis, DNA, Small Molecule Libraries chemistry, Beta vulgaris genetics, Beta vulgaris metabolism, Centromere chemistry, Chromosomes, Plant chemistry, DNA, Satellite chemistry, Epigenomics methods, Euchromatin chemistry, Heterochromatin chemistry, RNA, Small Interfering chemistry
- Abstract
Sugar beet (Beta vulgaris) chromosomes consist of large heterochromatic blocks in pericentromeric, centromeric, and intercalary regions comprised of two different highly abundant DNA satellite families. To investigate DNA methylation at single base resolution at heterochromatic regions, we applied a method for strand-specific bisulfite sequencing of more than 1,000 satellite monomers followed by statistical analyses. As a result, we uncovered diversity in the distribution of different methylation patterns in both satellite families. Heavily methylated CG and CHG (H=A, T, or C) sites occur more frequently in intercalary heterochromatin, while CHH sites, with the exception of CAA, are only sparsely methylated, in both intercalary and pericentromeric/centromeric heterochromatin. We show that the difference in DNA methylation intensity is correlated to unequal distribution of heterochromatic histone H3 methylation marks. While clusters of H3K9me2 were absent from pericentromeric heterochromatin and restricted only to intercalary heterochromatic regions, H3K9me1 and H3K27me1 were observed in all types of heterochromatin. By sequencing of a small RNA library consisting of 6.76 million small RNAs, we identified small interfering RNAs (siRNAs) of 24 nucleotides in size which originated from both strands of the satellite DNAs. We hypothesize an involvement of these siRNAs in the regulation of DNA and histone methylation for maintaining heterochromatin.
- Published
- 2011
- Full Text
- View/download PDF
45. Multiple platform assessment of the EGF dependent transcriptome by microarray and deep tag sequencing analysis.
- Author
-
Llorens F, Hummel M, Pastor X, Ferrer A, Pluvinet R, Vivancos A, Castillo E, Iraola S, Mosquera AM, González E, Lozano J, Ingham M, Dohm JC, Noguera M, Kofler R, del Río JA, Bayés M, Himmelbauer H, and Sumoy L
- Subjects
- HeLa Cells, Humans, Meta-Analysis as Topic, Metabolic Networks and Pathways genetics, Metallothionein genetics, Metallothionein metabolism, Signal Transduction, Software, Epidermal Growth Factor pharmacology, Gene Expression Profiling, Oligonucleotide Array Sequence Analysis methods, Sequence Analysis, DNA methods
- Abstract
Background: Epidermal Growth Factor (EGF) is a key regulatory growth factor activating many processes relevant to normal development and disease, affecting cell proliferation and survival. Here we use a combined approach to study the EGF dependent transcriptome of HeLa cells by using multiple long oligonucleotide based microarray platforms (from Agilent, Operon, and Illumina) in combination with digital gene expression profiling (DGE) with the Illumina Genome Analyzer., Results: By applying a procedure for cross-platform data meta-analysis based on RankProd and GlobalAncova tests, we establish a well validated gene set with transcript levels altered after EGF treatment. We use this robust gene list to build higher order networks of gene interaction by interconnecting associated networks, supporting and extending the important role of the EGF signaling pathway in cancer. In addition, we find an entirely new set of genes previously unrelated to the currently accepted EGF associated cellular functions., Conclusions: We propose that the use of global genomic cross-validation derived from high content technologies (microarrays or deep sequencing) can be used to generate more reliable datasets. This approach should help to improve the confidence of downstream in silico functional inference analyses based on high content data.
- Published
- 2011
- Full Text
- View/download PDF
46. Whole-genome sequencing identifies recurrent mutations in chronic lymphocytic leukaemia.
- Author
-
Puente XS, Pinyol M, Quesada V, Conde L, Ordóñez GR, Villamor N, Escaramis G, Jares P, Beà S, González-Díaz M, Bassaganyas L, Baumann T, Juan M, López-Guerra M, Colomer D, Tubío JM, López C, Navarro A, Tornador C, Aymerich M, Rozman M, Hernández JM, Puente DA, Freije JM, Velasco G, Gutiérrez-Fernández A, Costa D, Carrió A, Guijarro S, Enjuanes A, Hernández L, Yagüe J, Nicolás P, Romeo-Casabona CM, Himmelbauer H, Castillo E, Dohm JC, de Sanjosé S, Piris MA, de Alava E, San Miguel J, Royo R, Gelpí JL, Torrents D, Orozco M, Pisano DG, Valencia A, Guigó R, Bayés M, Heath S, Gut M, Klatt P, Marshall J, Raine K, Stebbings LA, Futreal PA, Stratton MR, Campbell PJ, Gut I, López-Guillermo A, Estivill X, Montserrat E, López-Otín C, and Campo E
- Subjects
- Amino Acid Sequence, Animals, Carrier Proteins genetics, DNA Mutational Analysis, Humans, Karyopherins genetics, Molecular Sequence Data, Myeloid Differentiation Factor 88 chemistry, Myeloid Differentiation Factor 88 genetics, Receptor, Notch1 genetics, Receptors, Cytoplasmic and Nuclear genetics, Reproducibility of Results, Exportin 1 Protein, Genome, Human genetics, Leukemia, Lymphocytic, Chronic, B-Cell genetics, Mutation genetics
- Abstract
Chronic lymphocytic leukaemia (CLL), the most frequent leukaemia in adults in Western countries, is a heterogeneous disease with variable clinical presentation and evolution. Two major molecular subtypes can be distinguished, characterized respectively by a high or low number of somatic hypermutations in the variable region of immunoglobulin genes. The molecular changes leading to the pathogenesis of the disease are still poorly understood. Here we performed whole-genome sequencing of four cases of CLL and identified 46 somatic mutations that potentially affect gene function. Further analysis of these mutations in 363 patients with CLL identified four genes that are recurrently mutated: notch 1 (NOTCH1), exportin 1 (XPO1), myeloid differentiation primary response gene 88 (MYD88) and kelch-like 6 (KLHL6). Mutations in MYD88 and KLHL6 are predominant in cases of CLL with mutated immunoglobulin genes, whereas NOTCH1 and XPO1 mutations are mainly detected in patients with unmutated immunoglobulins. The patterns of somatic mutation, supported by functional and clinical analyses, strongly indicate that the recurrent NOTCH1, MYD88 and XPO1 mutations are oncogenic changes that contribute to the clinical evolution of the disease. To our knowledge, this is the first comprehensive analysis of CLL combining whole-genome sequencing with clinical characteristics and clinical outcomes. It highlights the usefulness of this approach for the identification of clinically relevant mutations in cancer., (©2011 Macmillan Publishers Limited. All rights reserved)
- Published
- 2011
- Full Text
- View/download PDF
47. High-throughput identification of genetic markers using representational oligonucleotide microarray analysis.
- Author
-
Lange C, Mittermayr L, Dohm JC, Holtgräwe D, Weisshaar B, and Himmelbauer H
- Subjects
- Beta vulgaris growth & development, Chromosome Mapping, Genetic Linkage, Molecular Sequence Data, Beta vulgaris genetics, Biomarkers metabolism, Gene Expression Profiling, Genetic Markers genetics, Oligonucleotide Array Sequence Analysis
- Abstract
We describe a novel approach for high-throughput development of genetic markers using representational oligonucleotide microarray analysis. We test the performance of the method in sugar beet (Beta vulgaris L.) as a model for crop plants with little sequence information available. Genomic representations of both parents of a mapping population were hybridized on microarrays containing in total 146,554 custom made oligonucleotides based on sugar beet bacterial artificial chromosome (BAC) end sequences and expressed sequence tags (ESTs). Oligonucleotides showing a signal with one parental line only, were selected as potential marker candidates and placed onto an array, designed for genotyping of 184 F(2) individuals from the mapping population. Utilizing known co-dominant anchor markers we obtained 511 new dominant markers (392 derived from BAC end sequences, and 119 from ESTs) distributed over all nine sugar beet linkage groups and calculated genetic maps. Further improvements for large-scale application of the approach are discussed and its feasibility for the cost-effective and flexible generation of genetic markers is presented.
- Published
- 2010
- Full Text
- View/download PDF
48. Strand-specific deep sequencing of the transcriptome.
- Author
-
Vivancos AP, Güell M, Dohm JC, Serrano L, and Himmelbauer H
- Subjects
- Algorithms, Animals, Bacterial Proteins genetics, Cytoskeletal Proteins genetics, DNA, Bacterial analysis, DNA, Complementary analysis, Genetic Association Studies methods, Mice, Models, Biological, Multigene Family genetics, Oligonucleotide Array Sequence Analysis methods, Pneumonia, Mycoplasma genetics, Sequence Analysis, DNA methods, Substrate Specificity genetics, DNA, Single-Stranded analysis, Gene Expression Profiling methods, Polymerase Chain Reaction methods
- Abstract
Several studies support that antisense-mediated regulation may affect a large proportion of genes. Using the Illumina next-generation sequencing platform, we developed DSSS (direct strand specific sequencing), a strand-specific protocol for transcriptome sequencing. We tested DSSS with RNA from two samples, prokaryotic (Mycoplasma pneumoniae) as well as eukaryotic (Mus musculus), and obtained data containing strand-specific information, using single-read and paired-end sequencing. We validated our results by comparison with a strand-specific tiling array data set for strain M129 of the simple prokaryote M. pneumoniae, and by quantitative PCR (qPCR). The results of DSSS were very well supported by the results from tiling arrays and qPCR. Moreover, DSSS provided higher dynamic range and single-base resolution, thus enabling efficient antisense detection and the precise mapping of transcription start sites and untranslated regions. DSSS data for mouse confirmed strand specificity of the protocol and the general applicability of the approach to studying eukaryotic transcription. We propose DSSS as a simple and efficient strategy for strand-specific transcriptome sequencing and as a tool for genome annotation exploiting the increased read lengths that next-generation sequencing technology now is capable to deliver.
- Published
- 2010
- Full Text
- View/download PDF
49. Haplotype divergence in Beta vulgaris and microsynteny with sequenced plant genomes.
- Author
-
Dohm JC, Lange C, Reinhardt R, and Himmelbauer H
- Subjects
- Base Composition, Chromosomes, Artificial, Bacterial, Chromosomes, Plant, DNA, Plant genetics, Gene Library, Gene Order, Genes, Plant, Genotype, Polymorphism, Genetic, Sequence Alignment, Sequence Analysis, DNA, Beta vulgaris genetics, Genome, Plant, Haplotypes, Synteny
- Abstract
We characterized two overlapping sugar beet (Beta vulgaris) bacterial artificial chromosome (BAC) clones representing different haplotypes. A total of 254 kbp of the genomic sequence was determined, of which the two BACs share 92 kbp. Eleven of 15 genes discovered in the sequenced interval locate to the overlap region. The haplotypes differ in exons by 1% (nucleotide level) and in non-coding regions by 9% (6% mismatches, 3% gaps; alignable regions only). Large indels or high sequence divergence comprised 11% of either sequence. Of such indels, 68 and 45%, respectively, could be attributed to haplotype-specific integration of transposable elements. We identified novel repeat candidates by comparing the two BAC sequences to a set of genomic sugar beet sequences. Synteny was found with Arabidopsis chromosome 1 (At1), At2 and At4, Medicago chromosome 7, Vitis chromosome 15 and paralogous regions on poplar chromosomes II and XIV.
- Published
- 2009
- Full Text
- View/download PDF
50. Substantial biases in ultra-short read data sets from high-throughput DNA sequencing.
- Author
-
Dohm JC, Lottaz C, Borodina T, and Himmelbauer H
- Subjects
- Base Composition, Beta vulgaris genetics, DNA chemistry, Helicobacter genetics, Sequence Deletion, Software, Sequence Analysis, DNA standards
- Abstract
Novel sequencing technologies permit the rapid production of large sequence data sets. These technologies are likely to revolutionize genetics and biomedical research, but a thorough characterization of the ultra-short read output is necessary. We generated and analyzed two Illumina 1G ultra-short read data sets, i.e. 2.8 million 27mer reads from a Beta vulgaris genomic clone and 12.3 million 36mers from the Helicobacter acinonychis genome. We found that error rates range from 0.3% at the beginning of reads to 3.8% at the end of reads. Wrong base calls are frequently preceded by base G. Base substitution error frequencies vary by 10- to 11-fold, with A > C transversion being among the most frequent and C > G transversions among the least frequent substitution errors. Insertions and deletions of single bases occur at very low rates. When simulating re-sequencing we found a 20-fold sequencing coverage to be sufficient to compensate errors by correct reads. The read coverage of the sequenced regions is biased; the highest read density was found in intervals with elevated GC content. High Solexa quality scores are over-optimistic and low scores underestimate the data quality. Our results show different types of biases and ways to detect them. Such biases have implications on the use and interpretation of Solexa data, for de novo sequencing, re-sequencing, the identification of single nucleotide polymorphisms and DNA methylation sites, as well as for transcriptome analysis.
- Published
- 2008
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.