44 results on '"Datema E"'
Search Results
2. Two-tiered SARS-CoV-2 seroconversion screening in the Netherlands and stability of nucleocapsid, spike protein domain 1 and neutralizing antibodies
- Author
-
Garritsen, Anja, primary, Scholzen, Anja, additional, van den Nieuwenhof, Daan W.A., additional, Smits, Anke P.F., additional, Datema, E. Suzan, additional, van Galen, Luc S., additional, and Kouwijzer, Milou L.C.E., additional
- Published
- 2020
- Full Text
- View/download PDF
3. Computational pan-genomics: Status, promises and challenges
- Author
-
Marschall, T. (Tanja), Marz, M. (Manja), Abeel, T. (Thomas), Dijkstra, L. (Louis), Dutilh, B.E. (Bas), Ghaffaari, A. (Ali), Kersey, P. (Paul), Kloosterman, W.P. (Wigard), Mäkinen, V. (Veli), Novak, A.M. (Adam M.), Paten, B. (Benedict), Porubsky, D. (David), Rivals, E. (Eric), Alkan, C. (Can), Baaijens, J.A. (Jasmijn A.), Bakker, P.I.W. (Paul) de, Boeva, V. (Valentina), Bonnal, R.J.P. (Raoul J.P.), Chiaromonte, F. (Francesca), Chikhi, R. (Rayan), Ciccarelli, F.D. (Francesca D.), Cijvat, R. (Robin), Datema, E. (Erwin), Duijn, C.M. (Cornelia) van, Eichler, E.E. (Evan), Ernst, C. (Corinna), Eskin, E. (E.), Garrison, E. (Erik), El-Kebir, M. (Mohammed), Klau, G.W. (Gunnar W.), Korbel, J.O. (Jan), Lameijer, E.-W. (Eric-Wubbo), Langmead, B. (Benjamin), Martin, M. (Marcel), Medvedev, P. (Paul), Mu, J.C. (John C.), Neerincx, P.B.T. (Pieter B T), Ouwens, K. (Klaasjan), Peterlongo, P. (Pierre), Pisanti, N. (Nadia), Rahmann, S. (S.), Raphael, B.J. (Benjamin J.), Reinert, K. (Knut), Ridder, D. (Dick) de, de Ridder, J. (Jeroen), Schlesner, M. (Matthias), Schulz-Trieglaff, O. (Ole), Sanders, A.D. (Ashley D.), Sheikhizadeh, S. (Siavash), Shneider, C. (Carl), Smit, S. (Sandra), Valenzuela, D. (Daniel), Wang, J. (Jiayin), Wessels, L. (Lodewyk), Zhang, Y. (Ying), Guryev, V. (Victor), Vandin, F. (Fabio), Ye, K. (Kai), Schönhuth, A. (Alexander), Marschall, T. (Tanja), Marz, M. (Manja), Abeel, T. (Thomas), Dijkstra, L. (Louis), Dutilh, B.E. (Bas), Ghaffaari, A. (Ali), Kersey, P. (Paul), Kloosterman, W.P. (Wigard), Mäkinen, V. (Veli), Novak, A.M. (Adam M.), Paten, B. (Benedict), Porubsky, D. (David), Rivals, E. (Eric), Alkan, C. (Can), Baaijens, J.A. (Jasmijn A.), Bakker, P.I.W. (Paul) de, Boeva, V. (Valentina), Bonnal, R.J.P. (Raoul J.P.), Chiaromonte, F. (Francesca), Chikhi, R. (Rayan), Ciccarelli, F.D. (Francesca D.), Cijvat, R. (Robin), Datema, E. (Erwin), Duijn, C.M. (Cornelia) van, Eichler, E.E. (Evan), Ernst, C. (Corinna), Eskin, E. (E.), Garrison, E. (Erik), El-Kebir, M. (Mohammed), Klau, G.W. (Gunnar W.), Korbel, J.O. (Jan), Lameijer, E.-W. (Eric-Wubbo), Langmead, B. (Benjamin), Martin, M. (Marcel), Medvedev, P. (Paul), Mu, J.C. (John C.), Neerincx, P.B.T. (Pieter B T), Ouwens, K. (Klaasjan), Peterlongo, P. (Pierre), Pisanti, N. (Nadia), Rahmann, S. (S.), Raphael, B.J. (Benjamin J.), Reinert, K. (Knut), Ridder, D. (Dick) de, de Ridder, J. (Jeroen), Schlesner, M. (Matthias), Schulz-Trieglaff, O. (Ole), Sanders, A.D. (Ashley D.), Sheikhizadeh, S. (Siavash), Shneider, C. (Carl), Smit, S. (Sandra), Valenzuela, D. (Daniel), Wang, J. (Jiayin), Wessels, L. (Lodewyk), Zhang, Y. (Ying), Guryev, V. (Victor), Vandin, F. (Fabio), Ye, K. (Kai), and Schönhuth, A. (Alexander)
- Abstract
Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply scaling up established bioinformatics pipelines will not be sufficient for leveraging the full potential of such rich genomic data sets. Instead, novel, qualitatively different Computational methods and paradigms are needed.We will witness the rapid extension of Computational pan-genomics, a new sub-area of research in Computational biology. In this article, we generalize existing definitions and understand a pangenome as any collection of genomic sequences to be analyzed jointly or to be used as a reference. We examine already available approaches to construct and use pan-genomes, discuss the potential benefits of future technologies and methodologies and review open challenges from the vantage point of the above-mentioned biological disciplines. As a prominent example for a Computational paradigm shift, we particularly highlight the transition from the representation of reference genomes as strings to representations a
- Published
- 2018
- Full Text
- View/download PDF
4. Computational pan-genomics: status, promises and challenges.
- Author
-
Marschall, T., Marz, M., Abeel, T., Dijkstra, L., Dutilh, B.E., Ghaffaari, A., Kersey, P., Kloosterman, W.P., Mäkinen, V., Novak, A.M., Paten, B., Porubsky, D., Rivals, E., Alkan, C., Baaijens, J., Bakker, P.I. de, Boeva, V., Bonnal, R.J., Chiaromonte, F., Chikhi, R., Ciccarelli, F.D., Cijvat, R., Datema, E., Duijn, C.M. van, Eichler, E.E., Ernst, C., Eskin, E., Garrison, E., El-Kebir, M., Klau, G.W., Korbel, J.O., Lameijer, E.W., Langmead, B., Martin, M., Medvedev, P., Mu, J.C., Neerincx, P., Ouwens, K., Peterlongo, P., Pisanti, N., Rahmann, S., Raphael, B., Reinert, K., Ridder, D. de, Ridder, J. de, Schlesner, M., Schulz-Trieglaff, O., Sanders, A.D., Sheikhizadeh, S., Shneider, C., Smit, S., Valenzuela, D., Wang, J, Wessels, L., Zhang, Y, Guryev, V., Vandin, F., Ye, K., Schönhuth, A., Marschall, T., Marz, M., Abeel, T., Dijkstra, L., Dutilh, B.E., Ghaffaari, A., Kersey, P., Kloosterman, W.P., Mäkinen, V., Novak, A.M., Paten, B., Porubsky, D., Rivals, E., Alkan, C., Baaijens, J., Bakker, P.I. de, Boeva, V., Bonnal, R.J., Chiaromonte, F., Chikhi, R., Ciccarelli, F.D., Cijvat, R., Datema, E., Duijn, C.M. van, Eichler, E.E., Ernst, C., Eskin, E., Garrison, E., El-Kebir, M., Klau, G.W., Korbel, J.O., Lameijer, E.W., Langmead, B., Martin, M., Medvedev, P., Mu, J.C., Neerincx, P., Ouwens, K., Peterlongo, P., Pisanti, N., Rahmann, S., Raphael, B., Reinert, K., Ridder, D. de, Ridder, J. de, Schlesner, M., Schulz-Trieglaff, O., Sanders, A.D., Sheikhizadeh, S., Shneider, C., Smit, S., Valenzuela, D., Wang, J, Wessels, L., Zhang, Y, Guryev, V., Vandin, F., Ye, K., and Schönhuth, A.
- Abstract
Contains fulltext : 190288.pdf (publisher's version ) (Open Access)
- Published
- 2018
5. Computational pan-genomics: status, promises and challenges
- Author
-
The Computational Pan-Genomics Consortium, Marschall, T. (Tobias), Marz, M. (Manja), Abeel, T. (Thomas), Dijkstra, L.J. (Louis), Dutilh, B.E. (Bas), Ghaffaari, A. (Ali), Kersey, P. (Paul), Kloosterman, W.P. (Wigard), Mäkinen, V. (Veli), Novak, A.M. (Adam), Paten, B. (Benedict), Porubsky, D. (David), Rivals, E. (Eric), Alkan, C. (Can), Baaijens, J.A. (Jasmijn), Bakker, P.I.W. (Paul) de, Boeva, V. (Valentina), Bonnal, R.J.P. (Raoul), Chiaromonte, F. (Francesca), Chikhi, R. (Rayan), Ciccarelli, F.D. (Francesca), Cijvat, C.P. (Robin), Datema, E. (Erwin), Duijn, C.M. (Cornelia) van, Eichler, E.E. (Evan), Ernst, C. (Corinna), Eskin, E. (Eleazar), Garrison, E. (Erik), El-Kebir, M. (Mohammed), Klau, G.W. (Gunnar), Korbel, J.O. (Jan), Lameijer, E.-W. (Eric-Wubbo), Langmead, B. (Benjamin), Martin, M. (Marcel), Medvedev, P. (Paul), Mu, J.C. (John), Neerincx, P.B.T. (Pieter), Ouwens, K. (Klaasjan), Peterlongo, P. (Pierre), Pisanti, N. (Nadia), Rahmann, S. (Sven), Raphael, B.J. (Benjamin), Reinert, K. (Knut), Ridder, D. (Dick) de, Ridder, J. (Jeroen) de, Schlesner, M. (Matthias), Schulz-Trieglaff, O. (Ole), Sanders, A.D. (Ashley), Sheikhizadeh, S. (Siavash), Shneider, C. (Carl), Smit, S. (Sandra), Valenzuela, D. (Daniel), Wang, J. (Jiayin), Wessels, L.F.A. (Lodewyk), Zhang, Y. (Ying), Guryev, V. (Victor), Vandin, F. (Fabio), Ye, K. (Kai), Schönhuth, A. (Alexander), The Computational Pan-Genomics Consortium, Marschall, T. (Tobias), Marz, M. (Manja), Abeel, T. (Thomas), Dijkstra, L.J. (Louis), Dutilh, B.E. (Bas), Ghaffaari, A. (Ali), Kersey, P. (Paul), Kloosterman, W.P. (Wigard), Mäkinen, V. (Veli), Novak, A.M. (Adam), Paten, B. (Benedict), Porubsky, D. (David), Rivals, E. (Eric), Alkan, C. (Can), Baaijens, J.A. (Jasmijn), Bakker, P.I.W. (Paul) de, Boeva, V. (Valentina), Bonnal, R.J.P. (Raoul), Chiaromonte, F. (Francesca), Chikhi, R. (Rayan), Ciccarelli, F.D. (Francesca), Cijvat, C.P. (Robin), Datema, E. (Erwin), Duijn, C.M. (Cornelia) van, Eichler, E.E. (Evan), Ernst, C. (Corinna), Eskin, E. (Eleazar), Garrison, E. (Erik), El-Kebir, M. (Mohammed), Klau, G.W. (Gunnar), Korbel, J.O. (Jan), Lameijer, E.-W. (Eric-Wubbo), Langmead, B. (Benjamin), Martin, M. (Marcel), Medvedev, P. (Paul), Mu, J.C. (John), Neerincx, P.B.T. (Pieter), Ouwens, K. (Klaasjan), Peterlongo, P. (Pierre), Pisanti, N. (Nadia), Rahmann, S. (Sven), Raphael, B.J. (Benjamin), Reinert, K. (Knut), Ridder, D. (Dick) de, Ridder, J. (Jeroen) de, Schlesner, M. (Matthias), Schulz-Trieglaff, O. (Ole), Sanders, A.D. (Ashley), Sheikhizadeh, S. (Siavash), Shneider, C. (Carl), Smit, S. (Sandra), Valenzuela, D. (Daniel), Wang, J. (Jiayin), Wessels, L.F.A. (Lodewyk), Zhang, Y. (Ying), Guryev, V. (Victor), Vandin, F. (Fabio), Ye, K. (Kai), and Schönhuth, A. (Alexander)
- Abstract
Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply scaling up established bioinformatics pipelines will not be sufficient for leveraging the full potential of such rich genomic data sets. Instead, novel, qualitatively different computational methods and paradigms are needed. We will witness the rapid extension of computational pan-genomics, a new sub-area of research in computational biology. In this article, we generalize existing definitions and understand a pan-genome as any collection of genomic sequences to be analyzed jointly or to be used as a reference. We examine already available approaches to construct and use pan-genomes, discuss the potential benefits of future technologies and methodologies and review open challenges from the vantage point of the above-mentioned biological disciplines. As a prominent example for a computational paradigm shift, we particularly highlight the transition from the representation of reference genomes as strings to representations as graphs. We outline how this and other challenges from different application domains translate into common computational problems, point out relevant bioinformatics techniques and identify open problems in computer science. With this review, we aim to increase awareness that a joint approach to computational pan-genomics can help address many of the problems currently faced in various domains.
- Published
- 2018
- Full Text
- View/download PDF
6. Computational pan-genomics: status, promises and challenges
- Author
-
Marschall, T, Marz, M, Abeel, T, Dijkstra, L, Dutilh, BE, Ghaffaari, A, Kersey, P, Kloosterman, WP, Makinen, V, Novak, AM, Paten, B, Porubsky, D, Rivals, E, Alkan, C, Baaijens, J A, de Bakker, PIW, Boeva, V, Bonnal, RJP, Chiaromonte, F, Chikhi, R, Ciccarelli, FD, Cijvat, R, Datema, E, Duijn, Cornelia, Eichler, EE, Ernst, C, Eskin, E, Garrison, E, El-Kebir, M, Klau, GW, Korbel, JO, Lameijer, EW, Langmead, B, Martin, M, Medvedev, P, Mu, JC, Neerincx, P, Ouwens, K, Peterlongo, P, Pisanti, N, Rahmann, S, Raphael, B, Reinert, K, Ridder, D, Ridder, J (Jannemarie), Schlesner, M, Schulz-Trieglaff, O, Sanders, AD, Sheikhizadeh, S, Shneider, C, Smit, S, Valenzuela, D, Wang, JY, Wessels, L, Zhang, Y, Guryev, V, Vandin, F, Ye, K, Schonhuth, A, Marschall, T, Marz, M, Abeel, T, Dijkstra, L, Dutilh, BE, Ghaffaari, A, Kersey, P, Kloosterman, WP, Makinen, V, Novak, AM, Paten, B, Porubsky, D, Rivals, E, Alkan, C, Baaijens, J A, de Bakker, PIW, Boeva, V, Bonnal, RJP, Chiaromonte, F, Chikhi, R, Ciccarelli, FD, Cijvat, R, Datema, E, Duijn, Cornelia, Eichler, EE, Ernst, C, Eskin, E, Garrison, E, El-Kebir, M, Klau, GW, Korbel, JO, Lameijer, EW, Langmead, B, Martin, M, Medvedev, P, Mu, JC, Neerincx, P, Ouwens, K, Peterlongo, P, Pisanti, N, Rahmann, S, Raphael, B, Reinert, K, Ridder, D, Ridder, J (Jannemarie), Schlesner, M, Schulz-Trieglaff, O, Sanders, AD, Sheikhizadeh, S, Shneider, C, Smit, S, Valenzuela, D, Wang, JY, Wessels, L, Zhang, Y, Guryev, V, Vandin, F, Ye, K, and Schonhuth, A
- Published
- 2018
7. The megabase-sized fungal genome of Rhizoctonia solani assembled from nanopore reads only
- Author
-
Blommers L, Datema E, Wittenberg Ah, Hulzink Rj, de Vos M, Espejo Valle-Inclan J, and van Orsouw N
- Subjects
Genetics ,Sequence assembly ,food and beverages ,Genomics ,Biology ,biology.organism_classification ,Genome ,Rhizoctonia solani ,Nanopore ,chemistry.chemical_compound ,chemistry ,Fungal genome ,Length distribution ,DNA - Abstract
The ability to quickly obtain accurate genome sequences of eukaryotic pathogens at low costs provides a tremendous opportunity to identify novel targets for therapeutics, develop pesticides with increased target specificity and breed for resistance in food crops. Here, we present the first report of the ~54 MB eukaryotic genome sequence of Rhizoctonia solani, an important pathogenic fungal species of maize, using nanopore technology. Moreover, we show that optimizing the strategy for wet-lab procedures aimed to isolate high quality and ultra-pure high molecular weight (HMW) DNA results in increased read length distribution and thereby allowing generation of the most contiguous genome assembly for R. solani to date. We further determined sequencing accuracy and compared the assembly to short-read technologies. With the current sequencing technology and bioinformatics tool set, we are able to deliver an eukaryotic fungal genome at low cost within a week. With further improvements of the sequencing technology and increased throughput of the PromethION sequencer we aim to generate near-finished assemblies of large and repetitive plant genomes and cost-efficiently perform de novo sequencing of large collections of microbial pathogens and the microbial communities that surround our crops.
- Published
- 2016
- Full Text
- View/download PDF
8. Correction: The Genomes of the Fungal Plant Pathogens Cladosporium fulvum and Dothistroma septosporum Reveal Adaptation to Different Hosts and Lifestyles But Also Signatures of Common Ancestry [PLoS Genet, 11(12), (2015)]
- Author
-
de Wit, PJGM, van der Burgt, A, Ökmen, B, Stergiopoulos, I, Abd-Elsalam, KA, Aerts, AL, Bahkali, AH, Beenen, HG, Chettri, P, Cox, MP, Datema, E, de Vries, RP, Dhillon, B, Ganley, AR, Griffiths, SA, Guo, Y, Hamelin, RC, Henrissat, B, Shahjahan Kabir, M, Karimi Jashni, M, Kema, G, Klaubauf, S, Lapidus, A, Levasseur, A, Lindquist, E, Mehrabi, R, Ohm, RA, Owen, TJ, Salamov, A, Schwelm, A, Schijlen, E, Sun, H, van den Burg, HA, van Ham, RCHJ, Zhang, S, Goodwin, SB, Grigoriev, IV, Collemare, J, and Bradshaw, RE
- Published
- 2015
9. Collembolan transcriptomes highlight molecular evolution of hexapods and provide clues on the adaptation to terrestrial life
- Author
-
Faddeeva, A., Studer, R.A., Kraaijeveld, K., Sie, D., Ylstra, B., Marien, J., Camp, H.J.M. op den, Datema, E., Dunnen, J.T. den, Straalen, N.M. van, Roelofs, D., Faddeeva, A., Studer, R.A., Kraaijeveld, K., Sie, D., Ylstra, B., Marien, J., Camp, H.J.M. op den, Datema, E., Dunnen, J.T. den, Straalen, N.M. van, and Roelofs, D.
- Abstract
Contains fulltext : 144487.pdf (publisher's version ) (Open Access)
- Published
- 2015
10. Homologues of potato chromosome 5 show variable collinearity in the euchromatin, but dramatic absence of synteny in the pericentromeric heterochromatin
- Author
-
de Boer, J.M., Datema, E., Tang, X., Borm, T.J.A., Bakker, E.H., van Eck, H.J., van Ham, R.C.H.J., de Jong, J.H.S.G.M., Visser, R.G.F., Bachem, C.W.B., de Boer, J.M., Datema, E., Tang, X., Borm, T.J.A., Bakker, E.H., van Eck, H.J., van Ham, R.C.H.J., de Jong, J.H.S.G.M., Visser, R.G.F., and Bachem, C.W.B.
- Abstract
Background In flowering plants it has been shown that de novo genome assemblies of different species and genera show a significant drop in the proportion of alignable sequence. Within a plant species, however, it is assumed that different haplotypes of the same chromosome align well. In this paper we have compared three de novo assemblies of potato chromosome 5 and report on the sequence variation and the proportion of sequence that can be aligned. Results For the diploid potato clone RH89-039-16 (RH) we produced two linkage phase controlled and haplotype-specific assemblies of chromosome 5 based on BAC-by-BAC sequencing, which were aligned to each other and compared to the 52 Mb chromosome 5 reference sequence of the doubled monoploid clone DM 1–3 516 R44 (DM). We identified 17.0 Mb of non-redundant sequence scaffolds derived from euchromatic regions of RH and 38.4 Mb from the pericentromeric heterochromatin. For 32.7 Mb of the RH sequences the correct position and order on chromosome 5 was determined, using genetic markers, fluorescence in situ hybridisation and alignment to the DM reference genome. This ordered fraction of the RH sequences is situated in the euchromatic arms and in the heterochromatin borders. In the euchromatic regions, the sequence collinearity between the three chromosomal homologs is good, but interruption of collinearity occurs at nine gene clusters. Towards and into the heterochromatin borders, absence of collinearity due to structural variation was more extensive and was caused by hemizygous and poorly aligning regions of up to 450 kb in length. In the most central heterochromatin, a total of 22.7 Mb sequence from both RH haplotypes remained unordered. These RH sequences have very few syntenic regions and represent a non-alignable region between the RH and DM heterochromatin haplotypes of chromosome 5. Conclusions Our results show that among homologous potato chromosomes large regions are present with dramatic loss of sequence collinearity
- Published
- 2015
11. Single-molecule real-time sequencing combined with optical mapping yields completely finished fungal genome
- Author
-
Faino, L., Seidl, M.F., Datema, E., van den Berg, G.C.M., Janssen, A., Wittenberg, A.H.J., Thomma, B., Faino, L., Seidl, M.F., Datema, E., van den Berg, G.C.M., Janssen, A., Wittenberg, A.H.J., and Thomma, B.
- Abstract
Next-generation sequencing (NGS) technologies have increased the scalability, speed, and resolution of genomic sequencing and, thus, have revolutionized genomic studies. However, eukaryotic genome sequencing initiatives typically yield considerably fragmented genome assemblies. Here, we assessed various state-of-the-art sequencing and assembly strategies in order to produce a contiguous and complete eukaryotic genome assembly, focusing on the filamentous fungus Verticillium dahliae. Compared with Illumina-based assemblies of the V. dahliae genome, hybrid assemblies that also include PacBio-generated long reads establish superior contiguity. Intriguingly, provided that sufficient sequence depth is reached, assemblies solely based on PacBio reads outperform hybrid assemblies and even result in fully assembled chromosomes. Furthermore, the addition of optical map data allowed us to produce a gapless and complete V. dahliae genome assembly of the expected eight chromosomes from telomere to telomere. Consequently, we can now study genomic regions that were previously not assembled or poorly assembled, including regions that are populated by repetitive sequences, such as transposons, allowing us to fully appreciate an organism’s biological complexity. Our data show that a combination of PacBio-generated long reads and optical mapping can be used to generate complete and gapless assemblies of fungal genomes. IMPORTANCE Studying whole-genome sequences has become an important aspect of biological research. The advent of next-generation sequencing (NGS) technologies has nowadays brought genomic science within reach of most research laboratories, including those that study nonmodel organisms. However, most genome sequencing initiatives typically yield (highly) fragmented genome assemblies. Nevertheless, considerable relevant information related to genome structure and evolution is likely hidden in those nonassembled regions. Here, we investigated a diverse set of strategies to
- Published
- 2015
12. Genome bioinformatics of tomato and potato
- Author
-
Datema, E., Wageningen University, W. Stiekema, and Roeland van Ham
- Subjects
genomen ,genomica ,Bioinformatics ,gewassen ,food and beverages ,bioinformatics ,crops ,nucleotidenvolgordes ,BIOS Applied Bioinformatics ,solanum tuberosum ,solanum lycopersicum ,Bioinformatica ,genomics ,genen ,nucleotide sequences ,EPS ,bio-informatica ,genes ,genomes - Abstract
In the past two decades genome sequencing has developed from a laborious and costly technology employed by large international consortia to a widely used, automated and affordable tool used worldwide by many individual research groups. Genome sequences of many food animals and crop plants have been deciphered and are being exploited for fundamental research and applied to improve their breeding programs. The developments in sequencing technologies have also impacted the associated bioinformatics strategies and tools, both those that are required for data processing, management, and quality control, and those used for interpretation of the data. This thesis focuses on the application of genome sequencing, assembly and annotation to two members of the Solanaceae family, tomato and potato. Potato is the economically most important species within the Solanaceae, and its tubers contribute to dietary intake of starch, protein, antioxidants, and vitamins. Tomato fruits are the second most consumed vegetable after potato, and are a globally important dietary source of lycopene, beta-carotene, vitamin C, and fiber. The chapters in this thesis document the generation, exploitation and interpretation of genomic sequence resources for these two species and shed light on the contents, structure and evolution of their genomes. Chapter 1introduces the concepts of genome sequencing, assembly and annotation, and explains the novel genome sequencing technologies that have been developed in the past decade. These so-called Next Generation Sequencing platforms display considerable variation in chemistry and workflow, and as a consequence the throughput and data quality differs by orders of magnitude between the platforms. The currently available sequencing platforms produce a vast variety of read lengths and facilitate the generation of paired sequences with an approximately fixed distance between them. The choice of sequencing chemistry and platform combined with the type of sequencing template demands specifically adapted bioinformatics for data processing and interpretation. Irrespective of the sequencing and assembly strategy that is chosen, the resulting genome sequence, often represented by a collection of long linear strings of nucleotides, is of limited interest by itself. Interpretation of the genome can only be achieved through sequence annotation – that is, identification and classification of all functional elements in a genome sequence. Once these elements have been annotated, sequence alignments between multiple genomes of related accessions or species can be utilized to reveal the genetic variation on both the nucleotide and the structural level that underlies the difference between these species or accessions. Chapter 2describes BlastIf, a novel software tool that exploits sequence similarity searches with BLAST to provide a straightforward annotation of long nucleotide sequences. Generally, two problems are associated with the alignment of a long nucleotide sequence to a database of short gene or protein sequences: (i) the large number of similar hits that can be generated due to database redundancy; and (ii) the relationships implied between aligned segments within a hit that in fact correspond to distinct elements on the sequence such as genes. BlastIf generates a comprehensible BLAST output for long nucleotide sequences by reducing the number of similar hits while revealing most of the variation present between hits. It is a valuable tool for molecular biologists who wish to get a quick overview of the genetic elements present in a newly sequenced segment of DNA, prior to more elaborate efforts of gene structure prediction and annotation. In Chapter 3 a first genome-wide comparison between the emerging genomic sequence resources of tomato and potato is presented. Large collections of BAC end sequences from both species were annotated through repeat searches, transcript alignments and protein domain identification. In-depth comparisons of the annotated sequences revealed remarkable differences in both gene and repeat content between these closely related genomes. The tomato genome was found to be more repetitive than the potato genome, and substantial differences in the distribution of Gypsy and Copia retrotransposable elements as well as microsatellites were observed between the two genomes. A higher gene content was identified in the potato sequences, and in particular several large gene families including cytochrome P450 mono-oxygenases and serine-threonine protein kinases were significantly overrepresented in potato compared to tomato. Moreover, the cytochrome P450 gene family was found to be expanded in both tomato and potato when compared to Arabidopsis thaliana, suggesting an expanded network of secondary metabolic pathways in the Solanaceae. Together these findings present a first glimpse into the evolution of Solanaceous genomes, both within the family and relative to other plant species. Chapter 4explores the physical and genetic organization of tomato chromosome 6 through integration of BAC sequence analysis, High Information Content Fingerprinting, genetic analysis, and BAC-FISH mapping data. A collection of BACs spanning substantial parts of the short and long arm euchromatin and several dispersed regions of the pericentrometric heterochromatin were sequenced and assembled into several tiling paths spanning approximately 11 Mb. Overall, the cytogenetic order of BACs was in agreement with the order of BACs anchored to the Tomato EXPEN 2000 genetic map, although a few striking discrepancies were observed. The integration of BAC-FISH, sequence and genetic mapping data furthermore provided a clear picture of the borders between eu- and heterochromatin on chromosome 6. Annotation of the BAC sequences revealed that, although the majority of protein-coding genes were located in the euchromatin, the highly repetitive pericentromeric heterochromatin displayed an unexpectedly high gene content. Moreover, the short arm euchromatin was relatively rich in repeats, but the ratio of Gypsy and Copia retrotransposons across the different domains of the chromosome clearly distinguished euchromatin from heterochromatin. The ongoing whole-genome sequencing effort will reveal if these properties are unique for tomato chromosome 6, or a more general property of the tomato genome. Chapter 5presents the potato genome, the first genome sequence of an Asterid. To overcome the problems associated with genome assembly due tothe high level of heterozygosity that is observed in commercial tetraploid potato varieties, a homozygous doubled-monoploid potato clone was exploited to sequence and assemble 86% of the 844 Mb genome. This potato reference genome sequence was complemented with re-sequencing of aheterozygous diploid clone, revealing the form and extent of sequence polymorphism both between different genotypes and within a single heterozygous genotype. Gene presence/absence variants and other potentially deleterious mutations were found to occur frequently in potato and are a likely cause of inbreeding depression. Annotation of the genome was supported by deep transcriptome sequencing of both the doubled-monoploid and the heterozygous potato, resulting in the prediction of more than 39,000 protein coding genes. Transcriptome analysis provided evidence for the contribution of gene family expansion, tissue specific expression, and recruitment of genes to new pathways to the evolution of tuber development. The sequence of the potato genome has provided new insights into Eudicot genome evolution and has provided a solid basis for the elucidation of the evolution of tuberisation. Many traits of interest to plant breeders are quantitative in nature and the potato sequence will simplify both their characterization and deployment to generate novel cultivars. The outstanding challenges in plant genome sequencing are addressed in Chapter 6. The high concentration of repetitive elements and the heterozygosity and polyploidy of many interesting crop plant species currently pose a barrier for the efficient reconstruction of their genome sequences. Nonetheless, the completion of a large number of new genome sequences in recent years and the ongoing advances in sequencing technology provide many excitingopportunities for plant breeding and genome research. Current sequencing platforms are being continuously updated and improved, and novel technologies are being developed and implemented in third-generation sequencing platforms that sequence individual molecules without need for amplification. While these technologies create exciting opportunities for new sequencing applications, they also require robust software tools to process the data produced through them efficiently. The ever increasing amount of available genome sequences creates the need for an intuitive platform for the automated and reproducible interrogation of these data in order to formulate new biologically relevant questions on datasets spanning hundreds or thousands of genome sequences.
- Published
- 2011
13. Collembolan Transcriptomes Highlight Molecular Evolution of Hexapods and Provide Clues on the Adaptation to Terrestrial Life
- Author
-
Faddeeva, A., primary, Studer, R. A., additional, Kraaijeveld, K., additional, Sie, D., additional, Ylstra, B., additional, Mariën, J., additional, op den Camp, H. J. M., additional, Datema, E., additional, den Dunnen, J. T., additional, van Straalen, N. M., additional, and Roelofs, D., additional
- Published
- 2015
- Full Text
- View/download PDF
14. Cross-species BAC-FISH painting of the tomato and potato chromosome 6 reveals undescribed chromosomal rearrangements
- Author
-
Tang, X., Szinay, D., Ramanna, M.S., van der Vossen, E.A.G., Datema, E., Klein Lankhorst, R.M., de Boer, J.M., Peters, S.A., Bachem, C.W.B., Stiekema, W.J., Visser, R.G.F., de Jong, J.H., and Bai, Y.
- Subjects
fish ,lycopersicon-peruvianum ,EPS-2 ,Bioinformatics ,meiotic pachytene ,solanum ,fungi ,nematode-resistance gene ,food and beverages ,arabidopsis-thaliana ,molecular linkage maps ,Laboratorium voor Erfelijkheidsleer ,PRI Bioscience ,PRI Biodiversity and Breeding ,Plant Breeding ,Laboratorium voor Plantenveredeling ,pachytene chromosomes ,PRI Biodiversiteit en Veredeling ,short arm ,Bioinformatica ,Laboratory of Genetics ,genome - Abstract
Ongoing genomics projects of tomato (Solanum lycopersicum ) and potato (Solanum tuberosum) are providing unique tools for comparative mapping studies in Solanaceae. At the chromosomal level, BACs can be positioned on pachytene comple-ments by fluorescent in situ hybridization (FISH) on homoeologous chromosomes of related species. Here we present the first results of such a cross-species multi-colour cytogenetic mapping of tomato BACs on potato chromosome 6 and vice versa. The experiments were performed under low hybridization stringency, while blocking with Cot-100 was essential in suppressing excessive hybridization of repeat signals in both within-species FISH and cross-species FISH of tomato BACs. In the short arm we detected a large paracentric inversion that covers the whole euchromatin part with breakpoints close to the telomeric heterochromatin and at the border of the short arm pericentromere. The long arm BACs revealed no deviation in the co-linearity between tomato and potato. Further comparison between tomato cultivars Cherry VFNT and Heinz 1706 revealed co-linearity of the tested tomato BACs, whereas one of the six potato clones (RH98-856-18) showed minor putative rearrangements within the inversion. Our results present cross-species multi-colour BAC-FISH as a unique tool for comparative genetic studies across Solanum species
- Published
- 2008
15. De novo sequencing, assembly and analysis of the genome of the laboratory strain Saccharomyces cerevisiae CEN.PK113-7D, a model for modern industrial biotechnology
- Author
-
Nijkamp, J.F. (author), Van den Broek, M.A. (author), Datema, E. (author), De Kok, S. (author), Bosman, L. (author), Luttik, M.A.H. (author), Daran-Lapujade, P.A.S. (author), Vongsangnak, W. (author), Nielsen, J. (author), Heijne, W.H.M. (author), Klaassen, P. (author), Paddon, C.J. (author), Platt, D. (author), Kötter, P. (author), Van Ham, R.C. (author), Reinders, M.J.T. (author), Pronk, J.T. (author), De Ridder, D. (author), Daran, J.M. (author), Nijkamp, J.F. (author), Van den Broek, M.A. (author), Datema, E. (author), De Kok, S. (author), Bosman, L. (author), Luttik, M.A.H. (author), Daran-Lapujade, P.A.S. (author), Vongsangnak, W. (author), Nielsen, J. (author), Heijne, W.H.M. (author), Klaassen, P. (author), Paddon, C.J. (author), Platt, D. (author), Kötter, P. (author), Van Ham, R.C. (author), Reinders, M.J.T. (author), Pronk, J.T. (author), De Ridder, D. (author), and Daran, J.M. (author)
- Abstract
Saccharomyces cerevisiae CEN.PK 113-7D is widely used for metabolic engineering and systems biology research in industry and academia. We sequenced, assembled, annotated and analyzed its genome. Single-nucleotide variations (SNV), insertions/deletions (indels) and differences in genome organization compared to the reference strain S. cerevisiae S288C were analyzed. In addition to a few large deletions and duplications, nearly 3000 indels were identified in the CEN.PK113-7D genome relative to S288C. These differences were overrepresented in genes whose functions are related to transcriptional regulation and chromatin remodelling. Some of these variations were caused by unstable tandem repeats, suggesting an innate evolvability of the corresponding genes. Besides a previously characterized mutation in adenylate cyclase, the CEN.PK113-7D genome sequence revealed a significant enrichment of non-synonymous mutations in genes encoding for components of the cAMP signalling pathway. Some phenotypic characteristics of the CEN.PK113-7D strains were explained by the presence of additional specific metabolic genes relative to S288C. In particular, the presence of the BIO1 and BIO6 genes correlated with a biotin prototrophy of CEN.PK113-7D. Furthermore, the copy number, chromosomal location and sequences of the MAL loci were resolved. The assembled sequence reveals that CEN.PK113-7D has a mosaic genome that combines characteristics of laboratory strains and wild-industrial strains., Intelligent Systems, Electrical Engineering, Mathematics and Computer Science
- Published
- 2012
- Full Text
- View/download PDF
16. The Genomes of the Fungal Plant Pathogens Cladosporium fulvum and Dothistroma septosporum Reveal Adaptation to Different Hosts and Lifestyles But Also Signatures of Common Ancestry
- Author
-
de Wit, P.J., van der Burgt, A., Okmen, B., Stergiopoulos, I., Abd-Elsalam, K.A., Aerts, A.L., Bahkali, A.H., Beenen, H.G., Chettri, P., Cox, M.P., Datema, E., de Vries, R.P., Dhillon, B., Ganley, A.R., Griffiths, S.A., Guo, Y., Hamelin, R.C., Henrissat, B., Kabir, M.S., Jashni, M.K., Kema, G., Klaubauf, S., Lapidus, A., Levasseur, A., Lindquist, E., Mehrabi, R., Ohm, R.A., Owen, T.J., Salamov, A., Schwelm, A., Schijlen, E., Sun, H., van den Burg, H.A., van Ham, R.C., Zhang, S., Goodwin, S.B., Grigoriev, I.V., Collemare, J., Bradshaw, R.E., de Wit, P.J., van der Burgt, A., Okmen, B., Stergiopoulos, I., Abd-Elsalam, K.A., Aerts, A.L., Bahkali, A.H., Beenen, H.G., Chettri, P., Cox, M.P., Datema, E., de Vries, R.P., Dhillon, B., Ganley, A.R., Griffiths, S.A., Guo, Y., Hamelin, R.C., Henrissat, B., Kabir, M.S., Jashni, M.K., Kema, G., Klaubauf, S., Lapidus, A., Levasseur, A., Lindquist, E., Mehrabi, R., Ohm, R.A., Owen, T.J., Salamov, A., Schwelm, A., Schijlen, E., Sun, H., van den Burg, H.A., van Ham, R.C., Zhang, S., Goodwin, S.B., Grigoriev, I.V., Collemare, J., and Bradshaw, R.E.
- Published
- 2012
17. The tomato genome sequence provides insights into fleshy fruit evolution
- Author
-
Sato, S., Tabata, S., Hirakawa, H., Klein Lankhorst, R.M., de Jong, H., van Ham, R.C.H.J., Datema, E., Smit, S., Schijlen, E.G.W.M., van Haarst, J.C., Peters, S.A., Henkens, M.H.C., van Staveren, M.J., Mooijman, P.J.W., Hesselink, T., van de Belt, J., Szinay, D., Bai, Y., Visser, R.G.F., Sato, S., Tabata, S., Hirakawa, H., Klein Lankhorst, R.M., de Jong, H., van Ham, R.C.H.J., Datema, E., Smit, S., Schijlen, E.G.W.M., van Haarst, J.C., Peters, S.A., Henkens, M.H.C., van Staveren, M.J., Mooijman, P.J.W., Hesselink, T., van de Belt, J., Szinay, D., Bai, Y., and Visser, R.G.F.
- Abstract
Tomato (Solanum lycopersicum) is a major crop plant and a model system for fruit development. Solanum is one of the largest angiosperm genera1 and includes annual and perennial plants from diverse habitats. Here we present a high-quality genome sequence of domesticated tomato, a draft sequence of its closest wild relative, Solanum pimpinellifolium2, and compare them to each other and to the potato genome (Solanum tuberosum). The two tomato genomes show only 0.6% nucleotide divergence and signs of recent admixture, but show more than 8% divergence from potato, with nine large and several smaller inversions. In contrast to Arabidopsis, but similar to soybean, tomato and potato small RNAs map predominantly to gene-rich chromosomal regions, including gene promoters. The Solanum lineage has experienced two consecutive genome triplications: one that is ancient and shared with rosids, and a more recent one. These triplications set the stage for the neofunctionalization of genes controlling fruit characteristics, such as colour and fleshiness.
- Published
- 2012
18. De novo sequencing, assembly and analysis of the genome of the laboratory strain Saccharomyces cerevisiae CEN.PK113-7D, a model for modern industrial biotechnology
- Author
-
Nijkamp, J.F., van den Broek, M., Datema, E., de Kok, S., Bosman, L., Luttik, M.A., Daran-Lapujade, P., Vongsangnak, W., Nielsen, J., Heijne, W.H.M., Klaassen, P., Paddon, C.J., Platt, D., Kotter, P., van Ham, R.C.H.J., Reinders, M.J.T., Pronk, J.T., de Ridder, D., Daran, J.M., Nijkamp, J.F., van den Broek, M., Datema, E., de Kok, S., Bosman, L., Luttik, M.A., Daran-Lapujade, P., Vongsangnak, W., Nielsen, J., Heijne, W.H.M., Klaassen, P., Paddon, C.J., Platt, D., Kotter, P., van Ham, R.C.H.J., Reinders, M.J.T., Pronk, J.T., de Ridder, D., and Daran, J.M.
- Abstract
Saccharomyces cerevisiae CEN.PK 113-7D is widely used for metabolic engineering and systems biology research in industry and academia. We sequenced, assembled, annotated and analyzed its genome. Single-nucleotide variations (SNV), insertions/deletions (indels) and differences in genome organization compared to the reference strain S. cerevisiae S288C were analyzed. In addition to a few large deletions and duplications, nearly 3000 indels were identified in the CEN.PK113-7D genome relative to S288C. These differences were overrepresented in genes whose functions are related to transcriptional regulation and chromatin remodelling. Some of these variations were caused by unstable tandem repeats, suggesting an innate evolvability of the corresponding genes. Besides a previously characterized mutation in adenylate cyclase, the CEN. PK113-7D genome sequence revealed a significant enrichment of non-synonymous mutations in genes encoding for components of the cAMP signalling pathway. Some phenotypic characteristics of the CEN. PK113-7D strains were explained by the presence of additional specific metabolic genes relative to S288C. In particular, the presence of the BIO1 and BIO6 genes correlated with a biotin prototrophy of CEN. PK113-7D. Furthermore, the copy number, chromosomal location and sequences of the MAL loci were resolved. The assembled sequence reveals that CEN. PK113-7D has a mosaic genome that combines characteristics of laboratory strains and wild-industrial strains.
- Published
- 2012
19. The Genomes of the Fungal Plant Pathogens Cladosporium fulvum and Dothistroma septosporum Reveal Adaptation to Different Hosts and Lifestyles But Also Signatures of Common Ancestry
- Author
-
de Wit, P.J.G.M., van der Burgt, I.A., Ökman, B., Stergiopoulos, I., Abd-Elsalam, K.A., Aerts, A.L., Bahkali, A.H., Beenen, H.G., Chettri, P., Cox, M.P., Datema, E., de Vries, R.P., Dhillon, B., Ganley, A.R., Griffiths, S.A., Guo, Y., Hamelin, R.C., Henrissat, B., Karimi Jashni, M., Kema, G.H.J., Klaubauf, S., Lapidus, A., Levasseur, A., Lindquist, E., Mehrabi, R., Ohm, R.A., Owen, T.J., Salamov, A., Schwelm, A., van den Burg, H.A., van Ham, R.C.H.J., Zhang, S., Goodwin, S.B., Collemare, J., de Wit, P.J.G.M., van der Burgt, I.A., Ökman, B., Stergiopoulos, I., Abd-Elsalam, K.A., Aerts, A.L., Bahkali, A.H., Beenen, H.G., Chettri, P., Cox, M.P., Datema, E., de Vries, R.P., Dhillon, B., Ganley, A.R., Griffiths, S.A., Guo, Y., Hamelin, R.C., Henrissat, B., Karimi Jashni, M., Kema, G.H.J., Klaubauf, S., Lapidus, A., Levasseur, A., Lindquist, E., Mehrabi, R., Ohm, R.A., Owen, T.J., Salamov, A., Schwelm, A., van den Burg, H.A., van Ham, R.C.H.J., Zhang, S., Goodwin, S.B., and Collemare, J.
- Abstract
We sequenced and compared the genomes of the Dothideomycete fungal plant pathogens Cladosporium fulvum (Cfu) (syn. Passalora fulva) and Dothistroma septosporum (Dse) that are closely related phylogenetically, but have different lifestyles and hosts. Although both fungi grow extracellularly in close contact with host mesophyll cells, Cfu is a biotroph infecting tomato, while Dse is a hemibiotroph infecting pine. The genomes of these fungi have a similar set of genes (70% of gene content in both genomes are homologs), but differ significantly in size (Cfu >61.1-Mb; Dse 31.2-Mb), which is mainly due to the difference in repeat content (47.2% in Cfu versus 3.2% in Dse). Recent adaptation to different lifestyles and hosts is suggested by diverged sets of genes. Cfu contains an a-tomatinase gene that we predict might be required for detoxification of tomatine, while this gene is absent in Dse. Many genes encoding secreted proteins are unique to each species and the repeat-rich areas in Cfu are enriched for these species-specific genes. In contrast, conserved genes suggest common host ancestry. Homologs of Cfu effector genes, including Ecp2 and Avr4, are present in Dse and induce a Cf-Ecp2- and Cf-4-mediated hypersensitive response, respectively. Strikingly, genes involved in production of the toxin dothistromin, a likely virulence factor for Dse, are conserved in Cfu, but their expression differs markedly with essentially no expression by Cfu in planta. Likewise, Cfu has a carbohydrate-degrading enzyme catalog that is more similar to that of necrotrophs or hemibiotrophs and a larger pectinolytic gene arsenal than Dse, but many of these genes are not expressed in planta or are pseudogenized. Overall, comparison of their genomes suggests that these closely related plant pathogens had a common ancestral host but since adapted to different hosts and lifestyles by a combination of differentiated gene content, pseudogenization, and gene regulation
- Published
- 2012
20. Genome bioinformatics of tomato and potato
- Author
-
Stiekema, W., van Ham, Roeland, Datema, E., Stiekema, W., van Ham, Roeland, and Datema, E.
- Abstract
In the past two decades genome sequencing has developed from a laborious and costly technology employed by large international consortia to a widely used, automated and affordable tool used worldwide by many individual research groups. Genome sequences of many food animals and crop plants have been deciphered and are being exploited for fundamental research and applied to improve their breeding programs. The developments in sequencing technologies have also impacted the associated bioinformatics strategies and tools, both those that are required for data processing, management, and quality control, and those used for interpretation of the data. This thesis focuses on the application of genome sequencing, assembly and annotation to two members of the Solanaceae family, tomato and potato. Potato is the economically most important species within the Solanaceae, and its tubers contribute to dietary intake of starch, protein, antioxidants, and vitamins. Tomato fruits are the second most consumed vegetable after potato, and are a globally important dietary source of lycopene, beta-carotene, vitamin C, and fiber. The chapters in this thesis document the generation, exploitation and interpretation of genomic sequence resources for these two species and shed light on the contents, structure and evolution of their genomes. Chapter 1introduces the concepts of genome sequencing, assembly and annotation, and explains the novel genome sequencing technologies that have been developed in the past decade. These so-called Next Generation Sequencing platforms display considerable variation in chemistry and workflow, and as a consequence the throughput and data quality differs by orders of magnitude between the platforms. The currently available sequencing platforms produce a vast variety of read lengths and facilitate the generation of paired sequences with an approximately fixed distance between them. The choice of sequencing chemistry and platform combined with the type of sequencin
- Published
- 2011
21. Genome sequence and analysis of the tuber crop potato
- Author
-
Xu, X., Pan, S.K., Cheng, S.F., Zhang, B., Bachem, C.W.B., de Boer, J.M., Borm, T.J.A., Kloosterman, B.A., van Eck, H.J., Datema, E., Goverse, A., van Ham, R.C.H.J., Visser, R.G.F., Xu, X., Pan, S.K., Cheng, S.F., Zhang, B., Bachem, C.W.B., de Boer, J.M., Borm, T.J.A., Kloosterman, B.A., van Eck, H.J., Datema, E., Goverse, A., van Ham, R.C.H.J., and Visser, R.G.F.
- Abstract
Potato (Solanum tuberosum L.) is the world’s most important non-grain food crop and is central to global food security. It is clonally propagated, highly heterozygous, autotetraploid, and suffers acute inbreeding depression. Here we use a homozygous doubled-monoploid potato clone to sequence and assemble 86% of the 844-megabase genome. We predict 39,031 protein-coding genes and present evidence for at least two genome duplication events indicative of a palaeopolyploid origin. As the first genome sequence of an asterid, the potato genome reveals 2,642 genes specific to this large angiosperm clade. We also sequenced a heterozygous diploid clone and show that gene presence/absence variants and other potentially deleterious mutations occur frequently and are a likely cause of inbreeding depression. Gene family expansion, tissue-specific expression and recruitment of genes to new pathways contributed to the evolution of tuber development. The potato genome sequence provides a platform for genetic improvement of this vital crop
- Published
- 2011
22. A snapshot of the emerging tomato genome sequence
- Author
-
Mueller, L.A., Klein Lankhorst, R.M., Tanksley, S.D., Peters, R.M., van Staveren, M.J., Datema, E., Fiers, M.W.E.J., van Ham, R.C.H.J., Szinay, D., de Jong, J.H.S.G.M., Mueller, L.A., Klein Lankhorst, R.M., Tanksley, S.D., Peters, R.M., van Staveren, M.J., Datema, E., Fiers, M.W.E.J., van Ham, R.C.H.J., Szinay, D., and de Jong, J.H.S.G.M.
- Abstract
The genome of tomato (Solanum lycopersicum L.) is being sequenced by an international consortium of 10 countries (Korea, China, the United Kingdom, India, the Netherlands, France, Japan, Spain, Italy, and the United States) as part of the larger “International Solanaceae Genome Project (SOL): Systems Approach to Diversity and Adaptation” initiative. The tomato genome sequencing project uses an ordered bacterial artificial chromosome (BAC) approach to generate a high-quality tomato euchromatic genome sequence for use as a reference genome for the Solanaceae and euasterids. Sequence is deposited at GenBank and at the SOL Genomics Network (SGN). Currently, there are around 1000 BACs finished or in progress, representing more than a third of the projected euchromatic portion of the genome. An annotation effort is also underway by the International Tomato Annotation Group. The expected number of genes in the euchromatin is ~40,000, based on an estimate from a preliminary annotation of 11% of finished sequence. Here, we present this first snapshot of the emerging tomato genome and its annotation, a short comparison with potato (Solanum tuberosum L.) sequence data, and the tools available for the researchers to exploit this new resource are also presented. In the future, whole-genome shotgun techniques will be combined with the BAC-by-BAC approach to cover the entire tomato genome. The high-quality reference euchromatic tomato sequence is expected to be near completion by 2010
- Published
- 2009
23. Comparative BAC end sequence analysis of tomato and potato reveals overrepresentation of specific gene families in potato
- Author
-
Datema, E., Mueller, L.A., Buels, R., Giovannoni, J.J., Visser, R.G.F., Stiekema, W., van Ham, R.C.H.J., Datema, E., Mueller, L.A., Buels, R., Giovannoni, J.J., Visser, R.G.F., Stiekema, W., and van Ham, R.C.H.J.
- Abstract
Background: Tomato ( Solanum lycopersicon) and potato ( S. tuberosum) are two economically important crop species, the genomes of which are currently being sequenced. This study presents a first genome-wide analysis of these two species, based on two large collections of BAC end sequences representing approximately 19% of the tomato genome and 10% of the potato genome. Results: The tomato genome has a higher repeat content than the potato genome, primarily due to a higher number of retrotransposon insertions in the tomato genome. On the other hand, simple sequence repeats are more abundant in potato than in tomato. The two genomes also differ in the frequency distribution of SSR motifs. Based on EST and protein alignments, potato appears to contain up to 6,400 more putative coding regions than tomato. Major gene families such as cytochrome P450 mono-oxygenases and serine-threonine protein kinases are significantly overrepresented in potato, compared to tomato. Moreover, the P450 superfamily appears to have expanded spectacularly in both species compared to Arabidopsis thaliana, suggesting an expanded network of secondary metabolic pathways in the Solanaceae. Both tomato and potato appear to have a low level of microsynteny with A. thaliana. A higher degree of synteny was observed with Populus trichocarpa, specifically in the region between 15.2 and 19.4 Mb on P. trichocarpa chromosome 10. Conclusion: The findings in this paper present a first glimpse into the evolution of Solanaceous genomes, both within the family and relative to other plant species. When the complete genome sequences of these species become available, whole-genome comparisons and protein- or repeat-family specific studies may shed more light on the observations made here.
- Published
- 2008
24. High-throughput bioinformatics with the Cyrille2 pipeline system.
- Author
-
Fiers, M.W.E.J., van der Burgt, A., Datema, E., de Groot, J.C.W., van Ham, R.C.H.J., Fiers, M.W.E.J., van der Burgt, A., Datema, E., de Groot, J.C.W., and van Ham, R.C.H.J.
- Abstract
Background - Modern omics research involves the application of high-throughput technologies that generate vast volumes of data. These data need to be pre-processed, analyzed and integrated with existing knowledge through the use of diverse sets of software tools, models and databases. The analyses are often interdependent and chained together to form complex workflows or pipelines. Given the volume of the data used and the multitude of computational resources available, specialized pipeline software is required to make high-throughput analysis of large-scale omics datasets feasible. Results - We have developed a generic pipeline system called Cyrille2. The system is modular in design and consists of three functionally distinct parts: 1) a web based, graphical user interface (GUI) that enables a pipeline operator to manage the system; 2) the Scheduler, which forms the functional core of the system and which tracks what data enters the system and determines what jobs must be scheduled for execution, and; 3) the Executor, which searches for scheduled jobs and executes these on a compute cluster. Conclusion - The Cyrille2 system is an extensible, modular system, implementing the stated requirements. Cyrille2 enables easy creation and execution of high throughput, flexible bioinformatics pipelines.
- Published
- 2008
25. Correction: The Genomes of the Fungal Plant Pathogens Cladosporium fulvum and Dothistroma septosporum Reveal Adaptation to Different Hosts and Lifestyles But Also Signatures of Common Ancestry
- Author
-
Pj, Wit, van der Burgt A, Ökmen B, Stergiopoulos I, Ka, Abd-Elsalam, Al, Aerts, Ah, Bahkali, Hg, Beenen, Chettri P, Murray Cox, Datema E, Rp, Vries, Dhillon B, Ar, Ganley, Sa, Griffiths, Guo Y, Rc, Hamelin, Henrissat B, Ms, Kabir, and Mk, Jashni
26. De novo sequencing, assembly and analysis of the genome of the laboratory strain Saccharomyces cerevisiae CEN.PK113-7D, a model for modern industrial biotechnology
- Author
-
Nijkamp Jurgen F, van den Broek Marcel, Datema Erwin, de Kok Stefan, Bosman Lizanne, Luttik Marijke A, Daran-Lapujade Pascale, Vongsangnak Wanwipa, Nielsen Jens, Heijne Wilbert HM, Klaassen Paul, Paddon Chris J, Platt Darren, Kötter Peter, van Ham Roeland C, Reinders Marcel JT, Pronk Jack T, de Ridder Dick, and Daran Jean-Marc
- Subjects
Microbiology ,QR1-502 - Abstract
Abstract Saccharomyces cerevisiae CEN.PK 113-7D is widely used for metabolic engineering and systems biology research in industry and academia. We sequenced, assembled, annotated and analyzed its genome. Single-nucleotide variations (SNV), insertions/deletions (indels) and differences in genome organization compared to the reference strain S. cerevisiae S288C were analyzed. In addition to a few large deletions and duplications, nearly 3000 indels were identified in the CEN.PK113-7D genome relative to S288C. These differences were overrepresented in genes whose functions are related to transcriptional regulation and chromatin remodelling. Some of these variations were caused by unstable tandem repeats, suggesting an innate evolvability of the corresponding genes. Besides a previously characterized mutation in adenylate cyclase, the CEN.PK113-7D genome sequence revealed a significant enrichment of non-synonymous mutations in genes encoding for components of the cAMP signalling pathway. Some phenotypic characteristics of the CEN.PK113-7D strains were explained by the presence of additional specific metabolic genes relative to S288C. In particular, the presence of the BIO1 and BIO6 genes correlated with a biotin prototrophy of CEN.PK113-7D. Furthermore, the copy number, chromosomal location and sequences of the MAL loci were resolved. The assembled sequence reveals that CEN.PK113-7D has a mosaic genome that combines characteristics of laboratory strains and wild-industrial strains.
- Published
- 2012
- Full Text
- View/download PDF
27. Comparative BAC end sequence analysis of tomato and potato reveals overrepresentation of specific gene families in potato
- Author
-
Visser Richard GF, Giovannoni James J, Buels Robert, Mueller Lukas A, Datema Erwin, Stiekema Willem J, and van Ham Roeland CHJ
- Subjects
Botany ,QK1-989 - Abstract
Abstract Background Tomato (Solanum lycopersicon) and potato (S. tuberosum) are two economically important crop species, the genomes of which are currently being sequenced. This study presents a first genome-wide analysis of these two species, based on two large collections of BAC end sequences representing approximately 19% of the tomato genome and 10% of the potato genome. Results The tomato genome has a higher repeat content than the potato genome, primarily due to a higher number of retrotransposon insertions in the tomato genome. On the other hand, simple sequence repeats are more abundant in potato than in tomato. The two genomes also differ in the frequency distribution of SSR motifs. Based on EST and protein alignments, potato appears to contain up to 6,400 more putative coding regions than tomato. Major gene families such as cytochrome P450 mono-oxygenases and serine-threonine protein kinases are significantly overrepresented in potato, compared to tomato. Moreover, the P450 superfamily appears to have expanded spectacularly in both species compared to Arabidopsis thaliana, suggesting an expanded network of secondary metabolic pathways in the Solanaceae. Both tomato and potato appear to have a low level of microsynteny with A. thaliana. A higher degree of synteny was observed with Populus trichocarpa, specifically in the region between 15.2 and 19.4 Mb on P. trichocarpa chromosome 10. Conclusion The findings in this paper present a first glimpse into the evolution of Solanaceous genomes, both within the family and relative to other plant species. When the complete genome sequences of these species become available, whole-genome comparisons and protein- or repeat-family specific studies may shed more light on the observations made here.
- Published
- 2008
- Full Text
- View/download PDF
28. High-throughput bioinformatics with the Cyrille2 pipeline system
- Author
-
de Groot Joost CW, Datema Erwin, van der Burgt Ate, Fiers Mark WEJ, and van Ham Roeland CHJ
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Modern omics research involves the application of high-throughput technologies that generate vast volumes of data. These data need to be pre-processed, analyzed and integrated with existing knowledge through the use of diverse sets of software tools, models and databases. The analyses are often interdependent and chained together to form complex workflows or pipelines. Given the volume of the data used and the multitude of computational resources available, specialized pipeline software is required to make high-throughput analysis of large-scale omics datasets feasible. Results We have developed a generic pipeline system called Cyrille2. The system is modular in design and consists of three functionally distinct parts: 1) a web based, graphical user interface (GUI) that enables a pipeline operator to manage the system; 2) the Scheduler, which forms the functional core of the system and which tracks what data enters the system and determines what jobs must be scheduled for execution, and; 3) the Executor, which searches for scheduled jobs and executes these on a compute cluster. Conclusion The Cyrille2 system is an extensible, modular system, implementing the stated requirements. Cyrille2 enables easy creation and execution of high throughput, flexible bioinformatics pipelines.
- Published
- 2008
- Full Text
- View/download PDF
29. A parthenogenesis allele from apomictic dandelion can induce egg cell division without fertilization in lettuce
- Author
-
Underwood, CJ, Vijverberg, K, Rigola, D, Okamoto, S, Oplaat, C, Op den Camp, RHM, Radoeva, T, Schauer, SE, Fierens, J, Jansen, K, Mansveld, S, Busscher, M, Xiong, W, Datema, E, Nijbroek, K, Blom, E-J, Bicknell, R, Catanach, A, Erasmuson, S, Winefield, Christopher, van Tunen, AJ, Prins, M, Schranz, ME, and van Dijk, PJ
- Published
- 2022
- Full Text
- View/download PDF
30. A snapshot of the emerging tomato genome sequence
- Author
-
Sandra Knapp, Ying Wang, Antonio Granell, Dongyu Qu, Erika Asamizu, Pierre Frasse, Hongling Jiang, Mohamed Zouine, Pradeep Kumar Singh, Vivek Dalal, Luigi Frusciante, Robert M. Buels, Hans de Jong, Dongyuan Liu, James J. Giovannoni, Sander Peters, Sarita, Satoshi Tabata, Isaak Y. Tecle, Mara Ercolano, Jun Wang, Longfei Liu, Rekha Dixit, Heiko Schoof, Yongbiao Xue, Kishor Gaikwad, Julia Vrebalov, Alessandra Traini, Nunzio D’Agostino, Ruth White, Zhibiao Ye, Amparo Mico, Cheol-Goo Hur, Jitendra P. Khurana, Roderic Guigó, Arun Sharma, Paramjit Khurana, Jiuhai Zhao, Hiroyuki Fukuoka, Byung-Dong Kim, Smriti Shridhar, René Klein Lankhorst, Yuanyuan Dai, Dani Zamir, Sumera Praveen, Helen Beasley, Manuel Spannagl, Erwin Datema, Klaus X.F. Mayer, Yves Van de Peer, Akhilesh K. Tyagi, Aureliano Bombarely, P. Lindhout, Mark Fiers, Silvana Grandillo, Jane Rogers, Zhangjun Fei, Changbao Li, Giorgio Valle, Karen McLaren, Alok Singh, Sung-Hwan Jo, Sarah Butcher, Willem J. Stiekema, Murielle Philippot, Huajie Fan, Glenn J. Bryan, Fei Lu, Doil Choi, Jun He, Daniel W. A. Buchan, Stephane Rombauts, Jinfeng Chen, Yongchen Du, Xiao-Hua Yang, Shailendra Vyas, Daisuke Shibata, Maria Luisa Chiusano, Rajesh Kumar, Song Bin Chang, Marjo J. van Staveren, Gerard J. Bishop, Victoria Fernandez-Pedrosa, Hong-Qing Ling, Graham B. Seymour, Lukas A. Mueller, Mondher Bouzayen, Stephen M. Stack, Rémy Bruggmann, Ajay Kumar, Zhonghua Zhang, Christine Nicholson, Guoping Wang, Saloni Mathur, Sean Humphray, Vikrant Gupta, Jinfeng Shi, Roeland C. H. J. van Ham, Debasis Chattopadhyay, Amolkumar U. Solanke, Mingsheng Chen, Shusei Sato, Sanwen Huang, Sonia Osorio, Chen Lu, Zhukuan Cheng, Tilak Raj Sharma, Dóra Szinay, James Abbott, Awadhesh Pandit, Yu Geng, Mahavir Yadav, Sara Todesco, Manuel Pérez-Alonso, Giovanni Giuliano, Amalia Barone, Trilochan Mohapatra, Irfan Ahmad Ghazi, Wencai Yang, Francisco Camara, Giulia Falcone, Anika Jöcker, Clare Riddle, Alessandro Vezzi, Jianjun Chen, Shouhong Sun, Marco Pietrella, Joyce Van Eck, Lindsay A. Shearer, Adri A. Mills, Steven D. Tanksley, Miguel A. Botella, Chuanyou Li, Sarah Sims, Farid Regad, Jeffrey A. Fawcett, Parul Chowdhury, Naama Menda, Suzanne M. Royer, Nagendra K. Singh, Mueller, L. A., Lankhorst, R. K., Tanksley, S. D., Giovannoni, J. J., White, R., Vrebalov, J., Fei, Z., van Eck, J., Buels, R., Mills, A. A., Menda, N., Tecle, I. Y., Bombarely, A., Stack, S., Royer, S. M., Chang, S. B., Shearer, L. A., Kim, B. D., Jo, S. H., Hur, C. G., Choi, D., Li, C. B., Zhao, J., Jiang, H., Geng, Y., Dai, Y., Fan, H., Chen, J., Lu, F., Shi, J., Sun, S., Yang, X., Lu, C., Chen, M., Cheng, Z., Li, C., Ling, H., Xue, Y., Wang, Y., Seymour, G. B., Bishop, G. J., Bryan, G., Rogers, J., Sims, S., Butcher, S., Buchan, D., Abbott, J., Beasley, H., Nicholson, C., Riddle, C., Humphray, S., Mclaren, K., Mathur, S., Vyas, S., Solanke, A. U., Kumar, R., Gupta, V., Sharma, A. K., Khurana, P., Khurana, J. P., Tyagi, A., Sarita, Chowdhury, P., Shridhar, S., Chattopadhyay, D., Pandit, A., Singh, P., Kumar, A., Dixit, R., Singh, A., Praveen, S., Dalal, V., Yadav, M., Ghazi, I. A., Gaikwad, K., Sharma, T. R., Mohapatra, T., Singh, N. K., Szinay, D., de Jong, H., Peters, S., van Staveren, M., Datema, E., Fiers, M. W. E. J., van Ham, R. C. H. J., Lindhout, P., Philippot, M., Frasse, P., Regad, F., Zouine, M., Bouzayen, M., Asamizu, E., Sato, S., Fukuoka, H., Tabata, S., Shibata, D., Botella, M. A., Perez Alonso, M., Fernandez Pedrosa, V., Osorio, S., Mico, A., Granell, A., Zhang, Z., He, J., Huang, S., Du, Y., Qu, D., Liu, L., Liu, D., Wang, J., Ye, Z., Yang, W., Wang, G., Vezzi, A., Todesco, S., Valle, G., Falcone, G., Pietrella, M., Giuliano, G., Grandillo, S., Traini, A., D'Agostino, Nunzio, Chiusano, MARIA LUISA, Ercolano, MARIA RAFFAELLA, Barone, Amalia, Frusciante, Luigi, Schoof, H., Jöcker, A., Bruggmann, R., Spannagl, M., Mayer, K. X. F., Guigó, R., Camara, F., Rombauts, S., Fawcett, J. A., Van de Peer, Y., Knapp, S., Zamir, D., and Stiekema, W.
- Subjects
0106 biological sciences ,lcsh:QH426-470 ,Bioinformatics ,Genomics ,Plant Science ,Computational biology ,lcsh:Plant culture ,Biology ,Laboratorium voor Erfelijkheidsleer ,ENCODE ,01 natural sciences ,Genome ,03 medical and health sciences ,Laboratorium voor Plantenveredeling ,Bioinformatica ,Genetics ,Life Science ,lcsh:SB1-1110 ,030304 developmental biology ,Whole genome sequencing ,0303 health sciences ,Bacterial artificial chromosome ,EPS-4 ,fungi ,food and beverages ,Biology and Life Sciences ,Genome project ,PRI Bioscience ,lcsh:Genetics ,Plant Breeding ,GenBank ,Laboratory of Genetics ,Agronomy and Crop Science ,010606 plant biology & botany ,Reference genome - Abstract
The genome of tomato (Solanum lycopersicum L.) is being sequenced by an international consortium of 10 countries (Korea, China, the United Kingdom, India, the Netherlands, France, Japan, Spain, Italy, and the United States) as part of the larger “International Solanaceae Genome Project (SOL): Systems Approach to Diversity and Adaptation” initiative. The tomato genome sequencing project uses an ordered bacterial artificial chromosome (BAC) approach to generate a high-quality tomato euchromatic genome sequence for use as a reference genome for the Solanaceae and euasterids. Sequence is deposited at GenBank and at the SOL Genomics Network (SGN). Currently, there are around 1000 BACs finished or in progress, representing more than a third of the projected euchromatic portion of the genome. An annotation effort is also underway by the International Tomato Annotation Group. The expected number of genes in the euchromatin is ∼40,000, based on an estimate from a preliminary annotation of 11% of finished sequence. Here, we present this first snapshot of the emerging tomato genome and its annotation, a short comparison with potato (Solanum tuberosum L.) sequence data, and the tools available for the researchers to exploit this new resource are also presented. In the future, whole-genome shotgun techniques will be combined with the BAC-by-BAC approach to cover the entire tomato genome. The high-quality reference euchromatic tomato sequence is expected to be near completion by 2010.
- Published
- 2009
31. A PARTHENOGENESIS allele from apomictic dandelion can induce egg cell division without fertilization in lettuce.
- Author
-
Underwood CJ, Vijverberg K, Rigola D, Okamoto S, Oplaat C, Camp RHMOD, Radoeva T, Schauer SE, Fierens J, Jansen K, Mansveld S, Busscher M, Xiong W, Datema E, Nijbroek K, Blom EJ, Bicknell R, Catanach A, Erasmuson S, Winefield C, van Tunen AJ, Prins M, Schranz ME, and van Dijk PJ
- Subjects
- Alleles, Clustered Regularly Interspaced Short Palindromic Repeats, Lactuca growth & development, Ovum cytology, Transcriptome, Zinc Fingers genetics, Apomixis genetics, Cell Division genetics, Genes, Plant, Lactuca genetics, Taraxacum genetics
- Abstract
Apomixis, the clonal formation of seeds, is a rare yet widely distributed trait in flowering plants. We have isolated the PARTHENOGENESIS (PAR) gene from apomictic dandelion that triggers embryo development in unfertilized egg cells. PAR encodes a K2-2 zinc finger, EAR-domain protein. Unlike the recessive sexual alleles, the dominant PAR allele is expressed in egg cells and has a miniature inverted-repeat transposable element (MITE) transposon insertion in the promoter. The MITE-containing promoter can invoke a homologous gene from sexual lettuce to complement dandelion LOSS OF PARTHENOGENESIS mutants. A similar MITE is also present in the promoter of the PAR gene in apomictic forms of hawkweed, suggesting a case of parallel evolution. Heterologous expression of dandelion PAR in lettuce egg cells induced haploid embryo-like structures in the absence of fertilization. Sexual PAR alleles are expressed in pollen, suggesting that the gene product releases a block on embryogenesis after fertilization in sexual species while in apomictic species PAR expression triggers embryogenesis in the absence of fertilization., (© 2022. The Author(s), under exclusive licence to Springer Nature America, Inc.)
- Published
- 2022
- Full Text
- View/download PDF
32. Population Genomic Analysis Reveals Differential Evolutionary Histories and Patterns of Diversity across Subgenomes and Subpopulations of Brassica napus L.
- Author
-
Gazave E, Tassone EE, Ilut DC, Wingerson M, Datema E, Witsenboer HM, Davis JB, Grant D, Dyer JM, Jenks MA, Brown J, and Gore MA
- Abstract
The allotetraploid species Brassica napus L. is a global crop of major economic importance, providing canola oil (seed) and vegetables for human consumption and fodder and meal for livestock feed. Characterizing the genetic diversity present in the extant germplasm pool of B. napus is fundamental to better conserve, manage and utilize the genetic resources of this species. We used sequence-based genotyping to identify and genotype 30,881 SNPs in a diversity panel of 782 B. napus accessions, representing samples of winter and spring growth habits originating from 33 countries across Europe, Asia, and America. We detected strong population structure broadly concordant with growth habit and geography, and identified three major genetic groups: spring (SP), winter Europe (WE), and winter Asia (WA). Subpopulation-specific polymorphism patterns suggest enriched genetic diversity within the WA group and a smaller effective breeding population for the SP group compared to WE. Interestingly, the two subgenomes of B. napus appear to have different geographic origins, with phylogenetic analysis placing WE and WA as basal clades for the other subpopulations in the C and A subgenomes, respectively. Finally, we identified 16 genomic regions where the patterns of diversity differed markedly from the genome-wide average, several of which are suggestive of genomic inversions. The results obtained in this study constitute a valuable resource for worldwide breeding efforts and the genetic dissection and prediction of complex B. napus traits.
- Published
- 2016
- Full Text
- View/download PDF
33. Correction: The Genomes of the Fungal Plant Pathogens Cladosporium fulvum and Dothistroma septosporum Reveal Adaptation to Different Hosts and Lifestyles But Also Signatures of Common Ancestry.
- Author
-
de Wit PJ, van der Burgt A, Ökmen B, Stergiopoulos I, Abd-Elsalam KA, Aerts AL, Bahkali AH, Beenen HG, Chettri P, Cox MP, Datema E, de Vries RP, Dhillon B, Ganley AR, Griffiths SA, Guo Y, Hamelin RC, Henrissat B, Kabir MS, Jashni MK, Kema G, Klaubauf S, Lapidus A, Levasseur A, Lindquist E, Mehrabi R, Ohm RA, Owen TJ, Salamov A, Schwelm A, Schijlen E, Sun H, van den Burg HA, van Ham RC, Zhang S, Goodwin SB, Grigoriev IV, Collemare J, and Bradshaw RE
- Published
- 2015
- Full Text
- View/download PDF
34. Single-Molecule Real-Time Sequencing Combined with Optical Mapping Yields Completely Finished Fungal Genome.
- Author
-
Faino L, Seidl MF, Datema E, van den Berg GC, Janssen A, Wittenberg AH, and Thomma BP
- Subjects
- DNA Transposable Elements genetics, Genomics methods, Sequence Analysis, DNA methods, Telomere genetics, Chromosome Mapping methods, Genome, Fungal, High-Throughput Nucleotide Sequencing methods, Optical Restriction Mapping methods, Verticillium genetics
- Abstract
Unlabelled: Next-generation sequencing (NGS) technologies have increased the scalability, speed, and resolution of genomic sequencing and, thus, have revolutionized genomic studies. However, eukaryotic genome sequencing initiatives typically yield considerably fragmented genome assemblies. Here, we assessed various state-of-the-art sequencing and assembly strategies in order to produce a contiguous and complete eukaryotic genome assembly, focusing on the filamentous fungus Verticillium dahliae. Compared with Illumina-based assemblies of the V. dahliae genome, hybrid assemblies that also include PacBio-generated long reads establish superior contiguity. Intriguingly, provided that sufficient sequence depth is reached, assemblies solely based on PacBio reads outperform hybrid assemblies and even result in fully assembled chromosomes. Furthermore, the addition of optical map data allowed us to produce a gapless and complete V. dahliae genome assembly of the expected eight chromosomes from telomere to telomere. Consequently, we can now study genomic regions that were previously not assembled or poorly assembled, including regions that are populated by repetitive sequences, such as transposons, allowing us to fully appreciate an organism's biological complexity. Our data show that a combination of PacBio-generated long reads and optical mapping can be used to generate complete and gapless assemblies of fungal genomes., Importance: Studying whole-genome sequences has become an important aspect of biological research. The advent of next-generation sequencing (NGS) technologies has nowadays brought genomic science within reach of most research laboratories, including those that study nonmodel organisms. However, most genome sequencing initiatives typically yield (highly) fragmented genome assemblies. Nevertheless, considerable relevant information related to genome structure and evolution is likely hidden in those nonassembled regions. Here, we investigated a diverse set of strategies to obtain gapless genome assemblies, using the genome of a typical ascomycete fungus as the template. Eventually, we were able to show that a combination of PacBio-generated long reads and optical mapping yields a gapless telomere-to-telomere genome assembly, allowing in-depth genome analyses to facilitate functional studies into an organism's biology., (Copyright © 2015 Faino et al.)
- Published
- 2015
- Full Text
- View/download PDF
35. Homologues of potato chromosome 5 show variable collinearity in the euchromatin, but dramatic absence of sequence similarity in the pericentromeric heterochromatin.
- Author
-
de Boer JM, Datema E, Tang X, Borm TJ, Bakker EH, van Eck HJ, van Ham RC, de Jong H, Visser RG, and Bachem CW
- Subjects
- Chromosome Mapping, Chromosomes, Artificial, Bacterial, Euchromatin metabolism, Genetic Linkage, Genotype, Haplotypes, Heterochromatin metabolism, In Situ Hybridization, Fluorescence, Polymorphism, Genetic, Chromosomes, Plant, Euchromatin genetics, Heterochromatin genetics, Solanum tuberosum genetics
- Abstract
Background: In flowering plants it has been shown that de novo genome assemblies of different species and genera show a significant drop in the proportion of alignable sequence. Within a plant species, however, it is assumed that different haplotypes of the same chromosome align well. In this paper we have compared three de novo assemblies of potato chromosome 5 and report on the sequence variation and the proportion of sequence that can be aligned., Results: For the diploid potato clone RH89-039-16 (RH) we produced two linkage phase controlled and haplotype-specific assemblies of chromosome 5 based on BAC-by-BAC sequencing, which were aligned to each other and compared to the 52 Mb chromosome 5 reference sequence of the doubled monoploid clone DM 1-3 516 R44 (DM). We identified 17.0 Mb of non-redundant sequence scaffolds derived from euchromatic regions of RH and 38.4 Mb from the pericentromeric heterochromatin. For 32.7 Mb of the RH sequences the correct position and order on chromosome 5 was determined, using genetic markers, fluorescence in situ hybridisation and alignment to the DM reference genome. This ordered fraction of the RH sequences is situated in the euchromatic arms and in the heterochromatin borders. In the euchromatic regions, the sequence collinearity between the three chromosomal homologs is good, but interruption of collinearity occurs at nine gene clusters. Towards and into the heterochromatin borders, absence of collinearity due to structural variation was more extensive and was caused by hemizygous and poorly aligning regions of up to 450 kb in length. In the most central heterochromatin, a total of 22.7 Mb sequence from both RH haplotypes remained unordered. These RH sequences have very few syntenic regions and represent a non-alignable region between the RH and DM heterochromatin haplotypes of chromosome 5., Conclusions: Our results show that among homologous potato chromosomes large regions are present with dramatic loss of sequence collinearity. This stresses the need for more de novo reference assemblies in order to capture genome diversity in this crop. The discovery of three highly diverged pericentric heterochromatin haplotypes within one species is a novelty in plant genome analysis. The possible origin and cytogenetic implication of this heterochromatin haplotype diversity are discussed.
- Published
- 2015
- Full Text
- View/download PDF
36. Chromosomal organizations of major repeat families on potato (Solanum tuberosum) and further exploring in its sequenced genome.
- Author
-
Tang X, Datema E, Guzman MO, de Boer JM, van Eck HJ, Bachem CW, Visser RG, and de Jong H
- Subjects
- Base Sequence, Consensus Sequence, Physical Chromosome Mapping, Repetitive Sequences, Nucleic Acid, Retroelements, Tandem Repeat Sequences, Chromosomes, Plant, DNA, Plant chemistry, Genome, Plant, Solanum tuberosum genetics
- Abstract
One of the most powerful technologies in unraveling the organization of a eukaryotic plant genome is high-resolution Fluorescent in situ hybridization of repeats and single copy DNA sequences on pachytene chromosomes. This technology allows the integration of physical mapping information with chromosomal positions, including centromeres, telomeres, nucleolar-organizing region, and euchromatin and heterochromatin. In this report, we established chromosomal positions of different repeat fractions of the potato genomic DNA (Cot100, Cot500 and Cot1000) on the chromosomes. We also analysed various repeat elements that are unique to potato including the moderately repetitive P5 and REP2 elements, where the REP2 is part of a larger Gypsy-type LTR retrotransposon and cover most chromosome regions, with some brighter fluorescing spots in the heterochromatin. The most abundant tandem repeat is the potato genomic repeat 1 that covers subtelomeric regions of most chromosome arms. Extensive multiple alignments of these repetitive sequences in the assembled RH89-039-16 potato BACs and the draft assembly of the DM1-3 516 R44 genome shed light on the conservation of these repeats within the potato genome. The consensus sequences thus obtained revealed the native complete transposable elements from which they were derived.
- Published
- 2014
- Full Text
- View/download PDF
37. Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole-genome sequencing.
- Author
-
Aflitos S, Schijlen E, de Jong H, de Ridder D, Smit S, Finkers R, Wang J, Zhang G, Li N, Mao L, Bakker F, Dirks R, Breit T, Gravendeel B, Huits H, Struss D, Swanson-Wagner R, van Leeuwen H, van Ham RC, Fito L, Guignier L, Sevilla M, Ellul P, Ganko E, Kapur A, Reclus E, de Geus B, van de Geest H, Te Lintel Hekkert B, van Haarst J, Smits L, Koops A, Sanchez-Perez G, van Heusden AW, Visser R, Quan Z, Min J, Liao L, Wang X, Wang G, Yue Z, Yang X, Xu N, Schranz E, Smets E, Vos R, Rauwerda J, Ursem R, Schuit C, Kerns M, van den Berg J, Vriezen W, Janssen A, Datema E, Jahrman T, Moquet F, Bonnet J, and Peters S
- Subjects
- Breeding, Chromosome Mapping, DNA, Plant chemistry, DNA, Plant genetics, Fruit genetics, High-Throughput Nucleotide Sequencing, Molecular Sequence Data, Phenotype, Phylogeny, Polymorphism, Single Nucleotide, Sequence Alignment, Sequence Analysis, DNA, Species Specificity, Genetic Variation, Genome, Plant genetics, Solanum lycopersicum genetics
- Abstract
We explored genetic variation by sequencing a selection of 84 tomato accessions and related wild species representative of the Lycopersicon, Arcanum, Eriopersicon and Neolycopersicon groups, which has yielded a huge amount of precious data on sequence diversity in the tomato clade. Three new reference genomes were reconstructed to support our comparative genome analyses. Comparative sequence alignment revealed group-, species- and accession-specific polymorphisms, explaining characteristic fruit traits and growth habits in the various cultivars. Using gene models from the annotated Heinz 1706 reference genome, we observed differences in the ratio between non-synonymous and synonymous SNPs (dN/dS) in fruit diversification and plant growth genes compared to a random set of genes, indicating positive selection and differences in selection pressure between crop accessions and wild species. In wild species, the number of single-nucleotide polymorphisms (SNPs) exceeds 10 million, i.e. 20-fold higher than found in most of the crop accessions, indicating dramatic genetic erosion of crop and heirloom tomatoes. In addition, the highest levels of heterozygosity were found for allogamous self-incompatible wild species, while facultative and autogamous self-compatible species display a lower heterozygosity level. Using whole-genome SNP information for maximum-likelihood analysis, we achieved complete tree resolution, whereas maximum-likelihood trees based on SNPs from ten fruit and growth genes show incomplete resolution for the crop accessions, partly due to the effect of heterozygous SNPs. Finally, results suggest that phylogenetic relationships are correlated with habitat, indicating the occurrence of geographical races within these groups, which is of practical importance for Solanum genome evolution studies., (© 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.)
- Published
- 2014
- Full Text
- View/download PDF
38. The genomes of the fungal plant pathogens Cladosporium fulvum and Dothistroma septosporum reveal adaptation to different hosts and lifestyles but also signatures of common ancestry.
- Author
-
de Wit PJ, van der Burgt A, Ökmen B, Stergiopoulos I, Abd-Elsalam KA, Aerts AL, Bahkali AH, Beenen HG, Chettri P, Cox MP, Datema E, de Vries RP, Dhillon B, Ganley AR, Griffiths SA, Guo Y, Hamelin RC, Henrissat B, Kabir MS, Jashni MK, Kema G, Klaubauf S, Lapidus A, Levasseur A, Lindquist E, Mehrabi R, Ohm RA, Owen TJ, Salamov A, Schwelm A, Schijlen E, Sun H, van den Burg HA, van Ham RC, Zhang S, Goodwin SB, Grigoriev IV, Collemare J, and Bradshaw RE
- Subjects
- Base Sequence, Fungal Proteins genetics, Gene Expression Regulation, Fungal, Solanum lycopersicum genetics, Solanum lycopersicum parasitology, Phylogeny, Pinus genetics, Pinus parasitology, Plant Diseases genetics, Adaptation, Physiological genetics, Cladosporium genetics, Genome, Host-Pathogen Interactions
- Abstract
We sequenced and compared the genomes of the Dothideomycete fungal plant pathogens Cladosporium fulvum (Cfu) (syn. Passalora fulva) and Dothistroma septosporum (Dse) that are closely related phylogenetically, but have different lifestyles and hosts. Although both fungi grow extracellularly in close contact with host mesophyll cells, Cfu is a biotroph infecting tomato, while Dse is a hemibiotroph infecting pine. The genomes of these fungi have a similar set of genes (70% of gene content in both genomes are homologs), but differ significantly in size (Cfu >61.1-Mb; Dse 31.2-Mb), which is mainly due to the difference in repeat content (47.2% in Cfu versus 3.2% in Dse). Recent adaptation to different lifestyles and hosts is suggested by diverged sets of genes. Cfu contains an α-tomatinase gene that we predict might be required for detoxification of tomatine, while this gene is absent in Dse. Many genes encoding secreted proteins are unique to each species and the repeat-rich areas in Cfu are enriched for these species-specific genes. In contrast, conserved genes suggest common host ancestry. Homologs of Cfu effector genes, including Ecp2 and Avr4, are present in Dse and induce a Cf-Ecp2- and Cf-4-mediated hypersensitive response, respectively. Strikingly, genes involved in production of the toxin dothistromin, a likely virulence factor for Dse, are conserved in Cfu, but their expression differs markedly with essentially no expression by Cfu in planta. Likewise, Cfu has a carbohydrate-degrading enzyme catalog that is more similar to that of necrotrophs or hemibiotrophs and a larger pectinolytic gene arsenal than Dse, but many of these genes are not expressed in planta or are pseudogenized. Overall, comparison of their genomes suggests that these closely related plant pathogens had a common ancestral host but since adapted to different hosts and lifestyles by a combination of differentiated gene content, pseudogenization, and gene regulation., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2012
- Full Text
- View/download PDF
39. Genome sequence and analysis of the tuber crop potato.
- Author
-
Xu X, Pan S, Cheng S, Zhang B, Mu D, Ni P, Zhang G, Yang S, Li R, Wang J, Orjeda G, Guzman F, Torres M, Lozano R, Ponce O, Martinez D, De la Cruz G, Chakrabarti SK, Patil VU, Skryabin KG, Kuznetsov BB, Ravin NV, Kolganova TV, Beletsky AV, Mardanov AV, Di Genova A, Bolser DM, Martin DM, Li G, Yang Y, Kuang H, Hu Q, Xiong X, Bishop GJ, Sagredo B, Mejía N, Zagorski W, Gromadka R, Gawor J, Szczesny P, Huang S, Zhang Z, Liang C, He J, Li Y, He Y, Xu J, Zhang Y, Xie B, Du Y, Qu D, Bonierbale M, Ghislain M, Herrera Mdel R, Giuliano G, Pietrella M, Perrotta G, Facella P, O'Brien K, Feingold SE, Barreiro LE, Massa GA, Diambra L, Whitty BR, Vaillancourt B, Lin H, Massa AN, Geoffroy M, Lundback S, DellaPenna D, Buell CR, Sharma SK, Marshall DF, Waugh R, Bryan GJ, Destefanis M, Nagy I, Milbourne D, Thomson SJ, Fiers M, Jacobs JM, Nielsen KL, Sønderkær M, Iovene M, Torres GA, Jiang J, Veilleux RE, Bachem CW, de Boer J, Borm T, Kloosterman B, van Eck H, Datema E, Hekkert Bt, Goverse A, van Ham RC, and Visser RG
- Subjects
- Evolution, Molecular, Gene Duplication, Gene Expression Regulation, Plant, Genes, Plant genetics, Genetic Variation, Haplotypes genetics, Heterozygote, Homozygote, Immunity, Innate, Inbreeding, Molecular Sequence Annotation, Molecular Sequence Data, Plant Diseases genetics, Ploidies, Solanum tuberosum physiology, Genome, Plant genetics, Genomics, Solanum tuberosum genetics
- Abstract
Potato (Solanum tuberosum L.) is the world's most important non-grain food crop and is central to global food security. It is clonally propagated, highly heterozygous, autotetraploid, and suffers acute inbreeding depression. Here we use a homozygous doubled-monoploid potato clone to sequence and assemble 86% of the 844-megabase genome. We predict 39,031 protein-coding genes and present evidence for at least two genome duplication events indicative of a palaeopolyploid origin. As the first genome sequence of an asterid, the potato genome reveals 2,642 genes specific to this large angiosperm clade. We also sequenced a heterozygous diploid clone and show that gene presence/absence variants and other potentially deleterious mutations occur frequently and are a likely cause of inbreeding depression. Gene family expansion, tissue-specific expression and recruitment of genes to new pathways contributed to the evolution of tuber development. The potato genome sequence provides a platform for genetic improvement of this vital crop., (©2011 Macmillan Publishers Limited. All rights reserved)
- Published
- 2011
- Full Text
- View/download PDF
40. Solanum lycopersicum cv. Heinz 1706 chromosome 6: distribution and abundance of genes and retrotransposable elements.
- Author
-
Peters SA, Datema E, Szinay D, van Staveren MJ, Schijlen EG, van Haarst JC, Hesselink T, Abma-Henkens MH, Bai Y, de Jong H, Stiekema WJ, Klein Lankhorst RM, and van Ham RC
- Subjects
- Chromosome Walking, Chromosomes, Artificial, Bacterial, Contig Mapping, DNA Fingerprinting, DNA, Plant genetics, Euchromatin, Heterochromatin, In Situ Hybridization, Fluorescence, Sequence Analysis, DNA, Chromosomes, Plant genetics, Genes, Plant, Solanum lycopersicum genetics, Retroelements
- Abstract
We studied the physical and genetic organization of chromosome 6 of tomato (Solanum lycopersicum) cv. Heinz 1706 by combining bacterial artificial chromosome (BAC) sequence analysis, high-information-content fingerprinting, genetic analysis, and BAC-fluorescent in situ hybridization (FISH) mapping data. The chromosome positions of 81 anchored seed and extension BACs corresponded in most cases with the linear marker order on the high-density EXPEN 2000 linkage map. We assembled 25 BAC contigs and eight singleton BACs spanning 2.0 Mb of the short-arm euchromatin, 1.8 Mb of the pericentromeric heterochromatin and 6.9 Mb of the long-arm euchromatin. Sequence data were combined with their corresponding genetic and pachytene chromosome positions into an integrated map that covers approximately a third of the chromosome 6 euchromatin and a small part of the pericentromeric heterochromatin. We then compared physical length (Mb), genetic (cM) and chromosome distances (microm) for determining gap sizes between contigs, revealing relative hot and cold spots of recombination. Through sequence annotation we identified several clusters of functionally related genes and an uneven distribution of both gene and repeat sequences between heterochromatin and euchromatin domains. Although a greater number of the non-transposon genes were located in the euchromatin, the highly repetitive (22.4%) pericentromeric heterochromatin displayed an unexpectedly high gene content of one gene per 36.7 kb. Surprisingly, the short-arm euchromatin was relatively rich in repeats as well, with a repeat content of 13.4%, yet the ratio of Ty3/Gypsy and Ty1/Copia retrotransposable elements across the chromosome clearly distinguished euchromatin (2:3) from heterochromatin (3:2).
- Published
- 2009
- Full Text
- View/download PDF
41. Cross-species bacterial artificial chromosome-fluorescence in situ hybridization painting of the tomato and potato chromosome 6 reveals undescribed chromosomal rearrangements.
- Author
-
Tang X, Szinay D, Lang C, Ramanna MS, van der Vossen EA, Datema E, Lankhorst RK, de Boer J, Peters SA, Bachem C, Stiekema W, Visser RG, de Jong H, and Bai Y
- Subjects
- DNA, Plant genetics, Genetic Markers, Genome, Plant, Species Specificity, Chromosome Aberrations, Chromosomes, Artificial, Bacterial, Chromosomes, Plant genetics, In Situ Hybridization, Fluorescence methods, Solanum lycopersicum genetics, Physical Chromosome Mapping methods, Solanum tuberosum genetics
- Abstract
Ongoing genomics projects of tomato (Solanum lycopersicum) and potato (S. tuberosum) are providing unique tools for comparative mapping studies in Solanaceae. At the chromosomal level, bacterial artificial chromosomes (BACs) can be positioned on pachytene complements by fluorescence in situ hybridization (FISH) on homeologous chromosomes of related species. Here we present results of such a cross-species multicolor cytogenetic mapping of tomato BACs on potato chromosomes 6 and vice versa. The experiments were performed under low hybridization stringency, while blocking with Cot-100 was essential in suppressing excessive hybridization of repeat signals in both within-species FISH and cross-species FISH of tomato BACs. In the short arm we detected a large paracentric inversion that covers the whole euchromatin part with breakpoints close to the telomeric heterochromatin and at the border of the short arm pericentromere. The long arm BACs revealed no deviation in the colinearity between tomato and potato. Further comparison between tomato cultivars Cherry VFNT and Heinz 1706 revealed colinearity of the tested tomato BACs, whereas one of the six potato clones (RH98-856-18) showed minor putative rearrangements within the inversion. Our results present cross-species multicolor BAC-FISH as a unique tool for comparative genetic studies across Solanum species.
- Published
- 2008
- Full Text
- View/download PDF
42. Comparative BAC end sequence analysis of tomato and potato reveals overrepresentation of specific gene families in potato.
- Author
-
Datema E, Mueller LA, Buels R, Giovannoni JJ, Visser RG, Stiekema WJ, and van Ham RC
- Subjects
- Chromosome Mapping, Expressed Sequence Tags, Genes, Plant, Plant Proteins genetics, Genome, Plant, Solanum lycopersicum genetics, Sequence Analysis, DNA methods, Solanum tuberosum genetics
- Abstract
Background: Tomato (Solanum lycopersicon) and potato (S. tuberosum) are two economically important crop species, the genomes of which are currently being sequenced. This study presents a first genome-wide analysis of these two species, based on two large collections of BAC end sequences representing approximately 19% of the tomato genome and 10% of the potato genome., Results: The tomato genome has a higher repeat content than the potato genome, primarily due to a higher number of retrotransposon insertions in the tomato genome. On the other hand, simple sequence repeats are more abundant in potato than in tomato. The two genomes also differ in the frequency distribution of SSR motifs. Based on EST and protein alignments, potato appears to contain up to 6,400 more putative coding regions than tomato. Major gene families such as cytochrome P450 mono-oxygenases and serine-threonine protein kinases are significantly overrepresented in potato, compared to tomato. Moreover, the P450 superfamily appears to have expanded spectacularly in both species compared to Arabidopsis thaliana, suggesting an expanded network of secondary metabolic pathways in the Solanaceae. Both tomato and potato appear to have a low level of microsynteny with A. thaliana. A higher degree of synteny was observed with Populus trichocarpa, specifically in the region between 15.2 and 19.4 Mb on P. trichocarpa chromosome 10., Conclusion: The findings in this paper present a first glimpse into the evolution of Solanaceous genomes, both within the family and relative to other plant species. When the complete genome sequences of these species become available, whole-genome comparisons and protein- or repeat-family specific studies may shed more light on the observations made here.
- Published
- 2008
- Full Text
- View/download PDF
43. High-throughput bioinformatics with the Cyrille2 pipeline system.
- Author
-
Fiers MW, van der Burgt A, Datema E, de Groot JC, and van Ham RC
- Subjects
- Algorithms, Computational Biology methods, Database Management Systems, Databases, Genetic, Information Storage and Retrieval methods, Software, User-Computer Interface
- Abstract
Background: Modern omics research involves the application of high-throughput technologies that generate vast volumes of data. These data need to be pre-processed, analyzed and integrated with existing knowledge through the use of diverse sets of software tools, models and databases. The analyses are often interdependent and chained together to form complex workflows or pipelines. Given the volume of the data used and the multitude of computational resources available, specialized pipeline software is required to make high-throughput analysis of large-scale omics datasets feasible., Results: We have developed a generic pipeline system called Cyrille2. The system is modular in design and consists of three functionally distinct parts: 1) a web based, graphical user interface (GUI) that enables a pipeline operator to manage the system; 2) the Scheduler, which forms the functional core of the system and which tracks what data enters the system and determines what jobs must be scheduled for execution, and; 3) the Executor, which searches for scheduled jobs and executes these on a compute cluster., Conclusion: The Cyrille2 system is an extensible, modular system, implementing the stated requirements. Cyrille2 enables easy creation and execution of high throughput, flexible bioinformatics pipelines.
- Published
- 2008
- Full Text
- View/download PDF
44. FISH mapping and molecular organization of the major repetitive sequences of tomato.
- Author
-
Chang SB, Yang TJ, Datema E, van Vugt J, Vosman B, Kuipers A, Meznikova M, Szinay D, Lankhorst RK, Jacobsen E, and de Jong H
- Subjects
- Genome, Plant, Heterochromatin, Chromosome Mapping methods, DNA, Plant genetics, In Situ Hybridization, Fluorescence methods, Solanum lycopersicum genetics, Repetitive Sequences, Nucleic Acid genetics
- Abstract
This paper presents a bird's-eye view of the major repeats and chromatin types of tomato. Using fluorescence in-situ hybridization (FISH) with Cot-1, Cot-10 and Cot-100 DNA as probes we mapped repetitive sequences of different complexity on pachytene complements. Cot-100 was found to cover all heterochromatin regions, and could be used to identify repeat-rich clones in BAC filter hybridization. Next we established the chromosomal locations of the tandem and dispersed repeats with respect to euchromatin, nucleolar organizer regions (NORs), heterochromatin, and centromeres. The tomato genomic repeats TGRII and TGRIII appeared to be major components of the pericentromeres, whereas the newly discovered TGRIV repeat was found mainly in the structural centromeres. The highly methylated NOR of chromosome 2 is rich in [GACA](4), a microsatellite that also forms part of the pericentromeres, together with [GA](8), [GATA](4) and Ty1-copia. Based on the morphology of pachytene chromosomes and the distribution of repeats studied so far, we now propose six different chromatin classes for tomato: (1) euchromatin, (2) chromomeres, (3) distal heterochromatin and interstitial heterochromatic knobs, (4) pericentromere heterochromatin, (5) functional centromere heterochromatin and (6) nucleolar organizer region.
- Published
- 2008
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.