102 results on '"Overmars, L."'
Search Results
2. Characteristics of peripheral blood cells are independently related to major adverse cardiovascular events after carotid endarterectomy
- Author
-
Overmars, L. Malin, Mekke, Joost M., van Solinge, Wouter W., De Jager, Saskia C.A., Hulsbergen-Veelken, Cornelia A.R., Hoefer, Imo E., de Kleijn, Dominique P.V., de Borst, Gert J., van der Laan, Sander W., and Haitjema, Saskia
- Published
- 2023
- Full Text
- View/download PDF
3. A cluster of blood-based protein biomarkers associated with decreased cerebral blood flow relates to future cardiovascular events in patients with cardiovascular disease
- Author
-
Overmars, L Malin, Kuipers, Sanne, van Es, Bram, Bresser, Jeroen de, Bron, Esther E, Hoefer, Imo E, Van Solinge, Wouter W, Kappelle, L Jaap, van Osch, Matthias JP, Teunissen, Charlotte E, Biessels, Geert Jan, and Haitjema, Saskia
- Published
- 2023
- Full Text
- View/download PDF
4. Identification and prediction of difficult-to-treat rheumatoid arthritis patients in structured and unstructured routine care data: results from a hackathon
- Author
-
Messelink, Marianne A., Roodenrijs, Nadia M. T., van Es, Bram, Hulsbergen-Veelken, Cornelia A. R., Jong, Sebastiaan, Overmars, L. Malin, Reteig, Leon C., Tan, Sander C., Tauber, Tjebbe, van Laar, Jacob M., Welsing, Paco M. J., and Haitjema, Saskia
- Published
- 2021
- Full Text
- View/download PDF
5. A CLUSTER OF BLOOD-BASED BIOMARKERS REFLECTING COAGULATION RELATES TO THE BURDEN OF CEREBRAL SMALL VESSEL DISEASE
- Author
-
Kuipers, Sanne, Overmars, L Malin, van Es, Bram, de Bresser, Jeroen, Bron, Esther, Hoefer, Imo, Kappelle, L Jaap, Teunissen, Charlotte, Biessels, Geert Jan, and Haitjema, Saskia
- Published
- 2024
- Full Text
- View/download PDF
6. A CLUSTER OF BLOOD-BASED BIOMARKERS REFLECTING EXTRACELLULAR MATRIX ORGANIZATION, INFLAMMATION AND SIGNAL TRANSDUCTION RELATES TO CEREBRAL BLOOD FLOW IN PATIENTS WITH CARDIOVASCULAR DISEASE
- Author
-
Overmars, L Malin, Kuipers, Sanne, van Es, Bram, Bron, Esther, Hoefer, Imo, Kappelle, L Jaap, van Osch, Matthias, Teunissen, Charlotte, Haitjema, Saskia, and Biessels, Geert Jan
- Published
- 2024
- Full Text
- View/download PDF
7. Sexual dimorphism in peripheral blood cell characteristics linked to recanalization success of endovascular thrombectomy in acute ischemic stroke
- Author
-
CDL Upod, Centraal Diagnostisch Laboratorium, Onderzoek Vrouw Hart & Vaatziekten, Circulatory Health, Neurologen, Brain, Stroke, CDL Cluster Speciële Diagnostiek, Overmars, L Malin, van Solinge, Wouter W, Ruijter, Hester M den, van der Worp, H Bart, Van Es, Bram, Hulsbergen-Veelken, Cornelia A R, Biessels, Geert Jan, Exalto, Lieza G, Haitjema, Saskia, CDL Upod, Centraal Diagnostisch Laboratorium, Onderzoek Vrouw Hart & Vaatziekten, Circulatory Health, Neurologen, Brain, Stroke, CDL Cluster Speciële Diagnostiek, Overmars, L Malin, van Solinge, Wouter W, Ruijter, Hester M den, van der Worp, H Bart, Van Es, Bram, Hulsbergen-Veelken, Cornelia A R, Biessels, Geert Jan, Exalto, Lieza G, and Haitjema, Saskia
- Published
- 2023
8. Characteristics of peripheral blood cells are independently related to major adverse cardiovascular events after carotid endarterectomy
- Author
-
CDL Upod, Zorgeenheid Vaatchirurgie Medisch, Centraal Diagnostisch Laboratorium, Experimental Cardiology Laboratory, Experimentele Afd. Cardiologie 2, Circulatory Health, Infection & Immunity, Regenerative Medicine and Stem Cells, CDL Onderzoek Pasterkamp, CDL Cluster Speciële Diagnostiek, Overmars, L. Malin, Mekke, Joost M., van Solinge, Wouter W., De Jager, Saskia C.A., Hulsbergen-Veelken, Cornelia A.R., Hoefer, Imo E., de Kleijn, Dominique P.V., de Borst, Gert J., van der Laan, Sander W., Haitjema, Saskia, CDL Upod, Zorgeenheid Vaatchirurgie Medisch, Centraal Diagnostisch Laboratorium, Experimental Cardiology Laboratory, Experimentele Afd. Cardiologie 2, Circulatory Health, Infection & Immunity, Regenerative Medicine and Stem Cells, CDL Onderzoek Pasterkamp, CDL Cluster Speciële Diagnostiek, Overmars, L. Malin, Mekke, Joost M., van Solinge, Wouter W., De Jager, Saskia C.A., Hulsbergen-Veelken, Cornelia A.R., Hoefer, Imo E., de Kleijn, Dominique P.V., de Borst, Gert J., van der Laan, Sander W., and Haitjema, Saskia
- Published
- 2023
9. A Wolf in Sheep’s Clothing: Reuse of Routinely Obtained Laboratory Data in Research
- Author
-
Overmars, L Malin, primary, Niemantsverdriet, Michael S A, additional, Groenhof, T Katrien J, additional, De Groot, Mark C H, additional, Hulsbergen-Veelken, Cornelia A R, additional, Van Solinge, Wouter W, additional, Musson, Ruben E A, additional, Ten Berg, Maarten J, additional, Hoefer, Imo E, additional, and Haitjema, Saskia, additional
- Published
- 2022
- Full Text
- View/download PDF
10. A Wolf in Sheep's Clothing: Reuse of Routinely Obtained Laboratory Data in Research
- Author
-
CDL Upod, Cardiovasculaire Epi Team 5, Other research (not in main researchprogram), Centraal Diagnostisch Laboratorium, CDL Cluster 24/7 centraal laboratorium, Circulatory Health, CDL Cluster Front office & ext. business, Infection & Immunity, CDL Cluster Speciële Diagnostiek, Overmars, L Malin, Niemantsverdriet, Michael S A, Groenhof, T Katrien J, De Groot, Mark C H, Hulsbergen-Veelken, Cornelia A R, Van Solinge, Wouter W, Musson, Ruben E A, Ten Berg, Maarten J, Hoefer, Imo E, Haitjema, Saskia, CDL Upod, Cardiovasculaire Epi Team 5, Other research (not in main researchprogram), Centraal Diagnostisch Laboratorium, CDL Cluster 24/7 centraal laboratorium, Circulatory Health, CDL Cluster Front office & ext. business, Infection & Immunity, CDL Cluster Speciële Diagnostiek, Overmars, L Malin, Niemantsverdriet, Michael S A, Groenhof, T Katrien J, De Groot, Mark C H, Hulsbergen-Veelken, Cornelia A R, Van Solinge, Wouter W, Musson, Ruben E A, Ten Berg, Maarten J, Hoefer, Imo E, and Haitjema, Saskia
- Published
- 2022
11. A cluster of blood-based protein biomarkers reflecting coagulation relates to the burden of cerebral small vessel disease
- Author
-
Opleiding Neurologie, CDL Upod, Arts-assistenten Radiologie, Centraal Diagnostisch Laboratorium, Circulatory Health, CDL Arcadia, Neurologen, Brain, CDL Cluster Speciële Diagnostiek, Kuipers, Sanne, Overmars, L Malin, van Es, Bram, de Bresser, Jeroen, Bron, Esther E, Hoefer, Imo E, Kappelle, L Jaap, Teunissen, Charlotte E, Biessels, Geert Jan, Haitjema, Saskia, Opleiding Neurologie, CDL Upod, Arts-assistenten Radiologie, Centraal Diagnostisch Laboratorium, Circulatory Health, CDL Arcadia, Neurologen, Brain, CDL Cluster Speciële Diagnostiek, Kuipers, Sanne, Overmars, L Malin, van Es, Bram, de Bresser, Jeroen, Bron, Esther E, Hoefer, Imo E, Kappelle, L Jaap, Teunissen, Charlotte E, Biessels, Geert Jan, and Haitjema, Saskia
- Published
- 2022
12. A wolf in sheep’s clothing: The re-use of routinely obtained laboratory data in research (Preprint)
- Author
-
Overmars, L. Malin, primary, Niemantsverdriet, Michael S. A., additional, Groenhof, T. Katrien J., additional, De Groot, Mark C. H., additional, Hulsbergen-Veelken, Cornelia A. R., additional, Van Solinge, Wouter W., additional, Musson, Ruben E. A., additional, Ten Berg, Maarten J., additional, Hoefer, Imo E., additional, and Haitjema, Saskia, additional
- Published
- 2022
- Full Text
- View/download PDF
13. sj-pdf-1-jcb-10.1177_0271678X221077339 - Supplemental material for A cluster of blood-based protein biomarkers reflecting coagulation relates to the burden of cerebral small vessel disease
- Author
-
Kuipers, Sanne, Overmars, L Malin, van Es, Bram, de Bresser, Jeroen, Bron, Esther E, Hoefer, Imo E, Kappelle, L Jaap, Teunissen, Charlotte E, Biessels, Geert Jan, and Haitjema, Saskia
- Subjects
110320 Radiology and Organ Imaging ,FOS: Clinical medicine ,FOS: Biological sciences ,Medicine ,Cell Biology ,110305 Emergency Medicine ,110306 Endocrinology ,Biochemistry ,69999 Biological Sciences not elsewhere classified ,110904 Neurology and Neuromuscular Diseases ,Neuroscience - Abstract
Supplemental material, sj-pdf-1-jcb-10.1177_0271678X221077339 for A cluster of blood-based protein biomarkers reflecting coagulation relates to the burden of cerebral small vessel disease by Sanne Kuipers, L Malin Overmars, Bram van Es, Jeroen de Bresser, Esther E Bron, Imo E Hoefer, L Jaap Kappelle, Charlotte E Teunissen, Geert Jan Biessels and Saskia Haitjema in Journal of Cerebral Blood Flow & Metabolism
- Published
- 2022
- Full Text
- View/download PDF
14. A cluster of blood-based protein biomarkers reflecting coagulation relates to the burden of cerebral small vessel disease
- Author
-
Kuipers, Sanne, primary, Overmars, L Malin, additional, van Es, Bram, additional, de Bresser, Jeroen, additional, Bron, Esther E, additional, Hoefer, Imo E, additional, Kappelle, L Jaap, additional, Teunissen, Charlotte E, additional, Biessels, Geert Jan, additional, and Haitjema, Saskia, additional
- Published
- 2022
- Full Text
- View/download PDF
15. Preventing unnecessary imaging in patients suspect of coronary artery disease through machine learning of electronic health records
- Author
-
Overmars, L Malin, primary, van Es, Bram, additional, Groepenhoff, Floor, additional, De Groot, Mark C H, additional, Pasterkamp, Gerard, additional, den Ruijter, Hester M, additional, van Solinge, Wouter W, additional, Hoefer, Imo E, additional, and Haitjema, Saskia, additional
- Published
- 2021
- Full Text
- View/download PDF
16. Additional file 1 of Identification and prediction of difficult-to-treat rheumatoid arthritis patients in structured and unstructured routine care data: results from a hackathon
- Author
-
Messelink, Marianne A., Roodenrijs, Nadia M. T., van Es, Bram, Hulsbergen-Veelken, Cornelia A. R., Jong, Sebastiaan, Overmars, L. Malin, Reteig, Leon C., Tan, Sander C., Tauber, Tjebbe, van Laar, Jacob M., Welsing, Paco M. J., and Haitjema, Saskia
- Abstract
Additional file 1: Supplemental table 1. EULAR definition of D2T R A[8]. Supplemental table 2. Selected medication and ATC codes for extraction from the Utrecht Patient Oriented Database (UPOD). Supplemental table 3. Selected laboratory measurements for extraction from the Utrecht Patient Oriented Database (UPOD). Supplemental table 4. Patient characteristics of clinically classified D2T and non-D2T patients. Supplemental table 5. Most important features of machine learning model to predict the DAS28-ESR*. Supplemental table 6. Most important features of the machine learning model to predict the development of D2T RA before the start of the first b/tsDMARD.
- Published
- 2021
- Full Text
- View/download PDF
17. Identification and prediction of difficult-to-treat rheumatoid arthritis patients in structured and unstructured routine care data: results from a hackathon
- Author
-
Lab Reumatologie/Klinische Immunologie, Unit Opleiding Aios, Infection & Immunity, CDL Upod, CTI Otten, MS Reumatologie/Immunologie/Infectie, Regenerative Medicine and Stem Cells, JC onderzoeksprogramma Methodology, CDL Arcadia, Circulatory Health, Messelink, Marianne A., Roodenrijs, Nadia M.T., van Es, Bram, Hulsbergen-Veelken, Cornelia A.R., Jong, Sebastiaan, Overmars, L. Malin, Reteig, Leon C., Tan, Sander C., Tauber, Tjebbe, van Laar, Jacob M., Welsing, Paco M.J., Haitjema, Saskia, Lab Reumatologie/Klinische Immunologie, Unit Opleiding Aios, Infection & Immunity, CDL Upod, CTI Otten, MS Reumatologie/Immunologie/Infectie, Regenerative Medicine and Stem Cells, JC onderzoeksprogramma Methodology, CDL Arcadia, Circulatory Health, Messelink, Marianne A., Roodenrijs, Nadia M.T., van Es, Bram, Hulsbergen-Veelken, Cornelia A.R., Jong, Sebastiaan, Overmars, L. Malin, Reteig, Leon C., Tan, Sander C., Tauber, Tjebbe, van Laar, Jacob M., Welsing, Paco M.J., and Haitjema, Saskia
- Published
- 2021
18. Cardiovascular imaging of women and men visiting the outpatient clinic with chest pain or discomfort: design and rationale of the ARGUS Study
- Author
-
CDL Onderzoek Pasterkamp, Onderzoek Vrouw Hart & Vaatziekten, Circulatory Health, Experimentele Afd. Cardiologie 1, CDL Upod, Cardiovasculaire Epi Team 3, JC onderzoeksprogramma Cardiovasculaire Epidemiologie, CDL Arcadia, Researchgr. Cardiovasculaire Radiologie, Groepenhoff, Floor, Eikendal, Anouk L M, Bots, Sophie Heleen, van Ommen, Anne-Mar, Overmars, L M, Kapteijn, Daniek, Pasterkamp, Gerard, Reiber, Johan H C, Hautemann, David, Menken, Roxana, Wittekoek, Marianne E, Hofstra, Leonard, Onland-Moret, N Charlotte, Haitjema, Saskia, Hoefer, Imo, Leiner, Tim, den Ruijter, Hester M, CDL Onderzoek Pasterkamp, Onderzoek Vrouw Hart & Vaatziekten, Circulatory Health, Experimentele Afd. Cardiologie 1, CDL Upod, Cardiovasculaire Epi Team 3, JC onderzoeksprogramma Cardiovasculaire Epidemiologie, CDL Arcadia, Researchgr. Cardiovasculaire Radiologie, Groepenhoff, Floor, Eikendal, Anouk L M, Bots, Sophie Heleen, van Ommen, Anne-Mar, Overmars, L M, Kapteijn, Daniek, Pasterkamp, Gerard, Reiber, Johan H C, Hautemann, David, Menken, Roxana, Wittekoek, Marianne E, Hofstra, Leonard, Onland-Moret, N Charlotte, Haitjema, Saskia, Hoefer, Imo, Leiner, Tim, and den Ruijter, Hester M
- Published
- 2020
19. Cardiovascular imaging of women and men visiting the outpatient clinic with chest pain or discomfort: design and rationale of the ARGUS Study
- Author
-
Groepenhoff, Floor, primary, Eikendal, Anouk L M, additional, Bots, Sophie Heleen, additional, van Ommen, Anne-Mar, additional, Overmars, L M, additional, Kapteijn, Daniek, additional, Pasterkamp, Gerard, additional, Reiber, Johan H C, additional, Hautemann, David, additional, Menken, Roxana, additional, Wittekoek, Marianne E, additional, Hofstra, Leonard, additional, Onland-Moret, N Charlotte, additional, Haitjema, Saskia, additional, Hoefer, Imo, additional, Leiner, Tim, additional, and den Ruijter, Hester M, additional
- Published
- 2020
- Full Text
- View/download PDF
20. Draft genome sequence of Dethiobacter alkaliphilus strain AHT1T, a gram-positive sulfidogenic polyextremophile
- Author
-
Melton, ED, Sorokin, DY, Overmars, L, Lapidus, AL, Pillay, M, Ivanova, N, del Rio, TG, Kyrpides, NC, Woyke, T, and Muyzer, G
- Abstract
© 2017 The Author(s). Dethiobacter alkaliphilus strain AHT1Tis an anaerobic, sulfidogenic, moderately salt-tolerant alkaliphilic chemolithotroph isolated from hypersaline soda lake sediments in northeastern Mongolia. It is a Gram-positive bacterium with low GC content, within the phylum Firmicutes. Here we report its draft genome sequence, which consists of 34 contigs with a total sequence length of 3.12 Mbp. D. alkaliphilus strain AHT1Twas sequenced by the Joint Genome Institute (JGI) as part of the Community Science Program due to its relevance to bioremediation and biotechnological applications.
- Published
- 2017
- Full Text
- View/download PDF
21. The plasticity of prokaryotic regulatory elements
- Author
-
Overmars, L., Siezen, R.J., Francke, C., Hijum, S.A.F.T. van, and Radboud University Nijmegen
- Subjects
Radboud Institute for Molecular Life Sciences ,Nanomedicine [Radboudumc 19] ,GeneralLiterature_REFERENCE(e.g.,dictionaries,encyclopedias,glossaries) ,Nanomedicine Radboud Institute for Molecular Life Sciences [Radboudumc 19] - Abstract
Contains fulltext : 169268.pdf (Publisher’s version ) (Open Access) Radboud University, 31 mei 2017 Promotor : Siezen, R.J. Co-promotores : Francke, C., Hijum, S.A.F.T. van
- Published
- 2017
22. The plasticity of prokaryotic regulatory elements
- Author
-
Siezen, R.J., Francke, C., Hijum, S.A.F.T. van, Overmars, L., Siezen, R.J., Francke, C., Hijum, S.A.F.T. van, and Overmars, L.
- Abstract
Radboud University, 31 mei 2017, Promotor : Siezen, R.J. Co-promotores : Francke, C., Hijum, S.A.F.T. van, Contains fulltext : 169268.pdf (publisher's version ) (Open Access)
- Published
- 2017
23. A Novel Quality Measure and Correction Procedure for the Annotation of Microbial Translation Initiation Sites
- Author
-
Overmars, L., Siezen, R.J., Francke, C., Overmars, L., Siezen, R.J., and Francke, C.
- Abstract
Contains fulltext : 154712.PDF (publisher's version ) (Open Access), The identification of translation initiation sites (TISs) constitutes an important aspect of sequence-based genome analysis. An erroneous TIS annotation can impair the identification of regulatory elements and N-terminal signal peptides, and also may flaw the determination of descent, for any particular gene. We have formulated a reference-free method to score the TIS annotation quality. The method is based on a comparison of the observed and expected distribution of all TISs in a particular genome given prior gene-calling. We have assessed the TIS annotations for all available NCBI RefSeq microbial genomes and found that approximately 87% is of appropriate quality, whereas 13% needs substantial improvement. We have analyzed a number of factors that could affect TIS annotation quality such as GC-content, taxonomy, the fraction of genes with a Shine-Dalgarno sequence and the year of publication. The analysis showed that only the first factor has a clear effect. We have then formulated a straightforward Principle Component Analysis-based TIS identification strategy to self-organize and score potential TISs. The strategy is independent of reference data and a priori calculations. A representative set of 277 genomes was subjected to the analysis and we found a clear increase in TIS annotation quality for the genomes with a low quality score. The PCA-based annotation was also compared with annotation with the current tool of reference, Prodigal. The comparison for the model genome of Escherichia coli K12 showed that both methods supplement each other and that prediction agreement can be used as an indicator of a correct TIS annotation. Importantly, the data suggest that the addition of a PCA-based strategy to a Prodigal prediction can be used to 'flag' TIS annotations for re-evaluation and in addition can be used to evaluate a given annotation in case a Prodigal annotation is lacking.
- Published
- 2015
24. CiVi: circular genome visualization with unique features to analyze sequence elements
- Author
-
Overmars, L., Hijum, S.A. van, Siezen, R.J., Francke, C., Overmars, L., Hijum, S.A. van, Siezen, R.J., and Francke, C.
- Abstract
Item does not contain fulltext, We have developed CiVi, a user-friendly web-based tool to create custom circular maps to aid the analysis of microbial genomes and sequence elements. Sequence related data such as gene-name, COG class, PFAM domain, GC%, and subcellular location can be comprehensively viewed. Quantitative gene-related data (e.g. expression ratios or read counts) as well as predicted sequence elements (e.g. regulatory sequences) can be uploaded and visualized. CiVi accommodates the analysis of genomic elements by allowing a visual interpretation in the context of: (i) their genome-wide distribution, (ii) provided experimental data and (iii) the local orientation and location with respect to neighboring genes. CiVi thus enables both experts and non-experts to conveniently integrate public genome data with the results of genome analyses in circular genome maps suitable for publication. CONTACT: L.Overmars@gmail.com SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. AVAILABILITY AND IMPLEMENTATION: CiVi is freely available at http://civi.cmbi.ru.nl.
- Published
- 2015
25. Reconstruction of the regulatory network of Lactobacillus plantarum WCFS1 on basis of correlated gene expression and conserved regulatory motifs
- Author
-
Wels, M., Overmars, L., Francke, C., Kleerebezem, M., and Siezen, R.J.
- Subjects
Chemical and physical biology [NCMLS 7] ,comparative genomics ,Microbiology ,Microbiologie ,regulon ,escherichia-coli ,identification ,ctsr ,transcriptional regulation ,Host-Microbe Interactomics ,bacteria ,bacillus-subtilis ,negative regulator ,discovery ,VLAG - Abstract
Contains fulltext : 119522.pdf (Publisher’s version ) (Open Access) Gene regulatory networks can be reconstructed by combining transcriptome data from many different experiments to elucidate relations between the activity of certain transcription factors and the genes they control. To obtain insight in the regulatory network of Lactobacillus plantarum, microarray transcriptome data from more than 70 different experimental conditions were combined and the expression profiles of the transcriptional units (TUs) were compared. The TUs that displayed correlated expression were used to identify putative cis-regulatory elements by searching the upstream regions of the TUs for conserved motifs. Predicted motifs were extended and refined by searching for motifs in the upstream regions of additional TUs with correlated expression. In this way, cis-acting elements were identified for 41 regulons consisting of at least four TUs (correlation > 0.7). This set of regulons included the known regulons of CtsR and LexA, but also several novel ones encompassing genes with coherent biological functions. Visualization of the regulons and their connections revealed a highly interconnected regulatory network. This network contains several subnetworks that encompass genes of correlated biological function, such as sugar and energy metabolism, nitrogen metabolism and stress response. 01 mei 2011
- Published
- 2011
26. MGcV: the microbial genomic context viewer for comparative genome analysis
- Author
-
Overmars, L., Kerkhoven, R., Siezen, R.J., Francke, C., Overmars, L., Kerkhoven, R., Siezen, R.J., and Francke, C.
- Abstract
Contains fulltext : 117085.pdf (publisher's version ) (Open Access), BACKGROUND: Conserved gene context is used in many types of comparative genome analyses. It is used to provide leads on gene function, to guide the discovery of regulatory sequences, but also to aid in the reconstruction of metabolic networks. We present the Microbial Genomic context Viewer (MGcV), an interactive, web-based application tailored to strengthen the practice of manual comparative genome context analysis for bacteria. RESULTS: MGcV is a versatile, easy-to-use tool that renders a visualization of the genomic context of any set of selected genes, genes within a phylogenetic tree, genomic segments, or regulatory elements. It is tailored to facilitate laborious tasks such as the interactive annotation of gene function, the discovery of regulatory elements, or the sequence-based reconstruction of gene regulatory networks. We illustrate that MGcV can be used in gene function annotation by visually integrating information on prokaryotic genes, like their annotation as available from NCBI with other annotation data such as Pfam domains, sub-cellular location predictions and gene-sequence characteristics such as GC content. We also illustrate the usefulness of the interactive features that allow the graphical selection of genes to facilitate data gathering (e.g. upstream regions, ID's or annotation), in the analysis and reconstruction of transcription regulation. Moreover, putative regulatory elements and their corresponding scores or data from RNA-seq and microarray experiments can be uploaded, visualized and interpreted in (ranked-) comparative context maps. The ranked maps allow the interpretation of predicted regulatory elements and experimental data in light of each other. CONCLUSION: MGcV advances the manual comparative analysis of genes and regulatory elements by providing fast and flexible integration of gene related data combined with straightforward data retrieval. MGcV is available at http://mgcv.cmbi.ru.nl.
- Published
- 2013
27. Classification of the adenylation and acyl-transferase activity of NRPS and PKS systems using ensembles of substrate specific hidden Markov models
- Author
-
Khayatt, B., Overmars, L., Siezen, R.J., Francke, C., Khayatt, B., Overmars, L., Siezen, R.J., and Francke, C.
- Abstract
Contains fulltext : 118146.pdf (publisher's version ) (Open Access), There is a growing interest in the Non-ribosomal peptide synthetases (NRPSs) and polyketide synthases (PKSs) of microbes, fungi and plants because they can produce bioactive peptides such as antibiotics. The ability to identify the substrate specificity of the enzyme's adenylation (A) and acyl-transferase (AT) domains is essential to rationally deduce or engineer new products. We here report on a Hidden Markov Model (HMM)-based ensemble method to predict the substrate specificity at high quality. We collected a new reference set of experimentally validated sequences. An initial classification based on alignment and Neighbor Joining was performed in line with most of the previously published prediction methods. We then created and tested single substrate specific HMMs and found that their use improved the correct identification significantly for A as well as for AT domains. A major advantage of the use of HMMs is that it abolishes the dependency on multiple sequence alignment and residue selection that is hampering the alignment-based clustering methods. Using our models we obtained a high prediction quality for the substrate specificity of the A domains similar to two recently published tools that make use of HMMs or Support Vector Machines (NRPSsp and NRPS predictor2, respectively). Moreover, replacement of the single substrate specific HMMs by ensembles of models caused a clear increase in prediction quality. We argue that the superiority of the ensemble over the single model is caused by the way substrate specificity evolves for the studied systems. It is likely that this also holds true for other protein domains. The ensemble predictor has been implemented in a simple web-based tool that is available at http://www.cmbi.ru.nl/NRPS-PKS-substrate-predictor/.
- Published
- 2013
28. Reduce manual curation by combining gene predictions from multiple annotation engines, a case study of start codon prediction
- Author
-
Ederveen, T.H.A., Overmars, L., Hijum, S.A.F.T. van, Ederveen, T.H.A., Overmars, L., and Hijum, S.A.F.T. van
- Abstract
Contains fulltext : 117193.pdf (publisher's version ) (Open Access), Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene annotations. Automated genome annotation engines provide users a straight-forward and complete solution for predicting ORF coordinates and function. For many labs, the use of AGEs is therefore essential to decrease the time necessary for annotating a given prokaryotic genome. However, it is not uncommon for AGEs to provide different and sometimes conflicting predictions. Combining multiple AGEs might allow for more accurate predictions. Here we analyzed the ab initio open reading frame (ORF) calling performance of different AGEs based on curated genome annotations of eight strains from different bacterial species with GC% ranging from 35-52%. We present a case study which demonstrates a novel way of comparative genome annotation, using combinations of AGEs in a pre-defined order (or path) to predict ORF start codons. The order of AGE combinations is from high to low specificity, where the specificity is based on the eight genome annotations. For each AGE combination we are able to derive a so-called projected confidence value, which is the average specificity of ORF start codon prediction based on the eight genomes. The projected confidence enables estimating likeliness of a correct prediction for a particular ORF start codon by a particular AGE combination, pinpointing ORFs notoriously difficult to predict start codons. We correctly predict start codons for 90.5+/-4.8% of the genes in a genome (based on the eight genomes) with an accuracy of 81.1+/-7.6%. Our consensus-path methodology allows a marked improvement over majority voting (9.7+/-4.4%) and with an optimal path ORF start prediction sensitivity is gained while maintaining a high specificity.
- Published
- 2013
29. Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle?
- Author
-
Touw, W.G., Bayjanov, J., Overmars, L., Backus, L., Boekhorst, J., Wels, M.W., Hijum, S.A.F.T. van, Touw, W.G., Bayjanov, J., Overmars, L., Backus, L., Boekhorst, J., Wels, M.W., and Hijum, S.A.F.T. van
- Abstract
Item does not contain fulltext, In the Life Sciences 'omics' data is increasingly generated by different high-throughput technologies. Often only the integration of these data allows uncovering biological insights that can be experimentally validated or mechanistically modelled, i.e. sophisticated computational approaches are required to extract the complex non-linear trends present in omics data. Classification techniques allow training a model based on variables (e.g. SNPs in genetic association studies) to separate different classes (e.g. healthy subjects versus patients). Random Forest (RF) is a versatile classification algorithm suited for the analysis of these large data sets. In the Life Sciences, RF is popular because RF classification models have a high-prediction accuracy and provide information on importance of variables for classification. For omics data, variables or conditional relations between variables are typically important for a subset of samples of the same class. For example: within a class of cancer patients certain SNP combinations may be important for a subset of patients that have a specific subtype of cancer, but not important for a different subset of patients. These conditional relationships can in principle be uncovered from the data with RF as these are implicitly taken into account by the algorithm during the creation of the classification model. This review details some of the to the best of our knowledge rarely or never used RF properties that allow maximizing the biological insights that can be extracted from complex omics data sets using RF.
- Published
- 2013
30. Comparative genome analysis of central nitrogen metabolism and its control by GlnR in the class Bacilli.
- Author
-
Groot Kormelink, T., Koenders, E., Hagemeijer, Y., Overmars, L., Siezen, R.J., Vos, W.M. de, Francke, C., Groot Kormelink, T., Koenders, E., Hagemeijer, Y., Overmars, L., Siezen, R.J., Vos, W.M. de, and Francke, C.
- Abstract
Contains fulltext : 110622.pdf (publisher's version ) (Open Access), BACKGROUND: The assimilation of nitrogen in bacteria is achieved through only a few metabolic conversions between alpha-ketoglutarate, glutamate and glutamine. The enzymes that catalyze these conversions are glutamine synthetase, glutaminase, glutamate dehydrogenase and glutamine alpha-ketoglutarate aminotransferase. In low-GC Gram-positive bacteria the transcriptional control over the levels of the related enzymes is mediated by four regulators: GlnR, TnrA, GltC and CodY. We have analyzed the genomes of all species belonging to the taxonomic families Bacillaceae, Listeriaceae, Staphylococcaceae, Lactobacillaceae, Leuconostocaceae and Streptococcaceae to determine the diversity in central nitrogen metabolism and reconstructed the regulation by GlnR. RESULTS: Although we observed a substantial difference in the extent of central nitrogen metabolism in the various species, the basic GlnR regulon was remarkably constant and appeared not affected by the presence or absence of the other three main regulators. We found a conserved regulatory association of GlnR with glutamine synthetase (glnRA operon), and the transport of ammonium (amtB-glnK) and glutamine/glutamate (i.e. via glnQHMP, glnPHQ, gltT, alsT). In addition less-conserved associations were found with, for instance, glutamate dehydrogenase in Streptococcaceae, purine catabolism and the reduction of nitrite in Bacillaceae, and aspartate/asparagine deamination in Lactobacillaceae. CONCLUSIONS: Our analyses imply GlnR-mediated regulation in constraining the import of ammonia/amino-containing compounds and the production of intracellular ammonia under conditions of high nitrogen availability. Such a role fits with the intrinsic need for tight control of ammonia levels to limit futile cycling.
- Published
- 2012
31. Transcriptomes Reveal Genetic Signatures Underlying Physiological Variations Imposed by Different Fermentation Conditions in Lactobacillus plantarum.
- Author
-
Bron, P.A., Wels, M.W., Bongers, R.S., Bokhorst-van de Veen, H. van, Wiersma, A., Overmars, L., Marco, M.L., Kleerebezem, M., Bron, P.A., Wels, M.W., Bongers, R.S., Bokhorst-van de Veen, H. van, Wiersma, A., Overmars, L., Marco, M.L., and Kleerebezem, M.
- Abstract
Contains fulltext : 110027.pdf (publisher's version ) (Open Access), Lactic acid bacteria (LAB) are utilized widely for the fermentation of foods. In the current post-genomic era, tools have been developed that explore genetic diversity among LAB strains aiming to link these variations to differential phenotypes observed in the strains investigated. However, these genotype-phenotype matching approaches fail to assess the role of conserved genes in the determination of physiological characteristics of cultures by environmental conditions. This manuscript describes a complementary approach in which Lactobacillus plantarum WCFS1 was fermented under a variety of conditions that differ in temperature, pH, as well as NaCl, amino acid, and O(2) levels. Samples derived from these fermentations were analyzed by full-genome transcriptomics, paralleled by the assessment of physiological characteristics, e.g., maximum growth rate, yield, and organic acid profiles. A data-storage and -mining suite designated FermDB was constructed and exploited to identify correlations between fermentation conditions and industrially relevant physiological characteristics of L. plantarum, as well as the associated transcriptome signatures. Finally, integration of the specific fermentation variables with the transcriptomes enabled the reconstruction of the gene-regulatory networks involved. The fermentation-genomics platform presented here is a valuable complementary approach to earlier described genotype-phenotype matching strategies which allows the identification of transcriptome signatures underlying physiological variations imposed by different fermentation conditions.
- Published
- 2012
32. Comparative genome analysis of central nitrogen metabolism and its control by GlnR in the class Bacilli
- Author
-
Groot Kormelink, T. (author), Koenders, E. (author), Hagemeijer, Y. (author), Overmars, L. (author), Siezen, R.J. (author), De Vos, W.M. (author), Francke, C. (author), Groot Kormelink, T. (author), Koenders, E. (author), Hagemeijer, Y. (author), Overmars, L. (author), Siezen, R.J. (author), De Vos, W.M. (author), and Francke, C. (author)
- Abstract
Background: The assimilation of nitrogen in bacteria is achieved through only a few metabolic conversions between alpha-ketoglutarate, glutamate and glutamine. The enzymes that catalyze these conversions are glutamine synthetase, glutaminase, glutamate dehydrogenase and glutamine alpha-ketoglutarate aminotransferase. In low-GC Gram-positive bacteria the transcriptional control over the levels of the related enzymes is mediated by four regulators: GlnR, TnrA, GltC and CodY. We have analyzed the genomes of all species belonging to the taxonomic families Bacillaceae, Listeriaceae, Staphylococcaceae, Lactobacillaceae, Leuconostocaceae and Streptococcaceae to determine the diversity in central nitrogen metabolism and reconstructed the regulation by GlnR. Results: Although we observed a substantial difference in the extent of central nitrogen metabolism in the various species, the basic GlnR regulon was remarkably constant and appeared not affected by the presence or absence of the other three main regulators. We found a conserved regulatory association of GlnR with glutamine synthetase (glnRA operon), and the transport of ammonium (amtB-glnK) and glutamine/glutamate (i.e. via glnQHMP, glnPHQ, gltT, alsT). In addition less-conserved associations were found with, for instance, glutamate dehydrogenase in Streptococcaceae, purine catabolism and the reduction of nitrite in Bacillaceae, and aspartate/asparagine deamination in Lactobacillaceae. Conclusions: Our analyses imply GlnR-mediated regulation in constraining the import of ammonia/amino-containing compounds and the production of intracellular ammonia under conditions of high nitrogen availability. Such a role fits with the intrinsic need for tight control of ammonia levels to limit futile cycling., Applied Sciences
- Published
- 2012
- Full Text
- View/download PDF
33. Comparative genome analysis of central nitrogen metabolism and its control by GlnR in the class Bacilli
- Author
-
Kormelink, T.G., Koenders, E., Hagemeijer, Y., Overmars, L., Siezen, R.J., de Vos, W.M., Francke, C., Kormelink, T.G., Koenders, E., Hagemeijer, Y., Overmars, L., Siezen, R.J., de Vos, W.M., and Francke, C.
- Abstract
BACKGROUND: The assimilation of nitrogen in bacteria is achieved through only a few metabolic conversions between alpha-ketoglutarate, glutamate and glutamine. The enzymes that catalyze these conversions are glutamine synthetase, glutaminase, glutamate dehydrogenase and glutamine alpha-ketoglutarate aminotransferase. In low-GC Gram-positive bacteria the transcriptional control over the levels of the related enzymes is mediated by four regulators: GlnR, TnrA, GltC and CodY. We have analyzed the genomes of all species belonging to the taxonomic families Bacillaceae, Listeriaceae, Staphylococcaceae, Lactobacillaceae, Leuconostocaceae and Streptococcaceae to determine the diversity in central nitrogen metabolism and reconstructed the regulation by GlnR. RESULTS: Although we observed a substantial difference in the extent of central nitrogen metabolism in the various species, the basic GlnR regulon was remarkably constant and appeared not affected by the presence or absence of the other three main regulators. We found a conserved regulatory association of GlnR with glutamine synthetase (glnRA operon), and the transport of ammonium (amtB-glnK) and glutamine/glutamate (i.e. via glnQHMP, glnPHQ, gltT, alsT). In addition less-conserved associations were found with, for instance, glutamate dehydrogenase in Streptococcaceae, purine catabolism and the reduction of nitrite in Bacillaceae, and aspartate/asparagine deamination in Lactobacillaceae. CONCLUSIONS: Our analyses imply GlnR-mediated regulation in constraining the import of ammonia/amino-containing compounds and the production of intracellular ammonia under conditions of high nitrogen availability. Such a role fits with the intrinsic need for tight control of ammonia levels to limit futile cycling
- Published
- 2012
34. Reconstruction of the regulatory network of Lactobacillus plantarum WCFS1 on basis of correlated gene expression and conserved regulatory motifs.
- Author
-
Wels, M.W., Overmars, L., Francke, C., Kleerebezem, M., Siezen, R.J., Wels, M.W., Overmars, L., Francke, C., Kleerebezem, M., and Siezen, R.J.
- Abstract
01 mei 2011, Contains fulltext : 119522.pdf (publisher's version ) (Open Access), Gene regulatory networks can be reconstructed by combining transcriptome data from many different experiments to elucidate relations between the activity of certain transcription factors and the genes they control. To obtain insight in the regulatory network of Lactobacillus plantarum, microarray transcriptome data from more than 70 different experimental conditions were combined and the expression profiles of the transcriptional units (TUs) were compared. The TUs that displayed correlated expression were used to identify putative cis-regulatory elements by searching the upstream regions of the TUs for conserved motifs. Predicted motifs were extended and refined by searching for motifs in the upstream regions of additional TUs with correlated expression. In this way, cis-acting elements were identified for 41 regulons consisting of at least four TUs (correlation > 0.7). This set of regulons included the known regulons of CtsR and LexA, but also several novel ones encompassing genes with coherent biological functions. Visualization of the regulons and their connections revealed a highly interconnected regulatory network. This network contains several subnetworks that encompass genes of correlated biological function, such as sugar and energy metabolism, nitrogen metabolism and stress response.
- Published
- 2011
35. Comparative analyses imply that the enigmatic Sigma factor 54 is a central controller of the bacterial exterior.
- Author
-
Francke, C., Groot Kormelink, T., Hagemeijer, Y., Overmars, L., Sluijter, V., Moezelaar, R., Siezen, R.J., Francke, C., Groot Kormelink, T., Hagemeijer, Y., Overmars, L., Sluijter, V., Moezelaar, R., and Siezen, R.J.
- Abstract
Contains fulltext : 95738.pdf (publisher's version ) (Open Access), BACKGROUND: Sigma-54 is a central regulator in many pathogenic bacteria and has been linked to a multitude of cellular processes like nitrogen assimilation and important functional traits such as motility, virulence, and biofilm formation. Until now it has remained obscure whether these phenomena and the control by Sigma-54 share an underlying theme. RESULTS: We have uncovered the commonality by performing a range of comparative genome analyses. A) The presence of Sigma-54 and its associated activators was determined for all sequenced prokaryotes. We observed a phylum-dependent distribution that is suggestive of an evolutionary relationship between Sigma-54 and lipopolysaccharide and flagellar biosynthesis. B) All Sigma-54 activators were identified and annotated. The relation with phosphotransfer-mediated signaling (TCS and PTS) and the transport and assimilation of carboxylates and nitrogen containing metabolites was substantiated. C) The function annotations, that were represented within the genomic context of all genes encoding Sigma-54, its activators and its promoters, were analyzed for intra-phylum representation and inter-phylum conservation. Promoters were localized using a straightforward scoring strategy that was formulated to identify similar motifs. We found clear highly-represented and conserved genetic associations with genes that concern the transport and biosynthesis of the metabolic intermediates of exopolysaccharides, flagella, lipids, lipopolysaccharides, lipoproteins and peptidoglycan. CONCLUSION: Our analyses directly implicate Sigma-54 as a central player in the control over the processes that involve the physical interaction of an organism with its environment like in the colonization of a host (virulence) or the formation of biofilm.
- Published
- 2011
36. Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle?
- Author
-
Touw, W. G., primary, Bayjanov, J. R., additional, Overmars, L., additional, Backus, L., additional, Boekhorst, J., additional, Wels, M., additional, and van Hijum, S. A. F. T., additional
- Published
- 2012
- Full Text
- View/download PDF
37. Preventing unnecessary imaging in patients suspect of coronary artery disease through machine learning of electronic health records.
- Author
-
Overmars LM, van Es B, Groepenhoff F, De Groot MCH, Pasterkamp G, den Ruijter HM, van Solinge WW, Hoefer IE, and Haitjema S
- Abstract
Aims: With the ageing European population, the incidence of coronary artery disease (CAD) is expected to rise. This will likely result in an increased imaging use. Symptom recognition can be complicated, as symptoms caused by CAD can be atypical, particularly in women. Early CAD exclusion may help to optimize use of diagnostic resources and thus improve the sustainability of the healthcare system. To develop sex-stratified algorithms, trained on routinely available electronic health records (EHRs), raw electrocardiograms, and haematology data to exclude CAD in patients upfront., Methods and Results: We trained XGBoost algorithms on data from patients from the Utrecht Patient-Oriented Database, who underwent coronary computed tomography angiography (CCTA), and/or stress cardiac magnetic resonance (CMR) imaging, or stress single-photon emission computerized tomography (SPECT) in the UMC Utrecht. Outcomes were extracted from radiology reports. We aimed to maximize negative predictive value (NPV) to minimize the false negative risk with acceptable specificity. Of 6808 CCTA patients (31% female), 1029 females (48%) and 1908 males (45%) had no diagnosis of CAD. Of 3053 CMR/SPECT patients (45% female), 650 females (47%) and 881 males (48%) had no diagnosis of CAD. On the train and test set, the CCTA models achieved NPVs and specificities of 0.95 and 0.19 (females) and 0.96 and 0.09 (males). The CMR/SPECT models achieved NPVs and specificities of 0.75 and 0.041 (females) and 0.92 and 0.026 (males)., Conclusion: Coronary artery disease can be excluded from EHRs with high NPV. Our study demonstrates new possibilities to reduce unnecessary imaging in women and men suspected of CAD., (© The Author(s) 2021. Published by Oxford University Press on behalf of the European Society of Cardiology.)
- Published
- 2021
- Full Text
- View/download PDF
38. Diversity and Distribution of Sulfur Oxidation-Related Genes in Thioalkalivibrio , a Genus of Chemolithoautotrophic and Haloalkaliphilic Sulfur-Oxidizing Bacteria.
- Author
-
Berben T, Overmars L, Sorokin DY, and Muyzer G
- Abstract
Soda lakes are saline alkaline lakes characterized by high concentrations of sodium carbonate/bicarbonate which lead to a stable elevated pH (>9), and moderate to extremely high salinity. Despite this combination of extreme conditions, biodiversity in soda lakes is high, and the presence of diverse microbial communities provides a driving force for highly active biogeochemical cycles. The sulfur cycle is one of the most important of these and bacterial sulfur oxidation is dominated by members of the obligately chemolithoautotrophic genus Thioalkalivibrio . Currently, 10 species have been described in this genus, but over one hundred isolates have been obtained from soda lake samples. The genomes of 75 strains were sequenced and annotated previously, and used in this study to provide a comprehensive picture of the diversity and distribution of genes related to dissimilatory sulfur metabolism in Thioalkalivibrio . Initially, all annotated genes in 75 Thioalkalivibrio genomes were placed in ortholog groups and filtered by bi-directional best BLAST analysis. Investigation of the ortholog groups containing genes related to sulfur oxidation showed that flavocytochrome c (fcc) , the truncated sox system, and sulfite:quinone oxidoreductase ( soe ) are present in all strains, whereas dissimilatory sulfite reductase ( dsr ; which catalyzes the oxidation of elemental sulfur) was found in only six strains. The heterodisulfide reductase system ( hdr ), which is proposed to oxidize sulfur to sulfite in strains lacking both dsr and soxCD , was detected in 73 genomes. Hierarchical clustering of strains based on sulfur gene repertoire correlated closely with previous phylogenomic analysis. The phylogenetic analysis of several sulfur oxidation genes showed a complex evolutionary history. All in all, this study presents a comprehensive investigation of sulfur metabolism-related genes in cultivated Thioalkalivibrio strains and provides several avenues for future research.
- Published
- 2019
- Full Text
- View/download PDF
39. Draft genome sequence of Dethiobacter alkaliphilus strain AHT1 T , a gram-positive sulfidogenic polyextremophile.
- Author
-
Melton ED, Sorokin DY, Overmars L, Lapidus AL, Pillay M, Ivanova N, Del Rio TG, Kyrpides NC, Woyke T, and Muyzer G
- Abstract
Dethiobacter alkaliphilus strain AHT1
T is an anaerobic, sulfidogenic, moderately salt-tolerant alkaliphilic chemolithotroph isolated from hypersaline soda lake sediments in northeastern Mongolia. It is a Gram-positive bacterium with low GC content, within the phylum Firmicutes . Here we report its draft genome sequence, which consists of 34 contigs with a total sequence length of 3.12 Mbp. D. alkaliphilus strain AHT1T was sequenced by the Joint Genome Institute (JGI) as part of the Community Science Program due to its relevance to bioremediation and biotechnological applications.- Published
- 2017
- Full Text
- View/download PDF
40. Genomic diversity within the haloalkaliphilic genus Thioalkalivibrio.
- Author
-
Ahn AC, Meier-Kolthoff JP, Overmars L, Richter M, Woyke T, Sorokin DY, and Muyzer G
- Subjects
- Gammaproteobacteria classification, Gammaproteobacteria genetics, Genetic Variation, Genome, Bacterial, RNA, Bacterial genetics, RNA, Ribosomal, 16S genetics
- Abstract
Thioalkalivibrio is a genus of obligate chemolithoautotrophic haloalkaliphilic sulfur-oxidizing bacteria. Their habitat are soda lakes which are dual extreme environments with a pH range from 9.5 to 11 and salt concentrations up to saturation. More than 100 strains of this genus have been isolated from various soda lakes all over the world, but only ten species have been effectively described yet. Therefore, the assignment of the remaining strains to either existing or novel species is important and will further elucidate their genomic diversity as well as give a better general understanding of this genus. Recently, the genomes of 76 Thioalkalivibrio strains were sequenced. On these, we applied different methods including (i) 16S rRNA gene sequence analysis, (ii) Multilocus Sequence Analysis (MLSA) based on eight housekeeping genes, (iii) Average Nucleotide Identity based on BLAST (ANIb) and MUMmer (ANIm), (iv) Tetranucleotide frequency correlation coefficients (TETRA), (v) digital DNA:DNA hybridization (dDDH) as well as (vi) nucleotide- and amino acid-based Genome BLAST Distance Phylogeny (GBDP) analyses. We detected a high genomic diversity by revealing 15 new "genomic" species and 16 new "genomic" subspecies in addition to the ten already described species. Phylogenetic and phylogenomic analyses showed that the genus is not monophyletic, because four strains were clearly separated from the other Thioalkalivibrio by type strains from other genera. Therefore, it is recommended to classify the latter group as a novel genus. The biogeographic distribution of Thioalkalivibrio suggested that the different "genomic" species can be classified as candidate disjunct or candidate endemic species. This study is a detailed genome-based classification and identification of members within the genus Thioalkalivibrio. However, future phenotypical and chemotaxonomical studies will be needed for a full species description of this genus.
- Published
- 2017
- Full Text
- View/download PDF
41. Comparative Genome Analysis of Three Thiocyanate Oxidizing Thioalkalivibrio Species Isolated from Soda Lakes.
- Author
-
Berben T, Overmars L, Sorokin DY, and Muyzer G
- Abstract
Thiocyanate is a C1 compound containing carbon, nitrogen, and sulfur. It is a (by)product in a number of natural and industrial processes. Because thiocyanate is toxic to many organisms, including humans, its removal from industrial waste streams is an important problem. Although a number of bacteria can use thiocyanate as a nitrogen source, only a few can use it as an electron donor. There are two distinct pathways to use thiocyanate: (i) the "carbonyl sulfide pathway," which has been extensively studied, and (ii) the "cyanate pathway," whose key enzyme, thiocyanate dehydrogenase, was recently purified and studied. Three species of Thioalkalivibrio , a group of haloalkaliphilic sulfur-oxidizing bacteria isolated from soda lakes, have been described as thiocyanate oxidizers: (i) Thioalkalivibrio paradoxus ("cyanate pathway"), (ii) Thioalkalivibrio thiocyanoxidans ("cyanate pathway") and (iii) Thioalkalivibrio thiocyanodenitrificans ("carbonyl sulfide pathway"). In this study we provide a comparative genome analysis of these described thiocyanate oxidizers, with genomes ranging in size from 2.5 to 3.8 million base pairs. While focusing on thiocyanate degradation, we also analyzed the differences in sulfur, carbon, and nitrogen metabolism. We found that the thiocyanate dehydrogenase gene is present in 10 different Thioalkalivibrio strains, in two distinct genomic contexts/genotypes. The first genotype is defined by having genes for flavocytochrome c sulfide dehydrogenase upstream from the thiocyanate dehydrogenase operon (present in two strains including the type strain of Tv. paradoxus ), whereas in the second genotype these genes are located downstream, together with two additional genes of unknown function (present in eight strains, including the type strains of Tv. thiocyanoxidans ). Additionally, we found differences in the presence/absence of genes for various sulfur oxidation pathways, such as sulfide:quinone oxidoreductase, dissimilatory sulfite reductase, and sulfite dehydrogenase. One strain ( Tv. thiocyanodenitrificans ) lacks genes encoding a carbon concentrating mechanism and none of the investigated genomes were shown to contain known bicarbonate transporters. This study gives insight into the genomic variation of thiocyanate oxidizing bacteria and may lead to improvements in the application of these organisms in the bioremediation of industrial waste streams.
- Published
- 2017
- Full Text
- View/download PDF
42. Complete genome sequence of Desulfurivibrio alkaliphilus strain AHT2(T), a haloalkaliphilic sulfidogen from Egyptian hypersaline alkaline lakes.
- Author
-
Melton ED, Sorokin DY, Overmars L, Chertkov O, Clum A, Pillay M, Ivanova N, Shapiro N, Kyrpides NC, Woyke T, Lapidus AL, and Muyzer G
- Abstract
Desulfurivibrio alkaliphilus strain AHT2(T) is a strictly anaerobic sulfidogenic haloalkaliphile isolated from a composite sediment sample of eight hypersaline alkaline lakes in the Wadi al Natrun valley in the Egyptian Libyan Desert. D. alkaliphilus AHT2(T) is Gram-negative and belongs to the family Desulfobulbaceae within the Deltaproteobacteria. Here we report its genome sequence, which contains a 3.10 Mbp chromosome. D. alkaliphilus AHT2(T) is adapted to survive under highly alkaline and moderately saline conditions and therefore, is relevant to the biotechnology industry and life under extreme conditions. For these reasons, D. alkaliphilus AHT2(T) was sequenced by the DOE Joint Genome Institute as part of the Community Science Program.
- Published
- 2016
- Full Text
- View/download PDF
43. CiVi: circular genome visualization with unique features to analyze sequence elements.
- Author
-
Overmars L, van Hijum SA, Siezen RJ, and Francke C
- Subjects
- Databases, Genetic, Genomics methods, Information Storage and Retrieval, User-Computer Interface, Computational Biology methods, Computer Graphics, Genes, Bacterial genetics, Genome, Bacterial, Molecular Sequence Annotation methods, Regulatory Sequences, Nucleic Acid genetics, Software
- Abstract
Unlabelled: We have developed CiVi, a user-friendly web-based tool to create custom circular maps to aid the analysis of microbial genomes and sequence elements. Sequence related data such as gene-name, COG class, PFAM domain, GC%, and subcellular location can be comprehensively viewed. Quantitative gene-related data (e.g. expression ratios or read counts) as well as predicted sequence elements (e.g. regulatory sequences) can be uploaded and visualized. CiVi accommodates the analysis of genomic elements by allowing a visual interpretation in the context of: (i) their genome-wide distribution, (ii) provided experimental data and (iii) the local orientation and location with respect to neighboring genes. CiVi thus enables both experts and non-experts to conveniently integrate public genome data with the results of genome analyses in circular genome maps suitable for publication., Contact: L.Overmars@gmail.com, Supplementary Information: Supplementary data are available at Bioinformatics online., Availability and Implementation: CiVi is freely available at http://civi.cmbi.ru.nl., (© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.)
- Published
- 2015
- Full Text
- View/download PDF
44. A Novel Quality Measure and Correction Procedure for the Annotation of Microbial Translation Initiation Sites.
- Author
-
Overmars L, Siezen RJ, and Francke C
- Subjects
- Base Composition, Escherichia coli genetics, Open Reading Frames, RNA, Archaeal genetics, RNA, Bacterial genetics, RNA, Fungal genetics, RNA, Messenger genetics, Saccharomyces cerevisiae genetics, Archaea genetics, Bacteria genetics, Peptide Chain Initiation, Translational genetics, Principal Component Analysis, Regulatory Sequences, Ribonucleic Acid
- Abstract
The identification of translation initiation sites (TISs) constitutes an important aspect of sequence-based genome analysis. An erroneous TIS annotation can impair the identification of regulatory elements and N-terminal signal peptides, and also may flaw the determination of descent, for any particular gene. We have formulated a reference-free method to score the TIS annotation quality. The method is based on a comparison of the observed and expected distribution of all TISs in a particular genome given prior gene-calling. We have assessed the TIS annotations for all available NCBI RefSeq microbial genomes and found that approximately 87% is of appropriate quality, whereas 13% needs substantial improvement. We have analyzed a number of factors that could affect TIS annotation quality such as GC-content, taxonomy, the fraction of genes with a Shine-Dalgarno sequence and the year of publication. The analysis showed that only the first factor has a clear effect. We have then formulated a straightforward Principle Component Analysis-based TIS identification strategy to self-organize and score potential TISs. The strategy is independent of reference data and a priori calculations. A representative set of 277 genomes was subjected to the analysis and we found a clear increase in TIS annotation quality for the genomes with a low quality score. The PCA-based annotation was also compared with annotation with the current tool of reference, Prodigal. The comparison for the model genome of Escherichia coli K12 showed that both methods supplement each other and that prediction agreement can be used as an indicator of a correct TIS annotation. Importantly, the data suggest that the addition of a PCA-based strategy to a Prodigal prediction can be used to 'flag' TIS annotations for re-evaluation and in addition can be used to evaluate a given annotation in case a Prodigal annotation is lacking.
- Published
- 2015
- Full Text
- View/download PDF
45. Microbial diversity and biogeochemical cycling in soda lakes.
- Author
-
Sorokin DY, Berben T, Melton ED, Overmars L, Vavourakis CD, and Muyzer G
- Subjects
- Lakes chemistry, Methane chemistry, Methane metabolism, Nitrogen Cycle, Sulfur chemistry, Sulfur metabolism, Carbon Cycle, Lakes microbiology, Microbiota, Salt Tolerance
- Abstract
Soda lakes contain high concentrations of sodium carbonates resulting in a stable elevated pH, which provide a unique habitat to a rich diversity of haloalkaliphilic bacteria and archaea. Both cultivation-dependent and -independent methods have aided the identification of key processes and genes in the microbially mediated carbon, nitrogen, and sulfur biogeochemical cycles in soda lakes. In order to survive in this extreme environment, haloalkaliphiles have developed various bioenergetic and structural adaptations to maintain pH homeostasis and intracellular osmotic pressure. The cultivation of a handful of strains has led to the isolation of a number of extremozymes, which allow the cell to perform enzymatic reactions at these extreme conditions. These enzymes potentially contribute to biotechnological applications. In addition, microbial species active in the sulfur cycle can be used for sulfur remediation purposes. Future research should combine both innovative culture methods and state-of-the-art 'meta-omic' techniques to gain a comprehensive understanding of the microbes that flourish in these extreme environments and the processes they mediate. Coupling the biogeochemical C, N, and S cycles and identifying where each process takes place on a spatial and temporal scale could unravel the interspecies relationships and thereby reveal more about the ecosystem dynamics of these enigmatic extreme environments.
- Published
- 2014
- Full Text
- View/download PDF
46. Reduce manual curation by combining gene predictions from multiple annotation engines, a case study of start codon prediction.
- Author
-
Ederveen TH, Overmars L, and van Hijum SA
- Subjects
- Base Composition, Consensus Sequence, Genome, Bacterial, Genomics methods, Open Reading Frames, Reproducibility of Results, Codon, Initiator, Computational Biology methods, Molecular Sequence Annotation methods
- Abstract
Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene annotations. Automated genome annotation engines provide users a straight-forward and complete solution for predicting ORF coordinates and function. For many labs, the use of AGEs is therefore essential to decrease the time necessary for annotating a given prokaryotic genome. However, it is not uncommon for AGEs to provide different and sometimes conflicting predictions. Combining multiple AGEs might allow for more accurate predictions. Here we analyzed the ab initio open reading frame (ORF) calling performance of different AGEs based on curated genome annotations of eight strains from different bacterial species with GC% ranging from 35-52%. We present a case study which demonstrates a novel way of comparative genome annotation, using combinations of AGEs in a pre-defined order (or path) to predict ORF start codons. The order of AGE combinations is from high to low specificity, where the specificity is based on the eight genome annotations. For each AGE combination we are able to derive a so-called projected confidence value, which is the average specificity of ORF start codon prediction based on the eight genomes. The projected confidence enables estimating likeliness of a correct prediction for a particular ORF start codon by a particular AGE combination, pinpointing ORFs notoriously difficult to predict start codons. We correctly predict start codons for 90.5±4.8% of the genes in a genome (based on the eight genomes) with an accuracy of 81.1±7.6%. Our consensus-path methodology allows a marked improvement over majority voting (9.7±4.4%) and with an optimal path ORF start prediction sensitivity is gained while maintaining a high specificity.
- Published
- 2013
- Full Text
- View/download PDF
47. Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle?
- Author
-
Touw WG, Bayjanov JR, Overmars L, Backus L, Boekhorst J, Wels M, and van Hijum SA
- Subjects
- Humans, Neoplasms genetics, Polymorphism, Single Nucleotide, Algorithms, Biological Science Disciplines, Data Mining
- Abstract
In the Life Sciences 'omics' data is increasingly generated by different high-throughput technologies. Often only the integration of these data allows uncovering biological insights that can be experimentally validated or mechanistically modelled, i.e. sophisticated computational approaches are required to extract the complex non-linear trends present in omics data. Classification techniques allow training a model based on variables (e.g. SNPs in genetic association studies) to separate different classes (e.g. healthy subjects versus patients). Random Forest (RF) is a versatile classification algorithm suited for the analysis of these large data sets. In the Life Sciences, RF is popular because RF classification models have a high-prediction accuracy and provide information on importance of variables for classification. For omics data, variables or conditional relations between variables are typically important for a subset of samples of the same class. For example: within a class of cancer patients certain SNP combinations may be important for a subset of patients that have a specific subtype of cancer, but not important for a different subset of patients. These conditional relationships can in principle be uncovered from the data with RF as these are implicitly taken into account by the algorithm during the creation of the classification model. This review details some of the to the best of our knowledge rarely or never used RF properties that allow maximizing the biological insights that can be extracted from complex omics data sets using RF.
- Published
- 2013
- Full Text
- View/download PDF
48. Classification of the adenylation and acyl-transferase activity of NRPS and PKS systems using ensembles of substrate specific hidden Markov models.
- Author
-
Khayatt BI, Overmars L, Siezen RJ, and Francke C
- Subjects
- Adenosine Monophosphate metabolism, Catalytic Domain, Markov Chains, Polyketide Synthases metabolism, Protein Structure, Tertiary, Sequence Alignment, Acyltransferases metabolism, Nucleotidyltransferases metabolism, Peptide Biosynthesis, Nucleic Acid-Independent physiology, Polyketide Synthases chemistry, Substrate Specificity, Support Vector Machine
- Abstract
There is a growing interest in the Non-ribosomal peptide synthetases (NRPSs) and polyketide synthases (PKSs) of microbes, fungi and plants because they can produce bioactive peptides such as antibiotics. The ability to identify the substrate specificity of the enzyme's adenylation (A) and acyl-transferase (AT) domains is essential to rationally deduce or engineer new products. We here report on a Hidden Markov Model (HMM)-based ensemble method to predict the substrate specificity at high quality. We collected a new reference set of experimentally validated sequences. An initial classification based on alignment and Neighbor Joining was performed in line with most of the previously published prediction methods. We then created and tested single substrate specific HMMs and found that their use improved the correct identification significantly for A as well as for AT domains. A major advantage of the use of HMMs is that it abolishes the dependency on multiple sequence alignment and residue selection that is hampering the alignment-based clustering methods. Using our models we obtained a high prediction quality for the substrate specificity of the A domains similar to two recently published tools that make use of HMMs or Support Vector Machines (NRPSsp and NRPS predictor2, respectively). Moreover, replacement of the single substrate specific HMMs by ensembles of models caused a clear increase in prediction quality. We argue that the superiority of the ensemble over the single model is caused by the way substrate specificity evolves for the studied systems. It is likely that this also holds true for other protein domains. The ensemble predictor has been implemented in a simple web-based tool that is available at http://www.cmbi.ru.nl/NRPS-PKS-substrate-predictor/.
- Published
- 2013
- Full Text
- View/download PDF
49. MGcV: the microbial genomic context viewer for comparative genome analysis.
- Author
-
Overmars L, Kerkhoven R, Siezen RJ, and Francke C
- Subjects
- Databases, Genetic, Information Storage and Retrieval, Lactobacillus genetics, National Library of Medicine (U.S.), Regulatory Sequences, Nucleic Acid, Software, Streptococcus mutans genetics, United States, User-Computer Interface, Computational Biology methods, Genome, Bacterial, Genomics methods, Molecular Sequence Annotation methods
- Abstract
Background: Conserved gene context is used in many types of comparative genome analyses. It is used to provide leads on gene function, to guide the discovery of regulatory sequences, but also to aid in the reconstruction of metabolic networks. We present the Microbial Genomic context Viewer (MGcV), an interactive, web-based application tailored to strengthen the practice of manual comparative genome context analysis for bacteria., Results: MGcV is a versatile, easy-to-use tool that renders a visualization of the genomic context of any set of selected genes, genes within a phylogenetic tree, genomic segments, or regulatory elements. It is tailored to facilitate laborious tasks such as the interactive annotation of gene function, the discovery of regulatory elements, or the sequence-based reconstruction of gene regulatory networks. We illustrate that MGcV can be used in gene function annotation by visually integrating information on prokaryotic genes, like their annotation as available from NCBI with other annotation data such as Pfam domains, sub-cellular location predictions and gene-sequence characteristics such as GC content. We also illustrate the usefulness of the interactive features that allow the graphical selection of genes to facilitate data gathering (e.g. upstream regions, ID's or annotation), in the analysis and reconstruction of transcription regulation. Moreover, putative regulatory elements and their corresponding scores or data from RNA-seq and microarray experiments can be uploaded, visualized and interpreted in (ranked-) comparative context maps. The ranked maps allow the interpretation of predicted regulatory elements and experimental data in light of each other., Conclusion: MGcV advances the manual comparative analysis of genes and regulatory elements by providing fast and flexible integration of gene related data combined with straightforward data retrieval. MGcV is available at http://mgcv.cmbi.ru.nl.
- Published
- 2013
- Full Text
- View/download PDF
50. Comparative genome analysis of central nitrogen metabolism and its control by GlnR in the class Bacilli.
- Author
-
Groot Kormelink T, Koenders E, Hagemeijer Y, Overmars L, Siezen RJ, de Vos WM, and Francke C
- Subjects
- Amino Acid Sequence, Ammonia metabolism, Bacillaceae classification, Bacillaceae enzymology, Bacterial Proteins genetics, Binding Sites, DNA metabolism, Gene Expression Regulation, Bacterial, Glutamate-Ammonia Ligase genetics, Lactobacillaceae enzymology, Lactobacillaceae genetics, Leuconostocaceae enzymology, Leuconostocaceae genetics, Listeria enzymology, Listeria genetics, Molecular Sequence Data, Repressor Proteins genetics, Repressor Proteins metabolism, Staphylococcaceae enzymology, Staphylococcaceae genetics, Streptococcaceae enzymology, Streptococcaceae genetics, Bacillaceae genetics, Bacterial Proteins metabolism, Genome, Bacterial, Glutamate-Ammonia Ligase metabolism, Nitrogen metabolism
- Abstract
Background: The assimilation of nitrogen in bacteria is achieved through only a few metabolic conversions between alpha-ketoglutarate, glutamate and glutamine. The enzymes that catalyze these conversions are glutamine synthetase, glutaminase, glutamate dehydrogenase and glutamine alpha-ketoglutarate aminotransferase. In low-GC Gram-positive bacteria the transcriptional control over the levels of the related enzymes is mediated by four regulators: GlnR, TnrA, GltC and CodY. We have analyzed the genomes of all species belonging to the taxonomic families Bacillaceae, Listeriaceae, Staphylococcaceae, Lactobacillaceae, Leuconostocaceae and Streptococcaceae to determine the diversity in central nitrogen metabolism and reconstructed the regulation by GlnR., Results: Although we observed a substantial difference in the extent of central nitrogen metabolism in the various species, the basic GlnR regulon was remarkably constant and appeared not affected by the presence or absence of the other three main regulators. We found a conserved regulatory association of GlnR with glutamine synthetase (glnRA operon), and the transport of ammonium (amtB-glnK) and glutamine/glutamate (i.e. via glnQHMP, glnPHQ, gltT, alsT). In addition less-conserved associations were found with, for instance, glutamate dehydrogenase in Streptococcaceae, purine catabolism and the reduction of nitrite in Bacillaceae, and aspartate/asparagine deamination in Lactobacillaceae., Conclusions: Our analyses imply GlnR-mediated regulation in constraining the import of ammonia/amino-containing compounds and the production of intracellular ammonia under conditions of high nitrogen availability. Such a role fits with the intrinsic need for tight control of ammonia levels to limit futile cycling.
- Published
- 2012
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.