106 results on '"Kottmann, Renzo"'
Search Results
2. Megx.net: integrated database resource for marine ecological genomics
- Author
-
Kottmann, Renzo, Kostadinov, Ivalyo, Duhaime, Melissa Beth, Buttigieg, Pier Luigi, Yilmaz, Pelin, Hankeln, Wolfgang, Waldmann, Jost, and Glöckner, Frank Oliver
- Published
- 2010
- Full Text
- View/download PDF
3. Working together to put molecules on the map
- Author
-
Field, Dawn, Morrison, Norman, Glöckner, Frank Oliver, Kottmann, Renzo, Cochrane, Guy, Vaughan, Robert, Garrity, George, Cole, Jim, Hirschman, Lynette, Schriml, Lynn, Mizrachi, Ilene, Federhen, Scott, Schindel, David, Miller, Scott, Hebert, Paul, Ratnasingham, Sujeevan, Hanner, Robert, Amaral-Zettler, Linda, Sogin, Mitchell, Ashburner, Michael, Lewis, Suzanna, and Smith, Barry
- Published
- 2008
- Full Text
- View/download PDF
4. Megx.net—database resources for marine ecological genomics
- Author
-
Lombardot, Thierry, Kottmann, Renzo, Pfeffer, Hauke, Richter, Michael, Teeling, Hanno, Quast, Christian, and Glöckner, Frank Oliver
- Published
- 2006
5. MetaBar - a tool for consistent contextual data acquisition and standards compliant submission
- Author
-
Kottmann Renzo, Fink Dennis, Buttigieg Pier, Hankeln Wolfgang, Yilmaz Pelin, and Glöckner Frank
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Environmental sequence datasets are increasing at an exponential rate; however, the vast majority of them lack appropriate descriptors like sampling location, time and depth/altitude: generally referred to as metadata or contextual data. The consistent capture and structured submission of these data is crucial for integrated data analysis and ecosystems modeling. The application MetaBar has been developed, to support consistent contextual data acquisition. Results MetaBar is a spreadsheet and web-based software tool designed to assist users in the consistent acquisition, electronic storage, and submission of contextual data associated to their samples. A preconfigured Microsoft® Excel® spreadsheet is used to initiate structured contextual data storage in the field or laboratory. Each sample is given a unique identifier and at any stage the sheets can be uploaded to the MetaBar database server. To label samples, identifiers can be printed as barcodes. An intuitive web interface provides quick access to the contextual data in the MetaBar database as well as user and project management capabilities. Export functions facilitate contextual and sequence data submission to the International Nucleotide Sequence Database Collaboration (INSDC), comprising of the DNA DataBase of Japan (DDBJ), the European Molecular Biology Laboratory database (EMBL) and GenBank. MetaBar requests and stores contextual data in compliance to the Genomic Standards Consortium specifications. The MetaBar open source code base for local installation is available under the GNU General Public License version 3 (GNU GPL3). Conclusion The MetaBar software supports the typical workflow from data acquisition and field-sampling to contextual data enriched sequence submission to an INSDC database. The integration with the megx.net marine Ecological Genomics database and portal facilitates georeferenced data integration and metadata-based comparisons of sampling sites as well as interactive data visualization. The ample export functionalities and the INSDC submission support enable exchange of data across disciplines and safeguarding contextual data.
- Published
- 2010
- Full Text
- View/download PDF
6. MetaMine – A tool to detect and analyse gene patterns in their environmental context
- Author
-
Kottmann Renzo, Lombardot Thierry, Bohnebeck Uta, and Glöckner Frank O
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Modern sequencing technologies allow rapid sequencing and bioinformatic analysis of genomes and metagenomes. With every new sequencing project a vast number of new proteins become available with many genes remaining functionally unclassified based on evidences from sequence similarities alone. Extending similarity searches with gene pattern approaches, defined as genes sharing a distinct genomic neighbourhood, have shown to significantly improve the number of functional assignments. Further functional evidences can be gained by correlating these gene patterns with prevailing environmental parameters. MetaMine was developed to approach the large pool of unclassified proteins by searching for recurrent gene patterns across habitats based on key genes. Results MetaMine is an interactive data mining tool which enables the detection of gene patterns in an environmental context. The gene pattern search starts with a user defined environmentally interesting key gene. With this gene a BLAST search is carried out against the Microbial Ecological Genomics DataBase (MEGDB) containing marine genomic and metagenomic sequences. This is followed by the determination of all neighbouring genes within a given distance and a search for functionally equivalent genes. In the final step a set of common genes present in a defined number of distinct genomes is determined. The gene patterns found are associated with their individual pattern instances describing gene order and directions. They are presented together with information about the sample and the habitat. MetaMine is implemented in Java and provided as a client/server application with a user-friendly graphical user interface. The system was evaluated with environmentally relevant genes related to the methane-cycle and carbon monoxide oxidation. Conclusion MetaMine offers a targeted, semi-automatic search for gene patterns based on expert input. The graphical user interface of MetaMine provides a user-friendly overview of the computed gene patterns for further inspection in an ecological context. Prevailing biological processes associated with a key gene can be used to infer new annotations and shape hypotheses to guide further analyses. The use-cases demonstrate that meaningful gene patterns can be quickly detected using MetaMine. MetaMine is freely available for academic use from http://www.megx.net/metamine.
- Published
- 2008
- Full Text
- View/download PDF
7. JCoast – A biologist-centric software tool for data mining and comparison of prokaryotic (meta)genomes
- Author
-
Kottmann Renzo, Kostadinov Ivaylo, Lombardot Thierry, Richter Michael, Duhaime Melissa, Peplies Jörg, and Glöckner Frank
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Current sequencing technologies give access to sequence information for genomes and metagenomes at a tremendous speed. Subsequent data processing is mainly performed by automatic pipelines provided by the sequencing centers. Although, standardised workflows are desirable and useful in many respects, rational data mining, comparative genomics, and especially the interpretation of the sequence information in the biological context, demands for intuitive, flexible, and extendable solutions. Results The JCoast software tool was primarily designed to analyse and compare (meta)genome sequences of prokaryotes. Based on a pre-computed GenDB database project, JCoast offers a flexible graphical user interface (GUI), as well as an application programming interface (API) that facilitates back-end data access. JCoast offers individual, cross genome-, and metagenome analysis, and assists the biologist in exploration of large and complex datasets. Conclusion JCoast combines all functions required for the mining, annotation, and interpretation of (meta)genomic data. The lightweight software solution allows the user to easily take advantage of advanced back-end database structures by providing a programming and graphical user interface to answer biological questions. JCoast is available at the project homepage.
- Published
- 2008
- Full Text
- View/download PDF
8. MetaLook: a 3D visualisation software for marine ecological genomics
- Author
-
Addor Nans, de Bono Andrea, Giuliani Gregory, Kottmann Renzo, Lombardot Thierry, and Glöckner Frank
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Marine ecological genomics can be defined as the application of genomic sciences to understand the structure and function of marine ecosystems. In this field of research, the analysis of genomes and metagenomes of environmental relevance must take into account the corresponding habitat (contextual) data, e.g. water depth, physical and chemical parameters. The creation of specialised software tools and databases is requisite to allow this new kind of integrated analysis. Results We implemented the MetaLook software for visualisation and analysis of marine ecological genomic and metagenomic data with respect to habitat parameters. MetaLook offers a three-dimensional user interface to interactively visualise DNA sequences on a world map, based on a centralised georeferenced database. The user can define environmental containers to organise the sequences according to different habitat criteria. To find similar sequences, the containers can be queried with either genes from the georeferenced database or user-imported sequences, using the BLAST algorithm. This allows an interactive assessment of the distribution of gene functions in the environment. Conclusion MetaLook allows scientists to investigate sequence data in their environmental context and to explore correlations between genes and habitat parameters. This software is a step towards the creation of specialised tools to study constrained distributions and habitat specificity of genes correlated with specific processes. MetaLook is available at: http://www.megx.net/metalook
- Published
- 2007
- Full Text
- View/download PDF
9. The “Minimum Information about an ENvironmental Sequence” (MIENS) specification
- Author
-
Sterk, Peter, Gordon, Jeffrey, White, Owen, Johnston, Anjanette, Nakielny, Sara, Weinstock, George, Arumugam, Mani, Guralnick, Robert, Neufeld, Josh, Wilke, Andreas, Cochrane, Guy, Nelson, Karen, Wendel, Doug, Baumgartner, Laura, Hugenholtz, Philip, Hankeln, Wolfgang, Pace, Norman, Wortmann, Jennifer, Nemergut, Diana, Bonazzi, Vivien, Birren, Bruce, Jansson, Janet, Highlander, Sarah, Palanisamy, Giriprakash, Glöckner, Frank Oliver, Bork, Peer, Blaser, Martin, Kennedy, Jerry, Peplies, Jörg, Knights, Dan, Peterson, Jane, Petrosino, Joseph, Buttigieg, Pier Luigi, Chain, Patrick, Koren, Omry, Costello, Elizabeth, Kuczynski, Justin, Proctor, Lita, Huot-Creasy, Heather, Kyrpides, Nikos, Raes, Jeroen, Dawyndt, Peter, Relman, David, Larsen, Robert, Ratnasingham, Sujeevan, Yilmaz, Pelin, DeSantis, Todd, Ley, Ruth, Assunta-Sansone, Susanna, Lauber, Christian, Ravel, Jacques, Gallery, Rachel, Kottmann, Renzo, Fierer, Noah, Lozupone, Catherine, Schriml, Lynn, Legg, Teresa, Vaughan, Robert, Gibbs, Richard, Field, Dawn, Fuhrman, Jed, Ludwig, Wolfgang, Sodergren, Erica, Knight, Rob, Giglio, Michelle Gwinn, Lyons, Donna, Spor, Aymé, Hunter, Christopher, Maguire, Eamonn, Cole, James, Stombaugh, Jesse, San Gil, Inigo, Park, Joonhong, Amaral-Zettler, Linda, Tiedje, James, Methé, Barbara, Gilbert, Jack, Glass, Elizabeth, Morrison, Norman, Meyer, Folker, Karsch-Mizrachi, Ilene, Gonzalez, Antonio, Rocca-Serra, Phillipe, and Ward, Doyle
- Published
- 2010
- Full Text
- View/download PDF
10. The “Minimum Information about an ENvironmental Sequence” (MIENS) specification
- Author
-
Yilmaz, Pelin, Kottmann, Renzo, Field, Dawn, Knight, Rob, Cole, James, Amaral-Zettler, Linda, Gilbert, Jack, Karsch-Mizrachi, Ilene, Johnston, Anjanette, Cochrane, Guy, Vaughan, Robert, Hunter, Christopher, Park, Joonhong, Morrison, Norman, Rocca-Serra, Phillipe, Sterk, Peter, Arumugam, Mani, Baumgartner, Laura, Birren, Bruce, Blaser, Martin, Bonazzi, Vivien, Bork, Peer, Buttigieg, Pier Luigi, Chain, Patrick, Costello, Elizabeth, Huot-Creasy, Heather, Dawyndt, Peter, DeSantis, Todd, Fierer, Noah, Fuhrman, Jed, Gallery, Rachel, Gibbs, Richard, Giglio, Michelle Gwinn, San Gil, Inigo, Glass, Elizabeth, Gonzalez, Antonio, Gordon, Jeffrey, Guralnick, Robert, Hankeln, Wolfgang, Highlander, Sarah, Hugenholtz, Philip, Jansson, Janet, Kennedy, Jerry, Knights, Dan, Koren, Omry, Kuczynski, Justin, Kyrpides, Nikos, Larsen, Robert, Lauber, Christian, Legg, Teresa, Ley, Ruth, Lozupone, Catherine, Ludwig, Wolfgang, Lyons, Donna, Maguire, Eamonn, Methé, Barbara, Meyer, Folker, Nakielny, Sara, Nelson, Karen, Nemergut, Diana, Neufeld, Josh, Pace, Norman, Palanisamy, Giriprakash, Peplies, Jörg, Peterson, Jane, Petrosino, Joseph, Proctor, Lita, Raes, Jeroen, Ratnasingham, Sujeevan, Ravel, Jacques, Relman, David, Assunta-Sansone, Susanna, Schriml, Lynn, Sodergren, Erica, Spor, Aymé, Stombaugh, Jesse, Tiedje, James, Ward, Doyle, Weinstock, George, Wendel, Doug, White, Owen, Wilke, Andreas, Wortmann, Jennifer, and Glöckner, Frank Oliver
- Published
- 2010
- Full Text
- View/download PDF
11. Minimum Information about a Biosynthetic Gene cluster: commentary
- Author
-
Medema, Marnix H, Kottmann, Renzo, Yilmaz, Pelin, Cummings, Matthew, Biggins, John B, Blin, Kai, de Bruijn, Irene, Chooi, Yit Heng, Claesen, Jan, Coates, R Cameron, Cruz-Morales, Pablo, Duddela, Srikanth, Dusterhus, Stephanie, Edwards, Daniel J, Fewer, David P, Garg, Neha, Geiger, Christoph, Gomez-Escribano, Juan Pablo, Greule, Anja, Hadjithomas, Michalis, Haines, Anthony S, Helfrich, Eric J N, Hillwig, Matthew L, Ishida, Keishi, Jones, Adam C, Jones, Carla S, Jungmann, Katrin, Kegler, Carsten, Kim, Hyun Uk, Kotter, Peter, Krug, Daniel, Masschelein, Joleen, Melnik, Alexey V, Mantovani, Simone M, Monroe, Emily A, Moore, Marcus, Moss, Nathan, Nutzmann, Hans-Wilhelm, Pan, Guohui, Pati, Amrita, Petras, Daniel, Reen, F Jerry, Rosconi, Federico, Rui, Zhe, Tian, Zhenhua, Tobias, Nicholas J, Tsunematsu, Yuta, Wiemann, Philipp, Wyckoff, Elizabeth, Yan, Xiaohui, Yim, Grace, Yu, Fengan, Xie, Yunchang, Aigle, Bertrand, Apel, Alexander K, Balibar, Carl J, Balskus, Emily P, Barona-Gomez, Francisco, Bechthold, Andreas, Bode, Helge B, Borriss, Rainer, Brady, Sean F, Brakhage, Axel A, Caffrey, Patrick, Cheng, Yi-Qiang, Clardy, Jon, Cox, Russell J, De Mot, Rene, Donadio, Stefano, Donia, Mohamed S, van der Donk, Wilfred A, Dorrestein, Pieter C, Doyle, Sean, Driessen, Arnold J M, Ehling-Schulz, Monika, Entian, Karl-Dieter, Fischbach, Michael A, Gerwick, Lena, Gerwick, William H, Gross, Harald, Gust, Bertolt, Hertweck, Christian, Hofte, Monica, Jensen, Susan E, Ju, Jianhua, Katz, Leonard, Kaysser, Leonard, Klassen, Jonathan L, Keller, Nancy P, Kormanec, Jan, Kuipers, Oscar P, Kuzuyama, Tomohisa, Kyrpides, Nikos C, Kwon, Hyung-Jin, Lautru, Sylvie, Lavigne, Rob, Lee, Chia Y, Linquan, Bai, Liu, Xinyu, Liu, Wen, Luzhetskyy, Andriy, Mahmud, Taifo, Mast, Yvonne, Mendez, Carmen, Metsa-Ketela, Mikko, Micklefield, Jason, Mitchell, Douglas A, Moore, Bradley S, Moreira, Leonilde M, Muller, Rolf, Neilan, Brett A, Nett, Markus, Nielsen, Jens, O'Gara, Fergal, Oikawa, Hideaki, Osbourn, Anne, Osburne, Marcia S, Ostash, Bohdan, Payne, Shelley M, Pernodet, Jean-Luc, Petricek, Miroslav, Piel, Jorn, Ploux, Olivier, Raaijmakers, Jos M, Salas, Jose A, Schmitt, Esther K, Scott, Barry, Seipke, Ryan F, Shen, Ben, Sherman, David H, Sivonen, Kaarina, Smanski, Michael J, Sosio, Margherita, Stegmann, Evi, Sussmuth, Roderich D, Tahlan, Kapil, Thomas, Christopher M, Tang, Yi, Truman, Andrew W, Viaud, Muriel, Walton, Jonathan D, Walsh, Christopher T, Weber, Tilmann, van Wezel, Gilles P, Wilkinson, Barrie, Willey, Joanne M, Wohlleben, Wolfgang, Wright, Gerard D, Ziemert, Nadine, Zhang, Changsheng, Zotchev, Sergey B, Breitling, Rainer, Takano, Eriko, Glockner, Frank Oliver, and Microbial Ecology (ME)
- Subjects
international - Abstract
A wide variety of enzymatic pathways that produce specialized metabolites in bacteria, fungi and plants are known to be encoded in biosynthetic gene clusters. Information about these clusters, pathways and metabolites is currently dispersed throughout the literature, making it difficult to exploit. To facilitate consistent and systematic deposition and retrieval of data on biosynthetic gene clusters, we propose the Minimum Information about a Biosynthetic Gene cluster (MIBiG) data standard.
- Published
- 2015
12. Minimum Information about a Biosynthetic Gene cluster
- Author
-
Medema, Marnix H., Kottmann, Renzo, Yilmaz, Pelin, Cummings, Matthew, Biggins, John B., and Cox, Russell J.
- Subjects
protein synthesis ,International Cooperation ,plant ,Review ,gene cluster ,information ,polyketide ,data base ,genetic database ,acyltransferase ,Databases, Genetic ,genetics ,terpene ,biology ,fungus ,nonribosomal peptide synthesis ,Plants ,bacterium ,peptide ,priority journal ,Multigene Family ,ddc:540 ,standards ,Peptide Biosynthesis, Nucleic Acid-Independent ,nomenclature ,genetic marker ,alkaloid derivative ,Genetic Markers ,biological activity ,metagenome ,peptide derivative ,Alkaloids ,Polysaccharides ,ddc:570 ,Terminology as Topic ,enzyme specificity ,standardization ,nonhuman ,Bacteria ,Terpenes ,Fungi ,Computational Biology ,nucleotide sequence ,alkaloid ,operon ,gene function ,polysaccharide ,Polyketides ,Protein Biosynthesis ,chemical structure ,synthetic biology ,biosynthesis ,Peptides ,metabolism - Abstract
A wide variety of enzymatic pathways that produce specialized metabolites in bacteria, fungi and plants are known to be encoded in biosynthetic gene clusters. Information about these clusters, pathways and metabolites is currently dispersed throughout the literature, making it difficult to exploit. To facilitate consistent and systematic deposition and retrieval of data on biosynthetic gene clusters, we propose the Minimum Information about a Biosynthetic Gene cluster (MIBiG) data standard. Netherlands Organization for Scientific Research (NWO)/Rubicon/825.13.001 EU/FP7/Joint Call OCEAN Biotechnology and Biological Sciences Research Council (BBSRC) Natural Environment Research Council (UK) National Institute for Energy Ethics and Society (NIEeS; UK) Gordon and Betty Moore Foundation National Science Foundation (NSF; US) US Department of Energy Engineering and Physical Sciences Research Council (EPSRC)
- Published
- 2015
13. Bacterial communities in the South Adriatic dominated by SAR11
- Author
-
Korlević, Marino, Pop Ristova, Petra, Kottmann, Renzo, Amann, Rudolf, and Orlić, Sandi
- Subjects
bacteria ,south adriatic pit ,pyrosequencing - Abstract
The Adriatic Sea is a semi enclosed basin of the Mediterranean with a southern part characterized by a circular, 1243 m deep South Adriatic Pit. Adriatic ecosystems are influenced by a regular water exchange with the Ionian Sea through the strait of Otranto. The purpose of this study was to characterize the temporal and spatial distribution of bacterial communities in the South Adriatic using 454 pyrosequencing of the 16S rRNA gene. Bacterial communities were characterized at two stations, one in the middle of the South Adraitic Pit and one closer to the shore, from water samples collected in October 2011 and February, March and September 2012. Taxonomic analysis of various phylogenetic groups showed a dominant depth related distribution except in February 2012 when similar communities where found at different depths. Alphaproteobacteria SAR11 clade was dominating the bacterial communities during the year. SAR11 Surface 1 clade was more abundant in the upper part while SAR11 Deep 1 clade was more specific for deeper parts of the water column in the SAP and closer to the shore. This pattern was not observed only in February 2012 when SAR11 Surface 1 clade was strongly dominating at all depths. Only in February at 800 m and 1000 m and in March at 600 m Sphingobium was dominating Alphaproteobacterial communities instead of SAR11. Cyanobacteria where characteristic for the photic zone with the highest relative abundance in October and September in both waters and with a very low abundance in February. In October 2012 Cyanobacterial communities above 10 m were dominated by Synechococcus while below 10 m Prochlorococcus was more abundant. Chloroflexi and Deferribacteres were characteristic for higher depths and their abundance was highest in October below 200 m in the SAP. In conclusion, depth related bacterial community distribution was observed in the SAP and close to the shore, with a change in structure during a year and no major spacial differences
- Published
- 2013
14. MetaMine – A tool to detect and analyse gene patterns in their environmental context
- Author
-
Bohnebeck, Uta, Lombardot, Thierry, Kottmann, Renzo, and Glockner, Frank O
- Subjects
Computational Biology ,lcsh:Computer applications to medicine. Medical informatics ,Biochemistry ,Computer Science Applications ,Pattern Recognition, Automated ,lcsh:Biology (General) ,Multigene Family ,Databases, Genetic ,lcsh:R858-859.7 ,Molecular Biology ,lcsh:QH301-705.5 ,Software ,Algorithms ,Ecosystem - Abstract
Background Modern sequencing technologies allow rapid sequencing and bioinformatic analysis of genomes and metagenomes. With every new sequencing project a vast number of new proteins become available with many genes remaining functionally unclassified based on evidences from sequence similarities alone. Extending similarity searches with gene pattern approaches, defined as genes sharing a distinct genomic neighbourhood, have shown to significantly improve the number of functional assignments. Further functional evidences can be gained by correlating these gene patterns with prevailing environmental parameters. MetaMine was developed to approach the large pool of unclassified proteins by searching for recurrent gene patterns across habitats based on key genes. Results MetaMine is an interactive data mining tool which enables the detection of gene patterns in an environmental context. The gene pattern search starts with a user defined environmentally interesting key gene. With this gene a BLAST search is carried out against the Microbial Ecological Genomics DataBase (MEGDB) containing marine genomic and metagenomic sequences. This is followed by the determination of all neighbouring genes within a given distance and a search for functionally equivalent genes. In the final step a set of common genes present in a defined number of distinct genomes is determined. The gene patterns found are associated with their individual pattern instances describing gene order and directions. They are presented together with information about the sample and the habitat. MetaMine is implemented in Java and provided as a client/server application with a user-friendly graphical user interface. The system was evaluated with environmentally relevant genes related to the methane-cycle and carbon monoxide oxidation. Conclusion MetaMine offers a targeted, semi-automatic search for gene patterns based on expert input. The graphical user interface of MetaMine provides a user-friendly overview of the computed gene patterns for further inspection in an ecological context. Prevailing biological processes associated with a key gene can be used to infer new annotations and shape hypotheses to guide further analyses. The use-cases demonstrate that meaningful gene patterns can be quickly detected using MetaMine. MetaMine is freely available for academic use from http://www.megx.net/metamine.
- Published
- 2008
15. JCoast - A biologist-centric software tool for data mining and comparison of prokaryotic (meta) genomes
- Author
-
Kottmann Renzo, Kostadinov Ivaylo, Lombardot Thierry, Richter Michael, Duhaime Melissa, Peplies Jörg, and Glöckner Frank
- Subjects
Chromosome Mapping ,lcsh:Computer applications to medicine. Medical informatics ,Biochemistry ,Computer Science Applications ,User-Computer Interface ,lcsh:Biology (General) ,Prokaryotic Cells ,Databases, Genetic ,lcsh:R858-859.7 ,Database Management Systems ,Programming Languages ,lcsh:QH301-705.5 ,Molecular Biology ,Software ,Algorithms - Abstract
Background Current sequencing technologies give access to sequence information for genomes and metagenomes at a tremendous speed. Subsequent data processing is mainly performed by automatic pipelines provided by the sequencing centers. Although, standardised workflows are desirable and useful in many respects, rational data mining, comparative genomics, and especially the interpretation of the sequence information in the biological context, demands for intuitive, flexible, and extendable solutions. Results The JCoast software tool was primarily designed to analyse and compare (meta)genome sequences of prokaryotes. Based on a pre-computed GenDB database project, JCoast offers a flexible graphical user interface (GUI), as well as an application programming interface (API) that facilitates back-end data access. JCoast offers individual, cross genome-, and metagenome analysis, and assists the biologist in exploration of large and complex datasets. Conclusion JCoast combines all functions required for the mining, annotation, and interpretation of (meta)genomic data. The lightweight software solution allows the user to easily take advantage of advanced back-end database structures by providing a programming and graphical user interface to answer biological questions. JCoast is available at the project homepage.
- Published
- 2008
16. The antiSMASH database, a comprehensive database of microbial secondary metabolite biosynthetic gene clusters.
- Author
-
Kai Blin, Medema, Marnix H., Kottmann, Renzo, Sang Yup Lee, and Weber, Tilmann
- Published
- 2017
- Full Text
- View/download PDF
17. A participative tool for sharing, annotating and archiving submarine video data.
- Author
-
Marcon, Yann, Ratmeyer, Volker, Kottmann, Renzo, and Boetius, Antje
- Published
- 2015
- Full Text
- View/download PDF
18. The ocean sampling day consortium.
- Author
-
Kopf, Anna, Bicak, Mesude, Kottmann, Renzo, Schnetzer, Julia, Kostadinov, Ivaylo, Lehmann, Katja, Fernandez-Guerra, Antonio, Jeanthon, Christian, Rahav, Eyal, Ullrich, Matthias, Wichels, Antje, Gerdts, Gunnar, Polymenakou, Paraskevi, Kotoulas, Giorgos, Siam, Rania, Abdallah, Rehab Z., Sonnenschein, Eva C., Cariou, Thierry, O'Gara, Fergal, and Jackson, Stephen
- Subjects
MARINE microbiology ,MICROBIAL diversity ,BIOINFORMATICS ,MARINE biotechnology ,METAGENOMICS ,OCEANOGRAPHY - Abstract
Ocean Sampling Day was initiated by the EU-funded Micro B3 (Marine Microbial Biodiversity, Bioinformatics, Biotechnology) project to obtain a snapshot of the marine microbial biodiversity and function of the world's oceans. It is a simultaneous global mega-sequencing campaign aiming to generate the largest standardized microbial data set in a single day. This will be achievable only through the coordinated efforts of an Ocean Sampling Day Consortium, supportive partnerships and networks between sites. This commentary outlines the establishment, function and aims of the Consortium and describes our vision for a sustainable study of marine microbial communities and their embedded functional traits. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
19. Report of the 14 Genomic Standards Consortium Meeting, Oxford, UK, September 17-21, 2012.
- Author
-
Davies, Neil, Field, Dawn, Amaral-Zettler, Linda, Barker, Katharine, Bicak, Mesude, Bourlat, Sarah, Coddington, Jonathan, Deck, John, Drummond, Alexei, Gilbert, Jack, Glöckner, Frank, Kottmann, Renzo, Meyer, Chris, Morrison, Norman, Obst, Matthias, Robbins, Robert, Schriml, Lynn, Sterk, Peter, and Stones-Havas, Steven
- Subjects
OBSERVATORIES ,GENOMICS ,MOLECULAR genetics ,COMPARATIVE genomics ,FUNCTIONAL genomics - Abstract
This report summarizes the proceedings of the 14 workshop of the Genomic Standards Consortium (GSC) held at the University of Oxford in September 2012. The primary goal of the workshop was to work towards the launch of the Genomic Observatories (GOs) Network under the GSC. For the first time, it brought together potential GOs sites, GSC members, and a range of interested partner organizations. It thus represented the first meeting of the GOs Network (GOs1). Key outcomes include the formation of a core group of 'champions' ready to take the GOs Network forward, as well as the formation of working groups. The workshop also served as the first meeting of a wide range of participants in the Ocean Sampling Day (OSD) initiative, a first GOs action. Three projects with complementary interests - COST Action ES1103, MG4U and Micro B3 - organized joint sessions at the workshop. A two-day GSC Hackathon followed the main three days of meetings. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
20. Ecogenomic Perspectives on Domains of Unknown Function: Correlation-Based Exploration of Marine Metagenomes.
- Author
-
Buttigieg, Pier Luigi, Hankeln, Wolfgang, Kostadinov, Ivaylo, Kottmann, Renzo, Yilmaz, Pelin, Duhaime, Melissa Beth, and Glöckner, Frank Oliver
- Subjects
METAGENOMICS ,BIOINFORMATICS ,MARINE microorganisms ,NUCLEOTIDE sequence ,PROTEIN structure ,BIOTIC communities - Abstract
Background: The proportion of conserved DNA sequences with no clear function is steadily growing in bioinformatics databases. Studies of sequence and structural homology have indicated that many uncharacterized protein domain sequences are variants of functionally described domains. If these variants promote an organism's ecological fitness, they are likely to be conserved in the genome of its progeny and the population at large. The genetic composition of microbial communities in their native ecosystems is accessible through metagenomics. We hypothesize the co-variation of protein domain sequences across metagenomes from similar ecosystems will provide insights into their potential roles and aid further investigation. Methodology/Principal findings: We calculated the correlation of Pfam protein domain sequences across the Global Ocean Sampling metagenome collection, employing conservative detection and correlation thresholds to limit results to well-supported hits and associations. We then examined intercorrelations between domains of unknown function (DUFs) and domains involved in known metabolic pathways using network visualization and cluster-detection tools. We used a cautious “guilty-by-association” approach, referencing knowledge-level resources to identify and discuss associations that offer insight into DUF function. We observed numerous DUFs associated to photobiologically active domains and prevalent in the Cyanobacteria. Other clusters included DUFs associated with DNA maintenance and repair, inorganic nutrient metabolism, and sodium-translocating transport domains. We also observed a number of clusters reflecting known metabolic associations and cases that predicted functional reclassification of DUFs. Conclusion/Significance: Critically examining domain covariation across metagenomic datasets can grant new perspectives on the roles and associations of DUFs in an ecological setting. Targeted attempts at DUF characterization in the laboratory or in silico may draw from these insights and opportunities to discover new associations and corroborate existing ones will arise as more large-scale metagenomic datasets emerge. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
21. Poseidon: An information retrieval and extraction system for metagenomic marine science.
- Author
-
Radom, Marcin, Rybarczyk, Agnieszka, Kottmann, Renzo, Formanowicz, Piotr, Szachniuk, Marta, Glöckner, Frank Oliver, Rebholz-Schuhmann, Dietrich, and Błażewicz, Jacek
- Subjects
METAGENOMICS ,INFORMATION retrieval ,EXTRACTION techniques ,MARINE sciences ,SCIENTISTS ,MARINE microbiology - Abstract
Abstract: Gathering data has always presented a considerable challenge for scientists, whether it is done in laboratories or through reading specialized journals. It becomes particularly difficult in fields that generate a lot of publications, such as marine microbiology. Professionals interested in environments of seas and oceans, and organisms inhabiting them are faced with a flood of information on a daily basis. It is so because new papers on genetic samples and metagenomes of marine origin are being issued, and that is why the participants of the MetaFunctions project found it necessary to develop a new efficient way of extracting, processing, and storing data. As a result a new system, called Poseidon, was created, offering a convenient alternative to manual extraction of marine microbiology-related information. [Copyright &y& Elsevier]
- Published
- 2012
- Full Text
- View/download PDF
22. Ecological structuring of bacterial and archaeal taxa in surface ocean waters.
- Author
-
Yilmaz, Pelin, Iversen, Morten H., Hankeln, Wolfgang, Kottmann, Renzo, Quast, Christian, and Glöckner, Frank O.
- Subjects
MARINE habitats ,BACTERIAL ecology ,OCEAN bottom ,ONTOLOGY ,METAGENOMICS ,BACTERIOPLANKTON ,RIBOSOMAL RNA - Abstract
The Global Ocean Sampling ( GOS) expedition is currently the largest and geographically most comprehensive metagenomic dataset, including samples from the Atlantic, Pacific, and Indian Oceans. This study makes use of the wide range of environmental conditions and habitats encompassed within the GOS sites in order to investigate the ecological structuring of bacterial and archaeal taxon ranks. Community structures based on taxonomically classified 16 S ribosomal RNA (r RNA) gene fragments at phylum, class, order, family, and genus rank levels were examined using multivariate statistical analysis, and the results were inspected in the context of oceanographic environmental variables and structured habitat classifications. At all taxon rank levels, community structures of neritic, oceanic, estuarine biomes, as well as other exotic biomes (salt marsh, lake, mangrove), were readily distinguishable from each other. A strong structuring of the communities with chlorophyll a concentration and a weaker yet significant structuring with temperature and salinity were observed. Furthermore, there were significant correlations between community structures and habitat classification. These results were used for further investigation of one-to-one relationships between taxa and environment and provided indications for ecological preferences shaped by primary production for both cultured and uncultured bacterial and archaeal clades. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
23. RCN4GSC Meeting Report: Initiating a Testbed for Managing Data at the Interface of Biodiversity and Genomics/Metagenomics, May 2011.
- Author
-
Robbins, Robert J., Beach, James, Blum, Stan, Dawyndt, Peter, Deck, John, Kottmann, Renzo, Morrison, Norman, Tuama, Éamonn Ó., Gil, Inigo San, Vieglas, David, Wieczorek, John, and Wooley, John
- Subjects
BIODIVERSITY conferences ,GENOMICS ,METAGENOMICS ,MEETINGS ,DISCUSSION ,SOCIAL interaction ,CONFERENCES & conventions - Abstract
Following up on efforts from two earlier workshops, a meeting was convened in San Diego to (a) establish working connections between experts in the use of the Darwin Core and the GSC MIxS standards, (b) conduct mutual briefings to promote knowledge exchange and to increase the understanding of the two communities' approaches, constraints, community goals, subtleties, etc., (c) perform an element-by-element comparison of the two standards, assessing the compatibility and complementarity of the two approaches, (d) propose and consider possible use cases and test beds in which a joint annotation approach might be tried, to useful scientific effect, and (e) propose additional action items necessary to continue the development of this joint effort. Several focused working teams were identified to continue the work after the meeting ended. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
24. Meeting Report: Hackathon-Workshop on Darwin Core and MIxS Standards Alignment (February 2012).
- Author
-
Tuama, Éamonn Ó., Deck, John, Dröge, Gabriel, Döring, Markus, Field, Dawn, Kottmann, Renzo, Juncai Ma, Hiroshi Mori, Morrison, Norman, Sterk, Peter, Hideaki Sugawara, Wieczorek, John, Linhuan Wu, and Pelin Yilmaz
- Subjects
GENOMES ,SMALL group research ,BIODIVERSITY ,INFORMATION storage & retrieval systems standards ,INSTITUTIONAL repositories ,CONFERENCES & conventions - Abstract
The Global Biodiversity Information Facility and the Genomic Standards Consortium convened a joint workshop at the University of Oxford, 27-29 February 2012, with a small group of experts from Europe, USA, China and Japan, to continue the alignment of the Darwin Core with the MIxS and related genomics standards. Several reference mappings were produced as well as test expressions of MIxS in RDF. The use and management of controlled vocabulary terms was considered in relation to both GBIF and the GSC, and tools for working with terms were reviewed. Extensions for publishing genomic biodiversity data to the GBIF network via a Darwin Core Archive were prototyped and work begun on preparing translations of the Dar-win Core to Japanese and Chinese. Five genomic repositories were identified for engagement to begin the process of testing the publishing of genomic data to the GBIF network commencing with the SILVA rRNA database. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
25. RCN4GSC Workshop Report: Managing Data at the Interface of Biodiversity and (Meta)Genomics, March 2011.
- Author
-
Robbins, Robert J., Amaral-Zettler, Linda, Bik, Holly, Blum, Stan, Edwards, James, Field, Dawn, Garrity, George, Gilbert, Jack A., Kottmann, Renzo, Krishtalka, Leonard, Lapp, Hilmar, Lawrence, Carolyn, Morrison, Norman, Tuama, Éamonn Ó., Parr, Cynthia, Gil, Inigo San, Schindel, David, Schriml, Lynn, Vieglas, David, and Wooley, John
- Subjects
ADULT education workshops ,BIOINFORMATICS ,GENOMICS ,BIOMASS & the environment ,INTERNETWORKING ,BIOTIC communities ,CONFERENCES & conventions - Abstract
Building on the planning efforts of the RCN4GSC project, a workshop was convened in San Diego to bring together experts from genomics and metagenomics, biodiversity, ecology, and bioinformatics with the charge to identify potential for positive interactions and progress, especially building on successes at establishing data standards by the GSC and by the biodiversity and ecological communities. Until recently, the contribution of microbial life to the biomass and biodiversity of the biosphere was largely overlooked (because it was resistant to systematic study). Now, emerging genomic and metagenomic tools are making investigation possible. Initial research findings suggest that major advances are in the offing. Although different research communities share some overlapping concepts and traditions, they differ significantly in sampling approaches, vocabularies and workflows. Likewise, their definitions of 'fitness for use' for data differ significantly, as this concept stems from the specific research questions of most importance in the different fields. Nevertheless, there is little doubt that there is much to be gained from greater coordination and integration. As a first step toward interoperability of the information systems used by the different communities, participants agreed to conduct a case study on two of the leading data standards from the two formerly disparate fields: (a) GSC's standard check-lists for genomics and metagenomics and (b) TDWG's Darwin Core standard, used primarily in taxonomy and systematic biology. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
26. Analysis of 23S rRNA genes in metagenomes – A case study from the Global Ocean Sampling Expedition.
- Author
-
Yilmaz, Pelin, Kottmann, Renzo, Pruesse, Elmar, Quast, Christian, and Glöckner, Frank Oliver
- Subjects
RNA ,GENES ,GENOMES ,MICROORGANISMS ,POLYMERASE chain reaction ,DNA primers ,FLUORESCENCE ,IN situ hybridization ,PHYLOGENY - Abstract
Abstract: As an evolutionary marker, 23S ribosomal RNA (rRNA) offers more diagnostic sequence stretches and greater sequence variation than 16S rRNA. However, 23S rRNA is still not as widely used. Based on 80 metagenome samples from the Global Ocean Sampling (GOS) Expedition, the usefulness and taxonomic resolution of 23S rRNA were compared to those of 16S rRNA. Since 23S rRNA is approximately twice as large as 16S rRNA, twice as many 23S rRNA gene fragments were retrieved from the GOS reads than 16S rRNA gene fragments, with 23S rRNA gene fragments being generally about 100bp longer. Datasets for 16S and 23S rRNA sequences revealed similar relative abundances for major marine bacterial and archaeal taxa. However, 16S rRNA sequences had a better taxonomic resolution due to their significantly larger reference database. Reevaluation of the specificity of previously published PCR amplification primers and group specific fluorescence in situ hybridization probes on this metagenomic set of non-amplified 23S rRNA sequences revealed that out of 16 primers investigated, only two had more than 90% target group coverage. Evaluations of two probes, BET42a and GAM42a, were in accordance with previous evaluations, with a discrepancy in the target group coverage of the GAM42a probe when evaluated against the GOS metagenomic dataset. [Copyright &y& Elsevier]
- Published
- 2011
- Full Text
- View/download PDF
27. CDinFusion - Submission-Ready, On-Line Integration of Sequence and Contextual Data.
- Author
-
Hankeln, Wolfgang, Wendel, Norma Johanna, Gerken, Jan, Waldmann, Jost, Buttigieg, Pier Luigi, Kostadinov, Ivaylo, Kottmann, Renzo, Yilmaz, Pelin, and Glöckner, Frank Oliver
- Subjects
NUCLEOTIDE sequence ,DNA data banks ,COMPARATIVE studies ,DATA analysis ,ACQUISITION of data ,COMPUTER software - Abstract
State of the art (DNA) sequencing methods applied in "Omics" studies grant insight into the 'blueprints' of organisms from all domains of life. Sequencing is carried out around the globe and the data is submitted to the public repositories of the International Nucleotide Sequence Database Collaboration. However, the context in which these studies are conducted often gets lost, because experimental data, as well as information about the environment are rarely submitted along with the sequence data. If these contextual or metadata are missing, key opportunities of comparison and analysis across studies and habitats are hampered or even impossible. To address this problem, the Genomic Standards Consortium (GSC) promotes checklists and standards to better describe our sequence data collection and to promote the capturing, exchange and integration of sequence data with contextual data. In a recent community effort the GSC has developed a series of recommendations for contextual data that should be submitted along with sequence data. To support the scientific community to significantly enhance the quality and quantity of contextual data in the public sequence data repositories, specialized software tools are needed. In this work we present CDinFusion, a web-based tool to integrate contextual and sequence data in (Multi)FASTA format prior to submission. The tool is open source and available under the Lesser GNU Public License 3. A public installation is hosted and maintained at the Max Planck Institute for Marine Microbiology at http:// www.megx.net/cdinfusion. The tool may also be installed locally using the open source code available at http://code. google.com/p/cdinfusion. [ABSTRACT FROM AUTHOR]
- Published
- 2011
- Full Text
- View/download PDF
28. Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications.
- Author
-
Yilmaz, Pelin, Kottmann, Renzo, Field, Dawn, Knight, Rob, Cole, James R., Amaral-Zettler, Linda, Gilbert, Jack A., Karsch-Mizrachi, Ilene, Johnston, Anjanette, Cochrane, Guy, Vaughan, Robert, Hunter, Christopher, Park, Joonhong, Morrison, Norman, Rocca-Serra, Philippe, Sterk, Peter, Arumugam, Manimozhiyan, Bailey, Mark, Baumgartner, Laura, and Birren, Bruce W.
- Subjects
- *
GENOMICS , *GENOMES , *NUCLEOTIDE sequence , *GENES , *DNA - Abstract
Here we present a standard developed by the Genomic Standards Consortium (GSC) for reporting marker gene sequences-the minimum information about a marker gene sequence (MIMARKS). We also introduce a system for describing the environment from which a biological sample originates. The 'environmental packages' apply to any genome sequence of known origin and can be used in combination with MIMARKS and other GSC checklists. Finally, to establish a unified standard for describing sequence data and to provide a single point of entry for the scientific community to access and learn about GSC checklists, we present the minimum information about any (x) sequence (MIxS). Adoption of MIxS will enhance our ability to analyze natural genetic diversity documented by massive DNA sequencing efforts from myriad ecosystems in our ever-changing biosphere. [ABSTRACT FROM AUTHOR]
- Published
- 2011
- Full Text
- View/download PDF
29. MetaBar -- a tool for consistent contextual dataacquisition and standards compliant submission.
- Author
-
Hankeln, Wolfgang, Buttigieg, Pier Luigi, Fink, Dennis, Kottmann, Renzo, Yilmaz, Pelin, and Glöckner, Frank Oliver
- Subjects
DATA analysis ,METADATA ,GENETICS ,NUCLEIC acids ,MOLECULAR biology - Abstract
Background: Environmental sequence datasets are increasing at an exponential rate; however, the vast majority of them lack appropriate descriptors like sampling location, time and depth/altitude: generally referred to as metadata or contextual data. The consistent capture and structured submission of these data is crucial for integrated data analysis and ecosystems modeling. The application MetaBar has been developed, to support consistent contextual data acquisition. Results: MetaBar is a spreadsheet and web-based software tool designed to assist users in the consistent acquisition, electronic storage, and submission of contextual data associated to their samples. A preconfigured Microsoft® Excel® spreadsheet is used to initiate structured contextual data storage in the field or laboratory. Each sample is given a unique identifier and at any stage the sheets can be uploaded to the MetaBar database server. To label samples, identifiers can be printed as barcodes. An intuitive web interface provides quick access to the contextual data in the MetaBar database as well as user and project management capabilities. Export functions facilitate contextual and sequence data submission to the International Nucleotide Sequence Database Collaboration (INSDC), comprising of the DNA DataBase of Japan (DDBJ), the European Molecular Biology Laboratory database (EMBL) and GenBank. MetaBar requests and stores contextual data in compliance to the Genomic Standards Consortium specifications. The MetaBar open source code base for local installation is available under the GNU General Public License version 3 (GNU GPL3). Conclusion: The MetaBar software supports the typical workflow from data acquisition and field-sampling to contextual data enriched sequence submission to an INSDC database. The integration with the megx.net marine Ecological Genomics database and portal facilitates georeferenced data integration and metadata-based comparisons of sampling sites as well as interactive data visualization. The ample export functionalities and the INSDC submission support enable exchange of data across disciplines and safeguarding contextual data. [ABSTRACT FROM AUTHOR]
- Published
- 2010
- Full Text
- View/download PDF
30. A standard operating procedure for phylogenetic inference (SOPPI) using (rRNA) marker genes.
- Author
-
Peplies, Jörg, Kottmann, Renzo, Ludwig, Wolfgang, and Glöckner, Frank Oliver
- Subjects
CLASSIFICATION of microorganisms ,STANDARD operating procedure ,PHYLOGENY ,MICROORGANISMS ,RIBOSOMES ,MICROBIAL genetics - Abstract
Abstract: Phylogenetic analysis is currently used worldwide for taxonomic classification and identification of microorganisms. However, despite the countless trees that have been reconstructed and published in recent decades, so far, no user-friendly compilation of recommendations to standardize the data analysis and tree reconstruction process has been published. Consequently, this standard operating procedure for phylogenetic inference (SOPPI) offers a helping hand for working through the process from sampling in the field to phylogenetic tree reconstruction and publication. It is not meant to be authoritative or comprehensive, but should help to make phylogenetic inference and diversity analysis more reliable and comparable between different laboratories. It is mainly focused on using the ribosomal RNA as a universal phylogenetic marker, but the principles and recommendations can be applied to any valid marker gene. Feedback and suggestions from the scientific community are welcome in order to improve these guidelines further. Any updates will be made available on the SILVA webpage at http://www.arb-silva.de/projects/soppi. [Copyright &y& Elsevier]
- Published
- 2008
- Full Text
- View/download PDF
31. JCoast -- A biologist-centric software tool for data mining and comparison of prokaryotic (meta)genomes.
- Author
-
Richter, Michael, Lombardot, Thierry, Kostadinov, Ivaylo, Kottmann, Renzo, Duhaime, Melissa Beth, Peplies, Jörg, and Glöckner, Frank Oliver
- Subjects
COMPUTER software ,APPLICATION program interfaces - Abstract
Background: Current sequencing technologies give access to sequence information for genomes and metagenomes at a tremendous speed. Subsequent data processing is mainly performed by automatic pipelines provided by the sequencing centers. Although, standardised workflows are desirable and useful in many respects, rational data mining, comparative genomics, and especially the interpretation of the sequence information in the biological context, demands for intuitive, flexible, and extendable solutions. Results: The JCoast software tool was primarily designed to analyse and compare (meta)genome sequences of prokaryotes. Based on a pre-computed GenDB database project, JCoast offers a flexible graphical user interface (GUI), as well as an application programming interface (API) that facilitates back-end data access. JCoast offers individual, cross genome-, and metagenome analysis, and assists the biologist in exploration of large and complex datasets. Conclusion: JCoast combines all functions required for the mining, annotation, and interpretation of (meta)genomic data. The lightweight software solution allows the user to easily take advantage of advanced back-end database structures by providing a programming and graphical user interface to answer biological questions. JCoast is available at the project homepage. [ABSTRACT FROM AUTHOR]
- Published
- 2008
- Full Text
- View/download PDF
32. eGenomics: Cataloguing Our Complete Genome Collection III.
- Author
-
Field, Dawn, Garrity, George, Gray, Tanya, Selengut, Jeremy, Sterk, Peter, Thomson, Nick, Tatusova, Tatiana, Cochrane, Guy, Glöckner, Frank Oliver, Kottmann, Renzo, Lister, Allyson L., Tateno, Yoshio, and Vaughan, Robert
- Subjects
CONFERENCES & conventions ,CATALOGING ,GENOMES ,HUMAN genetic variation ,GENOMICS - Abstract
This meeting report summarizes the proceedings of the "eGenomics: Cataloguing our Complete Genome Collection III" workshop held September 11-13, 2006, at the National Institute for Environmental eScience (NIEeS), Cambridge, United Kingdom. This 3rd workshop of the Genomic Standards Consortium was divided into two parts. The first half of the three-day workshop was dedicated to reviewing the genomic diversity of our current and future genome and metagenome collection, and exploring linkages to a series of existing projects through formal presentations. The second half was dedicated to strategic discussions. Outcomes of the workshop include a revised "Minimum Information about a Genome Sequence" (MIGS) specification (v1.1), consensus on a variety of features to be added to the Genome Catalogue (GCat), agreement by several researchers to adopt MIGS for imminent genome publications, and an agreement by the EBI and NCBI to input their genome collections into GCat for the purpose of quantifying the amount of optional data already available (e.g., for geographic location coordinates) and working towards a single, global list of all public genomes and metagenomes. [ABSTRACT FROM AUTHOR]
- Published
- 2007
- Full Text
- View/download PDF
33. Applying graph theoretic approaches to microbial metagenomes.
- Author
-
Hankeln, Wolfgang, Buttigieg, Pier Luigi, Kostadinov, Ivaylo, Kottmann, Renzo, Yilmaz, Pelin, Duhaime, Melissa Beth, and Glöckner, Frank Oliver
- Published
- 2010
- Full Text
- View/download PDF
34. Genomic Standards Consortium Projects.
- Author
-
Field, Dawn, Sterk, Peter, Kottmann, Renzo, Smet, J., Amaral-Zettler, Linda, Cochrane, Guy, Cole, James, Davies, Neil, Dawyndt, Peter, Garrity, George, Gilbert, Jack, Glöckner, Frank, Hirschman, Lynette, Klenk, Hans-Peter, Knight, Rob, Kyrpides, Nikos, Meyer, Folker, Karsch-Mizrachi, Ilene, Morrison, Norman, and Robbins, Robert
- Subjects
GENOMICS ,MEMBERSHIP ,BEST practices ,STANDARDIZATION ,STANDARD-setting organizations - Abstract
The Genomic Standards Consortium (GSC) is an open-membership community that was founded in 2005 to work towards the development, implementation and harmonization of standards in the field of genomics. Starting with the defined task of establishing a minimal set of descriptions the GSC has evolved into an active standards-setting body that currently has 18 ongoing projects, with additional projects regularly proposed from within and outside the GSC. Here we describe our recently enacted policy for proposing new activities that are intended to be taken on by the GSC, along with the template for proposing such new activities. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
35. The Genomic Standards Consortium.
- Author
-
Field, Dawn, Amaral-Zettler, Linda, Cochrane, Guy, Cole, James R., Dawyndt, Peter, Garrity, George M., Gilbert, Jack, Glöckner, Frank Oliver, Hirschman, Lynette, Karsch-Mizrachi, Ilene, Klenk, Hans-Peter, Knight, Rob, Kottmann, Renzo, Kyrpides, Nikos, Meyer, Folker, Gil, Inigo San, Sansone, Susanna-Assunta, Schriml, Lynn M., Sterk, Peter, and Tatusova, Tatiana
- Subjects
CONSORTIA ,GENOMICS ,ONLINE databases ,INFORMATION resources management ,NUCLEOTIDE sequence ,SOCIETIES - Abstract
A vast and rich body of information has grown up as a result of the world's enthusiasm for 'omics technologies. Finding ways to describe and make available this information that maximise its usefulness has become a major effort across the 'omics world. At the heart of this effort is the Genomic Standards Consortium (GSC), an open-membership organization that drives communitybased standardization activities, Here we provide a short history of the GSC, provide an overview of its range of current activities, and make a call for the scientific community to join forces to improve the quality and quantity of contextual information about our public collections of genomes, metagenomes, and marker gene sequences. [ABSTRACT FROM AUTHOR]
- Published
- 2011
- Full Text
- View/download PDF
36. Minimum Information about a Biosynthetic Gene cluster
- Author
-
Medema, Marnix H., Kottmann, Renzo, Yilmaz, Pelin, Cummings, Matthew, Biggins, John B., and Cox, Russell J.
- Subjects
Dewey Decimal Classification::500 | Naturwissenschaften::540 | Chemie ,protein synthesis ,Dewey Decimal Classification::500 | Naturwissenschaften::570 | Biowissenschaften, Biologie ,International Cooperation ,plant ,Review ,gene cluster ,information ,polyketide ,data base ,genetic database ,acyltransferase ,Databases, Genetic ,genetics ,terpene ,biology ,fungus ,nonribosomal peptide synthesis ,Plants ,bacterium ,peptide ,priority journal ,Multigene Family ,standards ,Peptide Biosynthesis, Nucleic Acid-Independent ,nomenclature ,genetic marker ,alkaloid derivative ,Genetic Markers ,biological activity ,metagenome ,peptide derivative ,Alkaloids ,Polysaccharides ,Terminology as Topic ,14. Life underwater ,enzyme specificity ,standardization ,nonhuman ,Bacteria ,Terpenes ,Fungi ,Computational Biology ,nucleotide sequence ,alkaloid ,operon ,gene function ,13. Climate action ,polysaccharide ,Polyketides ,Protein Biosynthesis ,chemical structure ,synthetic biology ,biosynthesis ,Peptides ,metabolism - Abstract
A wide variety of enzymatic pathways that produce specialized metabolites in bacteria, fungi and plants are known to be encoded in biosynthetic gene clusters. Information about these clusters, pathways and metabolites is currently dispersed throughout the literature, making it difficult to exploit. To facilitate consistent and systematic deposition and retrieval of data on biosynthetic gene clusters, we propose the Minimum Information about a Biosynthetic Gene cluster (MIBiG) data standard.
37. Quantifying the effect of environment stability on the transcription factor repertoire of marine microbes
- Author
-
Kostadinov, Ivaylo, Kottmann, Renzo, Ramette, Alban Nicolas, Waldmann, Jost, Buttigieg, Pier Luigi, and Glöckner, Frank
- Subjects
interpolated environmental data ,multivariate statistics ,360 Social problems & social services ,Research ,transcription factors ,ecological metagenomics ,fungi ,570 Life sciences ,biology ,610 Medicine & health - Abstract
Background: DNA-binding transcription factors (TFs) regulate cellular functions in prokaryotes, often in response to environmental stimuli. Thus, the environment exerts constant selective pressure on the TF gene content of microbial communities. Recently a study on marine Synechococcus strains detected differences in their genomic TF content related to environmental adaptation, but so far the effect of environmental parameters on the content of TFs in bacterial communities has not been systematically investigated. Results: We quantified the effect of environment stability on the transcription factor repertoire of marine pelagic microbes from the Global Ocean Sampling (GOS) metagenome using interpolated physico-chemical parameters and multivariate statistics. Thirty-five percent of the difference in relative TF abundances between samples could be explained by environment stability. Six percent was attributable to spatial distance but none to a combination of both spatial distance and stability. Some individual TFs showed a stronger relationship to environment stability and space than the total TF pool. Conclusions: Environmental stability appears to have a clearly detectable effect on TF gene content in bacterioplanktonic communities described by the GOS metagenome. Interpolated environmental parameters were shown to compare well to in situ measurements and were essential for quantifying the effect of the environment on the TF content. It is demonstrated that comprehensive and well-structured contextual data will strongly enhance our ability to interpret the functional potential of microbes from metagenomic data.
- Full Text
- View/download PDF
38. Unifying the known and unknown microbial coding sequence space.
- Author
-
Vanni C, Schechter MS, Acinas SG, Barberán A, Buttigieg PL, Casamayor EO, Delmont TO, Duarte CM, Eren AM, Finn RD, Kottmann R, Mitchell A, Sánchez P, Siren K, Steinegger M, Gloeckner FO, and Fernàndez-Guerra A
- Subjects
- Metagenome, Open Reading Frames, Bacteria genetics, Genome, Archaeal
- Abstract
Genes of unknown function are among the biggest challenges in molecular biology, especially in microbial systems, where 40-60% of the predicted genes are unknown. Despite previous attempts, systematic approaches to include the unknown fraction into analytical workflows are still lacking. Here, we present a conceptual framework, its translation into the computational workflow AGNOSTOS and a demonstration on how we can bridge the known-unknown gap in genomes and metagenomes. By analyzing 415,971,742 genes predicted from 1749 metagenomes and 28,941 bacterial and archaeal genomes, we quantify the extent of the unknown fraction, its diversity, and its relevance across multiple organisms and environments. The unknown sequence space is exceptionally diverse, phylogenetically more conserved than the known fraction and predominantly taxonomically restricted at the species level. From the 71 M genes identified to be of unknown function, we compiled a collection of 283,874 lineage-specific genes of unknown function for Cand . Patescibacteria (also known as Candidate Phyla Radiation, CPR), which provides a significant resource to expand our understanding of their unusual biology. Finally, by identifying a target gene of unknown function for antibiotic resistance, we demonstrate how we can enable the generation of hypotheses that can be used to augment experimental data., Competing Interests: CV, MS, SA, AB, PB, EC, TD, CD, AE, RF, RK, AM, PS, KS, MS, FG, AF No competing interests declared, (© 2022, Vanni et al.)
- Published
- 2022
- Full Text
- View/download PDF
39. The antiSMASH database, a comprehensive database of microbial secondary metabolite biosynthetic gene clusters.
- Author
-
Blin K, Medema MH, Kottmann R, Lee SY, and Weber T
- Subjects
- Computational Biology methods, Gene Expression Regulation, Protein Processing, Post-Translational, Web Browser, Biosynthetic Pathways genetics, Databases, Factual, Microbiology, Secondary Metabolism genetics
- Abstract
Secondary metabolites produced by microorganisms are the main source of bioactive compounds that are in use as antimicrobial and anticancer drugs, fungicides, herbicides and pesticides. In the last decade, the increasing availability of microbial genomes has established genome mining as a very important method for the identification of their biosynthetic gene clusters (BGCs). One of the most popular tools for this task is antiSMASH. However, so far, antiSMASH is limited to de novo computing results for user-submitted genomes and only partially connects these with BGCs from other organisms. Therefore, we developed the antiSMASH database, a simple but highly useful new resource to browse antiSMASH-annotated BGCs in the currently 3907 bacterial genomes in the database and perform advanced search queries combining multiple search criteria. antiSMASH-DB is available at http://antismash-db.secondarymetabolites.org/., (© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2017
- Full Text
- View/download PDF
40. BioVeL: a virtual laboratory for data analysis and modelling in biodiversity science and ecology.
- Author
-
Hardisty AR, Bacall F, Beard N, Balcázar-Vargas MP, Balech B, Barcza Z, Bourlat SJ, De Giovanni R, de Jong Y, De Leo F, Dobor L, Donvito G, Fellows D, Guerra AF, Ferreira N, Fetyukova Y, Fosso B, Giddy J, Goble C, Güntsch A, Haines R, Ernst VH, Hettling H, Hidy D, Horváth F, Ittzés D, Ittzés P, Jones A, Kottmann R, Kulawik R, Leidenberger S, Lyytikäinen-Saarenmaa P, Mathew C, Morrison N, Nenadic A, de la Hidalga AN, Obst M, Oostermeijer G, Paymal E, Pesole G, Pinto S, Poigné A, Fernandez FQ, Santamaria M, Saarenmaa H, Sipos G, Sylla KH, Tähtinen M, Vicario S, Vos RA, Williams AR, and Yilmaz P
- Subjects
- Ecology instrumentation, Internet, Models, Biological, Software, Workflow, Biodiversity, Ecology methods
- Abstract
Background: Making forecasts about biodiversity and giving support to policy relies increasingly on large collections of data held electronically, and on substantial computational capability and capacity to analyse, model, simulate and predict using such data. However, the physically distributed nature of data resources and of expertise in advanced analytical tools creates many challenges for the modern scientist. Across the wider biological sciences, presenting such capabilities on the Internet (as "Web services") and using scientific workflow systems to compose them for particular tasks is a practical way to carry out robust "in silico" science. However, use of this approach in biodiversity science and ecology has thus far been quite limited., Results: BioVeL is a virtual laboratory for data analysis and modelling in biodiversity science and ecology, freely accessible via the Internet. BioVeL includes functions for accessing and analysing data through curated Web services; for performing complex in silico analysis through exposure of R programs, workflows, and batch processing functions; for on-line collaboration through sharing of workflows and workflow runs; for experiment documentation through reproducibility and repeatability; and for computational support via seamless connections to supporting computing infrastructures. We developed and improved more than 60 Web services with significant potential in many different kinds of data analysis and modelling tasks. We composed reusable workflows using these Web services, also incorporating R programs. Deploying these tools into an easy-to-use and accessible 'virtual laboratory', free via the Internet, we applied the workflows in several diverse case studies. We opened the virtual laboratory for public use and through a programme of external engagement we actively encouraged scientists and third party application and tool developers to try out the services and contribute to the activity., Conclusions: Our work shows we can deliver an operational, scalable and flexible Internet-based virtual laboratory to meet new demands for data processing and analysis in biodiversity science and ecology. In particular, we have successfully integrated existing and popular tools and practices from different scientific disciplines to be used in biodiversity and ecological research.
- Published
- 2016
- Full Text
- View/download PDF
41. MyOSD 2014: Evaluating Oceanographic Measurements Contributed by Citizen Scientists in Support of Ocean Sampling Day.
- Author
-
Schnetzer J, Kopf A, Bietz MJ, Buttigieg PL, Fernandez-Guerra A, Ristov AP, Glöckner FO, and Kottmann R
- Abstract
The first Ocean Sampling Day (OSD) took place on June 21, 2014. In a coordinated effort, an internationally distributed group of scientists collected samples from marine surface waters in order to study microbial diversity on a single day with global granularity. Concurrently, citizen scientists enriched the OSD initiative through the MyOSD project, providing additional oceanographic measurements crucial to the contextualization of microbial diversity. Clear protocols, a user-friendly smartphone application, and an online web-form guided citizens in accurate data acquisition, promoting quality submissions to the project's information system. To evaluate the coverage and quality of MyOSD data submissions, we compared the sea surface temperature measurements acquired through OSD, MyOSD, and automatic in situ systems and satellite measurements. Our results show that the quality of citizen-science measurements was comparable to that of scientific measurements. As 79% of MyOSD measurements were conducted in geographic areas not covered by automatic in situ or satellite measurement, citizen scientists contributed significantly to worldwide oceanographic data gathering. Furthermore, survey results indicate that participation in MyOSD made citizens feel more engaged in ocean issues and may have increased their environmental awareness and ocean literacy.
- Published
- 2016
- Full Text
- View/download PDF
42. Minimum Information about a Biosynthetic Gene cluster.
- Author
-
Medema MH, Kottmann R, Yilmaz P, Cummings M, Biggins JB, Blin K, de Bruijn I, Chooi YH, Claesen J, Coates RC, Cruz-Morales P, Duddela S, Düsterhus S, Edwards DJ, Fewer DP, Garg N, Geiger C, Gomez-Escribano JP, Greule A, Hadjithomas M, Haines AS, Helfrich EJ, Hillwig ML, Ishida K, Jones AC, Jones CS, Jungmann K, Kegler C, Kim HU, Kötter P, Krug D, Masschelein J, Melnik AV, Mantovani SM, Monroe EA, Moore M, Moss N, Nützmann HW, Pan G, Pati A, Petras D, Reen FJ, Rosconi F, Rui Z, Tian Z, Tobias NJ, Tsunematsu Y, Wiemann P, Wyckoff E, Yan X, Yim G, Yu F, Xie Y, Aigle B, Apel AK, Balibar CJ, Balskus EP, Barona-Gómez F, Bechthold A, Bode HB, Borriss R, Brady SF, Brakhage AA, Caffrey P, Cheng YQ, Clardy J, Cox RJ, De Mot R, Donadio S, Donia MS, van der Donk WA, Dorrestein PC, Doyle S, Driessen AJ, Ehling-Schulz M, Entian KD, Fischbach MA, Gerwick L, Gerwick WH, Gross H, Gust B, Hertweck C, Höfte M, Jensen SE, Ju J, Katz L, Kaysser L, Klassen JL, Keller NP, Kormanec J, Kuipers OP, Kuzuyama T, Kyrpides NC, Kwon HJ, Lautru S, Lavigne R, Lee CY, Linquan B, Liu X, Liu W, Luzhetskyy A, Mahmud T, Mast Y, Méndez C, Metsä-Ketelä M, Micklefield J, Mitchell DA, Moore BS, Moreira LM, Müller R, Neilan BA, Nett M, Nielsen J, O'Gara F, Oikawa H, Osbourn A, Osburne MS, Ostash B, Payne SM, Pernodet JL, Petricek M, Piel J, Ploux O, Raaijmakers JM, Salas JA, Schmitt EK, Scott B, Seipke RF, Shen B, Sherman DH, Sivonen K, Smanski MJ, Sosio M, Stegmann E, Süssmuth RD, Tahlan K, Thomas CM, Tang Y, Truman AW, Viaud M, Walton JD, Walsh CT, Weber T, van Wezel GP, Wilkinson B, Willey JM, Wohlleben W, Wright GD, Ziemert N, Zhang C, Zotchev SB, Breitling R, Takano E, and Glöckner FO
- Subjects
- Alkaloids biosynthesis, Bacteria metabolism, Databases, Genetic, Fungi metabolism, Genetic Markers, International Cooperation, Metagenome, Peptide Biosynthesis, Nucleic Acid-Independent, Peptides metabolism, Plants metabolism, Polyketides metabolism, Polysaccharides biosynthesis, Terminology as Topic, Terpenes metabolism, Bacteria genetics, Computational Biology standards, Fungi genetics, Multigene Family, Plants genetics, Protein Biosynthesis
- Published
- 2015
- Full Text
- View/download PDF
43. Marine microbial biodiversity, bioinformatics and biotechnology (M2B3) data reporting and service standards.
- Author
-
Ten Hoopen P, Pesant S, Kottmann R, Kopf A, Bicak M, Claus S, Deneudt K, Borremans C, Thijsse P, Dekeyzer S, Schaap DM, Bowler C, Glöckner FO, and Cochrane G
- Abstract
Contextual data collected concurrently with molecular samples are critical to the use of metagenomics in the fields of marine biodiversity, bioinformatics and biotechnology. We present here Marine Microbial Biodiversity, Bioinformatics and Biotechnology (M2B3) standards for "Reporting" and "Serving" data. The M2B3 Reporting Standard (1) describes minimal mandatory and recommended contextual information for a marine microbial sample obtained in the epipelagic zone, (2) includes meaningful information for researchers in the oceanographic, biodiversity and molecular disciplines, and (3) can easily be adopted by any marine laboratory with minimum sampling resources. The M2B3 Service Standard defines a software interface through which these data can be discovered and explored in data repositories. The M2B3 Standards were developed by the European project Micro B3, funded under 7(th) Framework Programme "Ocean of Tomorrow", and were first used with the Ocean Sampling Day initiative. We believe that these standards have value in broader marine science.
- Published
- 2015
- Full Text
- View/download PDF
44. The founding charter of the Genomic Observatories Network.
- Author
-
Davies N, Field D, Amaral-Zettler L, Clark MS, Deck J, Drummond A, Faith DP, Geller J, Gilbert J, Glöckner FO, Hirsch PR, Leong JA, Meyer C, Obst M, Planes S, Scholin C, Vogler AP, Gates RD, Toonen R, Berteaux-Lecellier V, Barbier M, Barker K, Bertilsson S, Bicak M, Bietz MJ, Bobe J, Bodrossy L, Borja A, Coddington J, Fuhrman J, Gerdts G, Gillespie R, Goodwin K, Hanson PC, Hero JM, Hoekman D, Jansson J, Jeanthon C, Kao R, Klindworth A, Knight R, Kottmann R, Koo MS, Kotoulas G, Lowe AJ, Marteinsson VT, Meyer F, Morrison N, Myrold DD, Pafilis E, Parker S, Parnell JJ, Polymenakou PN, Ratnasingham S, Roderick GK, Rodriguez-Ezpeleta N, Schonrogge K, Simon N, Valette-Silver NJ, Springer YP, Stone GN, Stones-Havas S, Sansone SA, Thibault KM, Wecker P, Wichels A, Wooley JC, Yahara T, and Zingone A
- Abstract
The co-authors of this paper hereby state their intention to work together to launch the Genomic Observatories Network (GOs Network) for which this document will serve as its Founding Charter. We define a Genomic Observatory as an ecosystem and/or site subject to long-term scientific research, including (but not limited to) the sustained study of genomic biodiversity from single-celled microbes to multicellular organisms.An international group of 64 scientists first published the call for a global network of Genomic Observatories in January 2012. The vision for such a network was expanded in a subsequent paper and developed over a series of meetings in Bremen (Germany), Shenzhen (China), Moorea (French Polynesia), Oxford (UK), Pacific Grove (California, USA), Washington (DC, USA), and London (UK). While this community-building process continues, here we express our mutual intent to establish the GOs Network formally, and to describe our shared vision for its future. The views expressed here are ours alone as individual scientists, and do not necessarily represent those of the institutions with which we are affiliated.
- Published
- 2014
- Full Text
- View/download PDF
45. A metagenomics portal for a democratized sequencing world.
- Author
-
Wilke A, Glass EM, Bartels D, Bischof J, Braithwaite D, D'Souza M, Gerlach W, Harrison T, Keegan K, Matthews H, Kottmann R, Paczian T, Tang W, Trimble WL, Yilmaz P, Wilkening J, Desai N, and Meyer F
- Subjects
- Bacteria classification, Bacteria genetics, Genome, Bacterial, High-Throughput Nucleotide Sequencing, Internet, Computational Biology methods, Metagenomics, Software
- Abstract
The democratized world of sequencing is leading to numerous data analysis challenges; MG-RAST addresses many of these challenges for diverse datasets, including amplicon datasets, shotgun metagenomes, and metatranscriptomes. The changes from version 2 to version 3 include the addition of a dedicated gene calling stage using FragGenescan, clustering of predicted proteins at 90% identity, and the use of BLAT for the computation of similarities. Together with changes in the underlying software infrastructure, this has enabled the dramatic scaling up of pipeline throughput while remaining on a limited hardware budget. The Web-based service allows upload, fully automated analysis, and visualization of results. As a result of the plummeting cost of sequencing and the readily available analytical power of MG-RAST, over 78,000 metagenomic datasets have been analyzed, with over 12,000 of them publicly available in MG-RAST., (© 2013 Elsevier Inc. All rights reserved.)
- Published
- 2013
- Full Text
- View/download PDF
46. RCN4GSC Workshop Report: Modeling a Testbed for Managing Data at the Interface of Biodiversity and (Meta)Genomics, April 2011.
- Author
-
Robbins RJ, Cochrane G, Davies N, Dawyndt P, Kottmann R, Krishtalka LK, Morrison N, Tuama EÓ, San Gil I, and Wooley J
- Abstract
At the GSC11 meeting (4-6 April 2011, Hinxton, England, the GSC's genomic biodiversity working group (GBWG) developed an initial model for a data management testbed at the interface of biodiversity with genomics and metagenomics. With representatives of the Global Biodiversity Information Facility (GBIF) participating, it was agreed that the most useful course of action would be for GBIF to collaborate with the GSC in its ongoing GBWG workshops to achieve common goals around interoperability/data integration across (meta)-genomic and species level data. It was determined that a quick comparison should be made of the contents of the Darwin Core (DwC) and the GSC data checklists, with a goal of determining their degree of overlap and compatibility. An ad-hoc task group lead by Renzo Kottman and Peter Dawyndt undertook an initial comparison between the Darwin Core (DwC) standard used by the Global Biodiversity Information Facility (GBIF) and the MIxS checklists put forward by the Genomic Standards Consortium (GSC). A term-by-term comparison showed that DwC and GSC concepts complement each other far more than they compete with each other. Because the preliminary analysis done at this meeting was based on expertise with GSC standards, but not with DwC standards, the group recommended that a joint meeting of DwC and GSC experts be convened as soon as possible to continue this joint assessment and to propose additional work going forward.
- Published
- 2012
- Full Text
- View/download PDF
47. A second special issue of Standards In Genomic Sciences from the Genomic Standards Consortium.
- Author
-
Field D, Kottmann R, Morrison N, and Sterk P
- Published
- 2011
- Full Text
- View/download PDF
48. Quantifying the effect of environment stability on the transcription factor repertoire of marine microbes.
- Author
-
Kostadinov I, Kottmann R, Ramette A, Waldmann J, Buttigieg PL, and Glöckner FO
- Abstract
Background: DNA-binding transcription factors (TFs) regulate cellular functions in prokaryotes, often in response to environmental stimuli. Thus, the environment exerts constant selective pressure on the TF gene content of microbial communities. Recently a study on marine Synechococcus strains detected differences in their genomic TF content related to environmental adaptation, but so far the effect of environmental parameters on the content of TFs in bacterial communities has not been systematically investigated., Results: We quantified the effect of environment stability on the transcription factor repertoire of marine pelagic microbes from the Global Ocean Sampling (GOS) metagenome using interpolated physico-chemical parameters and multivariate statistics. Thirty-five percent of the difference in relative TF abundances between samples could be explained by environment stability. Six percent was attributable to spatial distance but none to a combination of both spatial distance and stability. Some individual TFs showed a stronger relationship to environment stability and space than the total TF pool., Conclusions: Environmental stability appears to have a clearly detectable effect on TF gene content in bacterioplanktonic communities described by the GOS metagenome. Interpolated environmental parameters were shown to compare well to in situ measurements and were essential for quantifying the effect of the environment on the TF content. It is demonstrated that comprehensive and well-structured contextual data will strongly enhance our ability to interpret the functional potential of microbes from metagenomic data.
- Published
- 2011
- Full Text
- View/download PDF
49. Enriching public descriptions of marine phages using the Genomic Standards Consortium MIGS standard.
- Author
-
Duhaime MB, Kottmann R, Field D, and Glöckner FO
- Abstract
In any sequencing project, the possible depth of comparative analysis is determined largely by the amount and quality of the accompanying contextual data. The structure, content, and storage of this contextual data should be standardized to ensure consistent coverage of all sequenced entities and facilitate comparisons. The Genomic Standards Consortium (GSC) has developed the "Minimum Information about Genome/Metagenome Sequences (MIGS/MIMS)" checklist for the description of genomes and here we annotate all 30 publicly available marine bacteriophage sequences to the MIGS standard. These annotations build on existing International Nucleotide Sequence Database Collaboration (INSDC) records, and confirm, as expected that current submissions lack most MIGS fields. MIGS fields were manually curated from the literature and placed in XML format as specified by the Genomic Contextual Data Markup Language (GCDML). These "machine-readable" reports were then analyzed to highlight patterns describing this collection of genomes. Completed reports are provided in GCDML. This work represents one step towards the annotation of our complete collection of genome sequences and shows the utility of capturing richer metadata along with raw sequences.
- Published
- 2011
- Full Text
- View/download PDF
50. Data shopping in an open marketplace: Introducing the Ontogrator web application for marking up data using ontologies and browsing using facets.
- Author
-
Morrison N, Hancock D, Hirschman L, Dawyndt P, Verslyppe B, Kyrpides N, Kottmann R, Yilmaz P, Glöckner FO, Grethe J, Booth T, Sterk P, Nenadic G, and Field D
- Abstract
In the future, we hope to see an open and thriving data market in which users can find and select data from a wide range of data providers. In such an open access market, data are products that must be packaged accordingly. Increasingly, eCommerce sellers present heterogeneous product lines to buyers using faceted browsing. Using this approach we have developed the Ontogrator platform, which allows for rapid retrieval of data in a way that would be familiar to any online shopper. Using Knowledge Organization Systems (KOS), especially ontologies, Ontogrator uses text mining to mark up data and faceted browsing to help users navigate, query and retrieve data. Ontogrator offers the potential to impact scientific research in two major ways: 1) by significantly improving the retrieval of relevant information; and 2) by significantly reducing the time required to compose standard database queries and assemble information for further research. Here we present a pilot implementation developed in collaboration with the Genomic Standards Consortium (GSC) that includes content from the StrainInfo, GOLD, CAMERA, Silva and Pubmed databases. This implementation demonstrates the power of ontogration and highlights that the usefulness of this approach is fully dependent on both the quality of data and the KOS (ontologies) used. Ideally, the use and further expansion of this collaborative system will help to surface issues associated with the underlying quality of annotation and could lead to a systematic means for accessing integrated data resources.
- Published
- 2011
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.