Author: "J A, Cherry" / Journal: database - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"J A, Cherry"' showing total 9 results

Start Over Author "J A, Cherry" Journal database

9 results on '"J A, Cherry"'

1. Prevention of data duplication for high throughput sequencing repositories

Author: J. Seth Strattan, Carrie A. Davis, Forrest Y. Tanaka, Benjamin C. Hitz, J. Michael Cherry, Keenan Graham, Jean M. Davidson, Jason A. Hilton, Idan Gabdank, Kathrina C. Onate, Stuart R. Miyasato, Otto Jolanki, Timothy R. Dreszer, Esther T. Chan, Aditi K. Narayanan, Ulugbek K. Baymuradov, and Cricket A. Sloan
Subjects: 0301 basic medicine, Computer science, business.industry, Extramural, MEDLINE, Computational biology, General Biochemistry, Genetics and Molecular Biology, DNA sequencing, 03 medical and health sciences, 030104 developmental biology, 0302 clinical medicine, Text mining, Data deduplication, Original Article, Databases, Nucleic Acid, General Agricultural and Biological Sciences, business, Data Curation, 030217 neurology & neurosurgery, Information Systems
Abstract: Prevention of unintended duplication is one of the ongoing challenges many databases have to address. Working with high-throughput sequencing data, the complexity of that challenge increases with the complexity of the definition of a duplicate. In a computational data model, a data object represents a real entity like a reagent or a biosample. This representation is similar to how a card represents a book in a paper library catalog. Duplicated data objects not only waste storage, they can mislead users into assuming the model represents more than the single entity. Even if it is clear that two objects represent a single entity, data duplication opens the door to potential inconsistencies between the objects since the content of the duplicated objects can be updated independently, allowing divergence of the metadata associated with the objects. Analogously to a situation in which a catalog in a paper library would contain by mistake two cards for a single copy of a book. If these cards are listing simultaneously two different individuals as current book borrowers, it would be difficult to determine which borrower (out of the two listed) actually has the book. Unfortunately, in a large database with multiple submitters, unintended duplication is to be expected. In this article, we present three principal guidelines the Encyclopedia of DNA Elements (ENCODE) Portal follows in order to prevent unintended duplication of both actual files and data objects: definition of identifiable data objects (I), object uniqueness validation (II) and de-duplication mechanism (III). In addition to explaining our modus operandi, we elaborate on the methods used for identification of sequencing data files. Comparison of the approach taken by the ENCODE Portal vs other widely used biological data repositories is provided. Database URL: https://www.encodeproject.org/
Published: 2018
Full Text: View/download PDF

2. Curated protein information in the Saccharomyces genome database

Author: Kelley Paskov, Edith D. Wong, Sage T. Hellerstedt, Kalpana Karra, J. Michael Cherry, Robert S. Nash, Stacia R. Engel, and Shuai Weng
Subjects: 0301 basic medicine, Protein function, Saccharomyces cerevisiae Proteins, Proteome, 030102 biochemistry & molecular biology, Saccharomyces genome database, Molecular Sequence Annotation, Saccharomyces cerevisiae, Computational biology, Biology, Bioinformatics, Genome, General Biochemistry, Genetics and Molecular Biology, 03 medical and health sciences, ComputingMethodologies_PATTERNRECOGNITION, 030104 developmental biology, Original Article, Genome, Fungal, Databases, Protein, General Agricultural and Biological Sciences, Information Systems
Abstract: Due to recent advancements in the production of experimental proteomic data, the Saccharomyces genome database (SGD; www.yeastgenome.org) has been expanding our protein curation activities to make new data types available to our users. Because of broad interest in post-translational modifications (PTM) and their importance to protein function and regulation, we have recently started incorporating expertly curated PTM information on individual protein pages. Here we also present the inclusion of new abundance and protein half-life data obtained from high-throughput proteome studies. These new data types have been included with the aim to facilitate cellular biology research. Database URL: www.yeastgenome.org
Published: 2017
Full Text: View/download PDF

3. Outreach and online training services at the Saccharomyces Genome Database

Author: Robert S. Nash, Marek S. Skrzypek, Stacia R. Engel, Kevin A. MacPherson, Olivia W. Lang, Kyla S. Dalusag, Sage T. Hellerstedt, Barry Starr, J. Michael Cherry, and Edith D. Wong
Subjects: 0301 basic medicine, Biomedical Research, Resource (biology), Blogging, Computer science, Genomics, Saccharomyces cerevisiae, Bioinformatics, Genome, General Biochemistry, Genetics and Molecular Biology, Digital media, World Wide Web, Public access, 03 medical and health sciences, Databases, Genetic, 030102 biochemistry & molecular biology, Saccharomyces genome database, business.industry, Congresses as Topic, Variety (cybernetics), Outreach, ComputingMethodologies_PATTERNRECOGNITION, 030104 developmental biology, Original Article, Genome, Fungal, General Agricultural and Biological Sciences, business, Information Systems
Abstract: The Saccharomyces Genome Database (SGD; www.yeastgenome.org ), the primary genetics and genomics resource for the budding yeast S. cerevisiae , provides free public access to expertly curated information about the yeast genome and its gene products. As the central hub for the yeast research community, SGD engages in a variety of social outreach efforts to inform our users about new developments, promote collaboration, increase public awareness of the importance of yeast to biomedical research, and facilitate scientific discovery. Here we describe these various outreach methods, from networking at scientific conferences to the use of online media such as blog posts and webinars, and include our perspectives on the benefits provided by outreach activities for model organism databases. Database url http://www.yeastgenome.org.
Published: 2017
Full Text: View/download PDF

4. Ontology application and use at the ENCODE DCC

Author: Marcus Ho, Stuart R. Miyasato, W. James Kent, J. Seth Strattan, Jean M. Davidson, Nikhil R. Podduturi, Cricket A. Sloan, Greg Roe, Eurie L. Hong, Laurence D. Rowe, Brian T. Lee, Esther T. Chan, J. Michael Cherry, Drew T. Erickson, Forrest Y. Tanaka, Benjamin C. Hitz, Venkat S. Malladi, and Matt Simison
Subjects: Information retrieval, Transcription, Genetic, Standardization, Computer science, Experimental data, Molecular Sequence Annotation, Ontology (information science), ENCODE, General Biochemistry, Genetics and Molecular Biology, Set (abstract data type), World Wide Web, Metadata, Mice, Gene Ontology, Databases, Genetic, Encyclopedia, Animals, Humans, Original Article, Gene Regulatory Networks, General Agricultural and Biological Sciences, Data Curation, Information Systems
Abstract: The Encyclopedia of DNA elements (ENCODE) project is an ongoing collaborative effort to create a catalog of genomic annotations. To date, the project has generated over 4000 experiments across more than 350 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory network and transcriptional landscape of the Homo sapiens and Mus musculus genomes. All ENCODE experimental data, metadata and associated computational analyses are submitted to the ENCODE Data Coordination Center (DCC) for validation, tracking, storage and distribution to community resources and the scientific community. As the volume of data increases, the organization of experimental details becomes increasingly complicated and demands careful curation to identify related experiments. Here, we describe the ENCODE DCC’s use of ontologies to standardize experimental metadata. We discuss how ontologies, when used to annotate metadata, provide improved searching capabilities and facilitate the ability to find connections within a set of experiments. Additionally, we provide examples of how ontologies are used to annotate ENCODE metadata and how the annotations can be identified via ontology-driven searches at the ENCODE portal. As genomic datasets grow larger and more interconnected, standardization of metadata becomes increasingly vital to allow for exploration and comparison of data between different scientific projects. Database URL: https://www.encodeproject.org/
Published: 2015
Full Text: View/download PDF

5. YeastMine—an integrated data warehouse for Saccharomyces cerevisiae data as a multipurpose tool-kit

Author: Eurie L. Hong, Benjamin C. Hitz, Julie Park, Rama Balakrishnan, Kalpana Karra, Gail Binkley, J. Michael Cherry, Gos Micklem, and Julie Sullivan
Subjects: Computer science, Interface (computing), Saccharomyces cerevisiae, Data type, General Biochemistry, Genetics and Molecular Biology, World Wide Web, User-Computer Interface, 03 medical and health sciences, 0302 clinical medicine, Databases, Genetic, GeneralLiterature_REFERENCE(e.g.,dictionaries,encyclopedias,glossaries), 030304 developmental biology, Internet, 0303 health sciences, Information retrieval, biology, business.industry, Original Articles, biology.organism_classification, File format, Budding yeast, Data warehouse, Template, Database Management Systems, The Internet, Genome, Fungal, General Agricultural and Biological Sciences, business, 030217 neurology & neurosurgery, Information Systems
Abstract: The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) provides high-quality curated genomic, genetic, and molecular information on the genes and their products of the budding yeast Saccharomyces cerevisiae. To accommodate the increasingly complex, diverse needs of researchers for searching and comparing data, SGD has implemented InterMine (http://www.InterMine.org), an open source data warehouse system with a sophisticated querying interface, to create YeastMine (http://yeastmine.yeastgenome.org). YeastMine is a multifaceted search and retrieval environment that provides access to diverse data types. Searches can be initiated with a list of genes, a list of Gene Ontology terms, or lists of many other data types. The results from queries can be combined for further analysis and saved or downloaded in customizable file formats. Queries themselves can be customized by modifying predefined templates or by creating a new template to access a combination of specific data types. YeastMine offers multiple scenarios in which it can be used such as a powerful search interface, a discovery tool, a curation aid and also a complex database presentation format. DATABASE URL: http://yeastmine.yeastgenome.org.
Published: 2012
Full Text: View/download PDF

6. Integration of new alternative reference strain genome sequences into theSaccharomycesgenome database

Author: Rama Balakrishnan, Sage T. Hellerstedt, J. Michael Cherry, Janos Demeter, Edith D. Wong, Stacia R. Engel, Gail Binkley, Marek S. Skrzypek, Travis K. Sheppard, Maria C. Costanzo, Robert S. Nash, Kelley Paskov, Kalpana Karra, Shuai Weng, Giltae Song, Kyla S. Dalusag, and Benjamin C. Hitz
Subjects: 0301 basic medicine, Saccharomyces cerevisiae, Locus (genetics), Biology, ENCODE, Genome, General Biochemistry, Genetics and Molecular Biology, Saccharomyces, User-Computer Interface, 03 medical and health sciences, Protein sequencing, Databases, Genetic, natural sciences, Gene, Genetics, Reproducibility of Results, Molecular Sequence Annotation, Genomics, Genome project, biology.organism_classification, 030104 developmental biology, Database Update, Genome, Fungal, General Agricultural and Biological Sciences, Information Systems, Reference genome
Abstract: The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) is the authoritative community resource for the Saccharomyces cerevisiae reference genome sequence and its annotation. To provide a wider scope of genetic and phenotypic variation in yeast, the genome sequences and their corresponding annotations from 11 alternative S. cerevisiae reference strains have been integrated into SGD. Genomic and protein sequence information for genes from these strains are now available on the Sequence and Protein tab of the corresponding Locus Summary pages. We illustrate how these genome sequences can be utilized to aid our understanding of strain-specific functional and phenotypic differences. Database URL: www.yeastgenome.org
Published: 2016
Full Text: View/download PDF

7. From one to many: expanding theSaccharomyces cerevisiaereference genome panel

Author: Stacia R. Engel, Shuai Weng, Gail Binkley, Giltae Song, J. Michael Cherry, and Kelley Paskov
Subjects: 0301 basic medicine, Saccharomyces cerevisiae, One-to-many, Computational biology, Biology, Genome, General Biochemistry, Genetics and Molecular Biology, Automation, Open Reading Frames, 03 medical and health sciences, Annotation, Databases, Genetic, Data Mining, Base sequence, Genetics, Base Sequence, Saccharomyces genome database, Reference genome sequence, Reference Standards, biology.organism_classification, 030104 developmental biology, Original Article, Genome, Fungal, General Agricultural and Biological Sciences, Information Systems, Reference genome
Abstract: In recent years, thousands of Saccharomyces cerevisiae genomes have been sequenced to varying degrees of completion. The Saccharomyces Genome Database (SGD) has long been the keeper of the original eukaryotic reference genome sequence, which was derived primarily from S. cerevisiae strain S288C. Because new technologies are pushing S. cerevisiae annotation past the limits of any system based exclusively on a single reference sequence, SGD is actively working to expand the original S. cerevisiae systematic reference sequence from a single genome to a multi-genome reference panel. We first commissioned the sequencing of additional genomes and their automated analysis using the AGAPE pipeline. Here we describe our curation strategy to produce manually reviewed high-quality genome annotations in order to elevate 11 of these additional genomes to Reference status. Database URL: http://www.yeastgenome.org/
Published: 2016
Full Text: View/download PDF

8. DATABASE, The Journal of Biological Databases and Curation, is now the official journal of the International Society for Biocuration

Author: Teresa K. Attwood, Alex Bateman, Monica C. Munoz-Torres, Claire O'Donovan, J. Michael Cherry, P Gaudet, Renate Kania, Chisato Yamasaki, and Marc Robinson-Rechavi
Subjects: Societies, Scientific, Internationality, 030309 nutrition & dietetics, Computer science, MEDLINE, Biological database, External Data Representation, computer.software_genre, General Biochemistry, Genetics and Molecular Biology, World Wide Web, 03 medical and health sciences, Data Mining, Product (category theory), Cooperative Behavior, Letter to the Editor, Pace, 0303 health sciences, Database, Scope (project management), Data science, Variety (cybernetics), Databases as Topic, Work (electrical), Periodicals as Topic, General Agricultural and Biological Sciences, computer, Information Systems
Abstract: The International Society for Biocuration (ISB) was created in 2009 specifically to promote biocuration, the product of multidisciplinary teams of database curators, software developers and bioinformaticians. Biocurators, whose work facilitates research and education across the life sciences, create and maintain a wide variety of online tools and databases, covering topics as diverse as biochemical structures, chromosomal features and phenotypes of mutant genes. These resources are now so essential to the biological community that databases have become integral to the daily work of most researchers. But biological databases are far from static: in addition to the need to constantly capture new knowledge (from the literature, from other databases, from analysis tools, etc.), data representation must also keep pace with current research—new data types must be modeled—and improved methods of data storage, representation and analysis must be continually developed. Nevertheless, such important efforts have not always been recognized, and have often not been published in full, owing to the lack of a suitable journal. DATABASE, The Journal of Biological Databases and Curation, was launched in 2010 to support the growing need of the research community to discuss a range of issues related to the creation, development and maintenance of biological databases, and to strengthen communication between database developers, curators and users. As this resonates strongly with the mission of the ISB, we are delighted to announce that DATABASE has now become the Society’s official journal. The scope of DATABASE includes many areas relevant to the endeavors of the biocuration community. Moreover, DATABASE is an open-access journal, which is critical for biocuration worldwide and one of the core values that the ISB promotes. Since its creation, DATABASE has published more than 250 articles, 50 of which have appeared in the Biocuration Virtual Issue, a special collection of articles describing work presented at the annual International Biocuration Conference. Scholarly exchanges among scientists are invaluable for helping a discipline to realize its full potential. To this end, DATABASE and the ISB are excited to be able to work together more closely, a collaboration that we expect will enhance the visibility and impact of biocurators’ work, and hence to increase the value of the Journal to members both of the ISB and of the wider scientific community.
Published: 2013
Full Text: View/download PDF

9. Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study

Author: J. Michael Cherry, Maria C. Costanzo, Julie Park, Rama Balakrishnan, and Eurie L. Hong
Subjects: InterPro, Literature based, Saccharomyces cerevisiae, Biology, Ontology (information science), Genome, General Biochemistry, Genetics and Molecular Biology, 03 medical and health sciences, Annotation, Databases, Genetic, 030304 developmental biology, 0303 health sciences, Information retrieval, Gene ontology, 030302 biochemistry & molecular biology, Computational Biology, Molecular Sequence Annotation, Feasibility Studies, Original Article, Bibliographies as Topic, Genome, Fungal, General Agricultural and Biological Sciences, Software, Scope (computer science), Information Systems
Abstract: Annotation using Gene Ontology (GO) terms is one of the most important ways in which biological information about specific gene products can be expressed in a searchable, computable form that may be compared across genomes and organisms. Because literature-based GO annotations are often used to propagate functional predictions between related proteins, their accuracy is critically important. We present a strategy that employs a comparison of literature-based annotations with computational predictions to identify and prioritize genes whose annotations need review. Using this method, we show that comparison of manually assigned 'unknown' annotations in the Saccharomyces Genome Database (SGD) with InterPro-based predictions can identify annotations that need to be updated. A survey of literature-based annotations and computational predictions made by the Gene Ontology Annotation (GOA) project at the European Bioinformatics Institute (EBI) across several other databases shows that this comparison strategy could be used to maintain and improve the quality of GO annotations for other organisms besides yeast. The survey also shows that although GOA-assigned predictions are the most comprehensive source of functional information for many genomes, a large proportion of genes in a variety of different organisms entirely lack these predictions but do have manual annotations. This underscores the critical need for manually performed, literature-based curation to provide functional information about genes that are outside the scope of widely used computational methods. Thus, the combination of manual and computational methods is essential to provide the most accurate and complete functional annotation of a genome. Database URL: http://www.yeastgenome.org.
Published: 2011
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

9 results on '"J A, Cherry"'

1. Prevention of data duplication for high throughput sequencing repositories

2. Curated protein information in the Saccharomyces genome database

3. Outreach and online training services at the Saccharomyces Genome Database

4. Ontology application and use at the ENCODE DCC

5. YeastMine—an integrated data warehouse for Saccharomyces cerevisiae data as a multipurpose tool-kit

6. Integration of new alternative reference strain genome sequences into theSaccharomycesgenome database

7. From one to many: expanding theSaccharomyces cerevisiaereference genome panel

8. DATABASE, The Journal of Biological Databases and Curation, is now the official journal of the International Society for Biocuration

9. Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

9 results on '"J A, Cherry"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources