1. Tripal v3: an ontology-based toolkit for construction of FAIR biological community databases
- Author
-
Ming Chen, F. Alex Feltus, Jill L. Wegrzyn, Margaret Staton, Helena Rasche, Abdullah Almsaeed, Lacey-Anne Sanderson, Shawna Spoor, Chun-Huai Cheng, Kirstin E. Bett, Stephen P. Ficklin, Sook Jung, Anthony Bretaudeau, Bradford Condon, Dorrie Main, Washington State University (WSU), University of Saskatchewan, University of Tennessee, Institut de Génétique, Environnement et Protection des Plantes (IGEPP), Institut National de la Recherche Agronomique (INRA)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-AGROCAMPUS OUEST, University of Freiburg [Freiburg], University of Connecticut (UCONN), Clemson University, BRE060, Saskatchewan Pulse Growers, 8302, Genome Canada, USDA-ARS, U.S. Dry Pea and Lentil Council, Northern Pulse Growers, Cotton Incorporated, Washington Tree Fruit Research Commission, 1443040, NSF DIBBs, 1444573, NSF PGRP, USDA NIFA NRSP10, 2014-51181-2237, USDA NIFA SCRI, University of Saskatchewan [Saskatoon] (U of S), Institut National de la Recherche Agronomique (INRA)-Université de Rennes (UR)-AGROCAMPUS OUEST, and Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro)-Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro)
- Subjects
Computer science ,Biological database ,Ontology (information science) ,computer.software_genre ,Data type ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,Resource (project management) ,open science ,Databases, Genetic ,ontology ,database ,ComputingMilieux_MISCELLANEOUS ,ontologie ,030304 developmental biology ,base de données ,0303 health sciences ,Internet ,[INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB] ,Database ,Community engagement ,Application programming interface ,Information Dissemination ,030302 biochemistry & molecular biology ,Genomics ,Online community ,Biota ,Data sharing ,Original Article ,[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM] ,General Agricultural and Biological Sciences ,Transcriptome ,computer ,Software ,Information Systems - Abstract
Community biological databases provide an important online resource for both public and private data, analysis tools and community engagement. These sites house genomic, transcriptomic, genetic, breeding and ancillary data for specific species, families or clades. Due to the complexity and increasing quantities of these data, construction of online resources is increasingly difficult especially with limited funding and access to technical expertise. Furthermore, online repositories are expected to promote FAIR data principles (findable, accessible, interoperable and reusable) that presents additional challenges. The open-source Tripal database toolkit seeks to mitigate these challenges by creating both the software and an interactive community of developers for construction of online community databases. Additionally, through coordinated, distributed co-development, Tripal sites encourage community-wide sustainability. Here, we report the release of Tripal version 3 that improves data accessibility and data sharing through systematic use of controlled vocabularies (CVs). Tripal uses the community-developed Chado database as a default data store, but now provides tools to support other data stores, while ensuring that CVs remain the central organizational structure for the data. A new site developer can use Tripal to develop a basic site with little to no programming, with the ability to integrate other data types using extension modules and the Tripal application programming interface. A thorough online User’s Guide and Developer’s Handbook are available at http://tripal.info, providing download, installation and step-by-step setup instructions.
- Published
- 2019
- Full Text
- View/download PDF