Back to Search
Start Over
Revision and annotation of DNA barcode records for marine invertebrates: Report of the 8th iBOL conference hackathon
- Source :
- Metabarcoding and Metagenomics 5: e67862, Metabarcoding and Metagenomics, Vol 5, Iss, Pp 207-217 (2021)
- Publication Year :
- 2021
- Publisher :
- Pensoft Publishers, 2021.
-
Abstract
- The accuracy of specimen identification through DNA barcoding and metabarcoding relies on reference libraries containing records with reliable taxonomy and sequence quality. The considerable growth in barcode data requires stringent data curation, especially in taxonomically difficult groups such as marine invertebrates. A major effort in curating marine barcode data in the Barcode of Life Data Systems (BOLD) was undertaken during the 8th International Barcode of Life Conference (Trondheim, Norway, 2019). Major taxonomic groups (crustaceans, echinoderms, molluscs, and polychaetes) were reviewed to identify those which had disagreement between Linnaean names and Barcode Index Numbers (BINs). The records with disagreement were annotated with four tags: A) MIS-ID (misidentified, mislabeled, or contaminated records), b) AMBIG (ambiguous records unresolved with the existing data), c) COMPLEX (species names occurring in multiple BINs), and d) SHARE (barcodes shared between species). A total of 83,712 specimen records corresponding to 7,576 species were reviewed and 39% of the species were tagged (7% MIS-ID, 17% AMBIG, 14% COMPLEX, and 1% SHARE). High percentages (>50%) of AMBIG tags were recorded in gastropods, whereas COMPLEX tags dominated in crustaceans and polychaetes. The high proportion of tagged species reflects either flaws in the barcoding workflow (e.g., misidentification, cross-contamination) or taxonomic difficulties (e.g., synonyms, undescribed species). Although data curation is essential for barcode applications, such manual attempts to examine large datasets are unsustainable and automated solutions are extremely desirable.<br />The hackathon was organized with financial support from the European Union COST Action DNAqua-Net (CA 15219 https://dnaqua.net/) in the scope of the 8th International Barcode of Life Conference in Trondheim, Norway on 16 June 2019. DNAqua-Net is acknowledged for the funding provided and the local conference organizers for all the logistical support that ensured a successful event. Tyler Elliot and the rest of the BOLD team are acknowledged for their help with data queries and analytics. The authors also thank the hackathon participants for vibrant discussions during and after the event: Berry van der Hoorn, Katrine Konsghavn, Guy Paz, Mouna Rifi, Malin Strand, Anne Helene Tandberg, Adam Wall, and Endre Willassen. Marcos A. L. Teixeira was supported by a PhD grant from the Portuguese Foundation for Science and Technology (FCT I.P.) co-financed by ESF (SFRH/BD/131527/2017). Financial support granted by FCT to Sofia Duarte (CEECIND/00667/2017) and to Pedro E. Vieira (project NIS-DNA, PTDC/BIA-BMA/29754/2017) is also acknowledged. Sanna Majaneva was financially supported by the Norwegian Taxonomy Initiative (project no. 70184235). The authors thank the five reviewers who provided valuable input into the earlier version of the manuscript.
- Subjects :
- 0106 biological sciences
0303 health sciences
Ecology
Data curation
Marine invertebrates
Annotation
010603 evolutionary biology
01 natural sciences
03 medical and health sciences
Genetics
Metabarcoding
Reference libraries
Animal Science and Zoology
DNA barcoding
14. Life underwater
Molecular Biology
QH540-549.5
030304 developmental biology
Nature and Landscape Conservation
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- Metabarcoding and Metagenomics 5: e67862, Metabarcoding and Metagenomics, Vol 5, Iss, Pp 207-217 (2021)
- Accession number :
- edsair.doi.dedup.....fb66ae5eacf9f75fb9df949b5fad84e7