Back to Search Start Over

Towards a Taxonomy Machine: A Training Set of 5.6 Million Arthropod Images

Authors :
Dirk Steinke
Sujeevan Ratnasingham
Jireh Agda
Hamzah Ait Boutou
Isaiah C. H. Box
Mary Boyle
Dean Chan
Corey Feng
Scott C. Lowe
Jaclyn T. A. McKeown
Joschka McLeod
Alan Sanchez
Ian Smith
Spencer Walker
Catherine Y.-Y. Wei
Paul D. N. Hebert
Source :
Data, Vol 9, Iss 11, p 122 (2024)
Publication Year :
2024
Publisher :
MDPI AG, 2024.

Abstract

The taxonomic identification of organisms from images is an active research area within the machine learning community. Current algorithms are very effective for object recognition and discrimination, but they require extensive training datasets to generate reliable assignments. This study releases 5.6 million images with representatives from 10 arthropod classes and 26 insect orders. All images were taken using a Keyence VHX-7000 Digital Microscope system with an automatic stage to permit high-resolution (4K) microphotography. Providing phenotypic data for 324,000 species derived from 48 countries, this release represents, by far, the largest dataset of standardized arthropod images. As such, this dataset is well suited for testing the efficacy of machine learning algorithms for identifying specimens into higher taxonomic categories.

Details

Language :
English
ISSN :
23065729
Volume :
9
Issue :
11
Database :
Directory of Open Access Journals
Journal :
Data
Publication Type :
Academic Journal
Accession number :
edsdoj.1b005456467499cba8b8f165eb3543b
Document Type :
article
Full Text :
https://doi.org/10.3390/data9110122