1. Two Influential Primate Classifications Logically Aligned
- Author
-
Shawn Bowers, Shizhuo Yu, Parisa Kianmajd, Bertram Ludäscher, Nico M. Franz, Mingmin Chen, DeeAnn M. Reeder, and Naomi M. Pier
- Subjects
Primates ,0106 biological sciences ,0301 basic medicine ,Systematics ,computer.software_genre ,010603 evolutionary biology ,01 natural sciences ,03 medical and health sciences ,Answer set programming ,Congruence (geometry) ,Species level ,Genetics ,Animals ,ontology ,Region Connection Calculus ,Phylogeny ,Ecology, Evolution, Behavior and Systematics ,Alignment ,Mathematics ,logic ,Phylogenetic tree ,Region connection calculus ,business.industry ,Biodiversity ,Classification ,030104 developmental biology ,Scalability ,reasoning ,Artificial intelligence ,Data mining ,business ,computer ,Natural language processing ,Regular Articles ,concept taxonomy ,Information integration - Abstract
Classifications and phylogenies of perceived natural entities change in the light of new evidence. Taxonomic changes, translated into Code-compliant names, frequently lead to name:meaning dissociations across succeeding treatments. Classification standards such as the Mammal Species of the World (MSW) may experience significant levels of taxonomic change from one edition to the next, with potential costs to long-term, large-scale information integration. This circumstance challenges the biodiversity and phylogenetic data communities to express taxonomic congruence and incongruence in ways that both humans and machines can process, that is, to logically represent taxonomic alignments across multiple classifications. We demonstrate that such alignments are feasible for two classifications of primates corresponding to the second and third MSW editions. Our approach has three main components: (i) use of taxonomic concept labels, that is name sec. author (where sec. means according to), to assemble each concept hierarchy separately via parent/child relationships; (ii) articulation of select concepts across the two hierarchies with user-provided Region Connection Calculus (RCC-5) relationships; and (iii) the use of an Answer Set Programming toolkit to infer and visualize logically consistent alignments of these input constraints. Our use case entails the Primates sec. Groves (1993; MSW2-317 taxonomic concepts; 233 at the species level) and Primates sec. Groves (2005; MSW3-483 taxonomic concepts; 376 at the species level). Using 402 RCC-5 input articulations, the reasoning process yields a single, consistent alignment and 153,111 Maximally Informative Relations that constitute a comprehensive meaning resolution map for every concept pair in the Primates sec. MSW2/MSW3. The complete alignment, and various partitions thereof, facilitate quantitative analyses of name:meaning dissociation, revealing that nearly one in three taxonomic names are not reliable across treatments-in the sense of the same name identifying congruent taxonomic meanings. The RCC-5 alignment approach is potentially widely applicable in systematics and can achieve scalable, precise resolution of semantically evolving name usages in synthetic, next-generation biodiversity, and phylogeny data platforms.
- Published
- 2016