1. Names are not good enough: Reasoning over taxonomic change in the Andropogon complex1
- Author
-
Bertram Ludäscher, Mingmin Chen, Shizhuo Yu, Shawn Bowers, Parisa Kianmajd, Nico M. Franz, and Alan S. Weakley
- Subjects
0106 biological sciences ,0301 basic medicine ,Knowledge representation and reasoning ,Computer Networks and Communications ,business.industry ,Computer science ,Region connection calculus ,computer.software_genre ,010603 evolutionary biology ,01 natural sciences ,Computer Science Applications ,Identifier ,03 medical and health sciences ,Answer set programming ,030104 developmental biology ,Congruence (geometry) ,Pairwise comparison ,Artificial intelligence ,business ,computer ,Ontology alignment ,Natural language processing ,Information Systems - Abstract
We present a novel, logic-based solution to the challenge of reconciling the meanings of taxonomic names across multiple biological taxonomies. The challenge arises due to limitations inherent in using type-anchored taxonomic names as identifiers of granular semantic similarities and differences being expressed in original and revised taxonomic classifications. We address this challenge through: (1) the use of taxonomic concept labels - thereby individuating name usages according to particular sources and allowing each taxonomy to be recognized separately; (2) sets of user-provided Region Connection Calculus articulations among concepts (RCC-5: congruence, proper inclusion, inverse proper inclusion, overlap, exclusion); and (3) the use of an Answer Set Programming-based reasoning toolkit that ingests these constraints to infer and visualize consistent multi-taxonomy alignments. The feasibility of this approach is demonstrated with a use case involving pairwise alignments of 11 non-congruent classifications of Eastern United States grass entities variously assigned to the Andropogon glomeratus- virginicus 'complex' over an interval of 126 years. Analyses of name:meaning identity reveal that, on average, taxonomic names are reliable identifiers of taxonomic non-/congruence for approximately 60% of the 127 merge regions obtained in 12 pairwise alignments. The name:meaning cardinality over the entire time interval ranges from 1:6 to 4:1, with only 1:36 names attaining the semantically ideal 1:1 ratio. We discuss the applicability of the RCC-5 alignment approach in the context of achieving logic-based integration of non-/congruent taxonomic concept hierarchies in dynamic biodiversity data environments.
- Published
- 2016