Back to Search
Start Over
A New Phylogenetic Inference Based on Genetic Attribute Reduction for Morphological Data
- Source :
- Entropy, Volume 21, Issue 3, Entropy, Vol 21, Iss 3, p 313 (2019)
- Publication Year :
- 2019
- Publisher :
- Multidisciplinary Digital Publishing Institute, 2019.
-
Abstract
- To address the instability of phylogenetic trees in morphological datasets caused by missing values, we present a phylogenetic inference method based on a concept decision tree (CDT) in conjunction with attribute reduction. First, a reliable initial phylogenetic seed tree is created using a few species with relatively complete morphological information by using biologists&rsquo<br />prior knowledge or by applying existing tools such as MrBayes. Second, using a top-down data processing approach, we construct concept-sample templates by performing attribute reduction at each node in the initial phylogenetic seed tree. In this way, each node is turned into a decision point with multiple concept-sample templates, providing decision-making functions for grafting. Third, we apply a novel matching algorithm to evaluate the degree of similarity between the species&rsquo<br />attributes and their concept-sample templates and to determine the location of the species in the initial phylogenetic seed tree. In this manner, the phylogenetic tree is established step by step. We apply our algorithm to several datasets and compare it with the maximum parsimony, maximum likelihood, and Bayesian inference methods using the two evaluation criteria of accuracy and stability. The experimental results indicate that as the proportion of missing data increases, the accuracy of the CDT method remains at 86.5%, outperforming all other methods and producing a reliable phylogenetic tree.
- Subjects :
- 0106 biological sciences
Computer science
Grafting (decision trees)
Decision tree
General Physics and Astronomy
lcsh:Astrophysics
Bayesian inference
010603 evolutionary biology
01 natural sciences
Article
010104 statistics & probability
lcsh:QB460-466
morphological analysis
phylogenetic tree
0101 mathematics
lcsh:Science
Blossom algorithm
Seed tree
Phylogenetic tree
business.industry
information entropy
Pattern recognition
Missing data
lcsh:QC1-999
Maximum parsimony
ComputingMethodologies_PATTERNRECOGNITION
attribute reduction
lcsh:Q
Artificial intelligence
business
lcsh:Physics
Subjects
Details
- Language :
- English
- ISSN :
- 10994300
- Database :
- OpenAIRE
- Journal :
- Entropy
- Accession number :
- edsair.doi.dedup.....ab28080abf03a24bf9df9f2d150cad7d
- Full Text :
- https://doi.org/10.3390/e21030313