Back to Search Start Over

A [formula omitted] nearest neighbour ensemble via extended neighbourhood rule and feature subsets.

Authors :
Ali, Amjad
Hamraz, Muhammad
Gul, Naz
Khan, Dost Muhammad
Aldahmani, Saeed
Khan, Zardad
Source :
Pattern Recognition. Oct2023, Vol. 142, pN.PAG-N.PAG. 1p.
Publication Year :
2023

Abstract

• Introducing a novel nearest neighbourhood rule for kNN ensemble. • Improving classification performance in data with diverse patterns. • Building a kNN ensemble to achieve high classification performance by making the base models diversified. • Availability of the proposed method as an R package implementation on CRAN (https://cran.r-project.org/web/packages/ExNRuleEnsemble/index.html). k NN based ensemble methods minimise the effect of outliers by identifying a set of data points in the given feature space that are nearest to an unseen observation in order to predict its response by using majority voting. The ordinary ensembles based on k NN find out the k nearest observations in a region (bounded by a sphere) based on a predefined value of k. This scenario, however, might not work in situations where the test observation follows the pattern of the closest data points with the same class that lie on a certain path not contained in the given sphere. This paper proposes a k nearest neighbour ensemble where the neighbours are determined in k steps. Starting from the first nearest observation of the test point, the algorithm identifies a single observation that is closest to the observation at the previous step. At each base learner in the ensemble, this search is extended to k steps on a random bootstrap sample with a random subset of features selected from the feature space. The final predicted class of the test point is determined by using a majority vote in the predicted classes given by all base models. This new ensemble method is applied on 20 benchmark datasets and compared with other classical methods, including k NN based models, in terms of classification accuracy, kappa and Brier score as performance metrics. Boxplots are also utilised to illustrate the difference in the results given by the proposed and other state-of-the-art methods. The proposed method outperformed the considered classical methods in the majority of cases. The proposed method is further assessed through a detailed simulation study. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00313203
Volume :
142
Database :
Academic Search Index
Journal :
Pattern Recognition
Publication Type :
Academic Journal
Accession number :
164259609
Full Text :
https://doi.org/10.1016/j.patcog.2023.109641