1. k-Nearest Neighbour Classifiers - A Tutorial
- Author
-
Pádraig Cunningham, Sarah Jane Delany, and SFI
- Subjects
Artificial Intelligence and Robotics ,Speedup ,General Computer Science ,Computer science ,Dimension (graph theory) ,02 engineering and technology ,Machine learning ,computer.software_genre ,Theoretical Computer Science ,Machine Learning ,0504 sociology ,Similarity (network science) ,Classifier (linguistics) ,0202 electrical engineering, electronic engineering, information engineering ,Code (cryptography) ,𝑘-Nearest Neighbour Classifiers ,computer.programming_language ,business.industry ,Data Science ,05 social sciences ,k-NN ,050401 social sciences methods ,Python (programming language) ,Class (biology) ,ComputingMethodologies_PATTERNRECOGNITION ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,Curse of dimensionality - Abstract
Perhaps the most straightforward classifier in the arsenal or Machine Learning techniques is the Nearest Neighbour Classifier—classification is achieved by identifying the nearest neighbours to a query example and using those neighbours to determine the class of the query. This approach to classification is of particular importance, because issues of poor runtime performance is not such a problem these days with the computational power that is available. This article presents an overview of techniques for Nearest Neighbour classification focusing on: mechanisms for assessing similarity (distance), computational issues in identifying nearest neighbours, and mechanisms for reducing the dimension of the data. This article is the second edition of a paper previously published as a technical report [16]. Sections on similarity measures for time-series, retrieval speedup, and intrinsic dimensionality have been added. An Appendix is included, providing access to Python code for the key methods.
- Published
- 2021