Back to Search
Start Over
K -MLIO: Enabling K -Means for Large Data-Sets and Memory Constrained Embedded Systems
- Source :
- MASCOTS, 2019 IEEE 27th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS), 2019 IEEE 27th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS), Oct 2019, Rennes, France. pp.262-268, ⟨10.1109/MASCOTS.2019.00037⟩
- Publication Year :
- 2019
- Publisher :
- IEEE, 2019.
-
Abstract
- Machine Learning (ML) algorithms are increasingly used in embedded systems to perform different tasks such as clustering and pattern recognition. These algorithms are both compute and memory intensive whilst embedded devices offer lower hardware capabilities as compared to traditional ML platforms. K-means clustering is one of the widely used ML algorithms. In the case of large data-sets, our analysis showed that on average, more than 70% of the execution time is spent on I/Os. In this paper, we present a version of K-means that drastically reduces the number of I/Os by spanning the data-set only once as compared to the traditional version that reads it several times according to the number of iterations performed. Our evaluation showed that the proposed strategy reduces the overall execution time on large data-sets by 60% on average while lowering the number I/Os operations by 90% with a comparable precision to the traditional K-means implementation.
- Subjects :
- Random access memory
Computer science
business.industry
0211 other engineering and technologies
k-means clustering
02 engineering and technology
Execution time
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
020204 information systems
Embedded system
Pattern recognition (psychology)
0202 electrical engineering, electronic engineering, information engineering
[INFO.INFO-ES]Computer Science [cs]/Embedded Systems
[INFO.INFO-OS]Computer Science [cs]/Operating Systems [cs.OS]
Cluster analysis
business
ComputingMilieux_MISCELLANEOUS
021106 design practice & management
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2019 IEEE 27th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)
- Accession number :
- edsair.doi.dedup.....5a5213e87ada53e9fb1faa2cd9c93861