Back to Search Start Over

Online learning from capricious data streams via shared and new feature spaces.

Authors :
Zhou, Peng
Zhang, Shuai
Mu, Lin
Yan, Yuanting
Source :
Applied Intelligence; Oct2024, Vol. 54 Issue 19, p9429-9445, 17p
Publication Year :
2024

Abstract

Data streams refer to data sequences generated at a high rate over a continuous period, such as social media analysis, financial transaction monitoring, and sensor data processing. Most existing data stream mining methods make assumptions about the feature space, assuming it is either fixed or undergoes regular changes, such as trapezoidal or evolving data streams. However, these restrictions do not hold for real-world applications where data streams may exhibit arbitrary missing features. To address the issue of arbitrary missing features in the feature space, we propose the Online Learning from Capricious Data Streams (OLCDS) algorithm and its variant, OLCDS-I. Specifically, OLCDS first identifies the higher uncertainty features that can provide more information for the optimization model. Then, based on the shared and new feature space, we formulate the constrained optimization problem using the soft margin technique. We deduce the update rules and use model sparsity to retain the essential features for classifier learning. Compared to existing online learning approaches, our new method eliminates the need for feature space assumptions and avoids generating missing features. Extensive experiments compared with five state-of-the-art methods on ten real-world datasets demonstrate the effectiveness and efficiency of our new algorithms [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
0924669X
Volume :
54
Issue :
19
Database :
Complementary Index
Journal :
Applied Intelligence
Publication Type :
Academic Journal
Accession number :
179041553
Full Text :
https://doi.org/10.1007/s10489-024-05681-x