Back to Search
Start Over
Batched Online Contextual Sparse Bandits with Sequential Inclusion of Features
- Publication Year :
- 2024
-
Abstract
- Multi-armed Bandits (MABs) are increasingly employed in online platforms and e-commerce to optimize decision making for personalized user experiences. In this work, we focus on the Contextual Bandit problem with linear rewards, under conditions of sparsity and batched data. We address the challenge of fairness by excluding irrelevant features from decision-making processes using a novel algorithm, Online Batched Sequential Inclusion (OBSI), which sequentially includes features as confidence in their impact on the reward increases. Our experiments on synthetic data show the superior performance of OBSI compared to other algorithms in terms of regret, relevance of features used, and compute.<br />Comment: 4 pages, 4 figures, Accepted at the CONSEQUENCES 24 workshop, co-located with ACM RecSys 24
- Subjects :
- Computer Science - Machine Learning
Statistics - Machine Learning
Subjects
Details
- Database :
- arXiv
- Publication Type :
- Report
- Accession number :
- edsarx.2409.09199
- Document Type :
- Working Paper