Back to Search
Start Over
Best Arm Identification in Generalized Linear Bandits
- Publication Year :
- 2019
-
Abstract
- Motivated by drug design, we consider the best-arm identification problem in generalized linear bandits. More specifically, we assume each arm has a vector of covariates, there is an unknown vector of parameters that is common across the arms, and a generalized linear model captures the dependence of rewards on the covariate and parameter vectors. The problem is to minimize the number of arm pulls required to identify an arm that is sufficiently close to optimal with a sufficiently high probability. Building on recent progress in best-arm identification for linear bandits (Xu et al. 2018), we propose the first algorithm for best-arm identification for generalized linear bandits, provide theoretical guarantees on its accuracy and sampling efficiency, and evaluate its performance in various scenarios via simulation.
- Subjects :
- FOS: Computer and information sciences
Generalized linear model
High probability
Computer Science - Machine Learning
021103 operations research
Computer science
Sampling efficiency
Applied Mathematics
0211 other engineering and technologies
Sampling (statistics)
Machine Learning (stat.ML)
02 engineering and technology
Management Science and Operations Research
01 natural sciences
Industrial and Manufacturing Engineering
Machine Learning (cs.LG)
Parameter identification problem
010104 statistics & probability
Identification (information)
Statistics - Machine Learning
Covariate
0101 mathematics
Algorithm
Software
Astrophysics::Galaxy Astrophysics
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Accession number :
- edsair.doi.dedup.....7e4aea98d4020d83ed023252d5a76871