Back to Search Start Over

On the Conjecture of Berry Regarding a Bernoulli Two-Armed Bandit.

Authors :
Zhang, Jichen
Wu, Panyu
Source :
Mathematics (2227-7390); Feb2023, Vol. 11 Issue 3, p733, 19p
Publication Year :
2023

Abstract

In this paper, we study an independent Bernoulli two-armed bandit with unknown parameters ρ and λ , where ρ and λ have a pair of priori distributions such that d R (ρ) = C R ρ r 0 (1 − ρ) r 0 ′ d μ (ρ) , d L (λ) = C L λ l 0 (1 − λ) l 0 ′ d μ (λ) and μ is an arbitrary positive measure on [ 0 , 1 ] . Berry proposed the conjecture that, given a pair of priori distributions (R , L) of parameters ρ and λ , the arm with R is the current optimal choice if r 0 + r 0 ′ < l 0 + l 0 ′ and the expectation of ρ is not less than that of λ. We give an easily verifiable equivalent form of Berry's conjecture and use it to prove that Berry's conjecture holds when R and L are two-point distributions as well as when R and L are beta distributions and the number of trials N ≤ r 0 r 0 ′ + 1 . [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
22277390
Volume :
11
Issue :
3
Database :
Complementary Index
Journal :
Mathematics (2227-7390)
Publication Type :
Academic Journal
Accession number :
161857497
Full Text :
https://doi.org/10.3390/math11030733