Search

Your search keyword '"multi-armed bandits"' showing total 644 results

Search Constraints

Start Over You searched for: Descriptor "multi-armed bandits" Remove constraint Descriptor: "multi-armed bandits"
644 results on '"multi-armed bandits"'

Search Results

1. Rough Set Theoretic Approach for Solving the Multi-Armed Bandit Problems

2. Online Automated Imbalanced Learning via Adaptive Thompson Sampling

3. Posterior Tracking Algorithm for Multi-objective Classification Bandits

4. Introduction to the Bandit Problems

5. Reinforcement learning for sequential decision making in population research.

6. Sequential query prediction based on multi-armed bandits with ensemble of transformer experts and immediate feedback.

7. Learning Equilibria in Matching Markets with Bandit Feedback.

8. Multi‐armed bandit based online model selection for concept‐drift adaptation.

9. Online Learning in Budget-Constrained Dynamic Colonel Blotto Games.

10. Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions.

11. Fast Weakness Identification for Adaptive Feedback

12. QuizMaster: An Adaptive Formative Assessment System

13. Multi-armed Bandits with Generalized Temporally-Partitioned Rewards

14. Learning Action Embeddings for Off-Policy Evaluation

16. Starlet: Network defense resource allocation with multi-armed bandits for cloud-edge crowd sensing in IoT

17. Multi-class boosting for the analysis of multiple incomplete views on microbiome data

18. Adversarial Bandits with Knapsacks.

19. Multi-class boosting for the analysis of multiple incomplete views on microbiome data.

20. Dynamic Grouping within Minimax Optimal Strategy for Stochastic Multi-ArmedBandits in Reinforcement Learning Recommendation.

21. Multinomial Thompson sampling for rating scales and prior considerations for calibrating uncertainty.

22. Smart Testing with Vaccination: A Bandit Algorithm for Active Sampling for Managing COVID-19.

23. Fast Model Selection and Hyperparameter Tuning for Generative Models.

24. Implicitly normalized forecaster with clipping for linear and non-linear heavy-tailed multi-armed bandits.

25. Image adaptive sampling using reinforcement learning.

26. Multi-armed bandits with dependent arms.

27. Learning to Give Useful Hints: Assistance Action Evaluation and Policy Improvements

28. Case-Based Sample Generation Using Multi-Armed Bandits

29. MockSAS: Facilitating the Evaluation of Bandit Algorithms in Self-adaptive Systems

30. Reinforcement Learning in Education: A Multi-armed Bandit Approach

32. Reinforcement Learning for Protocol Synthesis in Resource-Constrained Wireless Sensor and IoT Networks

33. Multi-domain Active Learning for Semi-supervised Anomaly Detection

34. On the Complexity of All -Best Arms Identification

35. Hypothesis Transfer in Bandits by Weighted Models

36. Adaptive designs for best treatment identification with top‐two Thompson sampling and acceleration.

37. Hedging using reinforcement learning: Contextual k-armed bandit versus Q-learning.

39. Information Asymmetries in Data-Driven and Sustainable Operations: Stochastic Models and Adaptive Algorithms for Strategic Agents

40. Applications of Bayesian recommender systems in online environments

41. Adversarial thresholding semi-bandits

42. Dynamic Grouping within Minimax Optimal Strategy for Stochastic Multi-ArmedBandits in Reinforcement Learning Recommendation

43. Hedging using reinforcement learning: Contextual k-armed bandit versus Q-learning

44. Cutting to the chase with warm-start contextual bandits.

45. Block pruning residual networks using Multi-Armed Bandits.

46. $\mathsf{HyHooVer}$: Verification and Parameter Synthesis in Stochastic Systems With Hybrid State Space Using Optimistic Optimization

49. Network Defense Resource Allocation Scheme with Multi-armed Bandits

Catalog

Books, media, physical & digital resources