Content area

Abstract

With the widespread adoption of the internet, online advertising has grown exponentially. To enhance ad recommendation efficiency, various Multi-Armed Bandit (MAB) algorithms have been deployed. Among these, the Thompson ε-Greedy algorithm integrates the ε-Greedy policy with Thompson Sampling. To optimize the algorithm, specifically, to reduce cumulative regret and improve arm selection accuracy, this paper analyzes the parameter ε in the ε-Greedy framework. This paper argues that fixing ε wastes environmental information learned over time. As the number of rounds increases, environmental understanding deepens, and ε should decay with both the rounds and the selection count of the current best arm T_(t,arm_max), since a higher selection count implies greater confidence in its optimality. Two primary decay modes are considered: linear and nonlinear decay. The study analyzes both modes and optimizes their parameters using genetic algorithms. Results demonstrate that after introducing parameter T_(t,arm_max), nonlinear ε-decay achieves lower cumulative regret under optimal parameter settings, whereas linear decay shows no such improvement.

Details

1009240
Title
Optimizing the ε Parameter in ε-Greedy Strategy for Multi-Armed Bandits
Publication title
Volume
78
Source details
International Conference on Computer Science and Electronic Information Technology (CSEIT 2025)
Number of pages
10
Publication year
2025
Publication date
2025
Section
Deep Learning and Reinforcement Learning – Theories and Applications
Publisher
EDP Sciences
Place of publication
Les Ulis
Country of publication
France
ISSN
24317578
e-ISSN
22712097
Source type
Conference Paper
Language of publication
English
Document type
Conference Proceedings
Publication history
 
 
Online publication date
2025-09-08
Publication history
 
 
   First posting date
08 Sep 2025
ProQuest document ID
3252537145
Document URL
https://www.proquest.com/conference-papers-proceedings/optimizing-ε-parameter-greedy-strategy-multi/docview/3252537145/se-2?accountid=208611
Copyright
© 2025. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-09-20
Database
2 databases
  • ProQuest One Academic
  • ProQuest One Academic