Document Preview Unavailable

Near-Optimal Pure Exploration in Matrix Games: A Generalization of Stochastic Bandits & Dueling Bandits

Maiti, Arnab; Boczar, Ross; Jamieson, Kevin; Ratliff, Lillian J.  arXiv.org, Nov 27, 2023.

You might have access to this document