Document Preview Unavailable

Off-Policy Evaluation with Policy-Dependent Optimization Response

Guo, Wenshuo; Jordan, Michael I; Zhou, Angela.  arXiv.org, Nov 6, 2022.

You might have access to this document