Document Preview Unavailable

Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint

Vijayan, Nithia; Prashanth, L A.  arXiv.org, Jun 23, 2024.

You might have access to this document