Upcoming maintenance: ProQuest will be unavailable 10:00 PM ET Saturday, Aug 9 - 6:00 AM ET Sunday, Aug 10. ReadMore

Document Preview Unavailable

Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint

Vijayan, Nithia; Prashanth, L A.  arXiv.org, Jun 23, 2024.

You might have access to this document