Routine maintenance underway until 3:00 pm, ET. ProQuest remains fully available. Questions or issues? Contact Technical Support.

Document Preview Unavailable

On learning history based policies for controlling Markov decision processes

Patil, Gandharv; Mahajan, Aditya; Precup, Doina.  arXiv.org, Nov 6, 2022.

You might have access to this document