Document Preview Unavailable

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Bai, Yuntao; Jones, Andy; Ndousse, Kamal; Askell, Amanda; Chen, Anna; et al.  arXiv.org, Apr 12, 2022.

You might have access to this document