Comment by cratermoon

1 year ago

Is there not a survey paper on RLHF equivalent to the "A Survey on Large Language Model based Autonomous Agents" paper? Someone should get on that.

1 comment

cratermoon

_giorgio_ 1 year ago

1 point by _giorgio_ 0 minutes ago | next | edit | delete [–]

https://arxiv.org/abs/2412.05265

Reinforcement Learning: An Overview Kevin Murphy

    This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based methods, and various other topics (including a very brief discussion of RL+LLMs).

From: Kevin Murphy [view email] [v1] Fri, 6 Dec 2024 18:53:49 UTC (6,099 KB)