Comment by arrowsmith
3 days ago
I don't think it takes a psychologist. Maybe the LLMs are sycophantic because that's what the humans in the RLHF loop respond best to.
3 days ago
I don't think it takes a psychologist. Maybe the LLMs are sycophantic because that's what the humans in the RLHF loop respond best to.
No comments yet
Contribute on Hacker News ↗