← Back to context Comment by minimaxir 3 months ago Safety is both the system prompt and the RLHF posttraining to refuse to answer adversarial inputs. 0 comments minimaxir Reply No comments yet Contribute on Hacker News ↗
No comments yet
Contribute on Hacker News ↗