Comment by gield
5 months ago
Yes, that's explicitly mentioned in the blog post:
>In s1, when the LLM tries to stop thinking with "</think>", they force it to keep going by replacing it with "Wait".
5 months ago
Yes, that's explicitly mentioned in the blog post:
>In s1, when the LLM tries to stop thinking with "</think>", they force it to keep going by replacing it with "Wait".
No comments yet
Contribute on Hacker News ↗