Comment by gield
2 months ago
Yes, that's explicitly mentioned in the blog post:
>In s1, when the LLM tries to stop thinking with "</think>", they force it to keep going by replacing it with "Wait".
2 months ago
Yes, that's explicitly mentioned in the blog post:
>In s1, when the LLM tries to stop thinking with "</think>", they force it to keep going by replacing it with "Wait".
No comments yet
Contribute on Hacker News ↗