Comment by samus
14 hours ago
There have been papers about introducing thinking tokens in intermediary layers that get stripped from the output.
14 hours ago
There have been papers about introducing thinking tokens in intermediary layers that get stripped from the output.
No comments yet
Contribute on Hacker News ↗