Comment by singularity2001
3 months ago
I wonder whether even those models which emit thinking tokens in reality do most of the work within the latent space so the difference is only superficial
3 months ago
I wonder whether even those models which emit thinking tokens in reality do most of the work within the latent space so the difference is only superficial
No comments yet
Contribute on Hacker News ↗