Comment by idonotknowwhy
1 year ago
Yeah, I've had similar experiences. I still hesitate if it's a field I don't know too well of course (never trust an LLM), but R1 has been able to solve things I've been stuck on. And watching it's <think></think> process has been insightful. Only issue is that it ties up all my GPUs while I run it.
Hopefully Mistral can copy their technique and give us a 123b reasoning model.
No comments yet
Contribute on Hacker News ↗