← Back to context

Comment by idonotknowwhy

1 year ago

Yeah, I've had similar experiences. I still hesitate if it's a field I don't know too well of course (never trust an LLM), but R1 has been able to solve things I've been stuck on. And watching it's <think></think> process has been insightful. Only issue is that it ties up all my GPUs while I run it.

Hopefully Mistral can copy their technique and give us a 123b reasoning model.