Comment by Aperocky
3 months ago
They hid it and deepseek came up with R1 anyway, with RL on only results and not even needing any of the thinking tokens that OpenAI hid.
3 months ago
They hid it and deepseek came up with R1 anyway, with RL on only results and not even needing any of the thinking tokens that OpenAI hid.
Which is still the funniest and most interesting result in AI so far IMO. Fascinating, but sort of makes intuitive sense too!