Comment by Aperocky

1 year ago

They hid it and deepseek came up with R1 anyway, with RL on only results and not even needing any of the thinking tokens that OpenAI hid.

1 comment

Aperocky

Which is still the funniest and most interesting result in AI so far IMO. Fascinating, but sort of makes intuitive sense too!