Comment by torginus

5 days ago

I remember there was a conversation between two super-duper VCs (dont remember who but famous ones), about how DeepSeek was a super-genius level model because it solved an intro-level (like week 1-2) electrodynamics problem stated in a very convoluted way.

While cool and impressive for an LLM, I think they oversold the feat by quite a bit.

I don't want to belittle the performance of this model, but I would like for someone with domain expertise (and no dog in the AI race, like a random math PhD) to come forward, and explain exactly what the problem exactly was, and how did the model contribute to the solution.

0 comments

torginus

No comments yet

Contribute on Hacker News ↗