Comment by throwa356262
12 hours ago
1024 Huawei Ascend superpods = 50K 910C chips.
That is a tiny tiny system. OpenAI uses _milions_ of GPUs for training
On the other hand, this probably reuses the existing deepseek v4 architecture and weights. Maybe didn't need that much compute.
I'm sure it also takes more compute effort to be at the frontier, rather than being able to distill and poach ideas from the frontier. No mistake that it's the same handful of labs taking turns at or near the frontier.
I don't understand why people keep repeating this nonsense.
Anthropic claims deepseek has made 150K requests to their servers. Even if this number is correct, it takes far more requests to distill from a 3.2T model into a 1.6T model. 150K is closer to running a few benchmarks.
If anything, deepseek together with googles deepmind are the ones innovating while Anthropic and openAI are spending money and time on politics to try to hinder or ban competition.