Comment by walterbell
20 days ago
There are alternatives to NVIDIAmaxing with brute force. See the Chinese paper on DeepSeek V3, comparable to recent GPT and Claude, trained with 90% fewer resources. Research on efficient inference continues.
https://github.com/deepseek-ai/DeepSeek-V3/blob/main/DeepSee...
No comments yet
Contribute on Hacker News ↗