Comment by moralestapia
2 days ago
>ignoring the costs of facilities, salaries, non-cloud hardware, etc.
If you lease, those costs are amortized. It was definitely more than $5M, but I don't think it was as high as $100M. All things considered, I still believe Deepseek was trained at one (perhaps two) orders of magnitude lower cost than other competing models.
Perhaps. Do you think DeepSeek made use of those competing models at all in order to train theirs?
I believe so, but have no proof obviously.