Comment by Tiberium 3 months ago A bit interesting that they used Deepseek 3's architecture for their Large model :) 0 comments Tiberium Reply No comments yet Contribute on Hacker News ↗
No comments yet
Contribute on Hacker News ↗