← Back to context

Comment by quicklywilliam

1 month ago

I think the big idea here is that you can get a lot more performance if you take an integrated approach. This specific model made to work with this specific inference engine made to work with this specific harness/agent. When everything is done separately, developers of a given pieces have no idea what they are targeting for all the other pieces.

This is currently a huge advantage that Anthropic has over open weights models – they control the whole stack. Indeed, they train new models against Claude Code!

It's early days on this project, but just imagine it gets enough traction that future models start training against ds4. Indeed, in the post Antirez even seems to be hinting at some sort of collaboration with DeepSeek?