Comment by Npovview
2 hours ago
Hi simonw, can you write your thoughts (hivemind is yet to catch up with this idea), the idea of distilling a large Opus 4.7 model into a purely reasoning core with plugin like architecture (a programming sub-model, a literature submodel, a history submodel, a geography submodel). Why is Russian and Chinese data part of my model training process, its costs more to train and do inference. I want a core model and specialized models to which Core Reasoning model can talk to. This kind of innovation is what Mistral team should be doing. Is it fundamentally impossible to do?
No comments yet
Contribute on Hacker News ↗