← Back to context

Comment by Npovview

2 hours ago

Hi simonw, can you write your thoughts (hivemind is yet to catch up with this idea), the idea of distilling a large Opus 4.7 model into a purely reasoning core with plugin like architecture (a programming sub-model, a literature submodel, a history submodel, a geography submodel). Why is Russian and Chinese data part of my model training process, its costs more to train and do inference. I want a core model and specialized models to which Core Reasoning model can talk to. This kind of innovation is what Mistral team should be doing. Is it fundamentally impossible to do?