Basically, if you combine a bunch of near-frontier models (like GPT 5.5, etc) you can get performance that sometimes surpasses top line models like Claude's Fable.
Sakana seems to have a separate approach using a domain specific model to perform the model routing step.
This is a charitable read, but I think that being able to pick from a panoply of models will actually yield much better results in the long run.
The same model that has been post-trained to operate for hours as a Linux admin will be incapable of writing a heartfelt email, but with something like Fugu, you'd get both the Linux admin for driving the browser harness and the smaller writing specialist model for drafting the email itself.
It's similar to this: https://openrouter.ai/blog/announcements/fusion-beats-fronti...
Basically, if you combine a bunch of near-frontier models (like GPT 5.5, etc) you can get performance that sometimes surpasses top line models like Claude's Fable.
Sakana seems to have a separate approach using a domain specific model to perform the model routing step.
But it's priced the same as frontier models. Why do I not directly pay for frontier models?
This is a charitable read, but I think that being able to pick from a panoply of models will actually yield much better results in the long run.
The same model that has been post-trained to operate for hours as a Linux admin will be incapable of writing a heartfelt email, but with something like Fugu, you'd get both the Linux admin for driving the browser harness and the smaller writing specialist model for drafting the email itself.
u get a pool of them + sakana?
So plain old ensemble technique in classical ML.