Comment by Lerc
1 month ago
Mixture of Experts already have routing models,
I'm just suggesting eliminate (or weaken) the distinction between layers and expert and have just the one, then iterate that one until its 'gpod enough' score plus (iterationcount*spontaneity) is greater than some threshold.
No comments yet
Contribute on Hacker News ↗