Comment by vessenes
1 year ago
For sure: in fact MoE models train such a router directly, and the routers are not super large. But it would also be easy to run phi-3 against a request.
I almost think you could do like a check my work style response: ‘I’m pretty sure xx, .. wait, actually y.’ Or if you were right, ‘yep that’s correct. I just checked.’
There’s time in there to do the check and to get the large model to bridge the first sentence with the final response.
No comments yet
Contribute on Hacker News ↗