Comment by salter2
2 days ago
Perhaps something similar to speculative decoding.
Speculating Experts Accelerates Inference for Mixture-of-Experts: https://arxiv.org/abs/2603.19289
2 days ago
Perhaps something similar to speculative decoding.
Speculating Experts Accelerates Inference for Mixture-of-Experts: https://arxiv.org/abs/2603.19289
No comments yet
Contribute on Hacker News ↗