Comment by endymi0n
14 hours ago
I don’t exactly know where MTP inference fits within the inference stack, but does someone know whether it’s possible to implement it for the MLX universe?
14 hours ago
I don’t exactly know where MTP inference fits within the inference stack, but does someone know whether it’s possible to implement it for the MLX universe?
No comments yet
Contribute on Hacker News ↗