Comment by rahimnathwani
6 days ago
Looking forward to next time, hoping you mention speculative decoding and MTP :)
It would support your point about the performance of 20GB local models.
6 days ago
Looking forward to next time, hoping you mention speculative decoding and MTP :)
It would support your point about the performance of 20GB local models.
No comments yet
Contribute on Hacker News ↗