← Back to context Comment by solarkraft 3 days ago My bad, I took this as something Multi-head Latent Attention (MLA) related. 0 comments solarkraft Reply No comments yet Contribute on Hacker News ↗
No comments yet
Contribute on Hacker News ↗