Comment by porridgeraisin
1 day ago
Yeah, I think it's a super neat way to do MTP. Conceptually much more pleasing and simple than existing methods. Especially since this way scaling `k` as models get better will be easier. Wish it had been presented as such.
No comments yet
Contribute on Hacker News ↗