Comment by coder543
6 days ago
I meant large MoE models are more socially accepted now. They were not when Llama 4 launched, and I believe that worked against the Llama 4 models.
The Llama 4 models are MoE models, in case you are unaware, since it feels like your comment feels was implying they were dense models.
No comments yet
Contribute on Hacker News ↗