Comment by senko

1 year ago

With 2T params (!!), it better outperform everything else.

2 comments

senko

Given that the comparison doesn't include O3 or gemini pro 2.5, I'd say it doesn't. Looking both at the comparison table available for llama 4 behemoth and gemini pro 2.5 it seems like at least a few of the comparable items might be won by gemini

https://blog.google/technology/google-deepmind/gemini-model-...

wmf 1 year ago

We don't know how many params GPT-4, Claude, and Gemini are using so it could be in the ballpark.