← Back to context Comment by sidibe 4 days ago You are making the mistake of taking one of Elon's presentations at face value. 2 comments sidibe Reply tibbar 4 days ago I mean, either they cheated on evals ala Llama4, or they have a paradigm that's currently best in class in at least a few standard evals. Both alternatives are possible, I suppose. gitfan86 4 days ago [flagged]
tibbar 4 days ago I mean, either they cheated on evals ala Llama4, or they have a paradigm that's currently best in class in at least a few standard evals. Both alternatives are possible, I suppose.
I mean, either they cheated on evals ala Llama4, or they have a paradigm that's currently best in class in at least a few standard evals. Both alternatives are possible, I suppose.
[flagged]