Comment by mbowcut2

8 months ago

I'm not surprised. People really thought the models just kept getting better and better?

6 comments

mbowcut2

segmondy 8 months ago

The models are getting better and better.

giveita 8 months ago
That's expected. No one will release a worse model.
- sodality2 8 months ago
  
  Not a cheaper one, or better in some ways, or lower latency, etc?
  
  1 reply →

guerrilla 8 months ago

Maybe. How would I know?

jMyles 8 months ago

...even if the agent did "cheat", I think that having the capacity to figure out that it was being evaluated, find the repo containing the logic of that evaluation, and find the expected solution to the problem it faced... is "better" than anything that the models were able to do a couple years ago.