← Back to context

Comment by conception

21 hours ago

Probably explains why Opus was trash for the last week - https://marginlab.ai/trackers/claude-code/. Curious if the new baseline will rise now in-line with the new benchmarks.

Nice. Can you release that for older models too? I've been using a mixture of releases recently, and cannot tell the difference between any of them.