Comment by varispeed
3 days ago
You can't measure effectiveness, because you never know what kind of model will process your prompt. One request you might get full e.g. Opus and another they'll downgrade it to Sonnet or something more basic. I have this with "Opus 4.8" all the time.
No comments yet
Contribute on Hacker News ↗