Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library
← Back to context

Comment by whimsicalism

2 years ago

> GPT 3.5 is extremely good

Maybe I just use GPT4 too much, but I disagree and most benchmarks show Clause being neck-and-neck with 3.5, especially the lmsys benchmarks which I think are the highest quality. [0] MMLU is basically broken (although even that puts Claude higher).

[0]: https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboar...

0 comments

whimsicalism

Reply

No comments yet

Contribute on Hacker News ↗

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities