top
new
show
ask
jobs

Comment by djfergus

5 days ago

We need a benchmark that tests a models ability to do LLM research.

0 comments

djfergus

Reply

No comments yet

Contribute on Hacker News ↗

Slacker News

Product

API Reference
Hacker News RSS
Source on GitHub

Community

Support Ukraine
Equal Justice Initiative
GiveWell Charities