Comment by jimmyl02
15 days ago
the needle in a haystack benchmark looks good but at this point I think we need new benchmarks to test actual understanding of content in such a large window.
15 days ago
the needle in a haystack benchmark looks good but at this point I think we need new benchmarks to test actual understanding of content in such a large window.
No comments yet
Contribute on Hacker News ↗