Comment by jimmyl02
5 months ago
the needle in a haystack benchmark looks good but at this point I think we need new benchmarks to test actual understanding of content in such a large window.
5 months ago
the needle in a haystack benchmark looks good but at this point I think we need new benchmarks to test actual understanding of content in such a large window.
No comments yet
Contribute on Hacker News ↗