Comment by jimmyl02
1 year ago
the needle in a haystack benchmark looks good but at this point I think we need new benchmarks to test actual understanding of content in such a large window.
1 year ago
the needle in a haystack benchmark looks good but at this point I think we need new benchmarks to test actual understanding of content in such a large window.
No comments yet
Contribute on Hacker News ↗