Comment by petesergeant
13 hours ago
Is there any chance that this is because training corpus largely consists of documents shorter than the advertised context windows?
13 hours ago
Is there any chance that this is because training corpus largely consists of documents shorter than the advertised context windows?
No comments yet
Contribute on Hacker News ↗