Comment by zozbot234
3 hours ago
There's already models with capped long context but if you make that the whole model it makes needle-in-haystack search impossible and that's actually a very common operation. Which is why Qwen 3.5 only makes a portion of it capped, and AIUI the new Nemotron models are broadly similar.
No comments yet
Contribute on Hacker News ↗