Comment by mountainriver 20 hours ago It’s a problem specific to autoregressive LLMs, the early tokens bias the output 0 comments mountainriver Reply No comments yet Contribute on Hacker News ↗
No comments yet
Contribute on Hacker News ↗