Comment by throwawaymaths
1 month ago
> RL that incentivizes more concise thought chains
this seems backwards. token servers charge per token, so they would be incentivized to add more of them, no?
1 month ago
> RL that incentivizes more concise thought chains
this seems backwards. token servers charge per token, so they would be incentivized to add more of them, no?
No comments yet
Contribute on Hacker News ↗