Comment by puttycat

25 days ago

Interesting angle, didn't think of this. How do you think/find that current tools are optimized for being addictive?

I think there's a few things, but its a little subjective and its more about the style the ai uses when doing these than the actual specific behavior:

- Nuggesting improvements to the code after finishing the task you gave it, very irritating when the improvements were obvious and the ai didn't implement them on its own

- Not trying very hard when implementing something, leading to bugs, which leads to more tokens used (this behavior can be incentivized and learned with RL)

Since its a known fact if a user continues a session after the LLM says something, its not hard to train against this. The least efficient way to do this would be to GPRO directly against the user base and try to get as many people talking to the AI, and with OAI having a billion monthly active users the least efficient method would work really well for them.