Comment by dr_dshiv
1 year ago
So, this is just an RL trained method of having multiple GPT4o agents think through options and select the best before responding?
1 year ago
So, this is just an RL trained method of having multiple GPT4o agents think through options and select the best before responding?
No comments yet
Contribute on Hacker News ↗