Comment by cs702
3 years ago
> Note that Maciej a.k.a idlewords says...
That's why I added: "The obvious next step is to close the feedback loop with LLM-based agents instead of AI researchers/developers." We have early evidence that doing some like that might work, but no one knows for sure.
Yes, you did, and that's why I elaborated on why "closing the feedback loop" is barely enough to reach anything close to self-improvement. That is because self- requires intention to work in the direction of a particular goal, and -improvement requires an ability to evaluate whether the results are in line with it.
Going down to specifics, without the human intention of "getting good responses to human language prompts", and without the human ability to decide "this response was good" there is not much for an LLM to work on by itself.