Comment by visarga

3 years ago

> It is clear that firing ChatGPT at its own source code is not going to produce a better ChatGPT.

I think it could do that in software. Assuming compute is no issue we have:

- LLMs writing code, explaining code, changing code, and observing code execution

- LLMs that understand ML concepts and can explain their own workings

- LLMs can generate the training set all from inside (see TinyStories)

- LLMs can make "RLHF" data for the fine-tuning (see Alpaca, tuned with GPT3.5 and GPT4 data from LLaMA)

If we take a look, it seems LLMs can self replicate in software with nothing else but compute and a neural net framework. Of course making the chips is a whole other story.