Comment by visarga
3 years ago
> It is clear that firing ChatGPT at its own source code is not going to produce a better ChatGPT.
I think it could do that in software. Assuming compute is no issue we have:
- LLMs writing code, explaining code, changing code, and observing code execution
- LLMs that understand ML concepts and can explain their own workings
- LLMs can generate the training set all from inside (see TinyStories)
- LLMs can make "RLHF" data for the fine-tuning (see Alpaca, tuned with GPT3.5 and GPT4 data from LLaMA)
If we take a look, it seems LLMs can self replicate in software with nothing else but compute and a neural net framework. Of course making the chips is a whole other story.
No comments yet
Contribute on Hacker News ↗