Comment by AstroBen

4 hours ago

Our DNA does contain our pre-training, though. It's not true that we're an entirely blank slate.

Pre-training is not a good term if you are trying to compare it to LLM pre-training. Closer would be the model's architecture and learning algorithms which has been designed through decades of PhD research, and my point on that is that the differences are still much greater than the similarities.