Comment by Workaccount2

6 days ago

People have this misguided belief that LLMs just do look-ups of data present in their "model corpus", fed in during "training". Which isn't even training at that point its just copying + compressing. Like putting books into a .zip file.

This belief leads to the thinking that LLMs can only give correct output if they can match it to data in their "model corpus".