Comment by gwern
1 month ago
It is definitely not the first codebase an extensively RL-trained Claude has ever analyzed. How do you think it got so good?
1 month ago
It is definitely not the first codebase an extensively RL-trained Claude has ever analyzed. How do you think it got so good?
Meaning it has no episodic memory of any of those analyses that it has done.
You didn't say anything about 'episodic' and that's irrelevant to the point even if its long-term memory from training didn't count.