Comment by bevekspldnw 2 days ago How much of this is RL’ing a good coding model on every CVE ever? 1 comment bevekspldnw Reply sometimelurker 2 days ago most it this comes from the pretrain imo. just scale + some RL = mythos
most it this comes from the pretrain imo. just scale + some RL = mythos