Comment by zurfer

1 year ago

It is already said that gpt4 was trained on all high quality internet data. So it should have been included already. It seems to me that o1 has the same/similar pretraining corpus.

So we have 3 options:

- t3 was now included in the corpus

- t3 was used for RL

- o1 generalizes better