Comment by int_19h

8 months ago

Thing is, we keep finding out again and again that having a very broad training mix in the baseline model makes it better across the board, including in those specialized tasks when you fine-tune it.

As I understand it, the general ability to reason is what the models get out of "being trained on the tax policies of the Chang Dynasty", and we haven't really figured out a better way to do so than to throw most everything at them. And even if all you do is make toast, you still need some intelligence.

1 comment

int_19h

fnord123 7 months ago

> And even if all you do is make toast, you still need some intelligence.

No you don't. That was the point of the example.