Comment by imtringued

12 days ago

Yeah, that's basically it.

In robotics there is no free lunch dataset. You'll have to gather it yourself, but if you do that, you run into an obvious problem: labeling.

With SO, you literally have the best possible scenario, because the data is clearly structured and separated into prompt and answer (aka label).

Your objective function is literally "Say what he said".