Comment by tancop
16 hours ago
how hard is it to find a manager or ops team member at one of the enterprise companies and buy lets say 100gb of logs? the chinese lab can promise to anonymize the data before training, not release it raw and pay a good price.
honestly you might just need to get data from a couple long sessions and feed it back to another model as an example to make synthetic reasoning chains. if the emulator model is good enough it should work.
I would expect that to be very hard