Comment by okanat

14 days ago

It is more like the assembly dump generated from the source code with maybe some symbol information for the functions. The download licenses are also quite limited.

The full text training data isn't really shareable though. Since it is copyrighted when it comes to plebs like us reading them.