Comment by thw09j9m

7 months ago

It's Google. Assume the training set contains, as a subset, the entirety of all public digitized information. How would you like to them to share it?

If they wanna do research where they claim they did something novel, without showing that they didn't just "look it up" in their massive training set, then yes, they should share what is and what isn't contained within.