Comment by PaulRobinson
13 hours ago
It's been alleged that a major source of training data for many LLMs was libgen and SciHub - hardly casual.
13 hours ago
It's been alleged that a major source of training data for many LLMs was libgen and SciHub - hardly casual.
Even if that were comparable in size to the conversational Internet, how many novels and academic papers have you read that used multiple "not just A, but B" constructions in a single chapter/paper (that were not written by/about AI)?