Comment by Aurornis
10 hours ago
> I don't know about this case,
They compiled a document with the source material side-by-side https://v42.arretsurimages.net/fichiers/documents/2024-08-02...
This goes well beyond accidentally triggering a plagiarism detector.
> Feed stuff you wrote into those tools and it will call you a plagiarist every day because you wrote something similar to the person you learned it from.
The examples in the article use very distinctive wording. One or two occurrences would be forgivable as coincidence or inspiration. An entire document full of examples points to something else.
It seems like that should be the case yet when I listen to any same group of people over a period of time, I often find that those unfamiliar with a concept or solution on day 1 end up repeating it as if it was their own a few weeks later. When I was younger I tended to assume there was an element of intentional theft, but I'm not sure it's natural and a prerequisite to educational acquisition that people can categorize original origin of ideas that may have bounced around them for a long time before they understood their significance.
The plagiarism in the document was more significant than that.
This wasn't a couple cases of the same words or word pairs being used.
Sure a common answer would be intentionally copying in the same sessions, less likely is intentionally copying via eidetic memory.. But how much of a spectrum could there be in the middle for memory that would result in repeating a "plagiarism" form months later, etc?
People say how obvious the parlor trick is when they look at a small model LLMs. Well, I've seen the same parlor trick in students who get good grades but seem weak at thought from fundamentals. It seems quite possible to me that in some examples we are now going after them because the environment changed. At much earlier points we did actually value the people who could recite even if somewhat brokenly because we lacked random order recital tools.
1 reply →