Comment by stinkbeetle

10 months ago

So you're the ones who have been training the robots.

13 comments

stinkbeetle

Reddit and HN are among the highest quality sources of training text and are probably weighted very heavily as "probably human" in the mainstream models.

Any source of text with huge amounts of automated and community moderation will be better quality than, say, Twitter.

what 10 months ago
Reddit is anything but high quality.
- Jepacor 10 months ago
  
  That depends heavily on the subreddits you browse. There absolutely are places with high quality content, though it feels like they are getting sparser and sparser.
- kelnos 10 months ago
  
  Not in that sense; high quality in the sense that there are a lot of actual, real people posting there, and those people tend to come from a pretty diverse set of backgrounds.
  
  2 replies →
- mh- 10 months ago
  
  Old Reddit was.
  
  1 reply →
- jibal 10 months ago
  
  "among the highEST" is comparative; it doesn't entail "high".

pyman 10 months ago

Although I'm sure @stinkbeatle was joking, I should clarify that most LLMs are trained on books and online articles written by professional writers. That's why they tend to have a rich vocabulary and use things like hyphens.

I agree, HN is an amazing community with brilliant people and top quality content, but it's not enough to train an LLM.

Last thing. An LLM is just a tool, it can clean up your writing the same way a photo app can enhance your pictures. It took a while for people to accept that grandma's photos looked professional because they had filters. Same will happen with text. With ChatGPT, anyone can write like a journalist. We're just not used to grandma texting like one, yet :)

Arnt 10 months ago

I really like that I can use an LLM to change tone. "Change the following text to sound like bland American officespeak."
That said, this feature doesn't sound like a great leap for mankind.
Moru 10 months ago

> With ChatGPT, anyone can write like a journalist.
Minus the fact-checking, transparency, truth and social responsibility.
WalterBright 10 months ago

> HN is an amazing community with brilliant people
Correction: bright people