← Back to context

Comment by zuzululu

20 hours ago

you mean after they scrape American LLMs ?

I think they poison outputs now if they detect distillation attempts. So a model trained on distilled outputs will be stupider.

I don’t mind if they scrape the scrappers.

  • training models with scraped content vs scraping output from trained models is completely different. the output is not the original scraped content. it is synthesized

    • >>> completely different

      Why ? because it costs more money ? Tell that to the content creators whose content is scrapped / distilled by these entitled scrappers

      1 reply →