Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library
← Back to context

Comment by jppope

2 months ago

What about for their LLM products? We know that OpenAi does not respect the robots.txt file

2 comments

jppope

Reply

xnx  2 months ago

Google uses the same crawler and robots.txt file for training data.

  • inkysigma  2 months ago

    It's actually a different crawler for training data: Googlebot-extended so you can exclude yourself from the training data though not the search summaries.

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities