← Back to context

Comment by rickette

12 hours ago

LLMs.txt is also nonsense since it isn't adopted by any of the major AI players.

Google has recently added `llms.txt` to Chrome's Lighthouse check for agentic browsing (https://searchengineland.com/google-llms-txt-chrome-lighthou...), so adoption may be coming. Admittedly, I put more faith in

  <link rel="alternate" type="text/markdown" href="https://example.com/foo.md" title="Markdown version of the &lt;Foo&gt; page">

that I copied from Gwern.net. This convention is discoverable (just read the HTML) and naturally adapts to any website size and structure.

I have created an `llms.txt` for my website anyhow. I use a fixed LLM prompt to generate it from the internal links in `index.md`.

The same could be said of robots.txt

And anything else that might tell them not to access something.

  • robots.txt predates the modern web though

    • My point was that llms.txt not working is no different from them ignoring everything else that came before and probably everything that is yet to come.

      If they want it, they will take it, polite directives in text files will have no effect.

To be fair, "not adopted by any major AI player" is probably the most web-standard-compliant phase of a new web standard.