llms.txt: What It Is and How to Create One

A plain-English guide to the llms.txt file: what goes in it, how to generate and validate one, and the honest answer to whether it actually moves AI search visibility.

Key takeaways
  • llms.txt is a Markdown file at your site root that gives AI models a curated map of your most important content.
  • Critics called it useless because crawlers rarely fetch it, yet Google's experimental Lighthouse agentic browsing audit now checks for it, a sign it is becoming table stakes for the agent era.
  • It is cheap to add and future-friendly, but real AI citations come from clear, well-structured content, not the file alone.

llms.txt is a plain-text Markdown file you place at the root of your website (at /llms.txt) that gives large language models a clean, curated map of your most important pages. Think of it as a robots.txt for the AI era: instead of telling crawlers what to avoid, it tells AI models what matters and where to find it.

The format was proposed in 2024 by Jeremy Howard of Answer.AI and is documented at llmstxt.org. The pitch is simple: web pages are full of navigation, scripts and markup that waste an AI model's limited context window, so a curated Markdown index helps models ingest your canonical content efficiently.

What is llms.txt?

llms.txt is a single Markdown file that lists your site's key pages as titled links, often with one-line descriptions, so an AI system can find and read your best content without crawling the whole site. It lives at a fixed location, https://yourdomain.com/llms.txt, the same way robots.txt and sitemap.xml do.

The standard (llmstxt.org)

There is no official, search-engine-backed standard for llms.txt. It is a community proposal published at llmstxt.org and adopted voluntarily by some documentation platforms and tools. Treat it as a useful convention, not a rule Google or OpenAI has endorsed.

llms.txt vs robots.txt vs sitemap.xml

FileAudienceJob
robots.txtAll crawlersSays which URLs crawlers may or may not access.
sitemap.xmlSearch enginesLists every indexable URL so nothing is missed.
llms.txtAI models / LLMsCurates and describes your most important content in Markdown.

What an llms.txt file looks like

An llms.txt file starts with an H1 (your site name), an optional blockquote summary, then H2 sections containing Markdown link lists. Here is a real, annotated example:

llms.txt
# AI Ranking

> Community and course teaching business owners AI search
> engine optimization (SEO and GEO).

## Core guides
- [Learn GEO](https://airankingskool.com/learn/geo/): Get cited by AI search
- [Learn AI SEO](https://airankingskool.com/learn/ai-seo/): Automate SEO with AI
- [llms.txt guide](https://airankingskool.com/learn/geo/llms-txt/): This page

## Optional
- [About](https://airankingskool.com/about/): Founder story and authority

File structure and syntax

  • H1: the site or project name (required, one only).
  • Blockquote: a short summary of what the site is.
  • H2 sections: grouped Markdown link lists (for example "Core guides", "Docs").
  • Optional section: links an AI can skip if context is tight.

llms.txt vs llms-full.txt

Some sites also publish llms-full.txt, which inlines the full Markdown content of every page rather than just linking to it. It is heavier but lets a model ingest everything in one request. Use llms.txt as the index and llms-full.txt only if your content is small enough to fit.

How to create an llms.txt file

  1. List the 10 to 30 pages you most want AI models to read (pillars, key guides, product pages).
  2. Write a one-line description for each, focused on what the page answers.
  3. Format them as Markdown: an H1 title, a blockquote summary, then H2 sections with link lists.
  4. Save the file as llms.txt and upload it to your site root so it resolves at /llms.txt.
  5. Validate it, then keep it updated as you publish important new pages.

Generate it (free generators)

You do not have to write it by hand. Free generators like Firecrawl (llmstxt.firecrawl.dev) and Mintlify's auto-generation will crawl your site and produce a draft. Inside the community we use DataWise to generate and maintain it, then prune it down to your money pages.

Platform notes (WordPress, Next.js, Wix)

  • WordPress: the easiest route. Both Yoast SEO and Rank Math now include a free llms.txt option: flip the toggle on (Yoast generates and refreshes it automatically and lets you preview it; in Rank Math it is the "LLMS Txt" module) and the file is served at /llms.txt with no code. A plugin-generated file will not be as curated as one you write by hand, but it is a solid one-click start.
  • Next.js: serve it from /public/llms.txt or a route handler.
  • Wix / Squarespace: use a file or redirect that exposes /llms.txt at the root.

Validate it

Run your file through an llms.txt validator (low effort, easy win) to confirm the Markdown structure is correct and every link resolves. Broken links in your llms.txt defeat the purpose.

Does llms.txt actually work?

The honest answer

As of 2026 there is little public evidence that the major AI crawlers (GPTBot, ClaudeBot, PerplexityBot, Google-Extended) routinely fetch llms.txt. Adoption is real but modest, and several SEO teams report no llms.txt requests in their server logs.

Is anyone actually using it?

Yes, on the publishing side. Semrush counted roughly 951 domains with an llms.txt file by mid-2025, and directories like directory.llmstxt.cloud track adoption. Many of those are documentation sites (Mintlify, Anthropic docs and similar) where a clean Markdown index is a natural fit.

The reality check: do AI crawlers fetch it?

This is where most guides hedge and we will not. If you check your bot analytics or raw server logs, you will usually see plenty of hits to your normal HTML pages from AI crawlers and almost none to /llms.txt. The file is a forward-looking convention, not a switch that turns on AI traffic.

Why "it's useless" is now the outdated take

For most of 2024 and 2025 the loudest verdict on llms.txt was that it is useless, and the proof offered was always the same: check your server logs and you will see almost no AI crawlers requesting the file. That was a fair observation. It also measured the wrong thing.

In its experimental agentic browsing audit, Google's Lighthouse (the site-quality tool built into Chrome DevTools) now grades how ready your site is for AI agents. One of its checks sits under a section called Stability and Discoverability, is named simply llms.txt, and "checks for the presence of a machine-readable summary at the domain root." In plain English: Google's own tooling now looks for an llms.txt file and counts its presence in your favor.

What actually changed

The value of llms.txt is shifting from "do today's crawlers fetch it" to "is your site legible to the AI agents Google is now testing for." When Google starts scoring something in Lighthouse, it stops being a fringe convention and starts becoming table stakes. You can run this audit yourself: see our SEO audit guide for how to test agentic browsing in Chrome.

What actually wins AI citations

AI search cites pages it can read and trust. That comes from a direct answer in your first two sentences, clean heading structure, real expertise and being mentioned across the web, not from a text file alone. That is the core of generative engine optimization, and it is exactly what we teach. Add llms.txt because it is cheap and future-friendly, but put your real effort into answering questions directly and getting cited by ChatGPT.

Put it into practice

Learn Generative Engine Optimization hands-on inside the community

Courses, live calls and DataWise to track your AI citations and AI Overview presence.

Free for members

Do this faster with DataWise

DataWise helps you track your AI citations and AI Overview presence, free with every paid membership. Stop stitching together five different tools.

See DataWise
DataWise SEO tool dashboard
FAQ

llms.txt: common questions

What is llms.txt?

llms.txt is a Markdown file at your site root (/llms.txt) that gives AI models a curated list of your most important pages with short descriptions, so they can find and read your best content efficiently.

How do you create an llms.txt file?

List your key pages, write a one-line description for each, format them as Markdown with an H1 title, a blockquote summary and H2 link sections, then upload the file to your site root so it resolves at /llms.txt. Free generators like Firecrawl and Mintlify can draft it for you.

Is there an official llms.txt standard?

No. llms.txt is a community proposal documented at llmstxt.org, introduced by Jeremy Howard of Answer.AI. No search engine or AI company has adopted it as an official standard, so treat it as a useful convention.

What is the difference between llms.txt and robots.txt?

robots.txt tells crawlers which URLs they may access. llms.txt is for AI models and curates and describes your most important content in Markdown. They serve different audiences and jobs.

Do AI crawlers actually use llms.txt?

There is little evidence that major AI crawlers routinely fetch llms.txt today. Adoption is growing on the publishing side, but most server logs show AI crawlers hitting normal HTML pages, not /llms.txt. Add it as a low-cost, future-friendly step, but do not expect it to drive AI traffic on its own.

Is llms.txt useless?

No, and that view is now outdated. Critics called it useless because AI crawlers rarely requested the file. But Google's experimental Lighthouse agentic browsing audit now checks for an llms.txt file at your domain root, under its Stability and Discoverability section, which signals the file is becoming part of being AI-ready rather than a pointless extra.

Does Google check for llms.txt?

Yes, in its experimental agentic browsing audit in Lighthouse. That audit includes an llms.txt check that looks for a machine-readable summary at your domain root and ties it to discoverability and stability for AI agents. It is experimental and not a confirmed ranking factor, but it shows Google now treats llms.txt as a real signal.

Does llms.txt help SEO or AI search visibility?

Indirectly at best right now. Real AI visibility comes from clear, well-structured, trustworthy content that directly answers questions, which is the heart of GEO. llms.txt is a small complement to that work, not a replacement.

How do I create an llms.txt file in WordPress?

The fastest way is an SEO plugin. Both Yoast SEO and Rank Math now offer a free llms.txt feature: turn on the toggle and the plugin generates and serves the file at /llms.txt automatically, no code needed. Yoast refreshes it for you and lets you preview it; Rank Math exposes it as its 'LLMS Txt' module. A plugin file is less curated than one you write by hand, but it is a good one-click start.

Where should the llms.txt file live?

At your site root, so it is reachable at https://yourdomain.com/llms.txt, the same convention as robots.txt and sitemap.xml.

Stop guessing

Learn AI search with a community that has your back

Join 7,400+ business owners, agencies and freelancers, and get the tools, skills and live coaching to win in AI search.