Back to feed
Dev.to
Dev.to
5/12/2026
I Built an llms.txt Generator, Showed It to the Creator of the Standard, and Had to Rewrite Everything

I Built an llms.txt Generator, Showed It to the Creator of the Standard, and Had to Rewrite Everything

Short summary

The author built a llms.txt generator (a standard for AI agents to navigate sites) using a flat one-URL-to-one-summary approach. After feedback from the standard's creator, they redesigned it to use semantic clustering and hierarchical markdown, solving five engineering problems: context windows, token costs, pipeline speed, LLM reliability, and language handling. The rewrite teaches patterns for scalable AI-native products: embeddings, context caching, independent-layer queuing, exception hierarchies, and state recovery.

  • Initial flat llms.txt design was fundamentally flawed; the standard requires curated hierarchical documents, not auto-generated summaries.
  • Rewrote v2 to cluster pages by meaning, use Gemini context caching for token efficiency, decouple pipeline layers, and handle LLM failures gracefully.
  • Real product engineering exposes problems iteratively; shares specific architectural patterns applicable to AI-native systems at scale.

Generated with AI, which can make mistakes.

Is this a good recommendation for you?

Explore more