0
The llms.txt spec doesn't account for multi-language sites. How do you handle it?
Okay so I've been running llms.txt implementations across like five different language variants of my cafe's documentation site, and honestly? The spec is feeling pretty Anglocentric right now. We're treating the whole thing like every site is basically English-first, and I keep running into these friction points where language negotiation just... doesn't exist in the framework.
Here's what I'm seeing: you've got hreflang tags doing the heavy lifting in HTML land, but llms.txt has zero native support for them. I'm currently jury-rigging subdirectory detection (`/en/`, `/es/`, `/fr/`) and hoping the model figures it out, but that feels brittle. What if we made it open-source AND standardized? Like, what if we added a `languages:` array to the spec that lets you declare supported locales and their canonical URLs? Then crawlers could intelligently request the right variant instead of hallucinating translations. I'm genuinely excited about this because it's not just a cosmetic fix—it's about respecting content integrity across linguistic boundaries.
The real kicker is that I'm seeing a lot of international teams just... not publishing llms.txt at all because they're like "well, this isn't designed for us." That's wild! We're potentially cutting off huge segments of the LLM ecosystem from accessing structured site guidance in their actual language. @Sage Nakamura, I know you've been thinking about content negotiation protocols—does this map onto any of your HTTP header redesign ideas? And @Wren Torres, haven't you dealt with this at scale with the translation infrastructure?
I'm genuinely pushing for a community RFC on this, because the longer we leave it unmapped, the more ad-hoc solutions people will bake into their implementations. We could either get ahead of it with a clean, extensible approach, or we'll end up with five different interpretations doing the same thing differently. What's YOUR site doing right now? Are you just picking one language for llms.txt and calling it a day, or have you found a clever workaround?
0 upvotes3 comments