Why Most SA Operator Sites Aren't AI-Ready - And The 3 Things To Fix
Most serviced accommodation websites are invisible to ChatGPT, Perplexity and Google AI Overview. Here are the three specific gaps and how to close them this quarter.
Chris McCrow The short answer: Most SA operator websites fail AI readiness on three counts: they block AI crawlers in robots.txt, they have no llms.txt file telling AI agents what the business does, and their pages lack the structured data (schema markup) that lets AI parse services and availability. All three are fixable in under a month without rebuilding anything.
A guest opens Perplexity and asks “serviced apartments in Leeds with parking, two bedrooms, flexible cancellation”. The AI reads a handful of operator websites, pulls structured data from two of them, and recommends those two in its response. The other fifteen operators in Leeds never appear.
Those fifteen operators might have excellent properties. Good reviews on Booking.com. Decent Google rankings. They are invisible to AI assistants because their websites were built for human readers and Google’s traditional crawler, not for the large language models that now sit between the guest and the search result.
This is not a theoretical future problem. ChatGPT, Perplexity, Claude, and Google AI Overview are already answering accommodation queries, and the operators whose sites are parseable are getting mentioned. The ones whose sites are not parseable are losing a channel they did not know existed.
We audit SA operator websites every week. The same three gaps appear on nearly every site we review. Here is what they are and how to fix each one.
Gap 1: AI Crawlers Are Blocked by Default
Most CMS platforms ship with a robots.txt that either blocks non-Google crawlers or says nothing about them. WordPress themes, Wix templates, and even some custom builds default to restrictive crawler policies because the templates were written before AI search existed.
The AI crawlers that matter for accommodation discovery are:
- GPTBot (ChatGPT and OpenAI products)
- ClaudeBot (Claude and Anthropic products)
- PerplexityBot (Perplexity search)
- Google-Extended (Google AI Overview and Gemini)
If your robots.txt does not explicitly allow these user agents, your site may be invisible to them. Some CDN configurations and security plugins add blanket blocks on non-standard crawlers, which catches all four.
The Fix
Open your robots.txt file (usually at yourdomain.com/robots.txt) and confirm it includes explicit Allow directives for each AI crawler. A minimal addition looks like this:
User-agent: GPTBot
Allow: /
User-agent: ClaudeBot
Allow: /
User-agent: PerplexityBot
Allow: /
User-agent: Google-Extended
Allow: /
Check your CDN and security plugin settings separately. Cloudflare’s bot management, Sucuri, and Wordfence can all block AI crawlers at a level that overrides your robots.txt.
Time required: fifteen minutes. Impact: you go from invisible to crawlable overnight. This is the single highest-leverage change on the list.
Gap 2: No llms.txt File
An llms.txt is a plain-text file at your domain root (like robots.txt) that gives AI agents a structured summary of your business. It tells them who you are, what you offer, who you serve, and how to contact you, in a format they are designed to read.
Without one, an AI agent has to infer your business model from your homepage HTML, your meta tags, and whatever it can scrape from inner pages. That inference is error-prone. We have seen AI assistants describe operators as hotels, confuse serviced apartments with co-working spaces, and miss entire service categories because the homepage copy was written for humans who already understand the context.
An llms.txt removes the guesswork. The AI reads a deterministic, machine-formatted summary and uses that as its primary source for describing your business.
What Goes in It
For a serviced accommodation operator, your llms.txt should include:
- Business name, trading name, and legal entity (if different)
- Location and coverage area
- Property types and approximate unit count
- Target guest segments (corporate relocation, leisure, contractor, extended stay)
- Key amenities and differentiators
- Direct booking URL
- Contact details (email, phone)
- A pointer to your sitemap.xml
The Fix
Write a plain-text file following the format above, save it as llms.txt, and upload it to your domain root so it is accessible at yourdomain.com/llms.txt. There is a growing standard for the format (the llms.txt specification), but even a well-structured freeform file is better than nothing.
For a more comprehensive version, consider adding an llms-full.txt that includes service descriptions, FAQ answers, and pricing tiers. AI agents will use the longer version when they need detail and the short version for quick summaries.
Time required: one to two hours of writing, fifteen minutes to deploy. Impact: AI agents have a reliable, accurate summary of your business that they will use in preference to parsing your homepage.
For a full walkthrough of the llms.txt format and how it fits into a broader agent-facing strategy, see our detailed guide on agent-facing websites for hospitality.
Gap 3: Missing or Incomplete Schema Markup
Schema.org JSON-LD is structured data embedded in your page source that tells machines exactly what is on the page. It is how Google generates rich snippets, and it is how AI agents parse your services, locations, and reviews without having to guess from your copy.
Most SA operator websites have either no schema markup at all, or a minimal auto-generated version from a WordPress plugin that covers Organization basics and nothing else.
The schemas that matter for AI discoverability in serviced accommodation are:
- LodgingBusiness on your homepage and location pages (tells AI agents you provide accommodation, not office space)
- Service on each service page (corporate stays, leisure lets, property management)
- FAQPage on any page with an FAQ section (AI agents lift these answers directly into their responses)
- Article on every blog post (author, date, publisher, description)
- AggregateRating if you have verifiable reviews (trust signal for recommendations)
The critical detail most operators miss is consistency between the schema data and the visible page content. If your schema says you have 50 apartments but your homepage says 45, AI agents flag the inconsistency and downweight the source. Schema markup needs to match what is actually on the page.
The Fix
If you are on WordPress, a plugin like Yoast SEO or RankMath can generate basic Organization and Article schema. But for LodgingBusiness, Service, and FAQPage, you will likely need custom implementation or a developer who understands JSON-LD.
Test your existing markup with Google’s Rich Results Test (search.google.com/test/rich-results) to see what you already have and what is missing. Then prioritise additions in this order:
- LodgingBusiness on homepage
- FAQPage on service and location pages
- Service on each service offering
- Article on blog posts
Time required: a day of developer work for a standard CMS site. Impact: AI agents can parse your services accurately, which is the difference between being recommended as “a serviced apartment operator in [your city] with corporate and leisure stays” versus being described vaguely or not at all.
Why These Three Gaps Compound
Each gap makes the others worse. If your crawlers are blocked, the best llms.txt in the world does nothing because no AI agent ever reads it. If your crawlers are allowed but you have no llms.txt or schema, the AI is guessing from unstructured HTML and getting your services wrong. If you have schema but no llms.txt, the AI has structured data for individual pages but no business-level summary to anchor its understanding.
Fix all three and you have a site that AI agents can discover, understand, and recommend accurately. The compound effect is what moves you from invisible to cited.
How We Know This Works
We built websiteforbookings.com with the agent-facing pattern from scratch: schema markup on every page, an llms.txt file at the domain root, explicit AI crawler permissions, and answer-first content structure. When you ask ChatGPT or Perplexity what we do, the answer is accurate because the AI has a clean, machine-readable source to draw from.
The same structural approach applies to client projects. When we rebuilt the organic content strategy for relocationapartments.com, the site grew from zero to over 40,000 unique monthly visitors through organic search over eighteen months. That traffic volume means thousands of pages indexed and crawlable, which is exactly the kind of content library that AI agents reference when recommending operators. Strong traditional SEO creates the content base that agent-facing optimisation makes discoverable through AI channels.
For operators who already have a site that converts well through Google, these three changes are the fastest path to appearing in the AI-mediated searches that are growing alongside traditional search, not replacing it.
What to Do Next
If you are not sure whether your site has these gaps, check three things right now:
- Visit yourdomain.com/robots.txt and look for GPTBot, ClaudeBot, and PerplexityBot. If they are not mentioned, or if there is a blanket
Disallow: /for unknown agents, you are blocked. - Visit yourdomain.com/llms.txt. If you get a 404, you do not have one.
- Run your homepage through Google’s Rich Results Test. If you see Organization but no LodgingBusiness, Service, or FAQPage, your schema is incomplete.
Three checks, five minutes. If any of the three fail, you now know what to fix and roughly how long it takes.
For a broader view of how AI is reshaping hospitality marketing beyond just website structure, see our overview of AI in serviced accommodation marketing for 2026.
Or if you want a professional assessment of where your site stands, request a free website audit. We will check your AI readiness alongside traditional SEO, mobile experience, and conversion fundamentals, and tell you exactly what to prioritise.
Frequently Asked Questions
What does “AI-ready” mean for a serviced accommodation website?
An AI-ready website is one that AI assistants (ChatGPT, Perplexity, Claude, Google AI Overview) can crawl, parse, and accurately recommend to users. It requires three things: explicit crawler permissions in robots.txt, an llms.txt file summarising the business, and schema.org JSON-LD markup on key pages. These additions sit on top of traditional SEO, not instead of it.
Will making my site AI-ready hurt my Google rankings?
No. Every change required for AI readiness also benefits traditional search. Schema markup improves rich snippet eligibility. Clean content structure helps Google’s own algorithms. Allowing AI crawlers does not affect Googlebot’s behaviour. The two strategies reinforce each other.
How long before I see results from these changes?
AI crawler activity in your server logs typically appears within one to two weeks of updating robots.txt. Citations in ChatGPT and Perplexity responses start appearing in four to twelve weeks, depending on your content quality and how competitive your market is. Track the trend over quarters rather than expecting an overnight change.
Do I need to rebuild my website to become AI-ready?
No. All three fixes described in this post are additions to your existing site, not replacements. A robots.txt update takes fifteen minutes. An llms.txt file takes an afternoon. Schema markup implementation takes a day of developer time for a standard CMS site. You do not need a new design, a new CMS, or a new hosting provider.
About this content: This article was created with AI-assisted research and drafting, then reviewed and refined by Chris McCrow. I set the direction, provide the expertise, and own every word published. Learn about our content approach.
Chris McCrow
Founder of Website for Bookings. 20+ years in accommodation tech and hospitality marketing.
Related Articles
Need help with your direct booking strategy?
We specialise in helping serviced accommodation operators reduce OTA dependency and grow direct bookings.