AI Search Optimisation (AEO): a 2026 guide for Malaysian businesses
Answer Engine Optimisation (AEO) is the practice of structuring web content so AI assistants like ChatGPT, Claude, Perplexity, and Google AI Overviews extract and cite your website when answering user questions. For Malaysian SMEs in 2026, the full AEO stack is: llms.txt, explicit bot allows (GPTBot, ClaudeBot, PerplexityBot, Google-Extended), answer-first content, speakable schema, entity sameAs arrays, author Person schema on blog posts, and FAQPage schema site-wide. Most Malaysian SME sites have none of these — which is the opportunity.
Why this matters now
Half of high-intent buyer research — especially in B2B and professional services — now starts in an AI assistant. A buyer asks ChatGPT "best tax consultant in KL" before they google it. ChatGPT answers, cites three firms, and the buyer calls those three. The other twenty tax consultants in the area aren't considered, regardless of how good their Google ranking is.
That's the shift. Ranking in Google still matters. But being cited in AI answers is the new page-one.
How LLMs decide what to cite
AI assistants don't use PageRank. They use a mix of training data (baked in), live web fetches (via their crawlers), schema signals (extractable structure), entity signals (disambiguation), and content quality signals (is the page self-contained and clear?). A site with strong signals in all five becomes a default citation in its category. Absence of any one cripples the others.
The AEO stack, step by step
1. Make sure AI crawlers can reach you
First, the basics. AI crawlers respect robots.txt. If yours blocks them (default shared-hosting templates often do), you're invisible. Check yours for these user-agents and explicitly Allow them:
GPTBot— OpenAI (ChatGPT, GPT-powered apps)ClaudeBotandanthropic-ai— Anthropic (Claude)PerplexityBot— PerplexityGoogle-Extended— Google AI Overviews, Gemini trainingCCBot— Common Crawl (upstream training data)Applebot-Extended,Bytespider,cohere-ai,Meta-ExternalAgent
This alone puts you ahead of most Malaysian SME sites.
2. Publish an llms.txt
Proposed in 2024, llms.txt is the emerging AI-crawler analog to sitemap.xml: a Markdown file at the site root summarising your key pages, positioning, and content bundle links. Not all crawlers respect it yet, but leading ones do. Implementation cost is effectively zero; benefit grows as the standard matures. Also publish llms-full.txt — a larger plain-text bundle of your key page content. See our llms.txt and llms-full.txt as working examples.
3. Structure content answer-first
LLMs extract the first self-contained paragraph. If yours is a marketing tagline ("We're passionate about delivering excellence…"), that's what gets extracted. If yours is a clear 40–60 word answer to the page's implicit question, that gets extracted. Every page on this site opens with such a block — wrapped in <div data-answer> and styled as a callout so humans read it too.
4. Speakable schema
Add SpeakableSpecification to your schema graph, with CSS selectors pointing to the answer blocks. Voice assistants extract these; increasingly, so do LLM-based search engines.
5. Entity sameAs arrays
LLMs struggle to disambiguate brands. "Atlas" could be a gym, a consultancy, a moving company. The sameAs array on your Organization schema solves this: list your LinkedIn company page, Google Business Profile, Facebook, Instagram, and Crunchbase. Each external URL is a disambiguation signal.
6. Author Person schema on blog posts
Unattributed content is low-trust. Attributed content — real author, real LinkedIn, real credentials — is high-trust and gets cited preferentially. Every Atlas-built site has author Person schema with sameAs LinkedIn on every blog post.
7. FAQPage schema, site-wide
FAQ blocks wrapped in FAQPage schema get extracted directly by LLMs when users ask matching questions. Put FAQ sections on your homepage, services pages, and pillar blog posts. Wire each to FAQPage schema.
8. Citation-worthy content
The single biggest AEO differentiator is original primary content. LLMs cite primary sources disproportionately. If your blog is rehashed generic advice, you're competing with 10 million other pages. If you publish original data — "we surveyed 50 Malaysian SMEs on…", "we tested X across our 20 client sites and found…" — you become the source AI engines quote.
What Malaysian SMEs should do next
- Audit your
robots.txt— are AI crawlers Allowed? - Publish a basic
llms.txt - Rewrite the opening paragraph of your homepage as an answer-first block
- Add
sameAsto your Organization schema - Wrap your FAQs in FAQPage schema
That's a day's work. It puts you ahead of 95% of Malaysian SME competition on AEO signals. If you want it done properly and integrated with web design + SEO, that's exactly what the Atlas package ships.
AEO in 2026 is where SEO was in 2005. The sites that got it right early still dominate two decades later.
Stay Updated
Get the latest web design tips and insights delivered to your inbox. No spam, just value.