.cursor/skills/ai-search-technical-auditor/SKILL.md
Audit front-end code for AI search readiness. Use when reviewing HTML structure, meta tags, schema markup, and technical elements that affect how AI crawlers understand and index web pages.
npx skillsauth add ihmissuti/kimmoihanus-nextjs-site ai-search-technical-auditorInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Audit HTML, meta tags, schema markup, and technical implementation to ensure pages are optimized for AI search engine crawlers.
AI search engines use different crawlers than traditional search engines. Key AI crawlers include:
| Crawler | Platform | User Agent Contains | Primary Purpose |
| ----------------- | ---------- | ------------------- | ----------------------------------- |
| GPTBot | OpenAI | GPTBot | Training data for GPT models |
| ChatGPT-User | OpenAI | ChatGPT-User | Real-time browsing for current info |
| OAI-SearchBot | OpenAI | OAI-SearchBot | Indexing for ChatGPT Search |
| ClaudeBot | Anthropic | ClaudeBot | Training Claude and live retrieval |
| PerplexityBot | Perplexity | PerplexityBot | Building independent search index |
| Google-Extended | Google | Google-Extended | Training data for Gemini AI |
| Applebot-Extended | Apple | Applebot-Extended | Training Apple Intelligence |
Check that AI crawlers are allowed:
# Good - Allow AI crawlers
User-agent: GPTBot
Allow: /
User-agent: ChatGPT-User
Allow: /
User-agent: Google-Extended
Allow: /
User-agent: ClaudeBot
Allow: /
User-agent: PerplexityBot
Allow: /
# Bad - Blocking AI crawlers (unless intentional)
User-agent: GPTBot
Disallow: /
Audit questions:
Check for llms.txt file at domain root (/llms.txt):
# Company Name
> Brief description of the company/product
Key terms: important concept 1, concept 2, product name
## Docs
- [Documentation](https://example.com/docs.md): API and integration guides
## Product
- [Features](https://example.com/features.md): Platform capabilities
## Optional
- [Blog](https://example.com/blog.md): Industry insights
Audit questions:
Check for proper semantic structure:
<!-- Good -->
<article>
<header>
<h1>Page Title</h1>
<p class="summary">TL;DR content here</p>
</header>
<section>
<h2>Section Heading</h2>
<p>Content...</p>
</section>
</article>
<!-- Bad -->
<div class="article">
<div class="title">Page Title</div>
<div class="content">
<div class="heading">Section Heading</div>
<div>Content...</div>
</div>
</div>
Audit questions:
<article>, <section>, <nav>, <aside>)?<h1> per page?<main> element?<!-- Good - Content in HTML -->
<h2>Product Features</h2>
<ul>
<li>Feature one description</li>
<li>Feature two description</li>
</ul>
<!-- Bad - Content in JavaScript only -->
<div id="features"></div>
<script>
renderFeatures(); // Content not in initial HTML
</script>
Audit questions:
<th>)?<ul>/<ol> elements?<head>
<!-- Page title - Clear and descriptive -->
<title>What is GEO? Generative Engine Optimization Guide | Brand</title>
<!-- Meta description - Summarizes page content -->
<meta name="description" content="GEO (Generative Engine Optimization)
is the practice of optimizing content for AI search engines. Learn
how to improve visibility in ChatGPT, Perplexity, and Google AI Mode.">
<!-- Canonical URL -->
<link rel="canonical" href="https://example.com/geo-guide">
<!-- Language -->
<html lang="en">
<!-- Viewport for mobile -->
<meta name="viewport" content="width=device-width, initial-scale=1">
</head>
Audit questions:
<meta property="og:title" content="What is GEO?" />
<meta property="og:description" content="Guide to Generative Engine Optimization" />
<meta property="og:type" content="article" />
<meta property="og:url" content="https://example.com/geo-guide" />
<meta property="og:image" content="https://example.com/images/geo-guide.jpg" />
Audit questions:
| Content Type | Recommended Schema | Required Properties | | ------------ | ------------------ | ----------------------------------- | | Article/Blog | Article | headline, author, datePublished | | FAQ | FAQPage | mainEntity (Questions with Answers) | | Tutorial | HowTo | name, step (HowToSteps) | | Product | Product | name, offers, brand | | Company | Organization | name, url, logo | | Navigation | BreadcrumbList | itemListElement |
Audit questions:
AI crawlers have tight time budgets. Target these performance metrics:
| Metric | Target | Critical for | | ------------------------------ | ------- | ----------------------- | | Time to First Byte (TTFB) | < 200ms | All crawlers | | First Contentful Paint (FCP) | < 0.4s | AI citation eligibility | | Largest Contentful Paint (LCP) | < 2.5s | Full page analysis | | First Input Delay (FID) | < 100ms | Interactivity | | Cumulative Layout Shift (CLS) | < 0.1 | Visual stability |
The 0.4s Speed Signal: Fast-loading pages are 3x more likely to be cited by ChatGPT. Real-time AI agents may abandon pages that take longer to render.
Audit questions:
<!-- Viewport meta tag -->
<meta name="viewport" content="width=device-width, initial-scale=1" />
Audit questions:
Good URLs:
/guides/geo-optimization
/products/analytics-platform
/docs/api-reference
Bad URLs:
/page.php?id=123&cat=5
/guides/geo-optimization-best-practices-guide-2024-updated-version-3
/p/12345
Audit questions:
Server-Side vs Client-Side:
Audit questions:
# AI Search Technical Audit Report
**URL:** [Page URL]
**Date:** [Audit Date]
**Overall Score:** [X/100]
## Summary
[Brief summary of findings]
## Critical Issues (Fix Immediately)
- [ ] Issue 1
- [ ] Issue 2
## Warnings (Fix Soon)
- [ ] Warning 1
- [ ] Warning 2
## Passed Checks
- [x] Check 1
- [x] Check 2
## Performance Metrics
| Metric | Current | Target | Status |
| ------ | ------- | ------ | ------ |
| TTFB | Xms | <200ms | ✓/✗ |
| FCP | Xs | <0.4s | ✓/✗ |
| LCP | Xs | <2.5s | ✓/✗ |
## Recommendations
1. [Priority 1 recommendation]
2. [Priority 2 recommendation]
3. [Priority 3 recommendation]
## Next Audit Date
[Date for follow-up audit]
For fast assessments, check these critical items:
After implementing fixes, validate using:
data-ai
Generate and optimize Schema.org structured data for AI/LLM visibility. Use when adding schema markup, improving structured data, or optimizing for rich results and AI search engines.
development
Audit websites for AI search visibility and Generative Engine Optimization (GEO). Use when auditing a website for LLM/AI search presence, checking how ChatGPT or Perplexity see a brand, or optimizing content for AI recommendations.
tools
Score developer tools and SaaS products for AI agent compatibility. Use when evaluating how well a devtool works with AI coding assistants, or when optimizing a product for the agent era.
documentation
Generate JSON-LD schema markup for GEO optimization. Use when creating structured data for articles, products, FAQs, how-to guides, organizations, and other content types to improve AI search visibility and rich results.