Technical SEO Audit Checklist 2026: 47 Checks for Every Site
A technical SEO audit finds the issues that prevent your content from ranking — regardless of how good that content is. This checklist covers 47 checks across six categories, updated for 2026 to include AI search readiness alongside classic crawlability and performance checks.
Google's crawlers, indexers, and ranking algorithms have grown significantly more sophisticated in the last two years. Core Web Vitals are now well-established ranking factors. AI-powered search features — Google's AI Overviews, Bing Copilot integration, and third-party AI answer engines — have added a new layer of technical requirements. And with mobile-first indexing fully rolled out for all sites, the baseline for what constitutes a technically healthy site has risen.
Use this checklist as your complete framework. Work through it section by section, fix issues in priority order, and re-audit every three to six months.
Quick Reference: Audit Categories
| Category | Checks | Primary Tool |
|---|---|---|
| 1. Crawlability & Indexation | 10 | Google Search Console, Screaming Frog |
| 2. Core Web Vitals & Performance | 8 | PageSpeed Insights, CrUX |
| 3. Site Architecture & Internal Links | 7 | Screaming Frog, Ahrefs |
| 4. On-Page Technical Signals | 8 | Screaming Frog, manual |
| 5. Structured Data & Schema | 7 | Rich Results Test, Schema Validator |
| 6. AI Search Readiness | 7 | Manual, Perplexity, ChatGPT |
1. Crawlability & Indexation
These checks ensure that search engines can find, access, and index your pages. Issues here are foundational — no amount of content quality or link building compensates for a site that can't be properly crawled.
- robots.txt is not blocking important pages. Fetch
yourdomain.com/robots.txtdirectly. Confirm that key sections (homepage, blog, product pages) aren't disallowed. A mistakenDisallow: /will de-index your entire site. - XML sitemap is submitted and has no errors. Check Google Search Console → Sitemaps. Confirm the sitemap is submitted, discoverable, and error-free. All listed URLs should return 200 status codes.
- No pages with
noindexthat should be indexed. Run a Screaming Frog crawl and filter by "noindex" directive. Check for mistaken noindex tags on important pages (a common dev-environment flag left on after launch). - Canonical tags are correctly implemented. Self-referencing canonicals on all pages. Canonical tags should match the preferred URL exactly, including protocol (https), subdomain, trailing slash consistency.
- HTTPS is properly configured sitewide. All pages served over HTTPS. No mixed content warnings. HTTP URLs redirect to HTTPS via 301, not 302.
- Crawl budget is not wasted on low-value URLs. Check for paginated URLs, faceted navigation, filter parameters, or session IDs that create duplicate or near-duplicate pages. Use
noindexor canonical tags to consolidate. - No orphan pages (pages with no internal links pointing to them). Screaming Frog or Ahrefs site audit will flag orphan pages. Important pages that aren't internally linked get crawled less frequently and rank poorly.
- Redirect chains are less than 2 hops. Redirect chains (A→B→C→D) waste crawl budget and dilute link equity. Screaming Frog's redirect map will show all chains — clean them to single-hop 301s.
- No 4xx errors on indexed pages. In Google Search Console → Pages, check the "Not found (404)" count. Cross-reference with any inbound links pointing to those 404 URLs using Ahrefs or GSC's Link report.
- Hreflang is correctly implemented for multilingual sites. If you have language variants, check that hreflang tags are reciprocal (each language page references all others), that x-default is set, and that all hreflang values use valid language codes.
2. Core Web Vitals & Performance
Core Web Vitals have been a confirmed Google ranking factor since 2021. Passing the thresholds doesn't guarantee higher rankings, but failing them creates a ceiling on how competitive your pages can be in close matchups.
| Metric | Good | Needs Improvement | Poor |
|---|---|---|---|
| LCP (Largest Contentful Paint) | ≤ 2.5s | 2.5s – 4.0s | > 4.0s |
| INP (Interaction to Next Paint) | ≤ 200ms | 200ms – 500ms | > 500ms |
| CLS (Cumulative Layout Shift) | ≤ 0.1 | 0.1 – 0.25 | > 0.25 |
- LCP score is "Good" (≤ 2.5s) on mobile. Check PageSpeed Insights for your key landing pages. LCP is usually the hardest to pass and most impactful. Common fixes: serve hero images via CDN, add
fetchpriority="high"to the LCP element, preconnect to image origins. - INP score is "Good" (≤ 200ms). INP replaced FID as a Core Web Vital in March 2024. It measures responsiveness to user interactions. Common culprits: heavy JavaScript execution blocking the main thread, long tasks during page load.
- CLS score is "Good" (≤ 0.1). Layout shifts caused by images without explicit width/height attributes, dynamic content injection (ads, embeds), or web fonts loading. Set dimensions on all images and iframes.
- TTFB (Time to First Byte) is below 800ms. TTFB reflects server response time. If TTFB is high, investigate server infrastructure, hosting tier, or caching configuration before optimising client-side performance.
- Images are served in modern formats (WebP or AVIF). Next-gen formats are 25–35% smaller than JPEG at equivalent quality. Check that your image pipeline converts to WebP/AVIF and that the server sends correct Content-Type headers.
- Render-blocking resources are eliminated or deferred. CSS in
<head>is necessary; JavaScript that isn't render-critical should usedeferorasyncattributes. Check PageSpeed Insights' "Eliminate render-blocking resources" opportunity. - Mobile performance is tested separately from desktop. Google uses mobile-first indexing. Your desktop PageSpeed score is less relevant than your mobile score. Run all performance checks on mobile emulation.
- Font loading strategy doesn't cause layout shift. Web fonts cause CLS if the fallback font is significantly different in size. Use
font-display: swaporoptional, size-adjust, and preload for critical fonts.
3. Site Architecture & Internal Links
Site architecture determines how PageRank (link equity) flows through your site and how clearly topical authority is communicated to search engines.
- Key pages are within 3 clicks of the homepage. Crawl depth matters for both crawl budget and PageRank dilution. Important pages buried 6+ clicks deep receive less equity and crawl less frequently.
- The site has a clear content hierarchy. Topic clusters with a pillar page linking to supporting pages, and those pages linking back to the pillar, signal topical authority clearly. Flat architectures with no clear grouping miss this signal entirely.
- Internal links use descriptive anchor text. Avoid generic anchor text ("click here", "read more"). Use keyword-relevant anchor text that matches the target page's topic. Don't over-optimise with exact-match anchors on every link.
- No broken internal links. 404s from internal links waste crawl budget and create poor user experience. Screaming Frog will list all internal 404s in a single export.
- High-value pages receive proportionally more internal links. Check the "Inlinks" count for your target pages in Screaming Frog or Ahrefs. If a key landing page receives fewer internal links than a sidebar blog post, rebalance.
- Breadcrumb navigation is implemented consistently. Breadcrumbs help users navigate and provide structured hierarchy signals for search engines. They also improve the appearance of rich results in SERPs when marked up with schema.
- Pagination is handled correctly. For paginated series, use
rel="next"andrel="prev"if relevant, or consolidate paginated content with a "view all" canonical URL. Avoid indexing thin paginated pages.
4. On-Page Technical Signals
These checks cover the technical implementation of on-page elements that directly affect how pages are understood, ranked, and displayed in search results.
- Each page has a unique, descriptive title tag (50–60 characters). Duplicate title tags dilute relevance signals. Check Screaming Frog's "Page Titles" tab for duplicates, missing titles, and titles over 60 characters that will be truncated in SERPs.
- Each page has a unique meta description (150–160 characters). Meta descriptions don't directly affect rankings but do affect click-through rate. Duplicates and missing descriptions are both missed opportunities.
- Each page has exactly one H1 tag. Multiple H1s confuse hierarchy signals. Missing H1s miss a key relevance signal. The H1 should contain the primary keyword for the page.
- Heading hierarchy is logical (H1 → H2 → H3). Skipping heading levels (H1 directly to H4) weakens document structure signals. Screen readers and structured data extractors also rely on logical hierarchy.
- Image alt text is descriptive on all significant images. Alt text helps search engines understand image content and improves accessibility compliance. Decorative images should use
alt=""; content images need descriptive text including keywords where natural. - URL structure is clean, short, and keyword-relevant. URLs should be lowercase, use hyphens (not underscores), avoid dynamic parameters where possible, and include the target keyword. Avoid deep URL structures like
/blog/2023/04/22/article-title/. - Thin content pages are consolidated, noindexed, or improved. Pages under 300 words that don't serve a clear purpose (contact pages, thank-you pages) should be noindexed. Thin category pages should be improved or consolidated.
- Open Graph and Twitter Card meta tags are set on key pages. Social sharing metadata doesn't affect organic ranking but improves click-through when content is shared on social platforms.
5. Structured Data & Schema Markup
Schema markup doesn't directly boost rankings, but it enables rich results in SERPs and provides clear semantic signals that AI search systems increasingly use to understand and cite content.
- No structured data errors in Google Search Console. Check Search Console → Enhancements for any structured data errors that are preventing rich result eligibility.
- Article schema is implemented on blog posts. Article or BlogPosting schema should include
headline,datePublished,dateModified,author, andpublisher. This is a prerequisite for Top Stories eligibility. - FAQ schema is implemented on relevant pages. FAQ schema enables the expanded Q&A display in SERPs and is one of the most reliable ways to increase SERP real estate for informational content.
- BreadcrumbList schema matches actual breadcrumb navigation. Breadcrumb schema should match the visible breadcrumbs on the page. Mismatches can trigger validation warnings in Search Console.
- Product schema is correctly implemented on e-commerce pages. Product schema with Price, Availability, and Review properties enables rich product listings. Check for required properties and that prices are kept current (stale prices can cause disapproval).
- Organization schema is on the homepage. Establishes entity information for the site — name, logo, URL, social profiles. Helps Google's Knowledge Graph understand who the site belongs to.
- LocalBusiness schema is implemented for local businesses. If the site represents a physical location, LocalBusiness schema with address, opening hours, and phone number improves local search visibility and enables map pack eligibility.
6. AI Search Readiness
AI-powered search features — Google AI Overviews, Bing Copilot, Perplexity, ChatGPT Browse — now account for a meaningful share of search-driven traffic and citations. Technical readiness for AI extraction is an emerging SEO category that deserves explicit attention in 2026.
- Content is structured with clear, extractable answer blocks. AI systems prefer content with concise answer paragraphs near the top of sections, followed by supporting detail. The pattern "Topic + Direct Answer + Supporting Context" outperforms long introductory paragraphs before the actual answer.
- robots.txt doesn't block major AI crawlers. Check that
GPTBot,ClaudeBot,PerplexityBot,Bytespider, andCCBotare not blocked unless intentional. Blocking AI crawlers removes your content from AI citation eligibility. - The site has clear authorship signals. Author pages, bylines, author schema, and links to author profiles on authoritative external sites (LinkedIn, Google Scholar, industry publications) improve E-E-A-T signals that AI systems use to assess citation-worthiness.
- Content contains verifiable, factual claims with clear sources. AI systems are more likely to cite content that contains specific data points, statistics, and named sources. Vague claims without supporting evidence are less likely to be extracted.
- The site is mentioned or cited by authoritative external sources. AI training data and live-search citation algorithms both favour content that is already referenced by trusted sources. This is a long-term investment, but it begins with producing content worth citing.
- Key pages load without requiring JavaScript execution. Many AI crawlers don't execute JavaScript. If your core content is rendered client-side (SPA architecture), ensure critical text content is available in the initial HTML response via SSR or static generation.
- The site appears in Perplexity and ChatGPT searches for your key topics. Manually test 5–10 core queries in AI answer engines. If competitors appear in citations and you don't, that signals a discoverability or authority gap worth investigating.
Running Your Audit: Recommended Tool Stack
| Task | Tool | Cost |
|---|---|---|
| Full site crawl | Screaming Frog SEO Spider | Free (up to 500 URLs) / £259/year |
| Index coverage, manual actions | Google Search Console | Free |
| Core Web Vitals (field data) | PageSpeed Insights / CrUX | Free |
| Backlinks, broken links | Ahrefs or Semrush | From $99/month |
| Structured data validation | Google Rich Results Test | Free |
| Full automated audit | AuditDepot | Free |
AuditDepot checks over 100 SEO signals in seconds and produces a prioritised report of exactly what to fix — no account required.
→ Run your free audit now
How Often Should You Run a Technical SEO Audit?
The right cadence depends on site size and how frequently your site changes:
- Large e-commerce / enterprise sites: Monthly crawls + GSC monitoring weekly
- Mid-size sites (1,000–50,000 pages): Quarterly full audit, GSC monitoring monthly
- Small sites / blogs: Twice yearly, plus any time a major platform or CMS update is deployed
The triggers that should prompt an immediate audit regardless of schedule:
- A significant traffic drop in Google Analytics or GSC
- A major CMS migration or site redesign
- A Google algorithm update confirmation (check Google Search Status Dashboard)
- Moving to a new domain or HTTPS migration
This checklist is updated for 2026. For a fully automated version of this audit — including INP testing, AI crawlability checks, and structured data validation — run a free audit on AuditDepot.