SEO Técnico
12 min read Maio 2025 By Edinaldo Xavier

Technical SEO for e-commerce: complete crawlability, indexing and structured data audit

How to audit the technical SEO of an online store from scratch — crawling, indexing, structured data and Core Web Vitals. Tools, methodology and the most common errors that cost rankings.

Why technical SEO decides before content

Publishing excellent content in a store that Googlebot cannot crawl is like putting a sign in the window of a closed store. Technical SEO is the set of conditions that allow Google to find, interpret and index each product, category and content page of your e-commerce.

A well-performed technical audit answers three questions: can the bot reach it? Can it read? Is what it reads sufficiently structured to generate rich results? This article maps each layer of this process with tools and checklists applicable to any platform.

⚠️

Warning: Crawlability and indexing errors usually appear weeks after publishing — when organic traffic simply does not grow. Preventive diagnosis avoids months of lost content.

Layer 1: Crawlability — can the bot reach it?

Crawlability is the ability of Googlebot to navigate your site. The most common problems in e-commerce:

  • robots.txt blocking product or category pages — classic error in migrated stores
  • Noindex on listing pages — common in themes with incorrect default settings
  • JavaScript blocking rendering — SPAs and modern frameworks that Googlebot does not execute correctly
  • Chain redirects — each additional hop consumes crawl budget
  • Broken internal links (404) — waste authority and create dead ends for the bot

How to audit crawlability

Use Google Search Console → URL Inspection to verify pages individually. For a systemic view, tools like Screaming Frog, Sitebulb or Ahrefs Site Audit map the entire site and identify problematic patterns.

Also check the server crawl log: it shows which URLs Googlebot visited, how often and which returned errors. This information is more precise than any third-party tool.

Crawlability checklist: robots.txt without improper blocks · Updated and submitted XML Sitemap · No noindex on strategic pages · Direct redirects (no chain) · Crawl budget preserved (avoid infinite pagination without rel=canonical)

Layer 2: Indexing — is what was crawled in the index?

Crawling does not mean indexing. Google can visit a page and decide not to include it in the index for various reasons: duplicate content, perceived low quality, canonical pointing to another URL, or simply by algorithmic decision.

Canonicalization and duplicate products

In e-commerce, duplicates are endemic: the same t-shirt appears in /tops/white-t-shirt/, /sale/white-t-shirt/ and /summer-collection/white-t-shirt/. Without a correct rel=canonical, Google chooses which version to index — often not the one you want.

URL parameters for filters, sorting and tracking also create silent duplicates. Configure Google Search Console → URL Parameters to tell Google how to handle each parameter.

Thin content and empty category pages

Category pages with only 4 or 5 products, no description, no editorial context, are candidates for "I will not index this". Add a paragraph of relevant text above the products, specify the value proposition of the category and use heading tags (H1, H2) hierarchically.

Layer 3: Structured Data — data that generates rich results

Structured data are JSON-LD markup that tells Google what each element means, not just how it looks. For e-commerce, the most relevant types are:

🛍️

Product + Offer

Activates price, availability and reviews in search results. Required for shopping listings.

Review + AggregateRating

Displays rating stars directly in the organic result — increases CTR by up to 35%.

🗺️

BreadcrumbList

Shows the navigation hierarchy in the snippet and improves Google's understanding of site structure.

FAQPage

Expands the result with a questions accordion — takes up more space and reduces competitor clicks.

How to validate structured data

Use Google's Rich Results Test to check if the markup is correct and eligible for rich snippets. The Schema Markup Validator detects syntax errors. Both are free and should be part of the routine after any deployment.

💡

Note Google's requirements: Review rich snippets require reviews to be from real users, not the company. Incorrect markup can result in a manual penalty.

Core Web Vitals as a ranking factor

Since the 2021 Page Experience Update, LCP, INP and CLS are ranking signals. A store with LCP above 4s can have all the correct content, perfect structured data and still lose to faster competitors.

Measure your Core Web Vitals with PageSpeed Insights and, more importantly, with CrUX field data in Search Console (under "Page Experience"). Lab data is useful for debugging, but field data is what Google uses for ranking.

Technical SEO audit checklist — e-commerce

AreaWhat to checkTool
Crawlabilityrobots.txt, noindex, redirectsSearch Console, Screaming Frog
IndexingCoverage report, canonicals, duplicatesSearch Console
SpeedLCP, INP, CLS — field dataPageSpeed Insights, CrUX
Structured DataProduct, Offer, Review, BreadcrumbRich Results Test
Internal links404s, orphan pages, click depthScreaming Frog, Ahrefs
HTTPSValid certificate, mixed contentSSL Labs, DevTools

Need a complete technical SEO audit?

Detailed diagnosis of crawlability, indexing and structured data of your store — with report prioritized by impact and technical correction plan.

Is your store invisible to Google?

Crawlability and indexing errors are silent — the content exists, but no one finds it. A technical diagnosis identifies exactly where the problem is.

🔍

Complete audit

Crawlability, indexing, structured data and speed — all mapped with quantified impact.

📊

Prioritized report

Problems ordered by ranking impact, not technical complexity.

Action plan

Specific fixes for each problem found, with effort estimate.