What Is Site Architecture and Why It Matters for SEO

TL;DR

Site architecture is how your website’s pages are organized, linked together, and accessible to both users and search engines. Good architecture means: every important page is reachable within 3-4 clicks from the homepage, related content is grouped logically, internal links distribute authority to priority pages, and URL structure reflects the hierarchy. Poor architecture creates orphan pages Google can’t find, buries important content too deep, and confuses both users and search engines about what your site is about. Architecture is foundational; fixing it often has larger impact than content or link building.


Do This Today (3 Quick Checks)

  1. Count clicks to important pages: How many clicks from your homepage to reach your most important product or service page? More than 4 clicks = too deep.
  1. Check for orphan pages: In GSC, go to Links → Internal Links. Pages with zero or very few internal links may be orphaned and hard for Google to find.
  1. Review your URL structure: Do URLs reflect logical hierarchy (/services/consulting/)? Or are they flat and meaningless (/page-id-4829)?

Breadcrumb Implementation

What are breadcrumbs?
Navigation showing page hierarchy: Home > Category > Subcategory > Page

Why they matter for SEO:

  • Help Google understand site structure
  • Can appear in search results (enhanced visibility)
  • Improve user navigation
  • Create internal links up the hierarchy

Breadcrumb schema markup:

{
  "@context": "https://schema.org",
  "@type": "BreadcrumbList",
  "itemListElement": [
    {
      "@type": "ListItem",
      "position": 1,
      "name": "Home",
      "item": "https://example.com/"
    },
    {
      "@type": "ListItem",
      "position": 2,
      "name": "Kitchen",
      "item": "https://example.com/kitchen/"
    },
    {
      "@type": "ListItem",
      "position": 3,
      "name": "Refrigerators",
      "item": "https://example.com/kitchen/refrigerators/"
    },
    {
      "@type": "ListItem",
      "position": 4,
      "name": "Samsung French Door RF28"
    }
  ]
}

Implementation:

  • Display visible breadcrumbs on page
  • Add BreadcrumbList schema
  • Ensure breadcrumb links are crawlable
  • Breadcrumbs should match URL hierarchy

Faceted Navigation SEO

What is faceted navigation?
Filters on category pages: color, size, price, brand, etc. Each combination can create a new URL.

The problem:
A category with 5 facets of 10 options each = 100,000 possible URL combinations. This causes:

  • Crawl budget waste
  • Duplicate/thin content
  • Index bloat

Faceted navigation solutions:

Approach When to Use Implementation
<strong>Noindex, follow</strong> Filtered pages shouldn't rank Meta robots noindex
<strong>Canonical to base</strong> Filters don't change main content Canonical tag to unfiltered page
<strong>Robots.txt block</strong> Prevent crawling entirely Disallow filter parameters
<strong>Parameter handling</strong> Control in GSC GSC URL Parameters
<strong>AJAX/JavaScript</strong> Filters don't create URLs No URL change on filter

Recommended approach for most e-commerce:

  1. Allow indexing of valuable filtered pages (top brands, popular combinations)
  2. Noindex low-value filtered pages (obscure combinations)
  3. Canonical to base category for most filters
  4. Use consistent parameter order in URLs

Example decisions:

  • /shoes/?color=red → Noindex (color alone isn’t valuable)
  • /shoes/?brand=nike → Index (brand pages can rank)
  • /shoes/?color=red&size=10&brand=nike → Noindex (too specific)

Hub and Spoke Model

Concept:
Hub page (comprehensive, authoritative) links to and from spoke pages (detailed, specific).

[SPOKE: Running Shoes]
                    ↑↓
[SPOKE: Trail] ←→ [HUB: Athletic Shoes] ←→ [SPOKE: Cross Training]
                    ↑↓
           [SPOKE: Walking Shoes]

Hub page characteristics:

  • Comprehensive overview of topic/category
  • Links to all related spoke pages
  • Targets broader, higher-competition keywords
  • Regularly updated as spokes added
  • Higher internal link count (from spokes)

Spoke page characteristics:

  • Deep dive into specific subtopic
  • Links back to hub
  • Links to related spokes
  • Targets specific, long-tail keywords
  • Can be more focused and detailed

SEO benefits:

  • Hub accumulates authority from all spokes
  • Clear topical relationships
  • Prevents orphan pages
  • Supports both head terms and long-tail

Implementation:

  1. Identify your main topic areas (hubs)
  2. Map subtopics under each (spokes)
  3. Create or designate hub pages
  4. Ensure bidirectional linking (hub ↔ spoke)
  5. Add spoke-to-spoke links where relevant

Internal Link Audit Process

Step 1: Crawl your site
Use Screaming Frog, Sitebulb, or Ahrefs Site Audit to crawl all pages.

Step 2: Export internal link data
Get: URL, internal links in, internal links out, crawl depth

Step 3: Identify problems

Problem What to Look For Fix
<strong>Orphan pages</strong> 0 internal links in Add links from relevant pages
<strong>Deep pages</strong> Crawl depth 5+ Link from higher pages
<strong>Low-link important pages</strong> Priority pages with few links Add internal links
<strong>Over-linked low-value pages</strong> Thin pages with many links Reduce links or noindex
<strong>Broken internal links</strong> 404 internal links Fix or remove

Step 4: Map ideal link distribution
Priority pages should have proportionally more internal links:

  • Homepage: Links to all main categories
  • Category pages: Links from homepage, to products/posts
  • High-value products/content: Multiple link paths
  • Low-value/thin pages: Fewer links

Step 5: Create internal linking plan

  • Add contextual links in content
  • Improve navigation/menus
  • Add related content modules
  • Create hub pages linking to clusters

Tools for internal link analysis:

  • Screaming Frog (crawl data, link graphs)
  • Sitebulb (visual link flow)
  • Ahrefs Site Audit (internal link reports)
  • GSC Internal Links report (basic counts)

Site Architecture Principles

Principle Good Example Bad Example
<strong>Shallow depth</strong> Homepage → Category → Product (3 clicks) Homepage → Department → Category → Subcategory → Product (5 clicks)
<strong>Clear hierarchy</strong> /shoes/running/nike-air-max/ /product?id=4829&cat=17
<strong>Logical grouping</strong> All running shoes under /running/ Running shoes scattered across random URLs
<strong>Strategic internal linking</strong> Top pages have most internal links Internal links distributed randomly
<strong>No orphan pages</strong> Every page has multiple paths to it Pages only accessible via sitemap
<strong>Consistent navigation</strong> Same main nav across all pages Different navigation on different sections

Crawl Depth and PageRank Flow

How link authority flows:

Homepage (most authority)
    │
    ├── Main Category 1 (high authority)
    │   ├── Subcategory A (medium authority)
    │   │   ├── Page 1 (lower authority)
    │   │   └── Page 2 (lower authority)
    │   └── Subcategory B (medium authority)
    │
    ├── Main Category 2 (high authority)
    │
    └── About/Contact (lower priority, still reachable)

Key concept: Pages closer to homepage (fewer clicks away) typically receive more internal PageRank and are crawled more frequently.

Implication: Put your most important pages close to the homepage. Don’t bury important content 5+ clicks deep.


The Conversation

Omar, runs a large home improvement e-commerce site, 5,000 products across 50 categories, site has grown organically over 10 years without architectural planning, seeing crawling and indexing issues, some products not appearing in search, advanced business knowledge but limited technical SEO: We have about 5,000 products. Some categories have 500 products, others have 20. We’ve added categories over time whenever it made sense. Now we’re having problems. GSC shows thousands of products as “Discovered but not indexed.” New products take weeks to show up in Google. I think our site structure is a mess but don’t know how to fix it.

SDC SEO Brain: Ten years of organic growth without architectural planning is a common story, and the symptoms you describe (slow indexing, “Discovered but not indexed”) are classic signs of poor architecture. Let me diagnose. How deep is your navigation? If I’m on your homepage, how many clicks to reach a product?

Omar: Let me think… Homepage → Department (like “Kitchen”) → Category (like “Appliances”) → Subcategory (like “Refrigerators”) → Product. So 4 clicks minimum, but sometimes there’s another layer, so 5.

SDC SEO Brain: Four to five clicks is borderline. For a large catalog, some depth is necessary, but you want your most important products within 3 clicks. More critically, how many products are linked only from their subcategory page? No cross-links, no “related products,” no collection pages?

Omar: Most products are only in one subcategory. We don’t have a lot of cross-linking.

SDC SEO Brain: That’s likely your main problem. With 5,000 products and minimal cross-linking, most products have maybe 1-2 internal links pointing to them (their subcategory page, maybe a sitemap). Google assigns low priority to pages with few internal links. It discovers them but doesn’t prioritize crawling or indexing.

Omar: How do we add more internal links to 5,000 products? That sounds impossible.

SDC SEO Brain: You don’t add them manually one by one. You create structures that generate internal links programmatically:

Related products: On every product page, show 4-6 related products. Those are internal links.

Customers also bought: Another set of internal links per product.

Category features: On category pages, feature specific products, creating links from higher-authority category pages to individual products.

Collection/curated pages: Create pages like “Best Refrigerators Under $1000” that link to qualifying products.

Cross-category linking: A refrigerator might also appear on a “Kitchen Essentials” collection page.

Omar: We have related products but they’re auto-generated and often irrelevant.

SDC SEO Brain: Irrelevant related products don’t help much. If your refrigerator page shows related products that are random microwaves, it’s not building topical connection. Ideally, related products should be: genuinely related (other refrigerators, or complementary items like ice makers), and potentially manually curated for top products.

Omar: What about our URL structure? It’s a mess. Some products are /product/12345, others are /kitchen/appliances/product-name/.

SDC SEO Brain: Inconsistent URL structures signal disorganization to Google and make internal linking more complex. Ideal URL structure for e-commerce:

/category/subcategory/product-name/
Example: /kitchen/refrigerators/samsung-french-door-rf28/

This creates:

  • Hierarchy visible in URL
  • Category relationships explicit
  • Breadcrumb structure natural
  • Clean, readable URLs

Changing URLs now is disruptive (requires redirects, can temporarily impact rankings), but for a major restructure, it may be worth it.

Omar: Would we need to redirect thousands of URLs?

SDC SEO Brain: Yes, if you change URL structure. Every old URL needs a 301 redirect to its new location. This is technically complex and has short-term ranking impact, but for a fundamentally broken architecture, it’s often necessary. The alternative is living with a messy structure forever.

Omar: What about our category structure? We have 50 categories but they’re not well organized.

SDC SEO Brain: Fifty categories is a lot. How are they organized? Do you have parent categories (departments) with child categories, or are all 50 at the same level?

Omar: Mostly same level. Some things are weirdly categorized. We have both “Cooking Appliances” and “Kitchen Appliances” and “Stoves” and “Ranges” as separate categories.

SDC SEO Brain: That fragmentation confuses both users and Google. Overlapping categories cause products to be miscategorized, internal linking to be diluted, and Google to be uncertain which category page to rank for related queries.

Restructure into:

  • Fewer main departments (8-12)
  • Logical subcategories within each (50 total subcategories is fine)
  • Clear parent-child relationships
  • No overlapping category names

Example structure:

Kitchen
├── Cooking Appliances (stoves, ranges, ovens, microwaves)
├── Refrigeration (refrigerators, freezers, wine coolers)
├── Dishwashers
└── Small Appliances (blenders, toasters, etc.)

Omar: That’s a major project. Where do we even start?

SDC SEO Brain: Start with diagnosis and planning before execution:

Step 1: Crawl your site
Use Screaming Frog or Sitebulb to crawl everything. Analyze:

  • Crawl depth (how deep are products?)
  • Internal link counts per page
  • Orphan pages
  • Duplicate content
  • URL patterns

Step 2: Map current vs ideal structure
Document current categories and where products live. Design ideal structure. Map old to new.

Step 3: Prioritize by impact
Not all 5,000 products need fixing at once. Start with:

  • Top revenue products
  • Products with most search potential
  • Categories with worst structure

Step 4: Execute in phases
Restructure one department at a time. Set up redirects. Monitor impact before proceeding.

Omar: How long before we see results from restructuring?

SDC SEO Brain: Architecture changes take time to propagate. Google needs to:

  • Recrawl affected pages (weeks)
  • Process new internal link structure (weeks)
  • Re-evaluate page importance based on new links (weeks to months)

Expect 2-4 months before seeing meaningful ranking changes from major restructuring. Some improvements (like adding related products) can show faster results.


FAQ

Q: What’s the ideal click depth for pages?
A: Important pages should be reachable in 3 clicks or fewer from the homepage. Less important pages can be 4 clicks. Anything 5+ clicks deep is at risk of being undervalued by Google.

Q: How do I find orphan pages?
A: Crawl your site with Screaming Frog or Sitebulb and look for pages with zero internal links. Also check GSC → Links → Internal Links for pages with very few links.

Q: Should I use flat or hierarchical URL structures?
A: For most sites, hierarchical URLs (/category/subcategory/page/) are clearer for users and Google. Flat URLs (/page-name/) work for simple sites but don’t convey structure.

Q: Does changing site architecture hurt SEO?
A: Short-term, yes. Restructuring with URL changes causes temporary ranking fluctuations. Long-term, good architecture improves rankings. Use 301 redirects for all URL changes and expect 1-3 months of adjustment.

Q: How many internal links should a page have?
A: There’s no perfect number. Priority pages should have more internal links than low-priority pages. The important thing is proportionality: your homepage might have 50-100 outbound internal links; a product page might have 20-30.


Summary

Site architecture is foundational SEO. Poor architecture causes indexing problems, authority dilution, and confused topical signals. Fixing architecture often has more impact than content or link building.

Key architecture principles:

  • Important pages within 3 clicks of homepage
  • Logical grouping of related content
  • Internal links proportional to page importance
  • No orphan pages (every page has multiple paths)
  • Clear URL hierarchy reflecting site structure

Internal linking distributes authority. Pages with more internal links from authoritative pages receive more “PageRank.” Homepage → Category → Product structure naturally flows authority downward.

Common architecture problems:

  • Too deep: Important pages 5+ clicks from homepage
  • Orphan pages: Products only accessible via sitemap
  • Fragmented categories: Overlapping, poorly organized categories
  • Inconsistent URLs: No clear pattern or hierarchy
  • Weak cross-linking: No related products, collections, or contextual links

Architecture changes require patience. Restructuring with redirects takes 2-4 months to fully impact rankings. Plan carefully, execute in phases, monitor results.

Crawl your site to diagnose. Tools like Screaming Frog reveal crawl depth, orphan pages, internal link distribution, and URL patterns. Diagnosis before action.


Sources