Your Site Architecture Is a Confession
    Back to Articles
    Algorithm Science and Technical SEO

    Your Site Architecture Is a Confession

    Michael McDougald
    October 18, 2024

    Your Site Architecture Is a Confession

    Every website has a site architecture. Most are accidents. Marketing needs a landing page, so someone adds one. Blog posts accumulate without any linking strategy. Service pages multiply like weeds in a garden nobody tends. Through all of it, the architecture that emerges tells Google exactly what you understand about search.

    It is, whether you intended it or not, a confession.

    I've audited hundreds of websites over fifteen years. I can tell you within ten minutes of looking at a site's internal linking structure, URL hierarchy, and crawl depth what the previous agency understood about SEO, what they faked, and what they never considered. The architecture never lies. Every decision about where a page lives, what links to it, and how many clicks separate it from the homepage is a priority declaration that Google's algorithms read as clearly as you read this sentence.

    What Google Actually Measures When It Reads Your Site Architecture

    Most SEO advice about site architecture stops at "keep it flat" and "use internal links." That's like building a house by saying "use walls." The algorithm does something far more specific, and the 2024 Google API documentation leak confirmed what many of us suspected.

    Google calculates a metric called onsiteProminence, which measures "the importance of the document within its site." It computes this by propagating simulated traffic from the homepage and from pages that already receive high search clicks. Put simply, Google runs a simulation of how a real person would navigate your website, starting from your homepage and your highest-traffic pages, and uses that simulation to decide how important each page actually is to you.

    This isn't PageRank in the traditional sense. This is Google asking a more pointed question: if someone started at your homepage and clicked through your site naturally, which pages would they reach first, and which pages would they never find?

    The leaked documentation also revealed a three-tier index structure. The most important, frequently updated content sits in Base, stored on flash memory with the fastest access and highest crawl priority. Content of medium importance lands in Zeppelins on SSDs. The pages Google considers least important get filed in Landfills on slow hard drives, where they are crawled infrequently and ranked reluctantly. Your site architecture is one of the primary signals that determines which tier your pages land in. A page buried six clicks from your homepage, with two internal links pointing to it, is telling Google to put it in the Landfill. And Google listens.

    The Reasonable Surfer Is Judging Your Internal Links

    This is where most agencies fail their clients. They know internal links matter, so they scatter them everywhere. Footer links, sidebar widgets, "related posts" modules that link to random content. They treat all internal links as if they carry equal weight. They don't.

    Google holds a patent called the Reasonable Surfer Model (US8117209B1) that describes exactly how the algorithm assigns different weights to different links on the same page. The patent identifies specific factors: the position of the link on the page (above the fold carries more weight than below), the font size and visual prominence of the link, the number of words in the anchor text, and whether the link is in editorial content versus a navigation list. A link in the first paragraph of your body copy, using descriptive anchor text, passes significantly more weight than a link buried in a footer alongside two hundred other links.

    What this means for your site architecture is uncomfortable. Every internal link you place is a vote, and Google is reading the conviction behind that vote. A contextual link from a high-traffic blog post to your core service page, placed in the opening paragraphs with relevant anchor text, is a loud vote of confidence. A footer link to the same page, surrounded by every other page on the site, is a whisper that gets lost in the noise. Your linking patterns confess your priorities. If your most important revenue pages receive the majority of their internal links from footers and sidebars instead of editorial content, you're confessing that nobody on your team understood where link equity actually flows.

    I wrote about how common SEO mistakes compound over years of neglect. Bad internal linking architecture is one of the slowest, most invisible mistakes a site can make, precisely because it doesn't break anything visibly. The pages still load. The links still work. But the authority distribution is silently wrong, and the algorithm notices.

    Flat Architecture Is Not a Strategy

    The most repeated piece of site architecture advice on the internet is "keep your architecture flat." Every page within three clicks of the homepage. It's been repeated so often that people follow it without understanding why it existed in the first place.

    Flat architecture matters because of crawl depth. Pages closer to the homepage get crawled more frequently. A page at depth two might get crawled daily, while a page at depth six might get crawled once a month or never. Google's own documentation confirms that crawl budget is finite and allocated based on signals including internal link prominence. So yes, keeping important pages close to the homepage helps with crawl priority.

    But flatness alone is not architecture. It's just proximity. Real architecture creates a hierarchy that maps to topical relationships. Your pages should not just be close to the homepage. They should be organized into clusters that tell Google how your expertise is structured. A pillar page on technical SEO that links to supporting pages about crawl budget, site speed, and structured data creates a topical cluster that signals depth and authority. The same pages scattered randomly across your blog, all equally close to the homepage but with no logical relationship between them, create noise instead of signal.

    This is where the topical authority conversation intersects with architecture. You can't build topical authority without architectural intent. Publishing fifty posts about the same subject does nothing if those posts don't link to each other in a way that creates a navigable, hierarchical structure. The algorithm doesn't just count pages on a topic. It evaluates whether those pages are organized in a way that suggests genuine expertise, and that evaluation starts with how you structured them architecturally.

    The Architectural Signals That Separate Amateurs from Professionals

    I can diagnose a website's entire SEO history by looking at four architectural elements.

    URL structure is the most visible confession. Clean, hierarchical URLs like /services/technical-seo/site-audits tell Google and users where a page sits in the organizational hierarchy. URLs like /page-id-4837 or /blog/2023/04/15/post-title confess that nobody thought about information architecture when the site was built. The URL structure should mirror the navigational hierarchy, and the navigational hierarchy should mirror the topical structure of your expertise. When all three align, Google receives a clear, consistent signal about what your site is and how it's organized.

    Orphan pages, those with no internal links pointing to them, are architectural abandonment. They confess that content was created without a distribution strategy. Google's crawlers follow internal links to discover pages. An orphan page is a page you're asking Google to ignore, whether you meant to or not.

    Breadcrumbs reinforce your hierarchy. They provide a secondary navigational signal that tells both users and search engines exactly where a page sits within your structure. When breadcrumbs align with your URL structure and your main navigation, you're sending three consistent architectural signals. When they contradict each other, you're confessing confusion.

    And sitemaps, both XML for search engines and HTML for users, are your opportunity to declare your full architectural intent. An XML sitemap that includes every page on your site, organized by priority and update frequency, tells Google what you consider important. A missing or outdated sitemap confesses neglect.

    What I See When I Audit Site Architecture

    The patterns are remarkably consistent. A company hires an agency that builds them a beautiful website. The design is clean. The copy is polished. The site launches and everyone celebrates. Six months later, they wonder why organic traffic hasn't materialized.

    The answer is almost always architectural. The blog sits in a subfolder with no internal links to the service pages. The service pages link to each other haphazardly, creating loops instead of hierarchy. The most important revenue-generating pages are four or five clicks from the homepage because the navigation was designed for aesthetics, not for crawl priority. The homepage links to an "About" page and a "Contact" page but not to the core service categories that should be receiving the most link equity.

    I see sites where the blog has three thousand posts and not a single one links to a service page. The content team was told to "write for SEO" but nobody told them that content without architectural intent is just noise. I see sites where every page in the footer links to every other page, creating a web of links so dense that no single link carries meaningful weight. I see URL structures that were changed during a redesign without redirects, silently severing every internal link that pointed to the old paths.

    Every one of these patterns is a confession. The missing blog-to-service links confess that the content strategy and the business strategy were never connected. The footer link soup confesses that someone read "internal links help SEO" and implemented it literally without understanding the Reasonable Surfer patent. The broken redirects confess that the redesign team didn't include an SEO strategist, or worse, included one who didn't understand that architecture is one of the signals Google's quality systems evaluate.

    Architecture Is Strategy Made Visible

    Your site architecture is not a technical detail to be delegated to a developer. It is a strategic document. It declares what your business considers important, how your expertise is organized, and whether you understand the system you're trying to rank in. Google's algorithms, from onsiteProminence simulations to Reasonable Surfer link weighting to three-tier index placement, are reading your architecture as a signal of quality and intent.

    The sites that rank well, the ones that maintain visibility through core updates and algorithm shifts, share a common trait. Their architecture is intentional. Every page has a purpose, a place in the hierarchy, and a network of internal links that distribute authority according to business priority. Their URL structures mirror their navigation, which mirrors their topical clusters, which mirrors their actual expertise.

    The sites that struggle share a different trait. Their architecture is an accident. It grew organically, which is another way of saying it grew without strategy. And every time Google crawls it, the algorithm sees the confession: this site was built by people who didn't understand how search actually works.

    You can fix it. But you have to start by reading your own confession honestly.

    Michael McDougald is the founder of Right Thing SEO, a Nashville-based agency that builds search strategies on algorithm research, not guesswork. If your site architecture needs an honest audit, start the conversation here.

    MM

    Michael McDougald

    Founder of Right Thing SEO, a math-driven SEO agency based in Nashville and Sarasota. Michael has spent 15+ years helping businesses achieve sustainable organic growth through data-driven strategies.

    Learn more about Michael →

    Ready to Stop the Fall?

    Get a free SEO assessment and discover what's holding your site back.