Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Technical SEO for News Publishers

Sponsored · Your Podcast. Everywhere. Effortlessly. Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.

Technical SEO for News Publishers

Slides from my talk at the News and Editorial SEO Summit 2022, where I spoke about key aspects of technical SEO for news publishers.

More info: https://newsseo.io

Avatar for Barry Adams

Barry Adams

October 26, 2021
Tweet

More Decks by Barry Adams

Other Decks in Marketing & SEO

Transcript

  1. @badams @badams 1. Crawler (Googlebot) ➢ URL discovery ➢ URL

    prioritisation ➢ URL de-duplication ➢ Queue management ➢ HTTP response parsing ➢ TTFB monitoring ➢ Resource management ➢ … ? Crawler
  2. @badams @badams Optimise Crawling (2) • Serve correct HTTP status

    codes ➢ 200 OK ➢ 301 / 302 Redirects ➢ 304 Not Modified ➢ 401 / 403 Permission Issues ➢ 404 / 410 Not Found/Gone ➢ 5xx Error
  3. @badams @badams Optimise Crawling (3) • ALL resources consume crawl

    budget ➢ Not just HTML pages ➢ Reduce HTTP requests per page • Google AdsBot can consume crawl budget ➢ Double-check your Google Ads campaigns • Link equity (PageRank) impacts crawl budget ➢ More link equity = more crawl budget
  4. @badams @badams 2. Indexer Indexer ➢ Index selection ➢ HTML

    tokenisation & parsing ➢ Rendering (+++) ➢ Meta tag processing ➢ Canonicalisation ➢ Index sanitation ➢ Calculating PageRank ➢ Quality evaluations ➢ … ?
  5. @badams @badams Optimise Extraction (1) • Clean HTML; ➢ Yes,

    really! ➢ There is a max HTML size Google will parse - Speculation: ~1 MB ➢ Less clutter = easier parsing
  6. @badams @badams Optimise Extraction (2) • Clean <head>; ➢ Critical

    meta tags high in the <head> - Title & description - Open Graph - Canonical, hreflang & mobile alternate - Structured Data ➢ Internal CSS & JS lower in the <head>
  7. @badams @badams Optimise Extraction (3) • Uninterrupted article HTML; ➢

    Article to start at <h1> headline and continue in one clean block of HTML ➢ Bells & whistles can be added via CSS and client- side JS
  8. @badams @badams Optimise Semantics • Well-written content; ➢ Easily identifiable

    entities and relationships • Semantic HTML; ➢ Enables Google to separate style & boilerplate from content • Structured Data; ➢ Makes page contents explicitly clear
  9. @badams @badams Core Web Vitals & AMP • CWV are

    measured from the page version a user interacts with; ➢ This is often the AMP version • AMP has a performance cheat advantage; ➢ Preloading & prerendering from the AMP Cache • AMP no longer required for Top Stories on mobile; ➢ Does this mean non-AMP can rank?
  10. @badams @badams Structured Data Constantly evolving schemas New rich snippets

    in SERPs https://sitebulb.com/structured-data-history/
  11. @badams @badams SEO Review & Monitoring • Little Warden https://littlewarden.com/

    • SEO Info https://weeblr.com/doc/products.seoinfo/current/overview/ • SEOBrowse https://seobrowse.com/
  12. @badams @badams Barry Adams ➢ Doing SEO since 1998 ➢

    Specialist in News SEO & Tech SEO ➢ Newsletter: SEOforGoogleNews.com