Upgrade to Pro — share decks privately, control downloads, hide ads and more …

(Almost) Everything You Need To Know About Craw...

(Almost) Everything You Need To Know About Crawling, Indexing, and Especially Rendering in Google

Slides from my talk at Friends of Search 2022 in Amsterdam and Brussels where I spoke about crawling, indexing, and rendering in Google's search ecosystem.

Barry Adams

June 14, 2022
Tweet

More Decks by Barry Adams

Other Decks in Marketing & SEO

Transcript

  1. @badams #FOS22 (Almost) Everything You Need To Know About Crawling,

    Indexing, and Rendering in Google Barry Adams June 2022
  2. @badams #FOS22 @badams #FOS22 Optimise Crawling • Serve correct HTTP

    status codes; ➢ 200 OK ➢ 301 / 302 Redirects ➢ 304 Not Modified ➢ 401 / 403 Permission Issues ➢ 404 / 410 Not Found/Gone ➢ 5xx Error
  3. @badams #FOS22 @badams #FOS22 Optimise Crawling • ALL resources consume

    crawl budget; ➢ Not just HTML pages ➢ Reduce HTTP requests per page
  4. @badams #FOS22 @badams #FOS22 Optimise Crawling • ALL resources consume

    crawl budget; ➢ Not just HTML pages ➢ Reduce HTTP requests per page • AdsBot can consume crawl budget; ➢ Double-check your Google Ads campaigns
  5. @badams #FOS22 @badams #FOS22 Optimise Crawling • ALL resources consume

    crawl budget; ➢ Not just HTML pages ➢ Reduce HTTP requests per page • AdsBot can consume crawl budget; ➢ Double-check your Google Ads campaigns • Link equity (PageRank) impacts crawl budget; ➢ More link equity = more crawl budget
  6. @badams #FOS22 @badams #FOS22 Two Stages* of Indexing Crawler Indexer

    Ranker 1 2 *At least – indexing is a collection of interconnected processes
  7. @badams #FOS22 @badams #FOS22 What happens during Rendering in your

    Browser? HTML CSS HTML Parser CSS Parser DOM Tree CSSOM Render Tree Painting Display Layout
  8. @badams #FOS22 @badams #FOS22 JavaScript HTML CSS HTML Parser CSS

    Parser DOM Tree CSSOM Render Tree Painting Display JavaScript Layout
  9. @badams #FOS22 @badams #FOS22 JavaScript… HTML CSS HTML Parser CSS

    Parser DOM Tree CSSOM Render Tree Painting Display JavaScript Layout JavaScript
  10. @badams #FOS22 @badams #FOS22 JavaScript… HTML CSS HTML Parser CSS

    Parser DOM Tree CSSOM Render Tree Painting Display JavaScript Layout JavaScript JavaScript
  11. @badams #FOS22 @badams #FOS22 JavaScript… HTML CSS HTML Parser CSS

    Parser DOM Tree CSSOM Render Tree Painting Display JavaScript Layout JavaScript JavaScript JavaScript
  12. @badams #FOS22 @badams #FOS22 Google’s Rendering as part of Indexing

    HTML CSS HTML Parser CSS Parser DOM Tree CSSOM Render Tree Painting Display JavaScript Layout JavaScript JavaScript
  13. @badams #FOS22 @badams #FOS22 Rendering allows Google to… • …

    load all meta data, content, and links on a webpage • … understand the page’s layout and content hierarchy • … evaluate the usability and quality of the webpage
  14. @badams #FOS22 @badams #FOS22 Rendering Issues • JavaScript inserts invalid

    HTML in the <head>; ➢ <body> tags in the <head> break Google’s processing of meta tags
  15. @badams #FOS22 @badams #FOS22 Rendering Issues • JavaScript inserts invalid

    HTML in the <head>; ➢ <body> tags in the <head> break Google’s processing of meta tags
  16. @badams #FOS22 @badams #FOS22 Rendering Issues • HTML vs Render

    mismatch; ➢ Different content in raw HTML vs fully rendered page
  17. @badams #FOS22 @badams #FOS22 Optimise Rendering • Don’t rely on

    Google’s rendering; ➢ Use SSR & CDN caching • Minimise page weight; ➢ Fewer page resources = better use of crawl budget faster load speed & CWV less chance of rendering issues • Optimise your HTML source; ➢ Think about where <script> tags exist and what they do when their code is executed
  18. @badams #FOS22 @badams #FOS22 Optimise Indexing • Optimise your page

    layouts; ➢ Prominent content & links are more valuable for users & Google • Improve internal linking; ➢ More PageRank = higher chance of indexing • Improve your content; ➢ Google has no obligation to index all your pages ➢ Make it worth Google’s while…
  19. @badams #FOS22 @badams #FOS22 Edge SEO • CDNs store cached

    versions of your webpages; ➢ Global coverage with edge nodes worldwide ➢ Usually also results in faster crawling and better CWV • You manipulate your CDN cached pages; ➢ Cloud Workers enable a range of functionality • Googlebot crawls & indexes the changed CDN-cached pages; ➢ Your ‘original’ website remains unchanged ➢ Google only sees the changed CDN webpages
  20. @badams #FOS22 @badams #FOS22 Why Edge SEO? • Faster deployment;

    ➢ Bypass your developers’ lengthy queues ➢ ‘Ask forgiveness, not permission’ ➢ No reliance on client-side JavaScript • No CMS constraints; ➢ Change pages directly regardless of your CMS capabilities • Testing; ➢ Perform narrow tests on specific site sections ➢ A/B testing for SEO
  21. @badams #FOS22 @badams #FOS22 Barry Adams ➢ Doing SEO since

    1998 ➢ Specialist in Technical SEO & News SEO ➢ Newsletter: SEOforGoogleNews.com