SlideShare una empresa de Scribd logo
1 de 70
Behemoth SEOSearch Strategy For Huge Websites
@pip_net
Download Slides: clk.me/behemoth
Philipp Klöckner
Angel Investor & Advisor
@pip_net
2005 2010 2015 2019
Behemoth SEOSearch Strategy For Huge Websites
@pip_net
Most Behemoths Are Aggregation Websites with 1M+ Pages
Vertical Search Engines
• i.e. Comparison Shopping
Engines (CSEs) and Meta-
Search Engines
• Scraping and aggregating
price/fare and product
information
• Partly relying on affiliate data
and feeds
Classifieds
• Real Estate, Cars, Jobs,
Holiday Rentals, General
Classifieds
• Aggregating user-generated
or previously published
offers/ads
• Content usually expires after
certain timeframe
Marketplaces
• Aggregating supply
(product/service feeds) and
demand at the same time
• Supplies often have several
points of sale and syndicate
data
Social Networks & Forums
• Vast amounts of user
generated content
• Insufficient control over quality
and information architechture
Most of these are „Intermediaries“ doing „Search“ and implicitly violate Guidelines.
Advantages & Challenges of Aggregators
ChallengesAdvantages
• Aggregation attracts demand (users)
through superior availability,
assortment (choice) and competition
(price)
• High degree of automation
• Both market sides may create lots of
content, data and value
• Extremely scalable and capital
efficient
• Consequently build network effects
and moats over time…
• …and become hyper-profitable and
well defendable
• Automation potentially creates billions
of documents
• Quality of content/inventory is
extremely diverse
• Panda/Core algorithm sparked a
structural decline of the whole sector
• Google positions own verticals above
SERPs
• Aggregators may potentially violate
different Google Guidelines:
• Dupe Content (int/ext)
• Thin Content
• Affiliate Content
• Indexable Search
Thin
Affiliate
Duplicate
Content
Excessive
Page
Growth
Medic Panda
SERP in
SERPs
Thin &
Empty
Pages
Useful Advice
For Very Big Websites
But It‘s Has Gotten A Lot Better Recently…
“…there’s some really good stuff here. But there’s
also some really shady or iffy stuff here as well…
and we don’t know like how we should treat things
over all. That might be the case.” @JohnMu
Comparison Search has been in Structural Decline for the Past Decade
Panda 1.0
“YOU HAVE STOLEN
MY DREAMS AND MY
CHILDHOOD WITH
YOUR EMPTY
Navigating an
aggregation website
through Panda
PANDA HUGGER
Comparison Search has been in Structural Decline for the Past Decade
Panda 1.0
Well… Everyone but Two Players
Idealo.de
Ladenzeile.de
Classical Search Engine Optimisation Framework
SEO
Content Popularity Technical SEO
• Inventory
• Text
• Rich Media
• Video
• Advice
• Structured Data
• Tools & Apps
• Interactive Content
• Links
• Mentions
• Brand Search
• Comp. Brand
Search
• Direct Type-Ins
• Sharing
• All available signals
• Internal Linking
• URL Design
• Indexing
• Heading Tags
• Href Lang Setup
• Structured Data
• HTTPS/HTTP2
Search Engine Optimisation Post-Panda (2011)
SEO
Content Popularity Technical SEO
• Inventory
• Text
• Rich Media
• Video
• Advice
• Structured Data
• Tools & Apps
• Interactive Content
• …
• Links
• Mentions
• Brand Search
• Comp. Brand
Search
• Direct Type-Ins
• Sharing
• All available signals
• Internal Linking
• URL Design
• Indexing
• Heading Tags
• Href Lang Setup
• Structured Data
• HTTPS/HTTP2
User Experience
• Bounce Rate
• Back To SERP
• Dwell Time
• Retention
• Trust
• Search Journey
• Satisfaction of Intent
PageSpeed
*
* 2011 Major Google Update named after Engineer Panda Navneet
Search Engine Optimisation Today (2019)
SEO
Content Popularity Tech SEO User Experience
The Future of Search Engine Optimisation
SEO
C P T User Experience
http://clk.me/smx19
Focus Areas of Concern for Huge Websites
SEO
Content Popularity Technical SEO
• Inventory
• Text
• Rich Media
• Video
• Advice
• Structured Data
• Tools & Apps
• Interactive Content
• …
• Links
• Mentions
• Brand Search
• Comp. Brand
Search
• Direct Type-Ins
• Sharing
• All available signals
• Internal Linking
• URL Design
• Indexing
• Heading Tags
• Href Lang Setup
• Structured Data
• HTTPS/HTTP2
User Experience
• Bounce Rate
• Back To SERP
• Dwell Time
• Retention
• Trust
• Search Journey
• Satisfaction of Intent
PageSpeed
*
* 2011 Major Google Update named after Engineer Panda Navneet
Today we‘ll learn:
1. Index Management
2. Crawl Budget
Optimisation with
internal Linking
3. Making Users Happy!

4. Practise with Case
Studies
Theory: Typical Page Quality (Qp) over Number of Pages (np)
np
Qp
Homepage
Category
Category+Brand
Facetted Search
Thin Catalogue (low inventory)
Dupe Content page „no results“ page
highestlowestmediorceuseful
400.000200.000 300.000100.000
Page Quality (Qp) can
be defined as content
richness, engagement,
ultimateley how useful
the page is to the user.
But also its revenue
potential.
PROBLEM:
Since Panda (2011) this
structure has become toxic.
TIME FOR A PANDA DIET!
Theory: Typical Page Quality (Qp) over Number of Pages (np)
np
Qp
highestlowestmediorceuseful
400.000200.000 300.000100.000
Average Quality
😞
Quality Threshold (mediocre and better)
NOINDEX
(320.000)
INDEX
(80.000)
New Average Quality
QTY
INCREASE
Panda Diet:
Let‘s cut some crap!
Quality Threshold
RANKINGS
Page Quality (Qp) can
be defined as content
richness, engagement,
ultimateley how useful
the page is to the user.
But also its revenue
potential.
Identifying Low Quality Pages by Page-Type
Easy NOINDEX Targets
• „no results“ pages
• Few results pages (set item threshold)
• Single review pages, other low-quality UGC
• Bulk product pages
• Any dupe pages
• Facetted search w/o search demand
• Out of stock pages
• Expired offers/ads
• Parameters, etc…
If your site has more indexed pages than things on sale – you‘re
doing it wrong!
ME DOING THE PANDA DIET
Identifying Low Quality Pages: Data Driven Approach
Data to support page quality decisions
• Revenue distribution on landing pages (Google Analytics)
• Engagement and commercial metrics per page-type
• Conversion rate related to inventory count
• Demand-Data (Search Volume, PPC traffic, navigational traffic)
• „Indexation Gap“ (Sitemaps, Submitted vs. Indexed)
• Crawling Activity (Server Logs)
• Hint: Consider using De-Indexing sitemaps to accelerate Panda Diet
Theory: Typical Page Quality (Qp) over Number of Pages (np)
np
Qp
highestlowestmediorceuseful
400.000200.000 300.000100.000
Truth is:
This curve doesn‘t look
like this…
Page Quality (Qp) can
be defined as content
richness, engagement,
ultimateley how useful
the page is to the user.
But also its revenue
potential.
Theory: Typical Page Quality (Qp) over Number of Pages (np)
np
Qp
highestlowestmediorceuseful
400.000200.000 300.000100.000
Truth is:
This curve doesn‘t look
like this…
BUT: More like THIS!
Page Quality (Qp) can
be defined as content
richness, engagement,
ultimateley how useful
the page is to the user.
But also its revenue
potential.
Theory: ACTUAL Page Quality (Qp) over Number of Pages (np)
np
Qp
highestlowestmediorceuseful
400.000200.000 300.000100.000
Truth is:
This curve doesn‘t look
like this…
BUT: More like THIS!
ACTUALLY… like THIS!
Page Quality (Qp) can
be defined as content
richness, engagement,
ultimateley how useful
the page is to the user.
But also its revenue
potential.
Theory: ACTUAL Page Quality (Qp) over Number of Pages (np)
np
Qp
highestlowestmediorceuseful
400.000200.000 300.000100.000
Page Quality (Qp) can
be defined as content
richness, engagement,
ultimateley how useful
the page is to the user.
These pages typically…
• Never saw a visit, nor any conversions (GA Organic Langing
Pages)
• Aren‘t crawled any longer, as Google wont rank them anyway
(logs)
• Are not being considered for indexation (GSC Sitemaps Monitor)
While 100% of your revenue is here!
A Proper Cut: Extreme Panda Diet
The Result of Removing 997 out of 1,000 Pages
Dev Fuckup
How To Deal With Duplicate Content
Reliable Solutions
1. Avoid it! Internally and externally (Double Serving, Affiliate Content, Syndication)
2. Identify it! (Ryte Reports, „Quotation Searches“, HTML Improvements in GSC, etc)
3. Rewrite or enrich content
4. NOINDEX
5. Enforce Canoncial URL via 301 (lookup, fix, truncate – „Canonical for Adults“)
(http://example.com/landing/?page=2&affID=anet ==301==> https://www.example.com/landing/)
Post & Pray Solutions (these might or might not work perfectly)
1. Canonical Tag
2. GSC Parameter Handling
3. Robots.txt
Bot Recognition (Switch)
Crawling-
friendly
website
Fully
functional
website
Tip: Surf Amazon side-by-side as Googlebot vs Real User
If Noindex: Consequently „Orphanize“ Pages
Home
One Two Three
If Noindex: Consequently „Orphanize“ Pages
Home
One Two Three
NOINDEX
If Noindex: Consequently „Orphanize“ Pages
Home
One Two Three
NOINDEX
Viable solutions for link removal
• Nofollow
• Dynamic Serving („Cloaking“)
• Client-side JS
• PRG Pattern
• Forms/Buttons
Get Rid Of Pagination (Entirely)
Pagination Best Practise
• Pagination is a stupid offline concept
• More items, less pages, less problems
• Users like comprehensive pages (A/B Test)
• NOINDEX pagination if possible
• Remove links to those pages
• No pagination pages – no problem
• Make sure discovery remains intact
No one, ever…
This useless shit… Gone (for Bots at least)
Social Profile
Links
Locale Selector
Keep these on you Homepage or About Us, but not on every page.
(If they are important for the user, why are they in the footer?)
Product Detail Pages
46
Even Product/Offer Detail Pages Might Be Low-Quality
5x
?
0,1% of Pages
Case Study: How to identify the least valuable pages?
1. Out of Stock Handling: (OoS pages generate lots of html pages and poor UX)
1. If OoS for good: 301 to most similar page (parent category) or 410 if no alternative
2. (If potentially restocked keep page alive (200), offer restock alert and/or alternatives)
2. Facetted Search (Filters) & Indexable Site Search
1. Set minimum item threshold to define a „good“ search result page that doesn‘t look like a SERP
2. Build clusters where possible (typos, plurals, refined queries, entities)
3. Apply quality thresholds (Dwell time, Bounce rate, conversion) to SERP in SERP pages (indexing int. Search)
3. Pagination
1. Show more items per page (3x more items = 1/3 of pages)
2. Best solution for pagination: no pagination
4. PDP (product detail page) reduction
1. Get better at understanding shelf huggers and bestsellers using your data
2. Advanced: Predict page performance with machine learning (OEM, price, category, attributes, etc)
3. Merge variants into master products (sizes, patterns, color, etc)
5. Reviews & FAQ: Use Overview pages for reviews & questions, don‘t index single pieces of content
6. Don‘t built a self-fulfilling prophecy
1. Allow triggers for re-indexation (ppc traffic, navigational demand, etc)
Internal Search Makes Inventory Accessible
Million $
Mistake
Internal Search Makes Inventory Accessible
Put Your Site Search In A Prominent Place!
Case Study: How to identify the least valuable pages?
Pinterest: Dupe Content Clusterfuck
https://www.pinterest.com/pin/554083560398205192/
https://www.pinterest.de/pin/554083560398205192/
https://www.pinterest.at/pin/554083560398205192/
https://www.pinterest.fr/pin/554083560398205192/
https://www.pinterest.es/pin/554083560398205192/
https://www.pinterest.pt/pin/554083560398205192/
https://www.pinterest.se/pin/554083560398205192/
https://www.pinterest.dk/pin/554083560398205192/
https://www.pinterest.no/pin/554083560398205192/
https://www.pinterest.ch/pin/554083560398205192/
https://www.pinterest.ie/pin/554083560398205192/
https://www.pinterest.ch/pin/554083560398205192/
https://www.pinterest.id/pin/554083560398205192/
https://www.pinterest.it/pin/554083560398205192/
https://www.pinterest.ru/pin/554083560398205192/
+ 2 dozen more locales….
Pinterest: Internationalization
RE-PINS – Adding Insult To Injury!
https://www.pinterest.de/pin/243475923592500876/https://www.pinterest.de/pin/241013017546674029/ INDEXABLEINDEXABLE
New
URL!
Pinterest: Millions of Dead Files
Boards
Pins
Home Fave Places My Style
INDEXABLE
Quick Reminder (10 Years ago…)
2009!
Master of Soft 404s
Case Study: How to identify the least valuable pages?
1. Facebook Index Coverage: Accessibility vs. Page Quality
2. Inactive/Empty Groups, Pages, Users, Places
3. Privacy-aware users (or create incentive to share public to improve LP value)
4. Use Engagement as a quality metric for post-URLs (doesn‘t get much better than this)
5. Marketplace (See Advanced Panda Diet)
6. Expired Events
7. …
Case Study: How to identify the least valuable pages?
1. Facebook Index Coverage: Accessibility vs. Page Quality
2. Inactive/Empty Groups, Pages, Users, Places
3. Privacy-aware users (or create incentive to share public to improve LP value)
4. Use Engagement as a quality metric for post-URLs (doesn‘t get much better than this)
5. Marketplace (See Advanced Panda Diet)
6. Expired Events
7. …
63
Crawling Efficiency & Internal Linking
Links from GSC or
Crawling Tools
64
Crawling Efficiency & Internal Linking
Balance: Algorithmic Internal Linking for 1.000 Pages
1. New York
2. London
3. Paris
4. Rome
5. Amsterdam
6. Milan
7. Barcelona
8. Prague
9. Dublin
10. Berlin
1. Munich
2. Warzaw
3. Madrid
4. Copenhagen
5. Stockholm
6. San Francisco
7. Toronto
8. Hamburg
9. Rio de Janeiro
10. Cairo
1. Seattle
2. Marrakesh
3. Sofia
4. Wroclaw
5. Helsinki
6. Vancouver
7. Hanover
8. Marseille
9. Alicante
10. Edinburgh
First Tier
Top 10
This class of pages gets 1.000 Links each
Second Tier: Random
10 out of Top 100
This class of pages gets 100 Links each
Third Tier: Random
10 out of Top 1.000
This class of pages gets 10 Links each
• Shuffle container 2+3, but keep static per page
• Include relevance score/silos/topical proximity to improve UX
66
Fix Internal Linking Using Bestseller Lists
1. Standard Sorting: Popularity
2. Dyn. Bestseller Lists for Prioritization
3. „New Arrivals“ for Discovery
4. Related Products für Completeness
5. Breadcrumb for Bottom Up Prio
6. Prio über Sitemap: Ask Santa about it!
SEO EfficiencyTM
*
*
*
*
The key to extremely big websites: Trim them for Efficiency!
100x
2200x
THANK YOU!
Frequently Asked Questions
How isn‘t this cloaking?
I‘m afraid I could lose all my long-tail revenue. *mimimi*
Should I remove all those pages in one drastic move? Wouldn‘t Google see that as a weakness?
Should I really dynamically switch/flap index directives?
How does GoogleBot discover my content without pagination?
1. It doesn‘t alter user experience 2. It only makes Google‘s job easier 3. Take a look at Amazon, bro
1. There‘s usually no data confirming the long-tail 2. Rankings are usually not lost but substituted by
superior pages 3. Google actually prefers pages with good UX over the most specific result (Hummingbird,
RankBrain instead of perfect title string match)
It‘s always a good time to do the right thing!
I think you should. See
above.
If you need pagination for discovery, you‘ve got bigger fish to fry. Seriously…
What to remember…
1. We‘re doing this for 10 years (Pre-Panda) now and it has never backfired
2. This is most important if your website has more than 100.000 pages
3. Index Bloat: Millions of indexed HTML documents are not an asset but a
liability. Indexing everything is inefficient by definition.
4. 80 % (actually 95%) of your website usually is dead weight. And it‘s
pulling down your best pages.
5. Analyse your potential with an organic landing page report
6. There‘s no black and white, but a reasonable amount of grey which
should be defined by data
7. Non-transactional content is (most likely) overrated. (Inventory=Content)

Más contenido relacionado

La actualidad más candente

BrightonSEO March 2021 | Dan Taylor, Image Entity Tags
BrightonSEO March 2021 | Dan Taylor, Image Entity TagsBrightonSEO March 2021 | Dan Taylor, Image Entity Tags
BrightonSEO March 2021 | Dan Taylor, Image Entity TagsDan Taylor
 
Semantic Content Networks - Ranking Websites on Google with Semantic SEO
Semantic Content Networks - Ranking Websites on Google with Semantic SEOSemantic Content Networks - Ranking Websites on Google with Semantic SEO
Semantic Content Networks - Ranking Websites on Google with Semantic SEOKoray Tugberk GUBUR
 
SEO for Ecommerce: A Comprehensive Guide
SEO for Ecommerce: A Comprehensive GuideSEO for Ecommerce: A Comprehensive Guide
SEO for Ecommerce: A Comprehensive GuideAdam Audette
 
Understanding Semantic Search and AI Content to Drive Growth in 2023 March 2023
Understanding Semantic Search and AI Content to Drive Growth in 2023 March 2023Understanding Semantic Search and AI Content to Drive Growth in 2023 March 2023
Understanding Semantic Search and AI Content to Drive Growth in 2023 March 2023TysonStockton1
 
Automating Google Lighthouse
Automating Google LighthouseAutomating Google Lighthouse
Automating Google LighthouseHamlet Batista
 
How to Automatically Subcategorise Your Website Automatically With Python
How to Automatically Subcategorise Your Website Automatically With PythonHow to Automatically Subcategorise Your Website Automatically With Python
How to Automatically Subcategorise Your Website Automatically With Pythonsearchsolved
 
Keyword Research and Topic Modeling in a Semantic Web
Keyword Research and Topic Modeling in a Semantic WebKeyword Research and Topic Modeling in a Semantic Web
Keyword Research and Topic Modeling in a Semantic WebBill Slawski
 
Negotiating crawl budget with googlebots
Negotiating crawl budget with googlebotsNegotiating crawl budget with googlebots
Negotiating crawl budget with googlebotsDawn Anderson MSc DigM
 
Coronavirus and Future of SEO: Digital Marketing and Remote Culture
Coronavirus and Future of SEO: Digital Marketing and Remote CultureCoronavirus and Future of SEO: Digital Marketing and Remote Culture
Coronavirus and Future of SEO: Digital Marketing and Remote CultureKoray Tugberk GUBUR
 
The Reason Behind Semantic SEO: Why does Google Avoid the Word PageRank?
The Reason Behind Semantic SEO: Why does Google Avoid the Word PageRank?The Reason Behind Semantic SEO: Why does Google Avoid the Word PageRank?
The Reason Behind Semantic SEO: Why does Google Avoid the Word PageRank?Koray Tugberk GUBUR
 
Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...
Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...
Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...Koray Tugberk GUBUR
 
Paige Hobart - How to do GOOD Keyword Research - Search Advertising Show 2021
Paige Hobart - How to do GOOD Keyword Research - Search Advertising Show 2021Paige Hobart - How to do GOOD Keyword Research - Search Advertising Show 2021
Paige Hobart - How to do GOOD Keyword Research - Search Advertising Show 2021Paige Hobart
 
AI-powered Semantic SEO by Koray GUBUR
AI-powered Semantic SEO by Koray GUBURAI-powered Semantic SEO by Koray GUBUR
AI-powered Semantic SEO by Koray GUBURAnton Shulke
 
Visual Search Tools and Tactics by Crystal Carter at MozCon 2022
Visual Search Tools and Tactics by Crystal Carter at MozCon 2022Visual Search Tools and Tactics by Crystal Carter at MozCon 2022
Visual Search Tools and Tactics by Crystal Carter at MozCon 2022Crystal J Carter
 
SEO & Patents Vrtualcon v. 3.0
SEO & Patents Vrtualcon v. 3.0SEO & Patents Vrtualcon v. 3.0
SEO & Patents Vrtualcon v. 3.0Bill Slawski
 
MozCon 2022: Why Real Expertise is the Most Important Ranking Factor of Them ...
MozCon 2022: Why Real Expertise is the Most Important Ranking Factor of Them ...MozCon 2022: Why Real Expertise is the Most Important Ranking Factor of Them ...
MozCon 2022: Why Real Expertise is the Most Important Ranking Factor of Them ...Lily Ray
 
Google Sheets For SEO - Tom Pool - London SEO Meetup XL
Google Sheets For SEO - Tom Pool - London SEO Meetup XLGoogle Sheets For SEO - Tom Pool - London SEO Meetup XL
Google Sheets For SEO - Tom Pool - London SEO Meetup XLTom Pool
 
How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021
How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021
How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021Chris Green
 
How to Incorporate ML in your SERP Analysis, Lazarina Stoy -BrightonSEO Oct, ...
How to Incorporate ML in your SERP Analysis, Lazarina Stoy -BrightonSEO Oct, ...How to Incorporate ML in your SERP Analysis, Lazarina Stoy -BrightonSEO Oct, ...
How to Incorporate ML in your SERP Analysis, Lazarina Stoy -BrightonSEO Oct, ...LazarinaStoyanova
 

La actualidad más candente (20)

BrightonSEO March 2021 | Dan Taylor, Image Entity Tags
BrightonSEO March 2021 | Dan Taylor, Image Entity TagsBrightonSEO March 2021 | Dan Taylor, Image Entity Tags
BrightonSEO March 2021 | Dan Taylor, Image Entity Tags
 
Semantic Content Networks - Ranking Websites on Google with Semantic SEO
Semantic Content Networks - Ranking Websites on Google with Semantic SEOSemantic Content Networks - Ranking Websites on Google with Semantic SEO
Semantic Content Networks - Ranking Websites on Google with Semantic SEO
 
SEO for Ecommerce: A Comprehensive Guide
SEO for Ecommerce: A Comprehensive GuideSEO for Ecommerce: A Comprehensive Guide
SEO for Ecommerce: A Comprehensive Guide
 
Understanding Semantic Search and AI Content to Drive Growth in 2023 March 2023
Understanding Semantic Search and AI Content to Drive Growth in 2023 March 2023Understanding Semantic Search and AI Content to Drive Growth in 2023 March 2023
Understanding Semantic Search and AI Content to Drive Growth in 2023 March 2023
 
Automating Google Lighthouse
Automating Google LighthouseAutomating Google Lighthouse
Automating Google Lighthouse
 
Entity seo
Entity seoEntity seo
Entity seo
 
How to Automatically Subcategorise Your Website Automatically With Python
How to Automatically Subcategorise Your Website Automatically With PythonHow to Automatically Subcategorise Your Website Automatically With Python
How to Automatically Subcategorise Your Website Automatically With Python
 
Keyword Research and Topic Modeling in a Semantic Web
Keyword Research and Topic Modeling in a Semantic WebKeyword Research and Topic Modeling in a Semantic Web
Keyword Research and Topic Modeling in a Semantic Web
 
Negotiating crawl budget with googlebots
Negotiating crawl budget with googlebotsNegotiating crawl budget with googlebots
Negotiating crawl budget with googlebots
 
Coronavirus and Future of SEO: Digital Marketing and Remote Culture
Coronavirus and Future of SEO: Digital Marketing and Remote CultureCoronavirus and Future of SEO: Digital Marketing and Remote Culture
Coronavirus and Future of SEO: Digital Marketing and Remote Culture
 
The Reason Behind Semantic SEO: Why does Google Avoid the Word PageRank?
The Reason Behind Semantic SEO: Why does Google Avoid the Word PageRank?The Reason Behind Semantic SEO: Why does Google Avoid the Word PageRank?
The Reason Behind Semantic SEO: Why does Google Avoid the Word PageRank?
 
Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...
Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...
Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...
 
Paige Hobart - How to do GOOD Keyword Research - Search Advertising Show 2021
Paige Hobart - How to do GOOD Keyword Research - Search Advertising Show 2021Paige Hobart - How to do GOOD Keyword Research - Search Advertising Show 2021
Paige Hobart - How to do GOOD Keyword Research - Search Advertising Show 2021
 
AI-powered Semantic SEO by Koray GUBUR
AI-powered Semantic SEO by Koray GUBURAI-powered Semantic SEO by Koray GUBUR
AI-powered Semantic SEO by Koray GUBUR
 
Visual Search Tools and Tactics by Crystal Carter at MozCon 2022
Visual Search Tools and Tactics by Crystal Carter at MozCon 2022Visual Search Tools and Tactics by Crystal Carter at MozCon 2022
Visual Search Tools and Tactics by Crystal Carter at MozCon 2022
 
SEO & Patents Vrtualcon v. 3.0
SEO & Patents Vrtualcon v. 3.0SEO & Patents Vrtualcon v. 3.0
SEO & Patents Vrtualcon v. 3.0
 
MozCon 2022: Why Real Expertise is the Most Important Ranking Factor of Them ...
MozCon 2022: Why Real Expertise is the Most Important Ranking Factor of Them ...MozCon 2022: Why Real Expertise is the Most Important Ranking Factor of Them ...
MozCon 2022: Why Real Expertise is the Most Important Ranking Factor of Them ...
 
Google Sheets For SEO - Tom Pool - London SEO Meetup XL
Google Sheets For SEO - Tom Pool - London SEO Meetup XLGoogle Sheets For SEO - Tom Pool - London SEO Meetup XL
Google Sheets For SEO - Tom Pool - London SEO Meetup XL
 
How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021
How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021
How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021
 
How to Incorporate ML in your SERP Analysis, Lazarina Stoy -BrightonSEO Oct, ...
How to Incorporate ML in your SERP Analysis, Lazarina Stoy -BrightonSEO Oct, ...How to Incorporate ML in your SERP Analysis, Lazarina Stoy -BrightonSEO Oct, ...
How to Incorporate ML in your SERP Analysis, Lazarina Stoy -BrightonSEO Oct, ...
 

Similar a Behemoth SEO: Search Strategy for Huge Websites

Basic guide to SEO
Basic guide to SEOBasic guide to SEO
Basic guide to SEOShruti Goel
 
Search Engine Optimisation: A High Level View
Search Engine Optimisation: A High Level ViewSearch Engine Optimisation: A High Level View
Search Engine Optimisation: A High Level Viewjustin spratt
 
Search Engine Optimization (SEO)
Search Engine Optimization (SEO)Search Engine Optimization (SEO)
Search Engine Optimization (SEO)Christopher Mbinda
 
TCDrupal 2018: SEO! Snippets! Schema!
TCDrupal 2018: SEO! Snippets! Schema! TCDrupal 2018: SEO! Snippets! Schema!
TCDrupal 2018: SEO! Snippets! Schema! Diane Kulseth
 
Breaking News: Google Panda Update Affects 48 Million Daily Searches
Breaking News: Google Panda Update Affects 48 Million Daily SearchesBreaking News: Google Panda Update Affects 48 Million Daily Searches
Breaking News: Google Panda Update Affects 48 Million Daily SearchesDemandWave
 
Maximizing Your SEO Results Seminar 2-14-2013
Maximizing Your SEO Results Seminar 2-14-2013Maximizing Your SEO Results Seminar 2-14-2013
Maximizing Your SEO Results Seminar 2-14-2013Top Floor Technologies
 
Search Engine Optimization (SEO) 101
Search Engine Optimization (SEO) 101Search Engine Optimization (SEO) 101
Search Engine Optimization (SEO) 101pointit
 
Semantic Search
Semantic SearchSemantic Search
Semantic SearchEmily Hill
 
SEO - What is it?
SEO - What is it?SEO - What is it?
SEO - What is it?Woj Kwasi
 
Blogging for business
Blogging for businessBlogging for business
Blogging for businessEmily Hill
 
A brief history of seo
A brief history of seoA brief history of seo
A brief history of seoyousayjump
 
3 ½ Simple Ways to Improve SEO - Practical Ways to Rank Higher
3 ½ Simple Ways to Improve SEO - Practical Ways to Rank Higher3 ½ Simple Ways to Improve SEO - Practical Ways to Rank Higher
3 ½ Simple Ways to Improve SEO - Practical Ways to Rank HigherPardot
 
Seo services in lucknow king of digital marketing gaurav dubey
Seo services in lucknow king of digital marketing   gaurav dubeySeo services in lucknow king of digital marketing   gaurav dubey
Seo services in lucknow king of digital marketing gaurav dubeyKing of Digital Marketing
 
Small Business SEO Tips and Strategies For 2013 - Chaosmap.com
Small Business SEO Tips and Strategies For 2013 - Chaosmap.comSmall Business SEO Tips and Strategies For 2013 - Chaosmap.com
Small Business SEO Tips and Strategies For 2013 - Chaosmap.comJon Rognerud Chaosmap Digital
 
How to SEO a Terrific - and Profitable - User Experience
How to SEO a Terrific - and Profitable - User ExperienceHow to SEO a Terrific - and Profitable - User Experience
How to SEO a Terrific - and Profitable - User ExperienceBrightEdge
 
Innovation Melange: Introduction to SEO
Innovation Melange: Introduction to SEOInnovation Melange: Introduction to SEO
Innovation Melange: Introduction to SEODominik Berger
 

Similar a Behemoth SEO: Search Strategy for Huge Websites (20)

Basic guide to SEO
Basic guide to SEOBasic guide to SEO
Basic guide to SEO
 
Search Engine Optimisation: A High Level View
Search Engine Optimisation: A High Level ViewSearch Engine Optimisation: A High Level View
Search Engine Optimisation: A High Level View
 
Keywords
KeywordsKeywords
Keywords
 
Search Engine Optimization (SEO)
Search Engine Optimization (SEO)Search Engine Optimization (SEO)
Search Engine Optimization (SEO)
 
TCDrupal 2018: SEO! Snippets! Schema!
TCDrupal 2018: SEO! Snippets! Schema! TCDrupal 2018: SEO! Snippets! Schema!
TCDrupal 2018: SEO! Snippets! Schema!
 
Breaking News: Google Panda Update Affects 48 Million Daily Searches
Breaking News: Google Panda Update Affects 48 Million Daily SearchesBreaking News: Google Panda Update Affects 48 Million Daily Searches
Breaking News: Google Panda Update Affects 48 Million Daily Searches
 
Digital Marketing Basics
Digital Marketing BasicsDigital Marketing Basics
Digital Marketing Basics
 
Maximizing Your SEO Results Seminar 2-14-2013
Maximizing Your SEO Results Seminar 2-14-2013Maximizing Your SEO Results Seminar 2-14-2013
Maximizing Your SEO Results Seminar 2-14-2013
 
Search Engine Optimization (SEO) 101
Search Engine Optimization (SEO) 101Search Engine Optimization (SEO) 101
Search Engine Optimization (SEO) 101
 
Semantic Search
Semantic SearchSemantic Search
Semantic Search
 
SEO - What is it?
SEO - What is it?SEO - What is it?
SEO - What is it?
 
Seo Made Easy
Seo Made EasySeo Made Easy
Seo Made Easy
 
Blogging for business
Blogging for businessBlogging for business
Blogging for business
 
A brief history of seo
A brief history of seoA brief history of seo
A brief history of seo
 
3 ½ Simple Ways to Improve SEO - Practical Ways to Rank Higher
3 ½ Simple Ways to Improve SEO - Practical Ways to Rank Higher3 ½ Simple Ways to Improve SEO - Practical Ways to Rank Higher
3 ½ Simple Ways to Improve SEO - Practical Ways to Rank Higher
 
Seo services in lucknow king of digital marketing gaurav dubey
Seo services in lucknow king of digital marketing   gaurav dubeySeo services in lucknow king of digital marketing   gaurav dubey
Seo services in lucknow king of digital marketing gaurav dubey
 
Digital marketing
Digital marketingDigital marketing
Digital marketing
 
Small Business SEO Tips and Strategies For 2013 - Chaosmap.com
Small Business SEO Tips and Strategies For 2013 - Chaosmap.comSmall Business SEO Tips and Strategies For 2013 - Chaosmap.com
Small Business SEO Tips and Strategies For 2013 - Chaosmap.com
 
How to SEO a Terrific - and Profitable - User Experience
How to SEO a Terrific - and Profitable - User ExperienceHow to SEO a Terrific - and Profitable - User Experience
How to SEO a Terrific - and Profitable - User Experience
 
Innovation Melange: Introduction to SEO
Innovation Melange: Introduction to SEOInnovation Melange: Introduction to SEO
Innovation Melange: Introduction to SEO
 

Más de Philipp Klöckner

SEO in the Age of Artificial Intelligence | How AI influences Search
SEO in the Age of Artificial Intelligence | How AI influences SearchSEO in the Age of Artificial Intelligence | How AI influences Search
SEO in the Age of Artificial Intelligence | How AI influences SearchPhilipp Klöckner
 
Growth Hacking for Startups 2017 (OMX Austria)
Growth Hacking for Startups 2017 (OMX Austria)Growth Hacking for Startups 2017 (OMX Austria)
Growth Hacking for Startups 2017 (OMX Austria)Philipp Klöckner
 
B2B Marketing - Internetworld Kongress 2017 München
B2B Marketing - Internetworld Kongress 2017 MünchenB2B Marketing - Internetworld Kongress 2017 München
B2B Marketing - Internetworld Kongress 2017 MünchenPhilipp Klöckner
 
Relaunch & SEO: Best Practice, Checklists, Stolpersteine
Relaunch & SEO: Best Practice, Checklists, StolpersteineRelaunch & SEO: Best Practice, Checklists, Stolpersteine
Relaunch & SEO: Best Practice, Checklists, StolpersteinePhilipp Klöckner
 
SEO: SERPs im Wandel - SMX Munich 2017
SEO: SERPs im Wandel - SMX Munich 2017SEO: SERPs im Wandel - SMX Munich 2017
SEO: SERPs im Wandel - SMX Munich 2017Philipp Klöckner
 
Fast Growing Companies: 10 SEO Lessons Learned
Fast Growing Companies: 10 SEO Lessons LearnedFast Growing Companies: 10 SEO Lessons Learned
Fast Growing Companies: 10 SEO Lessons LearnedPhilipp Klöckner
 
Kauffman Foundation Report: Poor Long-Term Returns from Venture Capital
Kauffman Foundation Report: Poor Long-Term Returns from Venture CapitalKauffman Foundation Report: Poor Long-Term Returns from Venture Capital
Kauffman Foundation Report: Poor Long-Term Returns from Venture CapitalPhilipp Klöckner
 
Online Marketing for Startups | 2015 Philipp Klöckner @Plug and Play Accelerator
Online Marketing for Startups | 2015 Philipp Klöckner @Plug and Play AcceleratorOnline Marketing for Startups | 2015 Philipp Klöckner @Plug and Play Accelerator
Online Marketing for Startups | 2015 Philipp Klöckner @Plug and Play AcceleratorPhilipp Klöckner
 
Competitive Intelligence: Wettbewerbsbeobachtung im SEO und Online Marketing
Competitive Intelligence: Wettbewerbsbeobachtung im SEO und Online MarketingCompetitive Intelligence: Wettbewerbsbeobachtung im SEO und Online Marketing
Competitive Intelligence: Wettbewerbsbeobachtung im SEO und Online MarketingPhilipp Klöckner
 

Más de Philipp Klöckner (9)

SEO in the Age of Artificial Intelligence | How AI influences Search
SEO in the Age of Artificial Intelligence | How AI influences SearchSEO in the Age of Artificial Intelligence | How AI influences Search
SEO in the Age of Artificial Intelligence | How AI influences Search
 
Growth Hacking for Startups 2017 (OMX Austria)
Growth Hacking for Startups 2017 (OMX Austria)Growth Hacking for Startups 2017 (OMX Austria)
Growth Hacking for Startups 2017 (OMX Austria)
 
B2B Marketing - Internetworld Kongress 2017 München
B2B Marketing - Internetworld Kongress 2017 MünchenB2B Marketing - Internetworld Kongress 2017 München
B2B Marketing - Internetworld Kongress 2017 München
 
Relaunch & SEO: Best Practice, Checklists, Stolpersteine
Relaunch & SEO: Best Practice, Checklists, StolpersteineRelaunch & SEO: Best Practice, Checklists, Stolpersteine
Relaunch & SEO: Best Practice, Checklists, Stolpersteine
 
SEO: SERPs im Wandel - SMX Munich 2017
SEO: SERPs im Wandel - SMX Munich 2017SEO: SERPs im Wandel - SMX Munich 2017
SEO: SERPs im Wandel - SMX Munich 2017
 
Fast Growing Companies: 10 SEO Lessons Learned
Fast Growing Companies: 10 SEO Lessons LearnedFast Growing Companies: 10 SEO Lessons Learned
Fast Growing Companies: 10 SEO Lessons Learned
 
Kauffman Foundation Report: Poor Long-Term Returns from Venture Capital
Kauffman Foundation Report: Poor Long-Term Returns from Venture CapitalKauffman Foundation Report: Poor Long-Term Returns from Venture Capital
Kauffman Foundation Report: Poor Long-Term Returns from Venture Capital
 
Online Marketing for Startups | 2015 Philipp Klöckner @Plug and Play Accelerator
Online Marketing for Startups | 2015 Philipp Klöckner @Plug and Play AcceleratorOnline Marketing for Startups | 2015 Philipp Klöckner @Plug and Play Accelerator
Online Marketing for Startups | 2015 Philipp Klöckner @Plug and Play Accelerator
 
Competitive Intelligence: Wettbewerbsbeobachtung im SEO und Online Marketing
Competitive Intelligence: Wettbewerbsbeobachtung im SEO und Online MarketingCompetitive Intelligence: Wettbewerbsbeobachtung im SEO und Online Marketing
Competitive Intelligence: Wettbewerbsbeobachtung im SEO und Online Marketing
 

Último

Social media, ppt. Features, characteristics
Social media, ppt. Features, characteristicsSocial media, ppt. Features, characteristics
Social media, ppt. Features, characteristicswasim792942
 
Labour Day Celebrating Workers and Their Contributions.pptx
Labour Day Celebrating Workers and Their Contributions.pptxLabour Day Celebrating Workers and Their Contributions.pptx
Labour Day Celebrating Workers and Their Contributions.pptxelizabethella096
 
Brand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdfBrand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdftbatkhuu1
 
What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?riteshhsociall
 
Branding strategies of new company .pptx
Branding strategies of new company .pptxBranding strategies of new company .pptx
Branding strategies of new company .pptxVikasTiwari846641
 
Social Media Marketing PPT-Includes Paid media
Social Media Marketing PPT-Includes Paid mediaSocial Media Marketing PPT-Includes Paid media
Social Media Marketing PPT-Includes Paid mediaadityabelde2
 
Major SEO Trends in 2024 - Banyanbrain Digital
Major SEO Trends in 2024 - Banyanbrain DigitalMajor SEO Trends in 2024 - Banyanbrain Digital
Major SEO Trends in 2024 - Banyanbrain DigitalBanyanbrain
 
Unlocking the Mystery of the Voynich Manuscript
Unlocking the Mystery of the Voynich ManuscriptUnlocking the Mystery of the Voynich Manuscript
Unlocking the Mystery of the Voynich Manuscriptelizabethella096
 
How to utilize calculated properties in your HubSpot setups
How to utilize calculated properties in your HubSpot setupsHow to utilize calculated properties in your HubSpot setups
How to utilize calculated properties in your HubSpot setupsssuser4571da
 
Kraft Mac and Cheese campaign presentation
Kraft Mac and Cheese campaign presentationKraft Mac and Cheese campaign presentation
Kraft Mac and Cheese campaign presentationtbatkhuu1
 
BDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
Martal Group - B2B Lead Gen Agency - Onboarding Overview
Martal Group - B2B Lead Gen Agency - Onboarding OverviewMartal Group - B2B Lead Gen Agency - Onboarding Overview
Martal Group - B2B Lead Gen Agency - Onboarding OverviewMartal Group
 
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort ServiceDelhi Call girls
 

Último (20)

Social media, ppt. Features, characteristics
Social media, ppt. Features, characteristicsSocial media, ppt. Features, characteristics
Social media, ppt. Features, characteristics
 
Labour Day Celebrating Workers and Their Contributions.pptx
Labour Day Celebrating Workers and Their Contributions.pptxLabour Day Celebrating Workers and Their Contributions.pptx
Labour Day Celebrating Workers and Their Contributions.pptx
 
Brand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdfBrand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdf
 
Turn Digital Reputation Threats into Offense Tactics - Daniel Lemin
Turn Digital Reputation Threats into Offense Tactics - Daniel LeminTurn Digital Reputation Threats into Offense Tactics - Daniel Lemin
Turn Digital Reputation Threats into Offense Tactics - Daniel Lemin
 
Creator Influencer Strategy Master Class - Corinne Rose Guirgis
Creator Influencer Strategy Master Class - Corinne Rose GuirgisCreator Influencer Strategy Master Class - Corinne Rose Guirgis
Creator Influencer Strategy Master Class - Corinne Rose Guirgis
 
What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?
 
The Future of Brands on LinkedIn - Alison Kaltman
The Future of Brands on LinkedIn - Alison KaltmanThe Future of Brands on LinkedIn - Alison Kaltman
The Future of Brands on LinkedIn - Alison Kaltman
 
Branding strategies of new company .pptx
Branding strategies of new company .pptxBranding strategies of new company .pptx
Branding strategies of new company .pptx
 
Social Media Marketing PPT-Includes Paid media
Social Media Marketing PPT-Includes Paid mediaSocial Media Marketing PPT-Includes Paid media
Social Media Marketing PPT-Includes Paid media
 
Major SEO Trends in 2024 - Banyanbrain Digital
Major SEO Trends in 2024 - Banyanbrain DigitalMajor SEO Trends in 2024 - Banyanbrain Digital
Major SEO Trends in 2024 - Banyanbrain Digital
 
Brand Strategy Master Class - Juntae DeLane
Brand Strategy Master Class - Juntae DeLaneBrand Strategy Master Class - Juntae DeLane
Brand Strategy Master Class - Juntae DeLane
 
No Cookies No Problem - Steve Krull, Be Found Online
No Cookies No Problem - Steve Krull, Be Found OnlineNo Cookies No Problem - Steve Krull, Be Found Online
No Cookies No Problem - Steve Krull, Be Found Online
 
Unlocking the Mystery of the Voynich Manuscript
Unlocking the Mystery of the Voynich ManuscriptUnlocking the Mystery of the Voynich Manuscript
Unlocking the Mystery of the Voynich Manuscript
 
Foundation First - Why Your Website and Content Matters - David Pisarek
Foundation First - Why Your Website and Content Matters - David PisarekFoundation First - Why Your Website and Content Matters - David Pisarek
Foundation First - Why Your Website and Content Matters - David Pisarek
 
How to utilize calculated properties in your HubSpot setups
How to utilize calculated properties in your HubSpot setupsHow to utilize calculated properties in your HubSpot setups
How to utilize calculated properties in your HubSpot setups
 
Kraft Mac and Cheese campaign presentation
Kraft Mac and Cheese campaign presentationKraft Mac and Cheese campaign presentation
Kraft Mac and Cheese campaign presentation
 
BDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort Service
 
Martal Group - B2B Lead Gen Agency - Onboarding Overview
Martal Group - B2B Lead Gen Agency - Onboarding OverviewMartal Group - B2B Lead Gen Agency - Onboarding Overview
Martal Group - B2B Lead Gen Agency - Onboarding Overview
 
Podcast Marketing Master Class - Roger Nairn
Podcast Marketing Master Class - Roger NairnPodcast Marketing Master Class - Roger Nairn
Podcast Marketing Master Class - Roger Nairn
 
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort Service
 

Behemoth SEO: Search Strategy for Huge Websites

  • 1. Behemoth SEOSearch Strategy For Huge Websites @pip_net Download Slides: clk.me/behemoth
  • 2. Philipp Klöckner Angel Investor & Advisor @pip_net 2005 2010 2015 2019
  • 3.
  • 4. Behemoth SEOSearch Strategy For Huge Websites @pip_net
  • 5. Most Behemoths Are Aggregation Websites with 1M+ Pages Vertical Search Engines • i.e. Comparison Shopping Engines (CSEs) and Meta- Search Engines • Scraping and aggregating price/fare and product information • Partly relying on affiliate data and feeds Classifieds • Real Estate, Cars, Jobs, Holiday Rentals, General Classifieds • Aggregating user-generated or previously published offers/ads • Content usually expires after certain timeframe Marketplaces • Aggregating supply (product/service feeds) and demand at the same time • Supplies often have several points of sale and syndicate data Social Networks & Forums • Vast amounts of user generated content • Insufficient control over quality and information architechture Most of these are „Intermediaries“ doing „Search“ and implicitly violate Guidelines.
  • 6. Advantages & Challenges of Aggregators ChallengesAdvantages • Aggregation attracts demand (users) through superior availability, assortment (choice) and competition (price) • High degree of automation • Both market sides may create lots of content, data and value • Extremely scalable and capital efficient • Consequently build network effects and moats over time… • …and become hyper-profitable and well defendable • Automation potentially creates billions of documents • Quality of content/inventory is extremely diverse • Panda/Core algorithm sparked a structural decline of the whole sector • Google positions own verticals above SERPs • Aggregators may potentially violate different Google Guidelines: • Dupe Content (int/ext) • Thin Content • Affiliate Content • Indexable Search
  • 8.
  • 9. Useful Advice For Very Big Websites
  • 10. But It‘s Has Gotten A Lot Better Recently… “…there’s some really good stuff here. But there’s also some really shady or iffy stuff here as well… and we don’t know like how we should treat things over all. That might be the case.” @JohnMu
  • 11. Comparison Search has been in Structural Decline for the Past Decade Panda 1.0
  • 12. “YOU HAVE STOLEN MY DREAMS AND MY CHILDHOOD WITH YOUR EMPTY
  • 13.
  • 16. Comparison Search has been in Structural Decline for the Past Decade Panda 1.0
  • 17. Well… Everyone but Two Players Idealo.de Ladenzeile.de
  • 18. Classical Search Engine Optimisation Framework SEO Content Popularity Technical SEO • Inventory • Text • Rich Media • Video • Advice • Structured Data • Tools & Apps • Interactive Content • Links • Mentions • Brand Search • Comp. Brand Search • Direct Type-Ins • Sharing • All available signals • Internal Linking • URL Design • Indexing • Heading Tags • Href Lang Setup • Structured Data • HTTPS/HTTP2
  • 19. Search Engine Optimisation Post-Panda (2011) SEO Content Popularity Technical SEO • Inventory • Text • Rich Media • Video • Advice • Structured Data • Tools & Apps • Interactive Content • … • Links • Mentions • Brand Search • Comp. Brand Search • Direct Type-Ins • Sharing • All available signals • Internal Linking • URL Design • Indexing • Heading Tags • Href Lang Setup • Structured Data • HTTPS/HTTP2 User Experience • Bounce Rate • Back To SERP • Dwell Time • Retention • Trust • Search Journey • Satisfaction of Intent PageSpeed * * 2011 Major Google Update named after Engineer Panda Navneet
  • 20. Search Engine Optimisation Today (2019) SEO Content Popularity Tech SEO User Experience
  • 21. The Future of Search Engine Optimisation SEO C P T User Experience
  • 23. Focus Areas of Concern for Huge Websites SEO Content Popularity Technical SEO • Inventory • Text • Rich Media • Video • Advice • Structured Data • Tools & Apps • Interactive Content • … • Links • Mentions • Brand Search • Comp. Brand Search • Direct Type-Ins • Sharing • All available signals • Internal Linking • URL Design • Indexing • Heading Tags • Href Lang Setup • Structured Data • HTTPS/HTTP2 User Experience • Bounce Rate • Back To SERP • Dwell Time • Retention • Trust • Search Journey • Satisfaction of Intent PageSpeed * * 2011 Major Google Update named after Engineer Panda Navneet
  • 24. Today we‘ll learn: 1. Index Management 2. Crawl Budget Optimisation with internal Linking 3. Making Users Happy!  4. Practise with Case Studies
  • 25. Theory: Typical Page Quality (Qp) over Number of Pages (np) np Qp Homepage Category Category+Brand Facetted Search Thin Catalogue (low inventory) Dupe Content page „no results“ page highestlowestmediorceuseful 400.000200.000 300.000100.000 Page Quality (Qp) can be defined as content richness, engagement, ultimateley how useful the page is to the user. But also its revenue potential. PROBLEM: Since Panda (2011) this structure has become toxic.
  • 26. TIME FOR A PANDA DIET!
  • 27. Theory: Typical Page Quality (Qp) over Number of Pages (np) np Qp highestlowestmediorceuseful 400.000200.000 300.000100.000 Average Quality 😞 Quality Threshold (mediocre and better) NOINDEX (320.000) INDEX (80.000) New Average Quality QTY INCREASE Panda Diet: Let‘s cut some crap! Quality Threshold RANKINGS Page Quality (Qp) can be defined as content richness, engagement, ultimateley how useful the page is to the user. But also its revenue potential.
  • 28. Identifying Low Quality Pages by Page-Type Easy NOINDEX Targets • „no results“ pages • Few results pages (set item threshold) • Single review pages, other low-quality UGC • Bulk product pages • Any dupe pages • Facetted search w/o search demand • Out of stock pages • Expired offers/ads • Parameters, etc… If your site has more indexed pages than things on sale – you‘re doing it wrong!
  • 29. ME DOING THE PANDA DIET
  • 30. Identifying Low Quality Pages: Data Driven Approach Data to support page quality decisions • Revenue distribution on landing pages (Google Analytics) • Engagement and commercial metrics per page-type • Conversion rate related to inventory count • Demand-Data (Search Volume, PPC traffic, navigational traffic) • „Indexation Gap“ (Sitemaps, Submitted vs. Indexed) • Crawling Activity (Server Logs) • Hint: Consider using De-Indexing sitemaps to accelerate Panda Diet
  • 31. Theory: Typical Page Quality (Qp) over Number of Pages (np) np Qp highestlowestmediorceuseful 400.000200.000 300.000100.000 Truth is: This curve doesn‘t look like this… Page Quality (Qp) can be defined as content richness, engagement, ultimateley how useful the page is to the user. But also its revenue potential.
  • 32. Theory: Typical Page Quality (Qp) over Number of Pages (np) np Qp highestlowestmediorceuseful 400.000200.000 300.000100.000 Truth is: This curve doesn‘t look like this… BUT: More like THIS! Page Quality (Qp) can be defined as content richness, engagement, ultimateley how useful the page is to the user. But also its revenue potential.
  • 33. Theory: ACTUAL Page Quality (Qp) over Number of Pages (np) np Qp highestlowestmediorceuseful 400.000200.000 300.000100.000 Truth is: This curve doesn‘t look like this… BUT: More like THIS! ACTUALLY… like THIS! Page Quality (Qp) can be defined as content richness, engagement, ultimateley how useful the page is to the user. But also its revenue potential.
  • 34. Theory: ACTUAL Page Quality (Qp) over Number of Pages (np) np Qp highestlowestmediorceuseful 400.000200.000 300.000100.000 Page Quality (Qp) can be defined as content richness, engagement, ultimateley how useful the page is to the user. These pages typically… • Never saw a visit, nor any conversions (GA Organic Langing Pages) • Aren‘t crawled any longer, as Google wont rank them anyway (logs) • Are not being considered for indexation (GSC Sitemaps Monitor) While 100% of your revenue is here!
  • 35. A Proper Cut: Extreme Panda Diet
  • 36. The Result of Removing 997 out of 1,000 Pages
  • 38. How To Deal With Duplicate Content Reliable Solutions 1. Avoid it! Internally and externally (Double Serving, Affiliate Content, Syndication) 2. Identify it! (Ryte Reports, „Quotation Searches“, HTML Improvements in GSC, etc) 3. Rewrite or enrich content 4. NOINDEX 5. Enforce Canoncial URL via 301 (lookup, fix, truncate – „Canonical for Adults“) (http://example.com/landing/?page=2&affID=anet ==301==> https://www.example.com/landing/) Post & Pray Solutions (these might or might not work perfectly) 1. Canonical Tag 2. GSC Parameter Handling 3. Robots.txt
  • 39. Bot Recognition (Switch) Crawling- friendly website Fully functional website Tip: Surf Amazon side-by-side as Googlebot vs Real User
  • 40. If Noindex: Consequently „Orphanize“ Pages Home One Two Three
  • 41. If Noindex: Consequently „Orphanize“ Pages Home One Two Three NOINDEX
  • 42. If Noindex: Consequently „Orphanize“ Pages Home One Two Three NOINDEX Viable solutions for link removal • Nofollow • Dynamic Serving („Cloaking“) • Client-side JS • PRG Pattern • Forms/Buttons
  • 43. Get Rid Of Pagination (Entirely) Pagination Best Practise • Pagination is a stupid offline concept • More items, less pages, less problems • Users like comprehensive pages (A/B Test) • NOINDEX pagination if possible • Remove links to those pages • No pagination pages – no problem • Make sure discovery remains intact No one, ever…
  • 44. This useless shit… Gone (for Bots at least) Social Profile Links Locale Selector Keep these on you Homepage or About Us, but not on every page. (If they are important for the user, why are they in the footer?)
  • 46. 46 Even Product/Offer Detail Pages Might Be Low-Quality 5x ? 0,1% of Pages
  • 47. Case Study: How to identify the least valuable pages? 1. Out of Stock Handling: (OoS pages generate lots of html pages and poor UX) 1. If OoS for good: 301 to most similar page (parent category) or 410 if no alternative 2. (If potentially restocked keep page alive (200), offer restock alert and/or alternatives) 2. Facetted Search (Filters) & Indexable Site Search 1. Set minimum item threshold to define a „good“ search result page that doesn‘t look like a SERP 2. Build clusters where possible (typos, plurals, refined queries, entities) 3. Apply quality thresholds (Dwell time, Bounce rate, conversion) to SERP in SERP pages (indexing int. Search) 3. Pagination 1. Show more items per page (3x more items = 1/3 of pages) 2. Best solution for pagination: no pagination 4. PDP (product detail page) reduction 1. Get better at understanding shelf huggers and bestsellers using your data 2. Advanced: Predict page performance with machine learning (OEM, price, category, attributes, etc) 3. Merge variants into master products (sizes, patterns, color, etc) 5. Reviews & FAQ: Use Overview pages for reviews & questions, don‘t index single pieces of content 6. Don‘t built a self-fulfilling prophecy 1. Allow triggers for re-indexation (ppc traffic, navigational demand, etc)
  • 48. Internal Search Makes Inventory Accessible Million $ Mistake
  • 49. Internal Search Makes Inventory Accessible
  • 50. Put Your Site Search In A Prominent Place!
  • 51. Case Study: How to identify the least valuable pages?
  • 52. Pinterest: Dupe Content Clusterfuck https://www.pinterest.com/pin/554083560398205192/ https://www.pinterest.de/pin/554083560398205192/ https://www.pinterest.at/pin/554083560398205192/ https://www.pinterest.fr/pin/554083560398205192/ https://www.pinterest.es/pin/554083560398205192/ https://www.pinterest.pt/pin/554083560398205192/ https://www.pinterest.se/pin/554083560398205192/ https://www.pinterest.dk/pin/554083560398205192/ https://www.pinterest.no/pin/554083560398205192/ https://www.pinterest.ch/pin/554083560398205192/ https://www.pinterest.ie/pin/554083560398205192/ https://www.pinterest.ch/pin/554083560398205192/ https://www.pinterest.id/pin/554083560398205192/ https://www.pinterest.it/pin/554083560398205192/ https://www.pinterest.ru/pin/554083560398205192/ + 2 dozen more locales….
  • 54. RE-PINS – Adding Insult To Injury! https://www.pinterest.de/pin/243475923592500876/https://www.pinterest.de/pin/241013017546674029/ INDEXABLEINDEXABLE New URL!
  • 55.
  • 56.
  • 57. Pinterest: Millions of Dead Files Boards Pins Home Fave Places My Style INDEXABLE
  • 58. Quick Reminder (10 Years ago…) 2009!
  • 60. Case Study: How to identify the least valuable pages? 1. Facebook Index Coverage: Accessibility vs. Page Quality 2. Inactive/Empty Groups, Pages, Users, Places 3. Privacy-aware users (or create incentive to share public to improve LP value) 4. Use Engagement as a quality metric for post-URLs (doesn‘t get much better than this) 5. Marketplace (See Advanced Panda Diet) 6. Expired Events 7. …
  • 61.
  • 62. Case Study: How to identify the least valuable pages? 1. Facebook Index Coverage: Accessibility vs. Page Quality 2. Inactive/Empty Groups, Pages, Users, Places 3. Privacy-aware users (or create incentive to share public to improve LP value) 4. Use Engagement as a quality metric for post-URLs (doesn‘t get much better than this) 5. Marketplace (See Advanced Panda Diet) 6. Expired Events 7. …
  • 63. 63 Crawling Efficiency & Internal Linking Links from GSC or Crawling Tools
  • 64. 64 Crawling Efficiency & Internal Linking
  • 65. Balance: Algorithmic Internal Linking for 1.000 Pages 1. New York 2. London 3. Paris 4. Rome 5. Amsterdam 6. Milan 7. Barcelona 8. Prague 9. Dublin 10. Berlin 1. Munich 2. Warzaw 3. Madrid 4. Copenhagen 5. Stockholm 6. San Francisco 7. Toronto 8. Hamburg 9. Rio de Janeiro 10. Cairo 1. Seattle 2. Marrakesh 3. Sofia 4. Wroclaw 5. Helsinki 6. Vancouver 7. Hanover 8. Marseille 9. Alicante 10. Edinburgh First Tier Top 10 This class of pages gets 1.000 Links each Second Tier: Random 10 out of Top 100 This class of pages gets 100 Links each Third Tier: Random 10 out of Top 1.000 This class of pages gets 10 Links each • Shuffle container 2+3, but keep static per page • Include relevance score/silos/topical proximity to improve UX
  • 66. 66 Fix Internal Linking Using Bestseller Lists 1. Standard Sorting: Popularity 2. Dyn. Bestseller Lists for Prioritization 3. „New Arrivals“ for Discovery 4. Related Products für Completeness 5. Breadcrumb for Bottom Up Prio 6. Prio über Sitemap: Ask Santa about it!
  • 67. SEO EfficiencyTM * * * * The key to extremely big websites: Trim them for Efficiency! 100x 2200x
  • 69. Frequently Asked Questions How isn‘t this cloaking? I‘m afraid I could lose all my long-tail revenue. *mimimi* Should I remove all those pages in one drastic move? Wouldn‘t Google see that as a weakness? Should I really dynamically switch/flap index directives? How does GoogleBot discover my content without pagination? 1. It doesn‘t alter user experience 2. It only makes Google‘s job easier 3. Take a look at Amazon, bro 1. There‘s usually no data confirming the long-tail 2. Rankings are usually not lost but substituted by superior pages 3. Google actually prefers pages with good UX over the most specific result (Hummingbird, RankBrain instead of perfect title string match) It‘s always a good time to do the right thing! I think you should. See above. If you need pagination for discovery, you‘ve got bigger fish to fry. Seriously…
  • 70. What to remember… 1. We‘re doing this for 10 years (Pre-Panda) now and it has never backfired 2. This is most important if your website has more than 100.000 pages 3. Index Bloat: Millions of indexed HTML documents are not an asset but a liability. Indexing everything is inefficient by definition. 4. 80 % (actually 95%) of your website usually is dead weight. And it‘s pulling down your best pages. 5. Analyse your potential with an organic landing page report 6. There‘s no black and white, but a reasonable amount of grey which should be defined by data 7. Non-transactional content is (most likely) overrated. (Inventory=Content)

Notas del editor

  1. Pruning a tree, window-dressing, very fair
  2. Users look for overview, not one random page