SlideShare una empresa de Scribd logo
1 de 85
Descargar para leer sin conexión
Site Search Analytics
in a Nutshell
Louis Rosenfeld
lou@louisrosenfeld.com • @louisrosenfeld
Webdagane • 10 September 2013
Hello, my name is Lou
www.louisrosenfeld.com | www.rosenfeldmedia.com
Let’s look at the data
No, let’s look at the real data
Critical elements in bold: IP address, time/date stamp, query, and # of
results:
XXX.XXX.X.104 - - [10/Jul/2006:10:25:46 -0800]
"GET /search?access=p&entqr=0
&output=xml_no_dtd&sort=date%3AD%3AL
%3Ad1&ud=1&site=AllSites&ie=UTF-8
&client=www&oe=UTF-8&proxystylesheet=www&
q=lincense+plate&ip=XXX.XXX.X.104 HTTP/1.1"
200 971 0 0.02
XXX.XXX.X.104 - - [10/Jul/2006:10:25:48 -0800]
"GET /searchaccess=p&entqr=0
&output=xml_no_dtd&sort=date%3AD%3AL
%3Ad1&ie=UTF-8&client=www&
q=license+plate&ud=1&site=AllSites
&spell=1&oe=UTF-8&proxystylesheet=www&
ip=XXX.XXX.X.104 HTTP/1.1" 200 8283 146 0.16
No, let’s look at the real data
Critical elements in bold: IP address, time/date stamp, query, and # of
results:
XXX.XXX.X.104 - - [10/Jul/2006:10:25:46 -0800]
"GET /search?access=p&entqr=0
&output=xml_no_dtd&sort=date%3AD%3AL
%3Ad1&ud=1&site=AllSites&ie=UTF-8
&client=www&oe=UTF-8&proxystylesheet=www&
q=lincense+plate&ip=XXX.XXX.X.104 HTTP/1.1"
200 971 0 0.02
XXX.XXX.X.104 - - [10/Jul/2006:10:25:48 -0800]
"GET /searchaccess=p&entqr=0
&output=xml_no_dtd&sort=date%3AD%3AL
%3Ad1&ie=UTF-8&client=www&
q=license+plate&ud=1&site=AllSites
&spell=1&oe=UTF-8&proxystylesheet=www&
ip=XXX.XXX.X.104 HTTP/1.1" 200 8283 146 0.16
What are users
searching?
No, let’s look at the real data
Critical elements in bold: IP address, time/date stamp, query, and # of
results:
XXX.XXX.X.104 - - [10/Jul/2006:10:25:46 -0800]
"GET /search?access=p&entqr=0
&output=xml_no_dtd&sort=date%3AD%3AL
%3Ad1&ud=1&site=AllSites&ie=UTF-8
&client=www&oe=UTF-8&proxystylesheet=www&
q=lincense+plate&ip=XXX.XXX.X.104 HTTP/1.1"
200 971 0 0.02
XXX.XXX.X.104 - - [10/Jul/2006:10:25:48 -0800]
"GET /searchaccess=p&entqr=0
&output=xml_no_dtd&sort=date%3AD%3AL
%3Ad1&ie=UTF-8&client=www&
q=license+plate&ud=1&site=AllSites
&spell=1&oe=UTF-8&proxystylesheet=www&
ip=XXX.XXX.X.104 HTTP/1.1" 200 8283 146 0.16
What are users
searching?
How often are
users failing?
SSA is semantically rich data, and...
SSA is semantically rich data, and...
Queries
sorted by
frequency
...what users want--in their own words
A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents
meet the needs of your most important audiences
A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents
meet the needs of your most important audiences
Not all queries are
distributed equally
A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents
meet the needs of your most important audiences
A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents
meet the needs of your most important audiences
Nor do they
diminish gradually
A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents
meet the needs of your most important audiences
A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents
meet the needs of your most important audiences
80/20 rule isn’t
quite accurate
(and the tail is quite long)
(and the tail is quite long)
(and the tail is quite long)
(and the tail is quite long)
(and the tail is quite long)
The Long Tail is
much longer than
you’d suspect
The Zipf Distribution, textually
Some things you can do with SSA
1.Make it harder to get lost in deep content
2.Make search smarter
3.Reduce jargon
4.Learn how your audiences differ
5.Know when to publish what
6.Own and enjoy your failures
7.Avoid disaster
8.Predict the future
#1
Make it harder to get lost
Start with basic SSA data:
queries and query frequency
Percent: volume
of search activity
for a unique
query during a
particular time
period
Cumulative
Percent:
running sum of
percentages
Tease out common content types
Tease out common content types
Tease out common content types
Took an hour to...
• Analyze top 50 queries (20% of all search activity)
• Ask and iterate: “what kind of content would users be
looking for when they searched these terms?”
• Add cumulative percentages
Result: prioritized list of potential content types
#1) application: 11.77%
#2) reference: 10.5%
#3) instructions: 8.6%
#4) main/navigation pages: 5.91%
#5) contact info: 5.79%
#6) news/announcements: 4.27%
Clear content types lead to
better contextual navigation
artist descriptions
album reviews
album pages
artist biosdiscography
TV listings
#2
Make search smarter
Clear content types improve
search performance
Clear content types improve
search performance
Clear content types improve
search performance
Content objects
related to products
Clear content types improve
search performance
Content objects
related to products
Raw search results
Contextualizing “advanced” features
Session data suggest
progression and context
Session data suggest
progression and context
search session patterns
1. solar energy
2. how solar energy works
Session data suggest
progression and context
search session patterns
1. solar energy
2. how solar energy works
search session patterns
1. solar energy
2. energy
Session data suggest
progression and context
search session patterns
1. solar energy
2. how solar energy works
search session patterns
1. solar energy
2. energy
search session patterns
1. solar energy
2. solar energy charts
Session data suggest
progression and context
search session patterns
1. solar energy
2. how solar energy works
search session patterns
1. solar energy
2. energy
search session patterns
1. solar energy
2. solar energy charts
search session patterns
1. solar energy
2. explain solar energy
Session data suggest
progression and context
search session patterns
1. solar energy
2. how solar energy works
search session patterns
1. solar energy
2. energy
search session patterns
1. solar energy
2. solar energy charts
search session patterns
1. solar energy
2. explain solar energy
search session patterns
1. solar energy
2. solar energy news
Recognizing
proper nouns,
dates, and
unique ID#s
#3
Reduce jargon
Saving the brand by killing jargon
at a community college
Jargon related to online education: FlexEd, COD,
College on Demand
Marketing’s solution: expensive campaign to
educate public (via posters, brochures)
The Numbers
(from SSA):
Result: content relabeled, money saved
query rank query
#22 online*
#101 COD
#259 College on Demand
#389 FlexTrack
*“online”part of 213 queries
#4
Learn how your audiences differ
Who cares about what?
Who cares about what?
Who cares about what?
Who cares about what?
Why analyze queries by audience?
Fortify your personas with data
Learn about differences between audiences
• Open University “Enquirers”: 16 of 25 queries
are for subjects not taught at OU
• Open University Students: search for course
codes, topics dealing with completing program
Determine what’s commonly important to all
audiences (these queries better work well)
#5
Know when to publish what
Interest in the
football team:
going...
Interest in the
football team:
going...
...going...
Interest in the
football team:
going...
...going...
gone
Interest in the
football team:
going...
...going...
gone
Time to
study!
Before
Tax Day
After
Tax Day
#6
Own and enjoy your failures
Failed navigation?
Examining unexpected searching
Look for places
searches happen
beyond main page
What’s going on?
• Navigational
failure?
• Content failure?
• Something else?
Where navigation is failing
(“Professional Resources” page)
Do users and
AIGA mean
different
things by
“Professional
Resources”?
Comparing what users find
and what they want
Comparing what users find
and what they want
Failed business goals?
Developing custom metrics
Netflix asks
1. Which movies most frequently searched? (query count)
2. Which of them most frequently clicked through? (MDP views)
3. Which of them least frequently added to queue? (queue adds)
Failed business goals?
Developing custom metrics
Netflix asks
1. Which movies most frequently searched? (query count)
2. Which of them most frequently clicked through? (MDP views)
3. Which of them least frequently added to queue? (queue adds)
Failed business goals?
Developing custom metrics
Netflix asks
1. Which movies most frequently searched? (query count)
2. Which of them most frequently clicked through? (MDP views)
3. Which of them least frequently added to queue? (queue adds)
#7
Avoid disasters
The new and improved search engine
that wasn’t
Vanguard used SSA to help benchmark
existing search engine’s performance and
help select new engine
New search engine “performed” poorly
But IT needed
convincing
to delay
launch
Information Architect &
Dev Team Meeting
Search seems
to have a few
problems… Nah
.
Where’s the
proof?
You can’t tell
for sure.
What to do?
Test performance of common queries
“Before and after” testing using two sets of
metrics
1.Relevance: how reliably the search engine
returns the best matches first
2.Precision: proportion of relevant results
clustered at the top of the list
Old engine (target) and new compared
Note: low relevance and high precision scores are optimal
More on Vanguard case study: http://bit.ly/D3B8c
Old engine (target) and new compared
Note: low relevance and high precision scores are optimal
More on Vanguard case study: http://bit.ly/D3B8c
uh-oh
Old engine (target) and new compared
Note: low relevance and high precision scores are optimal
More on Vanguard case study: http://bit.ly/D3B8c
uh-oh better
#8
Predict the future
Shaping the
FinancialTimes’ editorial agenda
FT compares these
• Spiking queries
for proper nouns
(i.e., people and
companies)
• Recent editorial
coverage of
people and
companies
Discrepancy?
• Breaking story?!
• Let the
editors
know!
Seed your
Can SSA bring us together?
Lou’s TABLE OF
OVERGENERALIZED
DICHOTOMIES
Web Analytics User Experience
What they analyze
Users' behaviors (what's
happening)
Users' intentions and
motives (why those things
happen)
What methods they
employ
Quantitative methods to
determine what's happening
Qualitative methods for
explaining why things
happen
What they're trying
to achieve
Helps the organization meet
goals (expressed as KPI)
Helps users achieve goals
(expressed as tasks or
topics of interest)
How they use data
Measure performance (goal-
driven analysis)
Uncover patterns and
surprises (emergent
analysis)
What kind of data
they use
Statistical data ("real" data
in large volumes, full of
errors)
Descriptive data (in small
volumes, generated in lab
environment, full of errors)
Lands End and SKUs
Lands End and SKUs
SKU: # 39072-2AH1
Use SSA to start work
on a site report card
Use SSA to start work
on a site report card
SSA helps
determine common
information needs
Read this
	

 Search Analytics forYour Site:
Conversations with
Your Customers
by Louis Rosenfeld
(Rosenfeld Media, 2011)
www.rosenfeldmedia.com
Use code
WEBDAGENE2013
for 20% off all
Rosenfeld Media books
Louis Rosenfeld
lou@louisrosenfeld.com
www.louisrosenfeld.com
www.rosenfeldmedia.com
www.slideshare.net/lrosenfeld
@louisrosenfeld
@rosenfeldmedia
Say hello

Más contenido relacionado

Similar a Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
ALTER WAY
 

Similar a Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013) (20)

Site search analytics workshop presentation
Site search analytics workshop presentationSite search analytics workshop presentation
Site search analytics workshop presentation
 
Search Analytics for Content Strategists
Search Analytics for Content StrategistsSearch Analytics for Content Strategists
Search Analytics for Content Strategists
 
Using Search Analytics to Diagnose What’s Ailing your Information Architecture
Using Search Analytics to Diagnose What’s Ailing your Information ArchitectureUsing Search Analytics to Diagnose What’s Ailing your Information Architecture
Using Search Analytics to Diagnose What’s Ailing your Information Architecture
 
Search Analytics for Fun and Profit
Search Analytics for Fun and ProfitSearch Analytics for Fun and Profit
Search Analytics for Fun and Profit
 
Search Analytics: Conversations with Your Customers
Search Analytics: Conversations with Your CustomersSearch Analytics: Conversations with Your Customers
Search Analytics: Conversations with Your Customers
 
Professional Information Research
Professional Information ResearchProfessional Information Research
Professional Information Research
 
Beyond User Research
Beyond User ResearchBeyond User Research
Beyond User Research
 
Personalized Search-Building a prototype to infer the user's interest
Personalized Search-Building a prototype to infer the user's interestPersonalized Search-Building a prototype to infer the user's interest
Personalized Search-Building a prototype to infer the user's interest
 
Analytics in the Cloud
Analytics in the CloudAnalytics in the Cloud
Analytics in the Cloud
 
Search Analytics For Content Strategists @CSofNYC
Search Analytics For Content Strategists @CSofNYCSearch Analytics For Content Strategists @CSofNYC
Search Analytics For Content Strategists @CSofNYC
 
Introduction to Microsoft Search #SRC101 #365EduCon 20211214
Introduction to Microsoft Search #SRC101 #365EduCon 20211214Introduction to Microsoft Search #SRC101 #365EduCon 20211214
Introduction to Microsoft Search #SRC101 #365EduCon 20211214
 
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
 
Creating a Single View: Overview and Analysis
Creating a Single View: Overview and AnalysisCreating a Single View: Overview and Analysis
Creating a Single View: Overview and Analysis
 
Tech Essentials - UP Edition
Tech Essentials - UP EditionTech Essentials - UP Edition
Tech Essentials - UP Edition
 
Site Search Analytics eMetrics Madrid 2009
Site Search Analytics eMetrics Madrid 2009Site Search Analytics eMetrics Madrid 2009
Site Search Analytics eMetrics Madrid 2009
 
Introduction to Search #m365chicago
Introduction to Search #m365chicagoIntroduction to Search #m365chicago
Introduction to Search #m365chicago
 
SRC101 Introduction to Search #365EDUCon
SRC101 Introduction to Search #365EDUConSRC101 Introduction to Search #365EDUCon
SRC101 Introduction to Search #365EDUCon
 
Online research and research skills
Online research and research skillsOnline research and research skills
Online research and research skills
 
Opra W2&4 Tech Essentials
Opra W2&4 Tech EssentialsOpra W2&4 Tech Essentials
Opra W2&4 Tech Essentials
 
Search Analytics: Powerful diagnostics for your site
Search Analytics:  Powerful diagnostics for your siteSearch Analytics:  Powerful diagnostics for your site
Search Analytics: Powerful diagnostics for your site
 

Más de webdagene

Más de webdagene (20)

Hverdagen til kidsa. Hva kan vi lære? Med Kristin Magnussen
Hverdagen til kidsa. Hva kan vi lære? Med Kristin MagnussenHverdagen til kidsa. Hva kan vi lære? Med Kristin Magnussen
Hverdagen til kidsa. Hva kan vi lære? Med Kristin Magnussen
 
Samfunnsoppdrag på sosialt vis
Samfunnsoppdrag på sosialt visSamfunnsoppdrag på sosialt vis
Samfunnsoppdrag på sosialt vis
 
Building bridges on diversity: What the fight for civil rights can teach us a...
Building bridges on diversity: What the fight for civil rights can teach us a...Building bridges on diversity: What the fight for civil rights can teach us a...
Building bridges on diversity: What the fight for civil rights can teach us a...
 
The voice of the future (en) – med Cheryl Platz
The voice of the future (en) – med Cheryl PlatzThe voice of the future (en) – med Cheryl Platz
The voice of the future (en) – med Cheryl Platz
 
Hvorfor personvern er viktig for kommunikasjon – med Eva Jarbekk
Hvorfor personvern er viktig for kommunikasjon – med Eva JarbekkHvorfor personvern er viktig for kommunikasjon – med Eva Jarbekk
Hvorfor personvern er viktig for kommunikasjon – med Eva Jarbekk
 
Slik kommuniserer du til hele hjernen (sv) – med Erik Modig
Slik kommuniserer du til hele hjernen (sv) – med Erik ModigSlik kommuniserer du til hele hjernen (sv) – med Erik Modig
Slik kommuniserer du til hele hjernen (sv) – med Erik Modig
 
Digital innovasjon i praksis – med Klara Vatn
Digital innovasjon i praksis – med Klara VatnDigital innovasjon i praksis – med Klara Vatn
Digital innovasjon i praksis – med Klara Vatn
 
Hvordan bruke UX i design av hardware – med Marius Aabel
Hvordan bruke UX i design av hardware – med Marius AabelHvordan bruke UX i design av hardware – med Marius Aabel
Hvordan bruke UX i design av hardware – med Marius Aabel
 
Om å bryte tabuer på Snapchat – med Tale Maria Krohn Engvik
Om å bryte tabuer på Snapchat – med Tale Maria Krohn EngvikOm å bryte tabuer på Snapchat – med Tale Maria Krohn Engvik
Om å bryte tabuer på Snapchat – med Tale Maria Krohn Engvik
 
Enkel og effektiv brukertesting – med Ida Aalen
Enkel og effektiv brukertesting – med Ida AalenEnkel og effektiv brukertesting – med Ida Aalen
Enkel og effektiv brukertesting – med Ida Aalen
 
Ten realities of the internet of things – ​Alexandra Deschamps-Sonsino
Ten realities of the internet of things –  ​Alexandra Deschamps-SonsinoTen realities of the internet of things –  ​Alexandra Deschamps-Sonsino
Ten realities of the internet of things – ​Alexandra Deschamps-Sonsino
 
Internett. Hva nå? – med Jostein Magnussen
Internett. Hva nå? – med Jostein MagnussenInternett. Hva nå? – med Jostein Magnussen
Internett. Hva nå? – med Jostein Magnussen
 
Nysgjerrighet som drivkraft – med Louise Fuchs
Nysgjerrighet som drivkraft – med Louise FuchsNysgjerrighet som drivkraft – med Louise Fuchs
Nysgjerrighet som drivkraft – med Louise Fuchs
 
Scaling service design and the challenge of problem-caring – Sanjay Poyzer
Scaling service design and the challenge of problem-caring – Sanjay PoyzerScaling service design and the challenge of problem-caring – Sanjay Poyzer
Scaling service design and the challenge of problem-caring – Sanjay Poyzer
 
5 grep for kundeorientering i en digital hverdag. – med Guro Røberg og Ove Dalen
5 grep for kundeorientering i en digital hverdag. – med Guro Røberg og Ove Dalen5 grep for kundeorientering i en digital hverdag. – med Guro Røberg og Ove Dalen
5 grep for kundeorientering i en digital hverdag. – med Guro Røberg og Ove Dalen
 
Ten realities of the internet of things - ​Alexandra Deschamps-Sonsino
Ten realities of the internet of things -  ​Alexandra Deschamps-SonsinoTen realities of the internet of things -  ​Alexandra Deschamps-Sonsino
Ten realities of the internet of things - ​Alexandra Deschamps-Sonsino
 
Hvordan menneskesentrisk teknologi endrer kundeopplevelsen – med Claude Marie...
Hvordan menneskesentrisk teknologi endrer kundeopplevelsen – med Claude Marie...Hvordan menneskesentrisk teknologi endrer kundeopplevelsen – med Claude Marie...
Hvordan menneskesentrisk teknologi endrer kundeopplevelsen – med Claude Marie...
 
Understanding humans – Leah Reich
Understanding humans – Leah ReichUnderstanding humans – Leah Reich
Understanding humans – Leah Reich
 
The dark net – Jamie Bartlett
The dark net – Jamie BartlettThe dark net – Jamie Bartlett
The dark net – Jamie Bartlett
 
UX of Story: Designing the Future of Storytelling – Mandy Mandelstein
UX of Story: Designing the Future of Storytelling  – Mandy MandelsteinUX of Story: Designing the Future of Storytelling  – Mandy Mandelstein
UX of Story: Designing the Future of Storytelling – Mandy Mandelstein
 

Último

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

  • 1. Site Search Analytics in a Nutshell Louis Rosenfeld lou@louisrosenfeld.com • @louisrosenfeld Webdagane • 10 September 2013
  • 2. Hello, my name is Lou www.louisrosenfeld.com | www.rosenfeldmedia.com
  • 3. Let’s look at the data
  • 4. No, let’s look at the real data Critical elements in bold: IP address, time/date stamp, query, and # of results: XXX.XXX.X.104 - - [10/Jul/2006:10:25:46 -0800] "GET /search?access=p&entqr=0 &output=xml_no_dtd&sort=date%3AD%3AL %3Ad1&ud=1&site=AllSites&ie=UTF-8 &client=www&oe=UTF-8&proxystylesheet=www& q=lincense+plate&ip=XXX.XXX.X.104 HTTP/1.1" 200 971 0 0.02 XXX.XXX.X.104 - - [10/Jul/2006:10:25:48 -0800] "GET /searchaccess=p&entqr=0 &output=xml_no_dtd&sort=date%3AD%3AL %3Ad1&ie=UTF-8&client=www& q=license+plate&ud=1&site=AllSites &spell=1&oe=UTF-8&proxystylesheet=www& ip=XXX.XXX.X.104 HTTP/1.1" 200 8283 146 0.16
  • 5. No, let’s look at the real data Critical elements in bold: IP address, time/date stamp, query, and # of results: XXX.XXX.X.104 - - [10/Jul/2006:10:25:46 -0800] "GET /search?access=p&entqr=0 &output=xml_no_dtd&sort=date%3AD%3AL %3Ad1&ud=1&site=AllSites&ie=UTF-8 &client=www&oe=UTF-8&proxystylesheet=www& q=lincense+plate&ip=XXX.XXX.X.104 HTTP/1.1" 200 971 0 0.02 XXX.XXX.X.104 - - [10/Jul/2006:10:25:48 -0800] "GET /searchaccess=p&entqr=0 &output=xml_no_dtd&sort=date%3AD%3AL %3Ad1&ie=UTF-8&client=www& q=license+plate&ud=1&site=AllSites &spell=1&oe=UTF-8&proxystylesheet=www& ip=XXX.XXX.X.104 HTTP/1.1" 200 8283 146 0.16 What are users searching?
  • 6. No, let’s look at the real data Critical elements in bold: IP address, time/date stamp, query, and # of results: XXX.XXX.X.104 - - [10/Jul/2006:10:25:46 -0800] "GET /search?access=p&entqr=0 &output=xml_no_dtd&sort=date%3AD%3AL %3Ad1&ud=1&site=AllSites&ie=UTF-8 &client=www&oe=UTF-8&proxystylesheet=www& q=lincense+plate&ip=XXX.XXX.X.104 HTTP/1.1" 200 971 0 0.02 XXX.XXX.X.104 - - [10/Jul/2006:10:25:48 -0800] "GET /searchaccess=p&entqr=0 &output=xml_no_dtd&sort=date%3AD%3AL %3Ad1&ie=UTF-8&client=www& q=license+plate&ud=1&site=AllSites &spell=1&oe=UTF-8&proxystylesheet=www& ip=XXX.XXX.X.104 HTTP/1.1" 200 8283 146 0.16 What are users searching? How often are users failing?
  • 7. SSA is semantically rich data, and...
  • 8. SSA is semantically rich data, and... Queries sorted by frequency
  • 9. ...what users want--in their own words
  • 10. A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents meet the needs of your most important audiences
  • 11. A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents meet the needs of your most important audiences Not all queries are distributed equally
  • 12. A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents meet the needs of your most important audiences
  • 13. A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents meet the needs of your most important audiences Nor do they diminish gradually
  • 14. A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents meet the needs of your most important audiences
  • 15. A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents meet the needs of your most important audiences 80/20 rule isn’t quite accurate
  • 16. (and the tail is quite long)
  • 17. (and the tail is quite long)
  • 18. (and the tail is quite long)
  • 19. (and the tail is quite long)
  • 20. (and the tail is quite long) The Long Tail is much longer than you’d suspect
  • 22. Some things you can do with SSA 1.Make it harder to get lost in deep content 2.Make search smarter 3.Reduce jargon 4.Learn how your audiences differ 5.Know when to publish what 6.Own and enjoy your failures 7.Avoid disaster 8.Predict the future
  • 23. #1 Make it harder to get lost
  • 24. Start with basic SSA data: queries and query frequency Percent: volume of search activity for a unique query during a particular time period Cumulative Percent: running sum of percentages
  • 25. Tease out common content types
  • 26. Tease out common content types
  • 27. Tease out common content types Took an hour to... • Analyze top 50 queries (20% of all search activity) • Ask and iterate: “what kind of content would users be looking for when they searched these terms?” • Add cumulative percentages Result: prioritized list of potential content types #1) application: 11.77% #2) reference: 10.5% #3) instructions: 8.6% #4) main/navigation pages: 5.91% #5) contact info: 5.79% #6) news/announcements: 4.27%
  • 28. Clear content types lead to better contextual navigation artist descriptions album reviews album pages artist biosdiscography TV listings
  • 30. Clear content types improve search performance
  • 31. Clear content types improve search performance
  • 32. Clear content types improve search performance Content objects related to products
  • 33. Clear content types improve search performance Content objects related to products Raw search results
  • 36. Session data suggest progression and context search session patterns 1. solar energy 2. how solar energy works
  • 37. Session data suggest progression and context search session patterns 1. solar energy 2. how solar energy works search session patterns 1. solar energy 2. energy
  • 38. Session data suggest progression and context search session patterns 1. solar energy 2. how solar energy works search session patterns 1. solar energy 2. energy search session patterns 1. solar energy 2. solar energy charts
  • 39. Session data suggest progression and context search session patterns 1. solar energy 2. how solar energy works search session patterns 1. solar energy 2. energy search session patterns 1. solar energy 2. solar energy charts search session patterns 1. solar energy 2. explain solar energy
  • 40. Session data suggest progression and context search session patterns 1. solar energy 2. how solar energy works search session patterns 1. solar energy 2. energy search session patterns 1. solar energy 2. solar energy charts search session patterns 1. solar energy 2. explain solar energy search session patterns 1. solar energy 2. solar energy news
  • 43. Saving the brand by killing jargon at a community college Jargon related to online education: FlexEd, COD, College on Demand Marketing’s solution: expensive campaign to educate public (via posters, brochures) The Numbers (from SSA): Result: content relabeled, money saved query rank query #22 online* #101 COD #259 College on Demand #389 FlexTrack *“online”part of 213 queries
  • 44. #4 Learn how your audiences differ
  • 49. Why analyze queries by audience? Fortify your personas with data Learn about differences between audiences • Open University “Enquirers”: 16 of 25 queries are for subjects not taught at OU • Open University Students: search for course codes, topics dealing with completing program Determine what’s commonly important to all audiences (these queries better work well)
  • 50. #5 Know when to publish what
  • 51.
  • 52. Interest in the football team: going...
  • 53. Interest in the football team: going... ...going...
  • 54. Interest in the football team: going... ...going... gone
  • 55. Interest in the football team: going... ...going... gone Time to study!
  • 56.
  • 58.
  • 60. #6 Own and enjoy your failures
  • 61. Failed navigation? Examining unexpected searching Look for places searches happen beyond main page What’s going on? • Navigational failure? • Content failure? • Something else?
  • 62. Where navigation is failing (“Professional Resources” page) Do users and AIGA mean different things by “Professional Resources”?
  • 63. Comparing what users find and what they want
  • 64. Comparing what users find and what they want
  • 65. Failed business goals? Developing custom metrics Netflix asks 1. Which movies most frequently searched? (query count) 2. Which of them most frequently clicked through? (MDP views) 3. Which of them least frequently added to queue? (queue adds)
  • 66. Failed business goals? Developing custom metrics Netflix asks 1. Which movies most frequently searched? (query count) 2. Which of them most frequently clicked through? (MDP views) 3. Which of them least frequently added to queue? (queue adds)
  • 67. Failed business goals? Developing custom metrics Netflix asks 1. Which movies most frequently searched? (query count) 2. Which of them most frequently clicked through? (MDP views) 3. Which of them least frequently added to queue? (queue adds)
  • 69. The new and improved search engine that wasn’t Vanguard used SSA to help benchmark existing search engine’s performance and help select new engine New search engine “performed” poorly But IT needed convincing to delay launch Information Architect & Dev Team Meeting Search seems to have a few problems… Nah . Where’s the proof? You can’t tell for sure.
  • 70. What to do? Test performance of common queries “Before and after” testing using two sets of metrics 1.Relevance: how reliably the search engine returns the best matches first 2.Precision: proportion of relevant results clustered at the top of the list
  • 71. Old engine (target) and new compared Note: low relevance and high precision scores are optimal More on Vanguard case study: http://bit.ly/D3B8c
  • 72. Old engine (target) and new compared Note: low relevance and high precision scores are optimal More on Vanguard case study: http://bit.ly/D3B8c uh-oh
  • 73. Old engine (target) and new compared Note: low relevance and high precision scores are optimal More on Vanguard case study: http://bit.ly/D3B8c uh-oh better
  • 75. Shaping the FinancialTimes’ editorial agenda FT compares these • Spiking queries for proper nouns (i.e., people and companies) • Recent editorial coverage of people and companies Discrepancy? • Breaking story?! • Let the editors know! Seed your
  • 76. Can SSA bring us together?
  • 77. Lou’s TABLE OF OVERGENERALIZED DICHOTOMIES Web Analytics User Experience What they analyze Users' behaviors (what's happening) Users' intentions and motives (why those things happen) What methods they employ Quantitative methods to determine what's happening Qualitative methods for explaining why things happen What they're trying to achieve Helps the organization meet goals (expressed as KPI) Helps users achieve goals (expressed as tasks or topics of interest) How they use data Measure performance (goal- driven analysis) Uncover patterns and surprises (emergent analysis) What kind of data they use Statistical data ("real" data in large volumes, full of errors) Descriptive data (in small volumes, generated in lab environment, full of errors)
  • 78.
  • 79.
  • 81. Lands End and SKUs SKU: # 39072-2AH1
  • 82. Use SSA to start work on a site report card
  • 83. Use SSA to start work on a site report card SSA helps determine common information needs
  • 84. Read this Search Analytics forYour Site: Conversations with Your Customers by Louis Rosenfeld (Rosenfeld Media, 2011) www.rosenfeldmedia.com Use code WEBDAGENE2013 for 20% off all Rosenfeld Media books