Detecting Good Abandonment in Mobile Search

Detecting Good Abandonment
in Mobile Search
Kyle Williams Julia Kiseleva Aidan C. Crook
Imed Zitouni Ahmed Hassan Awadallah Madian Khabsa
Pennsylvania State University
Eindhoven University of Technology
Microsoft
WWW’16, Montréal, Québec, Canada

Mobile Search
• More and more popular: 2008  31% 2013  63%
• Mobile Search differs from traditional search [Human et. al, 2009]
• On Mobiles users are satisfied by the SERP [Li et. al, 2009]
• Mobiles screen is much smaller
• Mobiles are used on the way

Mobile Search
• More and more popular: 2008  31% 2013  63%
• Mobile Search differs from traditional search [Human et. al, 2009]
• On Mobiles users are satisfied by the SERP [Li et. al, 2009]
• Mobiles screen is much smaller
• Mobiles are used on the way
Search Engines need to adapt
And to Evaluate!

Knowledge Pane
Image Answer Image Answer
Organic Results: Snippets

Knowledge Pane
Image Answer Image Answer
Organic Results: Snippets
Knowledge Pane

Evaluating User Satisfaction
• We need metrics to evaluate user satisfaction
• Good abandonment [Human et. al, 2009]:
Mobile: 36% of abandoned queries in were likely good
Desktop: 14.3%
• Traditional methods use implicit signals: clicks and dwell time

Evaluating User Satisfaction
• We need metrics to evaluate user satisfaction
• Good abandonment [Human et. al, 2009]:
Mobile: 36% of abandoned queries in were likely good
Desktop: 14.3%
• Traditional methods use implicit signals: clicks and dwell time
Don’t work

Our Main Research Problem
In the absence of clicks, what is the relationship
between a user's gestures and satisfaction and can we
use gestures to detect satisfaction and good
abandonment?

Research Questions
• RQ1: What SERP elements are the sources of good
abandonment in mobile search?
• RQ2: Do a user's gestures provide signals that can be used
to detect satisfaction and good abandonment in mobile
search?
• RQ3: Which user gestures provide the strongest signals for
satisfaction and good abandonment?

Research Questions
search?
USERSTUDY

Research Questions
search?
USERSTUDY
CROWDSOURCING

User Study Participants
75%
25%
GENDER
Male Female
55%
45%
LANGUAGE
English Other
82%
8%
2%
8%
EDUCATION
Computer Science Electrical Engineering
Mathematics Other
• 60 Participants
• 25.53 +/- 5.42 years

User Study Design
• Video Instructions (same for all participants)
• Tasks:
1. A conversion between the imperial and metric systems
2. Determining if it was a good time to phone a friend in another
part of the world
3. Finding the score from a recent game of the user’s favorite
sports team
4. Finding the user's favorite celebrity's hair color
5. Finding the CEO of a company that lost most of its value in the
last 10 years

Find out what is
the hair color of
your favorite
celebrity

Questionnaire
• Were you able to complete the task?
o Yes/No
• Where did you find the answer?
o Answer Box, Image, SERP, Visited Website
• Which query led you to finding the answer?
o First, Second, Third, >= Fourth
• How satisfied are you with your experience in this task?
o 5-point Likert scale
• Did you put in a lot of effort to complete the task?

Questionnaire
• Were you able to complete the task?
o Yes/No
• Where did you find the answer?
o Answer Box, Image, SERP, Visited Website
• Which query led you to finding the answer?
o First, Second, Third, >= Fourth
• How satisfied are you with your experience in this task?
• Did you put in a lot of effort to complete the task?
5 Tasks
~20 Minutes

User Study Data
• Total queries – 607  563
• Abandoned queries – 576  461
• Potential abandonment tasks – 274

User Study Data
• Total queries – 607  563
• Abandoned queries – 576  461
• Potential abandonment tasks – 274
Binary
Labels

Crowdsourcing Procedure
Random sample of abandoned queries from the search logs of a
personal digital assistant during one week in June 2015 (no query
suggestion)

Crowdsourcing Procedure
Query: Peniston
Previous Query:
third eroics

Crowdsourcing Data
• Total amount of queries – 3,895
• Judgments agreement (3 per one query) – 73%
• After filtering: SAT – 1,565 and DSAT – 1,924

RQ1: Reasons of Good
Abandonment

RQ1: Reasons of Good
Abandonment
Mean of Satisfaction

Query and Session Features
• Session duration
• Number of queries in session
Session
Features

• Index of query within session
• Time to next query
• Query length (number of words)
• Is this query a reformulation
• Was this query reformulated
Session
Features
Query
Features

• Click count
• Number of SAT clicks (> 30 sec)
• Number of back-click clicks (< 30 sec)
Session
Features
Query
Features
Click
Features

Baseline 1:Click & Dwell
• Click count
Session
Features
Query
Features
Click
Features
Click >
30 sec
No
Refomul
ation
B1:Click,Dwellwith
noReformulation

Baseline 2: Optimistic
• Click count
Session
Features
Query
Features
Click
Features
NO
Click
NO
Refomul
ation
B2:Optimistic

Baseline 3: Query-Session Model
• Click count
Session
Features
Query
Features
Click
Features
B3:Query-SessionModel:
TrainingRandomForest

Gesture Features (1)
• Viewport features swipes-related:
o up swipes and down swipes
o changes in swipe direction
o swiped distance in pixels and average swiped distance
o swipe distance divided by time spent on the SERP

Gesture Features (1)
• Viewport features swipes-related:
o up swipes and down swipes
o changes in swipe direction
o swiped distance in pixels and average swiped distance
o swipe distance divided by time spent on the SERP
• Time To Focus
o Time to focus on Answer
o Time to Focus on Organic Search Results

3 seconds 6 seconds
33% of
ViewPort
66% of
ViewPort
ViewPortHeight
2 seconds
20% of
ViewPort
1s 4s 0.4s 5.4s+ + =
GF(2): Attributed Reading Time

400 pixels
300 pixels
Attributed
Reading Time: 5.4s
Pixel Area:
(400 pix x 300 pix)
0.045 ms/pix2=
GF (3): Attributed Reading
Time Per Pixel

Models: Detecting Good Abandonment
M1: Gesture Model:
Training Random Forest based on gesture features
M2: Gesture Model + Query and Session Features:
Training Random Forest based on gesture, query and session features

RQ2: Are gestures useful? (1)
On only abandoned user study data:
148 SAT queries and 313 DSAT queries

On crowdsourced data:

On all user study data:
Gestures Features are useful to detect user satisfaction
in general!

Conclusions
• RQ1: What SERP elements are the sources of good abandonment in
mobile search?
Answer, Images and Snippet
• RQ2: Do a user's gestures provide signals that can be used to detect
satisfaction and good abandonment in mobile search?
Yes
• RQ3: Which user gestures provide the strongest signals for satisfaction
and good abandonment
Time spent interacting with Answers is positively correlated. Swipe
actions and time spent with SERP is negatively correlated

• Answer, Images and Snippet are
potentially source of the good
abandonment
• User gestures provide useful signals to
detect good abandonment
• Time spent interacting with Answers is
positively correlated. Swipe actions
and time spent with SERP is
negatively correlated
Questions?

Detecting Good Abandonment in Mobile Search

Recomendados

Recomendados

Más contenido relacionado

Destacado

Destacado (6)

Similar a Detecting Good Abandonment in Mobile Search

Similar a Detecting Good Abandonment in Mobile Search (20)

Último

Último (20)

Detecting Good Abandonment in Mobile Search

Notas del editor