A Multi-armed Bandit Approach to Online Spatial Task Assignment

Umair ul Hassan, Edward Curry
A Multi-armed Bandit Approach to
Online Spatial Task Assignment
11th IEEE International Conference on Ubiquitous Intelligence and Computing
December 9-12, 2014
Ayodya Resort, Bali, Indonesia

Agenda
 Spatial Crowdsourcing
 Task Assignment
 Multi-armed Bandit
 SpatialUCB
 Experiments & Results
 Conclusion
 Q&A
4 July 20162

Spatial Crowdsourcing
Tasks require physical travel to a location.
4 July 20163
Please provide recent photos of this place.

4 July 20164
Overview of interacting agents
WorkersRequesters
Spatial Crowdsourcing Platform
Worker
Assignment & Filtering
Response
Aggregation & Filtering
submit tasks
receive results
feedback
assign tasks
submit responses

Task Assignment
Pull Method vs Push Method
4 July 20165
Platform
Submit tasks
Worker selects tasks
Platform
Submit tasks
Server assigns tasks
Our focusStarvation & Search Friction

Assumptions
A1) Workers do not actively visit the platform to seek tasks.
A2) Tasks arrivals are dynamic: task arrival time is unknown
A3) The outcome of an assignment is stochastic: worker may
accept or reject an assigned task
4 July 20166
Please provide recent photos of this place.

IMIRT Framework
Intelligent Models for Iterative Routing of Tasks (A1)
4 July 20167
IMIRT Framework
Router
ProfilerWorker
Models
Interface
assignment
update
outcomeexpectation
wj ti
wj-1
ti+1
wj+1
ti+2
Find The Best
Assignment

Offline Task Assignment
4 July 20168
Assignment
indicator
Acceptance
inidcator
Optimization
Constraints
Optimization
Objective
Number of
tasks
Number of
workers

Online Task Assignment
 Either tasks or workers arrive dynamically (A2)
 Existing research
4 July 20169

Online Task Assignment
 Assignment under uncertainty (A3)
Workers may accept or reject an assigned task
Exploration-Exploitation Trade-off
4 July 201610
Select worker to learn
about acceptance
behaviour Select
worker that has highest
expectation of successful outcome

Multi-armed Bandit
Which arm should be played to maximize reward?
4 July 201611
antigavin@flickr
“Bandit problems embody in
essential form a conflict evident in all
human action: choosing actions
which yield immediate reward vs.
choosing actions (e.g. acquiring
information or preparing the ground)
whose benefit will come only later.”
— P. Whittle (1980)

Multi-armed Bandit
Application of multi-armed bandit model
4 July 201612
Clinical
Trials
Ad
Placement
Adaptive
Routing
Stock
Investment

Multi-armed Bandit
 Assignment under uncertainty (R3)
Workers may accept or reject an assigned task
4 July 201613
Exploration
Exploitation
Trade-off
antigavin@flickr
What if we knew that reward of
each machine is dependent on
the time of day
(Contextual Bandit)

SpatialUCB
 Multi-armed bandit approach to task assignment
 Optimistic assignment under uncertainty
 Exploits relationship between spatial context and task
acceptance
4 July 201614
Start
Wait for new task
Observe spatial
features of task
and all workers
Calculate expectation of
success for all workers
Assign worker
with highest
expectation
Observe
outcome for the
assignment
Use ridge regression to update model
coefficients for assigned worker

Gowalla Dataset
 Location based Social Network
 User = Worker
 Check-in Spot = Spatial Task
 Highlight Spot = Non Spatial Task
Characteristics of selected 90 workers
4 July 201615
Users 9,183
Spots 30,367
Highlights 2,767
Check-ins 357,753

Experiment 1
 Simulate a worker as Binomial stochastic process
 Simulate 90 workers and 5,000 tasks
Probability of success based on Gowalla dataset
ASR (Assignment Success Rate)
 Context free algorithms
4 July 201616

Experiment 2
 Assignment with spatial context
ASR (Assignment Success Rate)
ATD (Average Travel Distance)
4 July 201617
5k tuning tasks 26k testing tasks

Conclusion
 Multi-armed bandit is an appropriate tool for modelling the
task assignment problem in spatial crowdsourcing
 SpatialUCB performs better for learning task acceptance
behaviour based on the spatial contextual information
 We plan to extend SpatialUCB to combinatorial
assignments
4 July 201618

Thank You
Umair ul Hassan
umair.ulhassan@insight-centre.org
www.insight-centre.org

A Multi-armed Bandit Approach to Online Spatial Task Assignment

Recomendados

Recomendados

Más contenido relacionado

Similar a A Multi-armed Bandit Approach to Online Spatial Task Assignment

Similar a A Multi-armed Bandit Approach to Online Spatial Task Assignment (8)

Más de Umair ul Hassan

Más de Umair ul Hassan (8)

Último

Último (20)

A Multi-armed Bandit Approach to Online Spatial Task Assignment

Notas del editor