Using Crowdsourcing, Automated Methods and Google Street View to Collect Sidewalk Accessibility Data

makeability lab
クラウドソーシング・コンピュータビジョン・
ストリートビューを用いた歩道の
アクセシビリティデータの収集手法
原航太郎 | Project Sidewalk (PI: Jon E. Froehlich)

Human-Computer Interaction Lab

Characterizing Sidewalk
Accessibility at Scale
using Google Street View, Crowdsourcing, and
Automated Methods
Kotaro Hara | Project Sidewalk (PI: Prof. Jon Froehlich)
makeability lab

I want to start with a story…

30.6million U.S. adults with mobility impairment

15.2million use an assistive aid

Incomplete Sidewalks Physical Obstacles Surface Problems No Curb Ramps Stairs/Businesses

The lack of street-level
accessibility information can
have a significant impact on
the independence and
mobility of citizens
cf. Nuernberger, 2008; Thapar et al., 2004

Accessibility-aware Navigation

Visualizing Accessibility of a City

Our goal is to collect and deliver data for
the accessibility of every city in the world

Mobile Crowdsourcing
SeeClickFix.com

These mobile tools require people to be on-site
Mobile Crowdsourcing
SeeClickFix.com

Use Google Street View (GSV) as a massive data source for
scalably finding and characterizing street-level accessibility

AutomationCrowdsourcing
How can we efficiently collect accurate accessibility data with…

Amazon Mechanical Turk is an online labor market
where you can hire workers to complete small tasks

Task: Find the company name from an email domain
$0.02 per task
Task interface

Timer: 00:07:00 of 3 hours
University of Maryland: Help make our sidewalks more accessible for wheelchair users with Google Maps
Kotaro Hara 10 3 hours
Crowdsourcing Data Collection
Hara K., Le V., and Froehlich J.E [ASSETS2012, CHI2013]
Crowdsourcing | Image Labeling

Manual labeling is accurate,
but labor intensive

Computer vision
automatically finds
curb ramps
Automatic Curb Ramp Detection

Curb Ramp Labels Detected with Computer Vision

Some curb ramps
never get detected
False detections

Computer vision + verification is cheaper
but less accurate compared to manual labeling

Automatic Task Allocation
Research Question
How can we combine manual labeling and
computer vision to achieve high accuracy and low cost?

Computer vision + verification is
cheaper but less accurate
but labor intensive
Design Principles

Computer vision + verification is
cheaper but less accurate
(not true for easy tasks)
but labor intensive
Design Principles

Dataset
svDetect
Automatic Curb
Ramp Detection
svCrawl
Web Scraper
Tohme
遠目 Remote Eye・

svCrawl
Web Scraper
Dataset
svDetect
Automatic Curb
Ramp Detection
svControl
Automatic
Task Allocation
Tohme

svCrawl
Web Scraper
Dataset
svDetect
Automatic Curb
Ramp Detection
svControl
Automatic
Task Allocation
svVerify
Manual Label
Verification
Tohme

svCrawl
Web Scraper
Dataset
svDetect
Automatic Curb
Ramp Detection
svControl
Automatic
Task Allocation
svVerify
Manual Label
Verification
svLabel
Manual Labeling
Tohme

Tohme
Complexity:
Cardinality:
Depth:
CV:
0.14
0.33
0.21
0.22

Tohme
Complexity:
Cardinality:
Depth:
CV:
0.14
0.33
0.21
0.22
Predict computer vision
performance

Tohme
Complexity:
Cardinality:
Depth:
CV:
0.14
0.33
0.21
0.22
The easy task is passed to the
cheaper verification workflow.

Tohme
Complexity:
Cardinality:
Depth:
CV:
0.82
0.25
0.96
0.54

Tohme
遠目 Remote Eye・ Complexity:
Cardinality:
Depth:
CV:
0.82
0.25
0.96
0.54The difficult task is passed to the
more accurate labeling workflow.

Google Street View Panoramas and Metadata
3D Point-cloud Data
Top-down Google Maps Imagery
Scraper

Saskatoon
Los Angeles
Baltimore
Washington D.C.
Washington D.C.
Baltimore
Los Angeles
Saskatoon

D.C. | Downtown D.C. | Residential
Scraper | Areas of Study

Washington D.C.
Dense urban area
Semi-urban residential areas
Scraper

Washington D.C. Baltimore Los Angeles Saskatoon
Total Area:11.3 km2
Intersections: 1,086
Curb Ramps: 2,877
Missing Curb Ramps:647
Avg. GSV Data Age:2.2 yr*
* At the time of downloading data in summer 2013
Scraper

How well does GSV data reflect
the current state of the physical
world?

Washington
D.C.
Baltimore
Physical Audit Areas
GSV and Physical World
> 97.7% agreement
273 Intersections
Dataset | Validating Dataset
Small disagreement due to
construction.

Ground Truth Curb Ramp Dataset
2 researchers labeled curb ramps in our dataset
2,877 curb ramp labels (M=2.6 per intersection)
Dataset

Deformable Part Models
Felzenszwalb et al. 2008
http://www.cs.berkeley.edu/~rbg/latent/

Deformable Part Models
Felzenszwalb et al. 2008
http://www.cs.berkeley.edu/~rbg/latent/
Root filter Parts filter Displacement cost

Multiple redundant
detection boxes
Detected Labels
Stage 1: Deformable Part Model
Correct 1
False Positive 12
Miss 0

Curb ramps shouldn’t be
in the sky or on roofs
Correct 1
False Positive 12
Miss 0
Detected Labels
Stage 1: Deformable Part Model

Detected Labels
Stage 2: Post-processing

Detected Labels
Stage 3: SVM-based Refinement
Filter out labels based on
their size, color, and position.
Correct 1
False Positive 5
Miss 0

Correct 1
False Positive 3
Miss 0
Detected Labels
Stage 3: SVM-based Refinement

Google Street View Panoramic Image
Curb Ramp Labels Detected by Computer Vision

Used two-fold cross validation to evaluate CV sub-system

0%
20%
40%
60%
80%
100%
0% 20% 40% 60% 80% 100%
Precision(%)
Recall (%)
COMPUTER VISION SUB-SYSTEM RESULTS
Precision
Higher, less false positives
Recall
Higher, less false negatives

0%
20%
40%
60%
80%
100%
0% 20% 40% 60% 80% 100%
Precision(%)
Recall (%)
Goal:
maximize area
under curve

0%
20%
40%
60%
80%
100%
0% 20% 40% 60% 80% 100%
Precision(%)
Recall (%)
Stage 1: DPM
Stage 2: Post-Processing
Stage 3: SVM
More than 20% of
curb ramps were
missed

0%
20%
40%
60%
80%
100%
0% 20% 40% 60% 80% 100%
Precision(%)
Recall (%)
Stage 1: DPM
Stage 2: Post-Processing
Stage 3: SVM
Confidence
threshold of -
0.99, which
results in 26%
precision and
67% recall

Occlusion Illumination
Scale Viewpoint Variation
Structures Similar to Curb Ramps Curb Ramp Design Variation
CURB RAMP DETECTION IS A HARD PROBLEM

Automatic Task Allocation | Features to Assess Scene Difficulty for CV
A number of streets connected in an intersection
Depth information for a road width and variance in distance
Top-down images to assess complexity of an intersection
A number of detections and confidence values

A number of street from metadata
Depth information to assess a road width and variance in distance

A number of streets from metadata

Google Maps Styled Maps

A number of streets from metadata
CV Output: A number of detections and confidence values

3x
Manual Labeling | Labeling Interface

Automatic Task Allocation
Can we combine manual labeling and
computer vision to achieve high accuracy and low cost?

STUDY METHOD: CONDITIONS
Manual labeling without
smart task allocation
&vs.
CV + Verification without
smart task allocation
Tohme遠目 Remote Eye・
vs.
Evaluation

Accuracy Task Completion Time
Evaluation
STUDY METHOD: MEASURES

Recruited workers from Mturk
Used 1,046 GSV images (40 used for golden insertion)
Evaluation
STUDY METHOD: APPROACH

RESULTS
Labeling Tasks Verification Tasks
# of distinct turkers: 242 161
1,270 582# of HITs completed:
# of tasks completed: 6,350 4,820
# of tasks allocated: 769 277
Evaluation
We used Monte Carlo simulations for evaluation

84%
68%
83%
88%
58%
86%86%
63%
84%
0%
20%
40%
60%
80%
100%
AccuracyMeasures(%)
Precision Recall F-measure 94
42
81
0
20
40
60
80
100
TaskCompletionTime/Scene(s)
Accuracy
measures
Task
completion
time per scene
Manual
Labeling
CV and Manual
Verification
& Tohme
遠目 Remote Eye・ Manual
Labeling
CV and Manual
Verification
& Tohme
Evaluation | Labeling Accuracy and Time Cost
Error bars are standard deviations.
ACCURACY COST (TIME)

84%
68%
83%
88%
58%
86%86%
63%
84%
0%
20%
40%
60%
80%
100%
AccuracyMeasures(%)
Precision Recall F-measure
Error bars are standard deviations.
Manual
Labeling
CV and Manual
Verification
&
94
42
81
0
20
40
60
80
100
TaskCompletionTime/Scene(s)
Manual
Labeling
CV and Manual
Verification
&
Accuracy
measures
Task
completion
time per scene
Tohme
Tohme
Evaluation | Labeling Accuracy and Time Cost
13% reduction
in cost
ACCURACY COST (TIME)

svControl
Automatic
Task Allocation svVerify
Manual Label
Verification
svLabel
Manual Labeling
Evaluation | Smart Task Allocator
~80% of svVerify tasks were correctly routed
~50% of svLabel tasks were correctly routed

svControl
Automatic
Task Allocation svVerify
Manual Label
Verification
svLabel
Manual Labeling
Evaluation | Smart Task Allocator
If svControl worked perfectly,
Tohme’s cost would drop to 28% of
a manually labelling approach
alone.

Example Labels from Manual Labeling

Evaluation | Example Labels from Manual Labeling

This is a driveway.
Not a curb ramp.
Evaluation | Example Labels from Manual Labeling

Examples Labels from CV + Verification

Raw Street View Image
Evaluation | Example Labels from CV + Verification

False detection
Automatic Detection

Automatic Detection + Human Verification

8,209Intersections in DC
BACK OF THE ENVELOPE CALCULATIONS
Manually labeling GSV with our custom interfaces
would take 214 hours
With Tohme, this drops to 184 hours
We think we can do better 

makeability lab
Smart task management can improve efficiency of
semi-automatic crowd-powered system
Takeaway
We can combine crowdsourcing and automated
methods to collect accessibility data from Street View

FUTURE WORK: COMPUTER VISION
Context integration & scene understanding
3D-data integration
Improve training & sample size
Mensuration

FUTURE WORK: DEPLOYMENT OF VOLUNTEER WEB SITE

This work is supported by
Faculty Research Award
makeability lab

THE CROWD-POWERED STREETVIEW ACCESSIBILITY TEAM!
Kotaro Hara Jin Sun Victoria Le Robert Moore Sean Pannella
Jonah Chazan David Jacobs Jon Froehlich
Zachary Lawrence
Graduate Student
Undergraduate
High School
Professor
Thanks!
@kotarohara_en | kotaro@cs.umd.edu

Using Crowdsourcing, Automated Methods and Google Street View to Collect Sidewalk Accessibility Data

Recomendados

Recomendados

Más contenido relacionado

Destacado

Destacado (14)

Similar a Using Crowdsourcing, Automated Methods and Google Street View to Collect Sidewalk Accessibility Data

Similar a Using Crowdsourcing, Automated Methods and Google Street View to Collect Sidewalk Accessibility Data (20)

Último

Último (20)

Using Crowdsourcing, Automated Methods and Google Street View to Collect Sidewalk Accessibility Data

Notas del editor