SlideShare una empresa de Scribd logo
1 de 35
Managing Data Quality in
         OpenStreetMap


TOOLS FOR AN ACTIVE
MAPPING COMMUNITY

NC GIS CONFERENCE 2013



    This document licensed in entirety by Creative Commons CC-by-SA. For specific terms of license, see:
    http://creativecommons.org/licenses/by-sa/3.0/
Overview
                            2

 The Short History of the OpenStreetMap
   Revolution

 Assessing Open Source Data Quality


 Overview of Tools


 Creating Tools that Matter


NC GIS Conference 2013                     23 February 2013
Overview: Key Questions
                                    3

 How can crowd-sourced projects manage data
   quality effectively?

 What tools exist for monitoring data quality in
   OpenStreetMap?

 What conclusions can be drawn about existing tools?


 What is the future of data quality in crowd-sourced
   projects?
NC GIS Conference 2013                             23 February 2013
OpenStreetMap is…
                                 4




 A freely-editable map of the world
   unconstrained by proprietary ownership

 “Wikipedia for maps”




NC GIS Conference 2013                       23 February 2013
The Origins of OpenStreetMap
                              5



 OpenStreetMap.org domain registered by Steve
  Coast in 2004
 Project originated in the United Kingdom, where…
   Crown copyright on geospatial data

   Little, or no public domain data

 Simple goal to create a free, publicly-available
  database of street centerlines


NC GIS Conference 2013                      23 February 2013
OpenStreetMap is…
                                 6




 A freely-editable map of the world
   unconstrained by proprietary ownership

 “Wikipedia for maps”




NC GIS Conference 2013                       23 February 2013
Looks like…a wiki
                                 7




NC GIS Conference 2013                       23 February 2013
Wiki-based Documentation!
                         8




NC GIS Conference 2013

                                         23 February 2013
Milestones in OpenStreetMap History
                             9

 2004 - OpenStreetMap.org registered by Steve Coast
 2005 – Map Limehouse, 1st OpenStreetMap mapping
    party
   2005 – 1000 registered OpenStreetMap users
   2006 – OpenStreetMap Foundation established
   2007 – 5 million ways in OSM database
   2007 – 10,000 registered OpenStreetMap users
   2008 - TIGER data import for the US completed
   2009 - 100,000 registered OpenStreetMap users
   2010 - 200,000 registered OpenStreetMap users
   2012 – ~670,000 registered OpenStreetMap users

NC GIS Conference 2013                          23 February 2013
OpenStreetMap User Growth
                                          10
One million registered users worldwide!




 NC GIS Conference 2013                         23 February 2013
OpenStreetMap Growth in User Edits
                         11




NC GIS Conference 2013                 23 February 2013
OpenStreetMap Database Growth
                           12




NC GIS Conference 2013                  23 February 2013
Data Quality in Crowd-sourced Projects
                                                            13

 Goodchild & Li: Identified three mechanisms for
   Quality Assurance

       Crowd-sourcing

       Social

       Geographic


Goodchild, Michael F., and Linna Li. "Assuring the quality of volunteered geographic information."
Spatial Statistics 1 (2012): 110-120.


NC GIS Conference 2013                                                                               23 February 2013
Crowd-sourced Approach to Data Quality
                                                        14

 Based on Surowiecki’s “Wisdom of the Crowd”
   Multiple users converge around consensus solutions that
    might escape an individual
   Many independent observations reinforce the validity of a
    single observation
   Concurrence on observed features (e.g. “It’s a bridge.”)

   Convergence on the truth



      The group validates observations & corrects errors



   Surowiecki, J., 2005. The Wisdom of Crowds. Anchor, New York.

NC GIS Conference 2013                                             23 February 2013
Social Approach to Data Quality
                             15

 Through practices, users acquire reputations
 Users with good reputations are trusted
 Trust and reputation are indicators of stewardship
 As the project evolves, social leadership becomes
   more formalized.

 The Data Working Group of OpenStreetMap fullfills
  this function
 Email lists supplement social stewardship


NC GIS Conference 2013                        23 February 2013
Geographic Tools for Data Quality
                                   16

 Geographic approach draws on formal geographic
   theory:
      Spatial neighbors & auto-correlation (Moran statistics)
      Christaller’s Central Place Theory
      Descriptive Statistics
      Inferential Statistics & Analysis of Variance (ANOVA)
      Richardson plots of linear measurements
      Cluster analysis, e.g. k-means
 These approaches have not been widely adopted for
   use in the OpenStreetMap project…yet

NC GIS Conference 2013                                     23 February 2013
A Quick Survey of Data Quality Tools
                               17

 Two types of tools are in widespread use:


      Error Detection Tools

      Monitoring Tools




NC GIS Conference 2013                        23 February 2013
Error Detection Tools: Keep Right
                             18




NC GIS Conference 2013                      23 February 2013
Error Detection Tools: Map Dust
                             19




NC GIS Conference 2013                     23 February 2013
Error Detection Tools: OpenStreetBugs




NC GIS Conference 2013                 23 February 2013
Error Detection Tools: No Name
                             21




NC GIS Conference 2013                     23 February 2013
Error Detection Tools: MapRoulette
                           22




NC GIS Conference 2013                    23 February 2013
Monitoring Tools
                                23




NC GIS Conference 2013                      23 February 2013
Monitoring Tools: OpenStreetMap Watch List
                  (OWL)
                         24




NC GIS Conference 2013            23 February 2013
Monitoring Tools: GeoFabrik Map Compare
                         25




NC GIS Conference 2013           23 February 2013
Monitoring Tools: Who Did It
                               26




NC GIS Conference 2013                           23 February 2013
Monitoring Tools: ITO TIGER Reviewed
                         27




NC GIS Conference 2013              23 February 2013
Monitoring Tools: ITO TIGER Reviewed
                         28




NC GIS Conference 2013              23 February 2013
Monitoring Tools: Green Means Go
                          29




NC GIS Conference 2013                  23 February 2013
Monitoring Tools: Who’s Around Me
                          30




NC GIS Conference 2013                  23 February 2013
Social Controls
                                31

 OpenStreetMap - Data Working Group (DWG)
   Resolving disputes between users

   Processes & protocols for data imports

   Investigates copyright infringement

   Deals with issues of vandalism and fraud

   Suspends or closes user accounts (in case of abuse)

   IP blocking (in case of abuse)




NC GIS Conference 2013                              23 February 2013
How do Social Methods Treat Vandalism?
                                32

 OpenStreetMap is not immune from malicious intent
   Copyright infringement (e.g. copying from Google Maps)

   Graffiti

   Disputes & “Edit Wars” (e.g. Kashmir region, Palestine)

   Spam

 Tools for Managing Vandalism
   Detect using daily diffs

   UserActivity – batch comparison of two versions of the
    database
   Revert – undo changeset to previous version

   Virtual Ban


NC GIS Conference 2013                                 23 February 2013
Summary Review
                                 33

 Three methods for data quality control
   Crowd-sourced

   Social

   Geographic

 OpenStreetMap has crowd-sourced and social tools
   for managing data quality
      Error & Monitoring tools
      Data Working Group - Social
 Geographic methods are experimental at this time
 Increasingly complete geographic features will lead
   to better tools
NC GIS Conference 2013                        23 February 2013
Lessons Learned about OSM Data Quality
                                                       34

 Successive editing by multiple users can improve
   accuracy…up to a point
      Haklay suggests that few improvements are made beyond the
       13th edit
      Semantic differences are not easy to resolve – “Tag wars”
      Obscure edits do not always get corrected if there are no local
       mappers that take ownership
 Social approaches will acquire more authority
   Are part-time, volunteer staffers enough to guarantee data
    quality?
   What are appropriate metrics for trust and reputation?

     Haklay, M. 2010. How Good is volunteered geographical information? a comparative study of OpenStreetMap and
     Ordnance Survey Datasets. Environment & Planning B: Planning and Design 37 (4), 682-703g
NC GIS Conference 2013                                                                           23 February 2013
Thank You
                                                                   35

 Questions?




 Steven Johnson
   (e) stevejohnson@deloitte.com

   (t) @geomantic




             This document licensed in entirety by Creative Commons CC-by-SA. For specific terms of license, see:
             http://creativecommons.org/licenses/by-sa/3.0/




NC GIS Conference 2013                                                                                              23 February 2013

Más contenido relacionado

La actualidad más candente

Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08
Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08
Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08Anita Graser
 
Geographic information system
Geographic information systemGeographic information system
Geographic information systemDhaval Jalalpara
 
The Application of GIS in Urban Planning
The Application of GIS in Urban PlanningThe Application of GIS in Urban Planning
The Application of GIS in Urban Planningagungwah
 
Geodatabase with GIS & RS
Geodatabase with GIS & RSGeodatabase with GIS & RS
Geodatabase with GIS & RSMohammed_82
 
Future of GIS, Moving to the Enterprise Platform
Future of GIS, Moving to the Enterprise PlatformFuture of GIS, Moving to the Enterprise Platform
Future of GIS, Moving to the Enterprise PlatformSSP Innovations
 
Geographic information system
Geographic information systemGeographic information system
Geographic information systemSumanta Das
 
GIS and Petroleum Land Management
GIS and Petroleum Land ManagementGIS and Petroleum Land Management
GIS and Petroleum Land Managementwlgardnerjr
 
Geographical information system in transportation planning
Geographical information system in transportation planning Geographical information system in transportation planning
Geographical information system in transportation planning shayiqRashid
 
Open Source GIS
Open Source GISOpen Source GIS
Open Source GISJoe Larson
 
Gis powerpoint
Gis powerpointGis powerpoint
Gis powerpointkaushdave
 
Open source health gis presentation final
Open source health gis  presentation finalOpen source health gis  presentation final
Open source health gis presentation finalJISC GECO
 
Why Does GIS Matter
Why Does GIS MatterWhy Does GIS Matter
Why Does GIS MatterSong Gao
 
Geographic Information Systems in the Oil & Gas Industry
Geographic Information Systems in the Oil & Gas IndustryGeographic Information Systems in the Oil & Gas Industry
Geographic Information Systems in the Oil & Gas IndustryFrancois Viljoen
 
A Study of the Development and Distribution of Open Geospatial Data in Japane...
A Study of the Development and Distribution of Open Geospatial Data in Japane...A Study of the Development and Distribution of Open Geospatial Data in Japane...
A Study of the Development and Distribution of Open Geospatial Data in Japane...Toshikazu Seto
 
MODERN trends of GIS
MODERN trends of GISMODERN trends of GIS
MODERN trends of GISVAISHALI JAIN
 
CKANへの空間情報機能拡張実装の試み
CKANへの空間情報機能拡張実装の試みCKANへの空間情報機能拡張実装の試み
CKANへの空間情報機能拡張実装の試みYoichi Kayama
 

La actualidad más candente (20)

Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08
Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08
Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08
 
Geographic information system
Geographic information systemGeographic information system
Geographic information system
 
The Application of GIS in Urban Planning
The Application of GIS in Urban PlanningThe Application of GIS in Urban Planning
The Application of GIS in Urban Planning
 
Geodatabase with GIS & RS
Geodatabase with GIS & RSGeodatabase with GIS & RS
Geodatabase with GIS & RS
 
Future of GIS, Moving to the Enterprise Platform
Future of GIS, Moving to the Enterprise PlatformFuture of GIS, Moving to the Enterprise Platform
Future of GIS, Moving to the Enterprise Platform
 
Geographic information system
Geographic information systemGeographic information system
Geographic information system
 
GIS and Petroleum Land Management
GIS and Petroleum Land ManagementGIS and Petroleum Land Management
GIS and Petroleum Land Management
 
Introduction To GIS
Introduction To GISIntroduction To GIS
Introduction To GIS
 
Geographical information system in transportation planning
Geographical information system in transportation planning Geographical information system in transportation planning
Geographical information system in transportation planning
 
Open Source GIS
Open Source GISOpen Source GIS
Open Source GIS
 
Gis powerpoint
Gis powerpointGis powerpoint
Gis powerpoint
 
Open source health gis presentation final
Open source health gis  presentation finalOpen source health gis  presentation final
Open source health gis presentation final
 
Gis
GisGis
Gis
 
Why Does GIS Matter
Why Does GIS MatterWhy Does GIS Matter
Why Does GIS Matter
 
survey paper 2
survey paper 2survey paper 2
survey paper 2
 
Geographic Information Systems in the Oil & Gas Industry
Geographic Information Systems in the Oil & Gas IndustryGeographic Information Systems in the Oil & Gas Industry
Geographic Information Systems in the Oil & Gas Industry
 
Get Big Geo Data
Get Big Geo DataGet Big Geo Data
Get Big Geo Data
 
A Study of the Development and Distribution of Open Geospatial Data in Japane...
A Study of the Development and Distribution of Open Geospatial Data in Japane...A Study of the Development and Distribution of Open Geospatial Data in Japane...
A Study of the Development and Distribution of Open Geospatial Data in Japane...
 
MODERN trends of GIS
MODERN trends of GISMODERN trends of GIS
MODERN trends of GIS
 
CKANへの空間情報機能拡張実装の試み
CKANへの空間情報機能拡張実装の試みCKANへの空間情報機能拡張実装の試み
CKANへの空間情報機能拡張実装の試み
 

Similar a OpenStreetMap Data Quality

Exploratory analysis of OpenStreetMap for land use classification
Exploratory analysis of OpenStreetMap for land use classificationExploratory analysis of OpenStreetMap for land use classification
Exploratory analysis of OpenStreetMap for land use classificationJacinto Estima
 
GIS for geophysics.pptx
GIS for geophysics.pptxGIS for geophysics.pptx
GIS for geophysics.pptxThomasHundasa1
 
Land information system in Nepal
Land information system in NepalLand information system in Nepal
Land information system in NepalQust04
 
Thesispresentatie maart
Thesispresentatie maartThesispresentatie maart
Thesispresentatie maartRobin De Croon
 
oWE-QGIS_Training-March2022
oWE-QGIS_Training-March2022oWE-QGIS_Training-March2022
oWE-QGIS_Training-March2022AbdilbasitHamid
 
MoWE-QGIS_Training-March2022-Day1_AM.pptx
MoWE-QGIS_Training-March2022-Day1_AM.pptxMoWE-QGIS_Training-March2022-Day1_AM.pptx
MoWE-QGIS_Training-March2022-Day1_AM.pptxAbdilbasitHamid
 
Converting Relational to Graph Databases
Converting Relational to Graph DatabasesConverting Relational to Graph Databases
Converting Relational to Graph DatabasesAntonio Maccioni
 
Gis Day Presentation 2010 - ACCC - Expanded Version
Gis Day Presentation 2010 - ACCC - Expanded VersionGis Day Presentation 2010 - ACCC - Expanded Version
Gis Day Presentation 2010 - ACCC - Expanded Versionpdcaris
 
Arc news - Fall-2015
Arc news - Fall-2015Arc news - Fall-2015
Arc news - Fall-2015what3words
 
New way for GIS Development(Gaia3D)
New way for  GIS Development(Gaia3D)New way for  GIS Development(Gaia3D)
New way for GIS Development(Gaia3D)slhead1
 
Introduction to Geographic Information System (GIS)
Introduction to Geographic Information System (GIS)Introduction to Geographic Information System (GIS)
Introduction to Geographic Information System (GIS)Shashank Singh
 
Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...
Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...
Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...David Rozas
 
Open Source based GIS devlopment cases by Gaia3D_20150417
Open Source based GIS devlopment cases by Gaia3D_20150417Open Source based GIS devlopment cases by Gaia3D_20150417
Open Source based GIS devlopment cases by Gaia3D_20150417BJ Jang
 
Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...
Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...
Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...Grant McKenzie
 
Spatial Analysis and Geomatics
Spatial Analysis and GeomaticsSpatial Analysis and Geomatics
Spatial Analysis and GeomaticsRich Heimann
 
COST Actions: ENERGIC, Mapping and the citizen sensor.
COST Actions: ENERGIC,  Mapping and the citizen sensor.COST Actions: ENERGIC,  Mapping and the citizen sensor.
COST Actions: ENERGIC, Mapping and the citizen sensor.Vyron
 

Similar a OpenStreetMap Data Quality (20)

Understanding the Volunteer in VGI
Understanding the Volunteer in VGIUnderstanding the Volunteer in VGI
Understanding the Volunteer in VGI
 
Exploratory analysis of OpenStreetMap for land use classification
Exploratory analysis of OpenStreetMap for land use classificationExploratory analysis of OpenStreetMap for land use classification
Exploratory analysis of OpenStreetMap for land use classification
 
GIS for geophysics.pptx
GIS for geophysics.pptxGIS for geophysics.pptx
GIS for geophysics.pptx
 
Land information system in Nepal
Land information system in NepalLand information system in Nepal
Land information system in Nepal
 
Thesispresentatie maart
Thesispresentatie maartThesispresentatie maart
Thesispresentatie maart
 
oWE-QGIS_Training-March2022
oWE-QGIS_Training-March2022oWE-QGIS_Training-March2022
oWE-QGIS_Training-March2022
 
MoWE-QGIS_Training-March2022-Day1_AM.pptx
MoWE-QGIS_Training-March2022-Day1_AM.pptxMoWE-QGIS_Training-March2022-Day1_AM.pptx
MoWE-QGIS_Training-March2022-Day1_AM.pptx
 
Converting Relational to Graph Databases
Converting Relational to Graph DatabasesConverting Relational to Graph Databases
Converting Relational to Graph Databases
 
Gis Day Presentation 2010 - ACCC - Expanded Version
Gis Day Presentation 2010 - ACCC - Expanded VersionGis Day Presentation 2010 - ACCC - Expanded Version
Gis Day Presentation 2010 - ACCC - Expanded Version
 
Arc news - Fall-2015
Arc news - Fall-2015Arc news - Fall-2015
Arc news - Fall-2015
 
New way for GIS Development(Gaia3D)
New way for  GIS Development(Gaia3D)New way for  GIS Development(Gaia3D)
New way for GIS Development(Gaia3D)
 
AAG panel discussion on great lakes africa
AAG panel discussion on great lakes africaAAG panel discussion on great lakes africa
AAG panel discussion on great lakes africa
 
Introduction to Geographic Information System (GIS)
Introduction to Geographic Information System (GIS)Introduction to Geographic Information System (GIS)
Introduction to Geographic Information System (GIS)
 
Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...
Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...
Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data...
 
Open Source based GIS devlopment cases by Gaia3D_20150417
Open Source based GIS devlopment cases by Gaia3D_20150417Open Source based GIS devlopment cases by Gaia3D_20150417
Open Source based GIS devlopment cases by Gaia3D_20150417
 
Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...
Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...
Coerced Geographic Information: The Not-so-voluntary Side of User-generated G...
 
Smart Citizens
Smart CitizensSmart Citizens
Smart Citizens
 
Smart Citizens
Smart CitizensSmart Citizens
Smart Citizens
 
Spatial Analysis and Geomatics
Spatial Analysis and GeomaticsSpatial Analysis and Geomatics
Spatial Analysis and Geomatics
 
COST Actions: ENERGIC, Mapping and the citizen sensor.
COST Actions: ENERGIC,  Mapping and the citizen sensor.COST Actions: ENERGIC,  Mapping and the citizen sensor.
COST Actions: ENERGIC, Mapping and the citizen sensor.
 

Último

Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 

Último (20)

Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 

OpenStreetMap Data Quality

  • 1. Managing Data Quality in OpenStreetMap TOOLS FOR AN ACTIVE MAPPING COMMUNITY NC GIS CONFERENCE 2013 This document licensed in entirety by Creative Commons CC-by-SA. For specific terms of license, see: http://creativecommons.org/licenses/by-sa/3.0/
  • 2. Overview 2  The Short History of the OpenStreetMap Revolution  Assessing Open Source Data Quality  Overview of Tools  Creating Tools that Matter NC GIS Conference 2013 23 February 2013
  • 3. Overview: Key Questions 3  How can crowd-sourced projects manage data quality effectively?  What tools exist for monitoring data quality in OpenStreetMap?  What conclusions can be drawn about existing tools?  What is the future of data quality in crowd-sourced projects? NC GIS Conference 2013 23 February 2013
  • 4. OpenStreetMap is… 4  A freely-editable map of the world unconstrained by proprietary ownership  “Wikipedia for maps” NC GIS Conference 2013 23 February 2013
  • 5. The Origins of OpenStreetMap 5  OpenStreetMap.org domain registered by Steve Coast in 2004  Project originated in the United Kingdom, where…  Crown copyright on geospatial data  Little, or no public domain data  Simple goal to create a free, publicly-available database of street centerlines NC GIS Conference 2013 23 February 2013
  • 6. OpenStreetMap is… 6  A freely-editable map of the world unconstrained by proprietary ownership  “Wikipedia for maps” NC GIS Conference 2013 23 February 2013
  • 7. Looks like…a wiki 7 NC GIS Conference 2013 23 February 2013
  • 8. Wiki-based Documentation! 8 NC GIS Conference 2013 23 February 2013
  • 9. Milestones in OpenStreetMap History 9  2004 - OpenStreetMap.org registered by Steve Coast  2005 – Map Limehouse, 1st OpenStreetMap mapping party  2005 – 1000 registered OpenStreetMap users  2006 – OpenStreetMap Foundation established  2007 – 5 million ways in OSM database  2007 – 10,000 registered OpenStreetMap users  2008 - TIGER data import for the US completed  2009 - 100,000 registered OpenStreetMap users  2010 - 200,000 registered OpenStreetMap users  2012 – ~670,000 registered OpenStreetMap users NC GIS Conference 2013 23 February 2013
  • 10. OpenStreetMap User Growth 10 One million registered users worldwide! NC GIS Conference 2013 23 February 2013
  • 11. OpenStreetMap Growth in User Edits 11 NC GIS Conference 2013 23 February 2013
  • 12. OpenStreetMap Database Growth 12 NC GIS Conference 2013 23 February 2013
  • 13. Data Quality in Crowd-sourced Projects 13  Goodchild & Li: Identified three mechanisms for Quality Assurance  Crowd-sourcing  Social  Geographic Goodchild, Michael F., and Linna Li. "Assuring the quality of volunteered geographic information." Spatial Statistics 1 (2012): 110-120. NC GIS Conference 2013 23 February 2013
  • 14. Crowd-sourced Approach to Data Quality 14  Based on Surowiecki’s “Wisdom of the Crowd”  Multiple users converge around consensus solutions that might escape an individual  Many independent observations reinforce the validity of a single observation  Concurrence on observed features (e.g. “It’s a bridge.”)  Convergence on the truth  The group validates observations & corrects errors Surowiecki, J., 2005. The Wisdom of Crowds. Anchor, New York. NC GIS Conference 2013 23 February 2013
  • 15. Social Approach to Data Quality 15  Through practices, users acquire reputations  Users with good reputations are trusted  Trust and reputation are indicators of stewardship  As the project evolves, social leadership becomes more formalized.  The Data Working Group of OpenStreetMap fullfills this function  Email lists supplement social stewardship NC GIS Conference 2013 23 February 2013
  • 16. Geographic Tools for Data Quality 16  Geographic approach draws on formal geographic theory:  Spatial neighbors & auto-correlation (Moran statistics)  Christaller’s Central Place Theory  Descriptive Statistics  Inferential Statistics & Analysis of Variance (ANOVA)  Richardson plots of linear measurements  Cluster analysis, e.g. k-means  These approaches have not been widely adopted for use in the OpenStreetMap project…yet NC GIS Conference 2013 23 February 2013
  • 17. A Quick Survey of Data Quality Tools 17  Two types of tools are in widespread use:  Error Detection Tools  Monitoring Tools NC GIS Conference 2013 23 February 2013
  • 18. Error Detection Tools: Keep Right 18 NC GIS Conference 2013 23 February 2013
  • 19. Error Detection Tools: Map Dust 19 NC GIS Conference 2013 23 February 2013
  • 20. Error Detection Tools: OpenStreetBugs NC GIS Conference 2013 23 February 2013
  • 21. Error Detection Tools: No Name 21 NC GIS Conference 2013 23 February 2013
  • 22. Error Detection Tools: MapRoulette 22 NC GIS Conference 2013 23 February 2013
  • 23. Monitoring Tools 23 NC GIS Conference 2013 23 February 2013
  • 24. Monitoring Tools: OpenStreetMap Watch List (OWL) 24 NC GIS Conference 2013 23 February 2013
  • 25. Monitoring Tools: GeoFabrik Map Compare 25 NC GIS Conference 2013 23 February 2013
  • 26. Monitoring Tools: Who Did It 26 NC GIS Conference 2013 23 February 2013
  • 27. Monitoring Tools: ITO TIGER Reviewed 27 NC GIS Conference 2013 23 February 2013
  • 28. Monitoring Tools: ITO TIGER Reviewed 28 NC GIS Conference 2013 23 February 2013
  • 29. Monitoring Tools: Green Means Go 29 NC GIS Conference 2013 23 February 2013
  • 30. Monitoring Tools: Who’s Around Me 30 NC GIS Conference 2013 23 February 2013
  • 31. Social Controls 31  OpenStreetMap - Data Working Group (DWG)  Resolving disputes between users  Processes & protocols for data imports  Investigates copyright infringement  Deals with issues of vandalism and fraud  Suspends or closes user accounts (in case of abuse)  IP blocking (in case of abuse) NC GIS Conference 2013 23 February 2013
  • 32. How do Social Methods Treat Vandalism? 32  OpenStreetMap is not immune from malicious intent  Copyright infringement (e.g. copying from Google Maps)  Graffiti  Disputes & “Edit Wars” (e.g. Kashmir region, Palestine)  Spam  Tools for Managing Vandalism  Detect using daily diffs  UserActivity – batch comparison of two versions of the database  Revert – undo changeset to previous version  Virtual Ban NC GIS Conference 2013 23 February 2013
  • 33. Summary Review 33  Three methods for data quality control  Crowd-sourced  Social  Geographic  OpenStreetMap has crowd-sourced and social tools for managing data quality  Error & Monitoring tools  Data Working Group - Social  Geographic methods are experimental at this time  Increasingly complete geographic features will lead to better tools NC GIS Conference 2013 23 February 2013
  • 34. Lessons Learned about OSM Data Quality 34  Successive editing by multiple users can improve accuracy…up to a point  Haklay suggests that few improvements are made beyond the 13th edit  Semantic differences are not easy to resolve – “Tag wars”  Obscure edits do not always get corrected if there are no local mappers that take ownership  Social approaches will acquire more authority  Are part-time, volunteer staffers enough to guarantee data quality?  What are appropriate metrics for trust and reputation? Haklay, M. 2010. How Good is volunteered geographical information? a comparative study of OpenStreetMap and Ordnance Survey Datasets. Environment & Planning B: Planning and Design 37 (4), 682-703g NC GIS Conference 2013 23 February 2013
  • 35. Thank You 35  Questions?  Steven Johnson  (e) stevejohnson@deloitte.com  (t) @geomantic This document licensed in entirety by Creative Commons CC-by-SA. For specific terms of license, see: http://creativecommons.org/licenses/by-sa/3.0/ NC GIS Conference 2013 23 February 2013