Harpster, J. - Open data on buildings with satellite imagery processing

What are we going to cover?
• Project Description
• Motivation
• Solution Overview
• Extracting Buildings for OpenStreetMap
• What’s next
• Conclusions

Building Extraction Project Overview
Why is Microsoft investing in this?
• Consistent side by side Loss against competitors for Base
Map
• Lack of Building footprints is #1 contributor
• Metrics show users like building polygons on base maps
• Reverse geocoding scenarios
• Cortana geofencing: “What building am I in?”
• OpenStreetMap is the largest source of buildings data by
far
• ~315M and growing by 1000s per day
• “Free” source of data if we follow the guidelines

Open Source at Microsoft
 Microsoft is the largest Open
Source contributor to GitHub
 2,620 public repos
 10,966 contributors
 902 teams
 The Open Source shift at MS makes
Opening data on GitHub easy

Map Tiles
• Images from Bing Satellite Layer
• Image map tiles are extracted from the world’s
Mercator projection using quadkeys
• Images are of 256x256 pixels size, jpeg
compressed
• The maximal level of detail available is 23
• Going one level deeper =>
• 2x better resolution -> more details
• 4x more data to process and store
• Detail level 19 properties:
• Resolution is 0.3m/pixel (~1 foot/pixel)
• The whole US is covered
• ~2.7 billion tiles in mainland US
Level 18 Level 19 Level 20

Scale is Huge
• Extracting from ~2.7 billion L19 tiles in mainland US in 1
month, need to process 1000 images per second
• 60TB of imagery for the US
• Toolkits & Platforms that makes this possible:
• CNTK – Microsoft Open Source Cognitive Toolkit:
• A unified deep-learning toolkit
• Philly Cluster:
• Internal GPU compute cluster for to large-scale DNN training
• COSMOS (aka AzureDB):
• A Microsoft internal Big Data platform service
• Our platform of choice for storing massive datasets

Creating the Training Set
• Once the tiles are isolated, training pairs are created
• Label masks are created in Æther:
• For each training tile find the overlapping label polygons by sending a query request to spatial DB
• Render building polygon on the label tile images – converts lat/long coordinates into the pixel
domain
• Creating uncertainty/weight areas around building enables better training
• Training loss function can be modified to account for errors around building edges
since the labels aren’t perfect
Text
Training pair
Uncertainty area

Postprocessing – Creating Building Polygons
• The process of converting building pixel predictions into the building polygons
defined by latitude/longitude pair vertices
• Processing stages included:
• Finetuning and de-noising raw DNN building predictions
• Conversion into representative polygons that approximate the building shapes
• Subtracting known source satellite image offsets in the resulting polygons
• Executed Azure Æther because of easy integration with Python image libraries
Raw predictions
Finetuning
Noise Removal Creating polygons
Storing into a textual stream
WKT formatParcels utilization

The Final Pipeline – 64 GPUs
Buildings Extraction in the United
States
• Support for autoscaling resources
• Parametrization of different jobs
• Auto restart with no data loss
Imagery data : United States = 60
TB;
Final Buildings Processing time:
United States = 2 weeks

Errors – Pacific county, WA
• Beach and waves identified as
buildings
• We didn’t have enough negative
examples in the training set
• Proper buildings were not
detected
• The imagery is of lower or
different resolution
• There were not enough positive
images in the training set

Seattle Downtown
• Boats & piers recognized as buildings
• We didn’t have enough negative examples
in the training set in coastal areas
• Parts of highway recognized as buildings
• Need to add road intersection
postprocessing
• Shadows can cause problems
• In high density areas buildings can put
shadows on other buildings
• Need more high-density area training
samples

Other issues rendering false positives
• We need to spend more time
working on the training set
• Snow spots and clouds can
generate false positive predictions

What else is possible?
We are only beginning
Possible data sets include:
• National Level Road Networks
• Buildings for additional countries
• Water & coastline extractions
• High accuracy lanes & road markings
• Off Nadir (oblique) segmentation

Roads – Extracted in days,
not months or weeks

Paved Attribute coverage
Some countries have a significant number of
unpaved roads
 In Australia OSM has a sparse attribute coverage:
 320K km of road missing surface=
 790K km of road correctly tagged
 Applied computer vision to detect the type:
 Classifier that identifies if an image contains paved road
 Classifier Precision/Recall = 92%/90% (per tile)
 Voting logic brings this to 98%/99% measured in km
Unpaved
Paved
PavedPaved
Unpaved
Unpaved
UnpavedUnpavedUnpavedUnpaved
Unpaved Paved

Building Extraction - Canada
• Canada spans on 12.5 billion L19 tiles
(US spans across 2.7 billion tiles)
• Optimization:
• L12 tiles intersecting:
• Expanded populated places and
neighborhoods by 10km
• Expanded roads by 2km
• This method could be validated on US
• Create the evaluation sets
• Iterate improvements targeting the error
space
Camera Id Canada US old US new
222/225 0.04% 1.30% 1.36%
217 0.00% 76.34% 0.79%
138 80.42% 10.87% 88.34%
235 3.89% 0.15% 0.22%
No image 15.64% 10.95% 9.25%
Model trained on
High focus
imagery
Low focus
imagery
Tiles camera distribution

Water & Coastlines
• L16 or L17 tiles extraction
• 64x or 256x smaller data space compared
to buildings extraction
• We can use the same deep net model –
or simplify
• No polygonization necessary

Oblique Imagery
• Taken from the flying plane/drone
• 4 directions
• Coverage is limited to top US cities
• Potential for 3D reconstruction
• 45 different shots of the same location
• Could help semantic understanding
• Building heights

Oblique Road Horizontal Signalization
• Image resolution 10300x7700
• 6cm/px close areas , 8.5 cm/px far
areas
• Compared to 32cm/px – building
extraction
• Turn restriction and lane
detection
• Apply Deep Neural Nets
• Lane Markings
• Car, bike, bus, pedestrian, parking, etc
Bottom road Upper road

Open up all the building data – for free
 We released all the extracted
buildings on GitHub
 Amplifies the value of the data by
making it Open
 Adheres to OSM community
guidelines
 Encourages additional contributions
 To date over 75K downloads of the
building data

Applications & Summary
• Population Estimations
• Neighborhood delineations
• Telecommunications – Mobile Signal Targeting
• Urban Planning
• Transportation & Land use Planning
The value of the data are amplified when Open
Everywhere we have training data we can apply deep
learning

Harpster, J. - Open data on buildings with satellite imagery processing

Harpster, J. - Open data on buildings with satellite imagery processing

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (10)

Similar a Harpster, J. - Open data on buildings with satellite imagery processing

Similar a Harpster, J. - Open data on buildings with satellite imagery processing (20)

Más de OECDregions

Más de OECDregions (20)

Último

Último (20)

Harpster, J. - Open data on buildings with satellite imagery processing