SlideShare una empresa de Scribd logo
1 de 31
Machine Learning for Imagery
Abel Coronado & Jimena Juarez
Imagery Pipeline
Imagery Pipeline
Imagery Pipeline
In order to have a road map.
We build an abstract machine learning pipeline for Imagery
Based on:
IBM CRISP-DM
Cross Industry Standard Process for Data Mining
&
Microsoft Data Science lifecycle
InKyung Choi; Refactoring
Imagery Pipeline
Imagery Pipeline
Pilot Project Pipeline
Establish the problem to be solved
Establish the problem to be solved
Monitor the growth of cities of Mexico (Urban and Rural), which would generate a more timely
input for :
• Cartography update
• Estimation of the population in non-census years
• Related with:
• SDG Indicator 11.3.1: Ratio of land consumption rate to population growth rate
• SDG Indicator 15.3.1: Proportion of land that is degraded over total land area
Identify satellite data to address the problem
Identify satellite data to address the problem
In our case, the data to monitor the growth of cities can be Landsat images (NASA & USGS):
• They are open data.
• There is a constant record since the 70s, although they are available from 1985 to date.
• The spatial resolution of Landsat images is 30 meters.
• Temporal resolution is 16 days.
• Spectral resolution is 6 spectral bands (range of wavelength measured by the passive
sensors of the Landsat Satellites)
Identify satellite data to address the problem
Identify satellite data to address the problem
Identify satellite data to address the problem
Identify satellite data to address the problem
Identify satellite data to address the problem
Identify satellite data to address the problem
Translate the problem into statistical problem
Translate the problem into statistical problem
We define that is a classification problem:
• Unit of analysis: 1km x 1km squares covering the whole country.
• 3 classes were assigned:
• Urban
• Rural
• None of the above
Obtain ground-truth data
Obtain ground-truth data
Obtain layers of geographical information:
• Georeferenced Population Census (Block Level Aggregation)
• Georeferenced Economic Census (Economic Unit Level)
• Visual Inspection
Translate the problem into statistical problem
Obtain ground-truth data
Build the polygons (1 km – 1 km)
1’ 975, 719 Polygons
Translate the problem into statistical problem
Obtain ground-truth data
Assign Labels
Obtain satellite imagery data
Obtain satellite imagery data
• 32 TeraBytes of Images in external discs.
• March 4, 2019
• The images are ARD, Analysis Ready Data.
• In essence ARD means that the pixels of the entire time serie are aligned.
• Which are processed to combine sensors and make comparisons at different times.
Obtain satellite imagery data
Obtain satellite imagery data
• 34 Years
Obtain satellite imagery data
Obtain satellite imagery data
• 2010 same year of the Census
Obtain satellite imagery data
Obtain satellite imagery data
• 2010 same year of the Census
2,737,273,075 pixels – 35 Gb
Integrate ground-truth data with satellite data
Integrate ground-truth data with satellite data
1 km x 1 km tagged polygon
33 x 33 pixels
30 meters of resolution
33 Pixels
6Bands
6 swir 2------
5 swir 1------
4 nir---------
3 red---------
2 green-------
1 blue--------
Python
Function
Multidimensional Array
NumPy
Take a Random Sample
Take a Random Sample
Define features and generate a dataset to be used for analysis
Define features and generate a dataset to be used for analysis
• Feature Engineering
30,000 tagged regions
1km x 1km
33 pixels x 33 pixels x 6 Bands
30,000
4,305
30,000 rows in a data matrix
?
Define features and generate a dataset to be used for analysis
Define features and generate a dataset to be used for analysis
• Feature Engineering
30,000 tagged regions
1km x 1km
33 pixels x 33 pixels x 6 Bands
30,000
4,305
30,000 rows in a data matrix
Auto Machine Learning
https://epistasislab.github.io/tpot/
Evaluate ML
Random Sample
30,000 Elements
Machine Learning
Result of TPOT
2010 National
Classification
76% Accuracy
Training
2010 Geomedian
Classification
30,000
4,305
30,000 rows in a data matrix
Detailed results; First Iteration
Detailed results; First Iteration
Next Steps
Second Iteration
At this moment finishing second iteration
Improve 2 or 3 %
Next Steps
Next Steps
• Finish a second iteration, no later than two weeks.
• The grid is also an important innovation and is evolving, it is likely
that we will have a new version very soon and we should update
the classification.
• Apply the model to un-labelled years, for example 2019 first
semester.
• Validate a sample of the un-labelled year, requesting support for
the area of visual interpretation of the geography division, to have
a measure of product quality.
• Hold meetings with the area of sociodemographic statistics,
involve them in the exercise to improve the potential benefit in the
population estimate.
• Hold meetings with the area of statistical research, to receive
feedback and observations that can improve the product.
• Hold meetings with the cartographic update area, to receive
feedback.
Next Steps
• To the last week of October
• Write a first draft of the document that explains the process carried out. Probably until the results of the second
iteration and some results of meetings with stakeholders.
GRACIAS!
Machine learning and Satellite Images

Más contenido relacionado

La actualidad más candente

Weather factor landslide hazard
Weather factor landslide hazardWeather factor landslide hazard
Weather factor landslide hazardAlfonso Crisci
 
Luo-IGARSS2011-2385.ppt
Luo-IGARSS2011-2385.pptLuo-IGARSS2011-2385.ppt
Luo-IGARSS2011-2385.pptgrssieee
 
Identifying Land Patterns from Satellite Images using Deep Learning
Identifying Land Patterns from Satellite Images using Deep LearningIdentifying Land Patterns from Satellite Images using Deep Learning
Identifying Land Patterns from Satellite Images using Deep LearningSoumyadeep Debnath
 
Swiss Territorial Data Lab - geo Data Science - colloque FHNW
Swiss Territorial Data Lab - geo Data Science - colloque FHNWSwiss Territorial Data Lab - geo Data Science - colloque FHNW
Swiss Territorial Data Lab - geo Data Science - colloque FHNWRaphael Rollier
 
Processing of raw astronomical data of large volume by map reduce model
Processing of raw astronomical data of large volume by map reduce modelProcessing of raw astronomical data of large volume by map reduce model
Processing of raw astronomical data of large volume by map reduce modelSergey Gerasimov
 
GEM Risk: main achievements during the first implementation phase
GEM Risk: main achievements during the first implementation phaseGEM Risk: main achievements during the first implementation phase
GEM Risk: main achievements during the first implementation phaseGlobal Earthquake Model Foundation
 
Using Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite ImageryUsing Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite ImageryAstraea, Inc.
 
Leland Wilkinson, H2O.ai - The Grammar of Graphics and the Future of Big Data...
Leland Wilkinson, H2O.ai - The Grammar of Graphics and the Future of Big Data...Leland Wilkinson, H2O.ai - The Grammar of Graphics and the Future of Big Data...
Leland Wilkinson, H2O.ai - The Grammar of Graphics and the Future of Big Data...Sri Ambati
 
State of the Map US 2018: Analytic Support to Mapping Contributors
State of the Map US 2018: Analytic Support to Mapping ContributorsState of the Map US 2018: Analytic Support to Mapping Contributors
State of the Map US 2018: Analytic Support to Mapping Contributorsrlewis48
 
Spectral_classification_of_WorldView2_multiangle_sequence.pptx
Spectral_classification_of_WorldView2_multiangle_sequence.pptxSpectral_classification_of_WorldView2_multiangle_sequence.pptx
Spectral_classification_of_WorldView2_multiangle_sequence.pptxgrssieee
 
Automated Summarisation of Big Data, useR! 2018
Automated Summarisation of Big Data, useR! 2018  Automated Summarisation of Big Data, useR! 2018
Automated Summarisation of Big Data, useR! 2018 Amy Stringer
 
The Seismic Hazard Modeller’s Toolkit: An Open-Source Library for the Const...
The Seismic Hazard Modeller’s Toolkit:  An Open-Source Library for the Const...The Seismic Hazard Modeller’s Toolkit:  An Open-Source Library for the Const...
The Seismic Hazard Modeller’s Toolkit: An Open-Source Library for the Const...Global Earthquake Model Foundation
 
2nd e-ROSA Stakeholder Workshop: By, EO Based Global Public goods
2nd e-ROSA Stakeholder Workshop: By, EO Based Global Public goods2nd e-ROSA Stakeholder Workshop: By, EO Based Global Public goods
2nd e-ROSA Stakeholder Workshop: By, EO Based Global Public goodse-ROSA
 
How to set achievable tree canopy goals
How to set achievable tree canopy goalsHow to set achievable tree canopy goals
How to set achievable tree canopy goalsJosh Behounek
 
07 a70110 remotesensingandgisapplications
07 a70110 remotesensingandgisapplications07 a70110 remotesensingandgisapplications
07 a70110 remotesensingandgisapplicationsimaduddin91
 
Bayesian Hilbert Maps for Dynamic Continuous Occupancy Mapping
Bayesian Hilbert Maps for Dynamic Continuous Occupancy MappingBayesian Hilbert Maps for Dynamic Continuous Occupancy Mapping
Bayesian Hilbert Maps for Dynamic Continuous Occupancy MappingRansalu Senanayake
 

La actualidad más candente (18)

Weather factor landslide hazard
Weather factor landslide hazardWeather factor landslide hazard
Weather factor landslide hazard
 
Luo-IGARSS2011-2385.ppt
Luo-IGARSS2011-2385.pptLuo-IGARSS2011-2385.ppt
Luo-IGARSS2011-2385.ppt
 
Identifying Land Patterns from Satellite Images using Deep Learning
Identifying Land Patterns from Satellite Images using Deep LearningIdentifying Land Patterns from Satellite Images using Deep Learning
Identifying Land Patterns from Satellite Images using Deep Learning
 
Swiss Territorial Data Lab - geo Data Science - colloque FHNW
Swiss Territorial Data Lab - geo Data Science - colloque FHNWSwiss Territorial Data Lab - geo Data Science - colloque FHNW
Swiss Territorial Data Lab - geo Data Science - colloque FHNW
 
Processing of raw astronomical data of large volume by map reduce model
Processing of raw astronomical data of large volume by map reduce modelProcessing of raw astronomical data of large volume by map reduce model
Processing of raw astronomical data of large volume by map reduce model
 
GEM Risk: main achievements during the first implementation phase
GEM Risk: main achievements during the first implementation phaseGEM Risk: main achievements during the first implementation phase
GEM Risk: main achievements during the first implementation phase
 
Using Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite ImageryUsing Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite Imagery
 
Leland Wilkinson, H2O.ai - The Grammar of Graphics and the Future of Big Data...
Leland Wilkinson, H2O.ai - The Grammar of Graphics and the Future of Big Data...Leland Wilkinson, H2O.ai - The Grammar of Graphics and the Future of Big Data...
Leland Wilkinson, H2O.ai - The Grammar of Graphics and the Future of Big Data...
 
Casos de uso do FME no IBGE
Casos de uso do FME no IBGECasos de uso do FME no IBGE
Casos de uso do FME no IBGE
 
State of the Map US 2018: Analytic Support to Mapping Contributors
State of the Map US 2018: Analytic Support to Mapping ContributorsState of the Map US 2018: Analytic Support to Mapping Contributors
State of the Map US 2018: Analytic Support to Mapping Contributors
 
Spectral_classification_of_WorldView2_multiangle_sequence.pptx
Spectral_classification_of_WorldView2_multiangle_sequence.pptxSpectral_classification_of_WorldView2_multiangle_sequence.pptx
Spectral_classification_of_WorldView2_multiangle_sequence.pptx
 
Automated Summarisation of Big Data, useR! 2018
Automated Summarisation of Big Data, useR! 2018  Automated Summarisation of Big Data, useR! 2018
Automated Summarisation of Big Data, useR! 2018
 
The Seismic Hazard Modeller’s Toolkit: An Open-Source Library for the Const...
The Seismic Hazard Modeller’s Toolkit:  An Open-Source Library for the Const...The Seismic Hazard Modeller’s Toolkit:  An Open-Source Library for the Const...
The Seismic Hazard Modeller’s Toolkit: An Open-Source Library for the Const...
 
2nd e-ROSA Stakeholder Workshop: By, EO Based Global Public goods
2nd e-ROSA Stakeholder Workshop: By, EO Based Global Public goods2nd e-ROSA Stakeholder Workshop: By, EO Based Global Public goods
2nd e-ROSA Stakeholder Workshop: By, EO Based Global Public goods
 
How to set achievable tree canopy goals
How to set achievable tree canopy goalsHow to set achievable tree canopy goals
How to set achievable tree canopy goals
 
Big data in GIS Environment
Big data in GIS Environment Big data in GIS Environment
Big data in GIS Environment
 
07 a70110 remotesensingandgisapplications
07 a70110 remotesensingandgisapplications07 a70110 remotesensingandgisapplications
07 a70110 remotesensingandgisapplications
 
Bayesian Hilbert Maps for Dynamic Continuous Occupancy Mapping
Bayesian Hilbert Maps for Dynamic Continuous Occupancy MappingBayesian Hilbert Maps for Dynamic Continuous Occupancy Mapping
Bayesian Hilbert Maps for Dynamic Continuous Occupancy Mapping
 

Similar a Machine learning and Satellite Images

Integrating eo with official statistics using machine learning in mexico geo ...
Integrating eo with official statistics using machine learning in mexico geo ...Integrating eo with official statistics using machine learning in mexico geo ...
Integrating eo with official statistics using machine learning in mexico geo ...Abel Alejandro Coronado Iruegas
 
Carpita metulini 111220_dssr_bari_version2
Carpita metulini 111220_dssr_bari_version2Carpita metulini 111220_dssr_bari_version2
Carpita metulini 111220_dssr_bari_version2University of Salerno
 
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...Gloria Re Calegari
 
Crowd sourcing gis for global urban area mapping
Crowd sourcing gis for global urban area mappingCrowd sourcing gis for global urban area mapping
Crowd sourcing gis for global urban area mappingHiroyuki Miyazaki
 
Christian jensen advanced routing in spatial networks using big data
Christian jensen advanced routing in spatial networks using big dataChristian jensen advanced routing in spatial networks using big data
Christian jensen advanced routing in spatial networks using big datajins0618
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...Ian Foster
 
flat_presentation_time_evolving_OD_matrix_estimation
flat_presentation_time_evolving_OD_matrix_estimationflat_presentation_time_evolving_OD_matrix_estimation
flat_presentation_time_evolving_OD_matrix_estimationLuís Moreira-Matias
 
Geospatial Open Data and Urban Growth Modelling for Evidence-based Decision M...
Geospatial Open Data and Urban Growth Modelling for Evidence-based Decision M...Geospatial Open Data and Urban Growth Modelling for Evidence-based Decision M...
Geospatial Open Data and Urban Growth Modelling for Evidence-based Decision M...Piyush Yadav
 
IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079ibankuk
 
Do we need a new standard for visualizing the invisible?
Do we need a new standard for visualizing the invisible?Do we need a new standard for visualizing the invisible?
Do we need a new standard for visualizing the invisible?SANGHEE SHIN
 
Singapore 3D Map for a Smart Nation
Singapore 3D Map for a Smart NationSingapore 3D Map for a Smart Nation
Singapore 3D Map for a Smart NationMonica Moran
 
Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...
Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...
Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...NSW Environment and Planning
 
12 SuperAI on Supercomputers
12 SuperAI on Supercomputers12 SuperAI on Supercomputers
12 SuperAI on SupercomputersRCCSRENKEI
 

Similar a Machine learning and Satellite Images (20)

Integrating eo with official statistics using machine learning in mexico geo ...
Integrating eo with official statistics using machine learning in mexico geo ...Integrating eo with official statistics using machine learning in mexico geo ...
Integrating eo with official statistics using machine learning in mexico geo ...
 
Carpita metulini 111220_dssr_bari_version2
Carpita metulini 111220_dssr_bari_version2Carpita metulini 111220_dssr_bari_version2
Carpita metulini 111220_dssr_bari_version2
 
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
 
IEEE Mapathon_2022_BMS CE_Sep.pdf
IEEE Mapathon_2022_BMS CE_Sep.pdfIEEE Mapathon_2022_BMS CE_Sep.pdf
IEEE Mapathon_2022_BMS CE_Sep.pdf
 
Crowd sourcing gis for global urban area mapping
Crowd sourcing gis for global urban area mappingCrowd sourcing gis for global urban area mapping
Crowd sourcing gis for global urban area mapping
 
Christian jensen advanced routing in spatial networks using big data
Christian jensen advanced routing in spatial networks using big dataChristian jensen advanced routing in spatial networks using big data
Christian jensen advanced routing in spatial networks using big data
 
Multimedia Mining
Multimedia Mining Multimedia Mining
Multimedia Mining
 
ONS local presents clustering
ONS local presents clusteringONS local presents clustering
ONS local presents clustering
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
 
flat_presentation_time_evolving_OD_matrix_estimation
flat_presentation_time_evolving_OD_matrix_estimationflat_presentation_time_evolving_OD_matrix_estimation
flat_presentation_time_evolving_OD_matrix_estimation
 
Geospatial Open Data and Urban Growth Modelling for Evidence-based Decision M...
Geospatial Open Data and Urban Growth Modelling for Evidence-based Decision M...Geospatial Open Data and Urban Growth Modelling for Evidence-based Decision M...
Geospatial Open Data and Urban Growth Modelling for Evidence-based Decision M...
 
IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079
 
Do we need a new standard for visualizing the invisible?
Do we need a new standard for visualizing the invisible?Do we need a new standard for visualizing the invisible?
Do we need a new standard for visualizing the invisible?
 
Singapore 3D Map for a Smart Nation
Singapore 3D Map for a Smart NationSingapore 3D Map for a Smart Nation
Singapore 3D Map for a Smart Nation
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Hadoop PDF
Hadoop PDFHadoop PDF
Hadoop PDF
 
Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...
Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...
Exploration in the House 2015: NSW Seamless Geology Project: Progress to date...
 
12 SuperAI on Supercomputers
12 SuperAI on Supercomputers12 SuperAI on Supercomputers
12 SuperAI on Supercomputers
 
ML & Decision Making
ML & Decision MakingML & Decision Making
ML & Decision Making
 

Más de Abel Alejandro Coronado Iruegas

Analisis del Sentimiento en el Estado de Animo de los Tuiteros en Mexico
Analisis del Sentimiento en el Estado de Animo de los Tuiteros en MexicoAnalisis del Sentimiento en el Estado de Animo de los Tuiteros en Mexico
Analisis del Sentimiento en el Estado de Animo de los Tuiteros en MexicoAbel Alejandro Coronado Iruegas
 
Ejemplos de Proyectos de Ciencia de Datos y Big Data en el INEGI
Ejemplos de Proyectos de Ciencia de Datos y Big Data en el INEGIEjemplos de Proyectos de Ciencia de Datos y Big Data en el INEGI
Ejemplos de Proyectos de Ciencia de Datos y Big Data en el INEGIAbel Alejandro Coronado Iruegas
 

Más de Abel Alejandro Coronado Iruegas (20)

Mobility Master Class.pdf
Mobility Master Class.pdfMobility Master Class.pdf
Mobility Master Class.pdf
 
Live UAEMex Cubo de Datos Geoespaciales de Mexico
Live UAEMex Cubo de Datos Geoespaciales de MexicoLive UAEMex Cubo de Datos Geoespaciales de Mexico
Live UAEMex Cubo de Datos Geoespaciales de Mexico
 
Cubo de datos uaemex
Cubo de datos uaemexCubo de datos uaemex
Cubo de datos uaemex
 
Geo Big Data 4 Datalab
Geo Big Data 4 DatalabGeo Big Data 4 Datalab
Geo Big Data 4 Datalab
 
Catedra INEGI Big Data en IBERO
Catedra INEGI Big Data en IBEROCatedra INEGI Big Data en IBERO
Catedra INEGI Big Data en IBERO
 
El Cubo de Datos Geoespaciales de Mexico
El Cubo de Datos Geoespaciales de MexicoEl Cubo de Datos Geoespaciales de Mexico
El Cubo de Datos Geoespaciales de Mexico
 
No Sql
No SqlNo Sql
No Sql
 
Cubo de Datos Geoespaciales de Mexico
Cubo de Datos Geoespaciales de MexicoCubo de Datos Geoespaciales de Mexico
Cubo de Datos Geoespaciales de Mexico
 
Congreso UAA 2018 Animo Tuitero 2 0
Congreso UAA 2018 Animo Tuitero 2 0Congreso UAA 2018 Animo Tuitero 2 0
Congreso UAA 2018 Animo Tuitero 2 0
 
Analisis del Sentimiento en el Estado de Animo de los Tuiteros en Mexico
Analisis del Sentimiento en el Estado de Animo de los Tuiteros en MexicoAnalisis del Sentimiento en el Estado de Animo de los Tuiteros en Mexico
Analisis del Sentimiento en el Estado de Animo de los Tuiteros en Mexico
 
Ejemplos de Proyectos de Ciencia de Datos y Big Data en el INEGI
Ejemplos de Proyectos de Ciencia de Datos y Big Data en el INEGIEjemplos de Proyectos de Ciencia de Datos y Big Data en el INEGI
Ejemplos de Proyectos de Ciencia de Datos y Big Data en el INEGI
 
Big data big opportunities
Big data big opportunitiesBig data big opportunities
Big data big opportunities
 
Big data taller inegi sedesol
Big data taller inegi sedesolBig data taller inegi sedesol
Big data taller inegi sedesol
 
INEGI ESS big data workshop
INEGI ESS big data workshopINEGI ESS big data workshop
INEGI ESS big data workshop
 
Taller de Big Data y Ciencia de Datos en COLMEX dia 2
Taller de Big Data y Ciencia de Datos en COLMEX dia 2Taller de Big Data y Ciencia de Datos en COLMEX dia 2
Taller de Big Data y Ciencia de Datos en COLMEX dia 2
 
Taller de Big Data y Ciencia de Datos en COLMEX dia 1
Taller de Big Data y Ciencia de Datos en COLMEX dia 1 Taller de Big Data y Ciencia de Datos en COLMEX dia 1
Taller de Big Data y Ciencia de Datos en COLMEX dia 1
 
Big data lead colmex
Big data lead colmexBig data lead colmex
Big data lead colmex
 
Explorando Big Data y Ciencia de Datos con GPUs
Explorando Big Data y Ciencia de Datos con GPUsExplorando Big Data y Ciencia de Datos con GPUs
Explorando Big Data y Ciencia de Datos con GPUs
 
Geo Big Data 2015
Geo Big Data 2015 Geo Big Data 2015
Geo Big Data 2015
 
Anatomía de un proyecto de Big Data
Anatomía de un proyecto de Big DataAnatomía de un proyecto de Big Data
Anatomía de un proyecto de Big Data
 

Último

Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 

Último (20)

Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 

Machine learning and Satellite Images

  • 1. Machine Learning for Imagery Abel Coronado & Jimena Juarez
  • 3. Imagery Pipeline Imagery Pipeline In order to have a road map. We build an abstract machine learning pipeline for Imagery Based on: IBM CRISP-DM Cross Industry Standard Process for Data Mining & Microsoft Data Science lifecycle InKyung Choi; Refactoring
  • 6. Establish the problem to be solved Establish the problem to be solved Monitor the growth of cities of Mexico (Urban and Rural), which would generate a more timely input for : • Cartography update • Estimation of the population in non-census years • Related with: • SDG Indicator 11.3.1: Ratio of land consumption rate to population growth rate • SDG Indicator 15.3.1: Proportion of land that is degraded over total land area
  • 7. Identify satellite data to address the problem Identify satellite data to address the problem In our case, the data to monitor the growth of cities can be Landsat images (NASA & USGS): • They are open data. • There is a constant record since the 70s, although they are available from 1985 to date. • The spatial resolution of Landsat images is 30 meters. • Temporal resolution is 16 days. • Spectral resolution is 6 spectral bands (range of wavelength measured by the passive sensors of the Landsat Satellites)
  • 8. Identify satellite data to address the problem Identify satellite data to address the problem
  • 9. Identify satellite data to address the problem Identify satellite data to address the problem
  • 10. Identify satellite data to address the problem Identify satellite data to address the problem
  • 11. Translate the problem into statistical problem Translate the problem into statistical problem We define that is a classification problem: • Unit of analysis: 1km x 1km squares covering the whole country. • 3 classes were assigned: • Urban • Rural • None of the above
  • 12. Obtain ground-truth data Obtain ground-truth data Obtain layers of geographical information: • Georeferenced Population Census (Block Level Aggregation) • Georeferenced Economic Census (Economic Unit Level) • Visual Inspection
  • 13. Translate the problem into statistical problem Obtain ground-truth data Build the polygons (1 km – 1 km) 1’ 975, 719 Polygons
  • 14. Translate the problem into statistical problem Obtain ground-truth data Assign Labels
  • 15. Obtain satellite imagery data Obtain satellite imagery data • 32 TeraBytes of Images in external discs. • March 4, 2019 • The images are ARD, Analysis Ready Data. • In essence ARD means that the pixels of the entire time serie are aligned. • Which are processed to combine sensors and make comparisons at different times.
  • 16. Obtain satellite imagery data Obtain satellite imagery data • 34 Years
  • 17. Obtain satellite imagery data Obtain satellite imagery data • 2010 same year of the Census
  • 18. Obtain satellite imagery data Obtain satellite imagery data • 2010 same year of the Census 2,737,273,075 pixels – 35 Gb
  • 19. Integrate ground-truth data with satellite data Integrate ground-truth data with satellite data 1 km x 1 km tagged polygon 33 x 33 pixels 30 meters of resolution 33 Pixels 6Bands 6 swir 2------ 5 swir 1------ 4 nir--------- 3 red--------- 2 green------- 1 blue-------- Python Function Multidimensional Array NumPy
  • 20. Take a Random Sample Take a Random Sample
  • 21. Define features and generate a dataset to be used for analysis Define features and generate a dataset to be used for analysis • Feature Engineering 30,000 tagged regions 1km x 1km 33 pixels x 33 pixels x 6 Bands 30,000 4,305 30,000 rows in a data matrix ?
  • 22. Define features and generate a dataset to be used for analysis Define features and generate a dataset to be used for analysis • Feature Engineering 30,000 tagged regions 1km x 1km 33 pixels x 33 pixels x 6 Bands 30,000 4,305 30,000 rows in a data matrix
  • 24. Evaluate ML Random Sample 30,000 Elements Machine Learning Result of TPOT 2010 National Classification 76% Accuracy Training 2010 Geomedian Classification 30,000 4,305 30,000 rows in a data matrix
  • 25. Detailed results; First Iteration Detailed results; First Iteration
  • 27. Second Iteration At this moment finishing second iteration Improve 2 or 3 %
  • 28. Next Steps Next Steps • Finish a second iteration, no later than two weeks. • The grid is also an important innovation and is evolving, it is likely that we will have a new version very soon and we should update the classification. • Apply the model to un-labelled years, for example 2019 first semester. • Validate a sample of the un-labelled year, requesting support for the area of visual interpretation of the geography division, to have a measure of product quality. • Hold meetings with the area of sociodemographic statistics, involve them in the exercise to improve the potential benefit in the population estimate. • Hold meetings with the area of statistical research, to receive feedback and observations that can improve the product. • Hold meetings with the cartographic update area, to receive feedback.
  • 29. Next Steps • To the last week of October • Write a first draft of the document that explains the process carried out. Probably until the results of the second iteration and some results of meetings with stakeholders.