SlideShare una empresa de Scribd logo
1 de 17
Feature Geo Analytics and
Big Data Processing:
Hybrid Approaches for Earth Science
and Real-Time Decision Making
Mansour Raad, Erik Hoel, Michael Park, Adam
Mollenkopf, Dawn J. Wright
Environmental Systems Research Institute (aka Esri)
IN12A-01 (Invited)
AGU Fall Meeting, 12 December 2016
What is Feature Geo Analytics?
A new way of processing spatiotemporal data designed for WEB-
BASED big data by leveraging distributed analytics and storage
• Works with existing GIS data and tabular data
• Designed to perform both spatial and temporal analysis
• Uses familiar workflows to complete complex analyses
• “Hybridity” - integrating open-source frameworks on clusters to run analytics
Feature Geo Analytics
Geoprocessing
Distributed analytics and storage
Feature Geo Analytics
Portal
Web GIS Layers
newmore extends
Solve New Problems
Run analytics:
• against data too big for a single desktop machine
- Buffer 8.2 million points or thousands of polygons in a little over a minute
- billions of observations of ship movements ingested via GeoEvent
• designed to gain insight into both spatial and temporal patterns
• against massive collections in a scalable manner
• and meet time constraints
months weeks days hours minutes
Geo Analytics Architectural Overview
Portal
Web GIS Layers
Un-Managed Data
New Web GIS Layers
Register large data stores, then distribute
spatial analysis across cluster of machines
for parallel processing
Store and/or deploy to web
Web GIS layers
via Pro, Portal,
Python Notebooks,
or the REST API
Managed Data
Relational
Data Store
Spatiotemporal
Data Store
Files
Files
Delimited Files EnterpriseShapefiles Big Data Stores
Server
Cluster
Rich Collection of (Web) Analysis Tools
Summarize Data
Aggregate Points
Summarize Nearby
Summarize Within
Reconstruct Tracks
Join Features
Find Locations
Find Existing Locations
Find Similar Locations
Analyze Patterns
Calculate Density
Find Hot Spots
Create Space Time Cube
Use Proximity
Create Buffers
Manage Data
Extract Data
* Temporally aware tools
Aggregate Points
Summarize Nearby
Summarize Within
Find Existing Locations
Find Similar Locations
Calculate Density
Find Hot Spots
Create Buffers
Extract Data
Analytical Overview: Aggregating and Summarizing
• Spatial Joins
• Space-time slices
• Spatiotemporal joins
Target Features Join Features Intermediate Result Final Result
Analytical Overview: Aggregating and Summarizing
Temporal Relationships on Intervals
• Points into Bins
Analytical Overview: Aggregating and Summarizing
Aggregation – Polygons vs Cells
Aggregation By Polygons Aggregation By Cells
• Reconstruct Tracks
- Summarize time-enabled points into tracks
Analytical Overview: Aggregating and Summarizing
Use Case: Hurricane Tracts
• Hurricane dataset
- 120,000 points, ~100 years
- Each point has:
- ID number
- Location
- Date
- Wind speed and pressure attributes
- Problems?
- Difficult to visualize that many points
- Difficult to visualize hurricane path
“Hybridity” for Distributed Computation
See also www.esri.com/software/open
“Hybridity” for Distributed Computation
See also www.esri.com/software/open
Real-Time GIS Performance
ArcGIS 10.4
10s of thousands of e/s
ArcGIS Spatiotemporal
Big Data Store
DesktopWeb Device
ArcGIS Server
4,000
e/s
Ingestion
GeoEvent
4,000
e/s
Visualization
Live and Historic
Aggregates & Features
Enhanced Map and
Feature Service
• Ingest high-velocity real-
time data
• Observations in a Big Data
Store
• Visualize high-velocity,
high-volume data
- as an AGGREGATION,
- as discrete FEATURES,
- live & HISTORICALLY
• Visualizations CAN scale
Stream Service
Stream Layer
3,000
e/s
Live Features
Geo Analytics Performance
Spatiotemporal
Big Data Store
Discussion groups at geonet.esri.com
Step 1. Click orange “Join in” button to create your
account.
Step 2. Join the Big Data or Sciences groups
Step 3. Contribute to AGU conversations!
Mansour Raad, Esri Big Data Team
mraad@esri.com
thunderheadxpler.blogspot.com
github.com/mraad
@mraad
For Questions/Discussion

Más contenido relacionado

La actualidad más candente

Scalable Data Analytics and Visualization with Cloud Optimized Services
Scalable Data Analytics and Visualization with Cloud Optimized ServicesScalable Data Analytics and Visualization with Cloud Optimized Services
Scalable Data Analytics and Visualization with Cloud Optimized ServicesGlobus
 
ERFEG Seminar Fall 2008
ERFEG Seminar Fall 2008ERFEG Seminar Fall 2008
ERFEG Seminar Fall 2008shirabay
 
From analogue to digital history
From analogue to digital historyFrom analogue to digital history
From analogue to digital historyKatrina Navickas
 
Magellen: Geospatial Analytics on Spark by Ram Sriharsha
Magellen: Geospatial Analytics on Spark by Ram SriharshaMagellen: Geospatial Analytics on Spark by Ram Sriharsha
Magellen: Geospatial Analytics on Spark by Ram SriharshaSpark Summit
 
Processing Geospatial Data At Scale @locationtech
Processing Geospatial Data At Scale @locationtechProcessing Geospatial Data At Scale @locationtech
Processing Geospatial Data At Scale @locationtechRob Emanuele
 
Big data mega surveys pushing the boundaries
Big data   mega surveys pushing the boundariesBig data   mega surveys pushing the boundaries
Big data mega surveys pushing the boundariesGeodata AS
 
Big Data and Geospatial with HPCC Systems
Big Data and Geospatial with HPCC SystemsBig Data and Geospatial with HPCC Systems
Big Data and Geospatial with HPCC SystemsHPCC Systems
 
Computation of spatial data on Hadoop Cluster
Computation of spatial data on Hadoop ClusterComputation of spatial data on Hadoop Cluster
Computation of spatial data on Hadoop ClusterAbhishek Sagar
 
1Spatial: Edinburgh FME World Tour: Performance tips
1Spatial: Edinburgh FME World Tour: Performance tips1Spatial: Edinburgh FME World Tour: Performance tips
1Spatial: Edinburgh FME World Tour: Performance tips1Spatial
 
Kansa SAA 2014 Archaeological Data on Vastly Different Scales
Kansa SAA 2014 Archaeological Data on Vastly Different ScalesKansa SAA 2014 Archaeological Data on Vastly Different Scales
Kansa SAA 2014 Archaeological Data on Vastly Different Scalesdinaa_proj
 
Snow cover assessment tool using Python
Snow cover assessment tool using PythonSnow cover assessment tool using Python
Snow cover assessment tool using PythonPrasun Kumar Gupta
 
Enabling Access to Big Geospatial Data with LocationTech and Apache projects
Enabling Access to Big Geospatial Data with LocationTech and Apache projectsEnabling Access to Big Geospatial Data with LocationTech and Apache projects
Enabling Access to Big Geospatial Data with LocationTech and Apache projectsRob Emanuele
 
NJ Wildlife Habitat Finder
NJ Wildlife Habitat FinderNJ Wildlife Habitat Finder
NJ Wildlife Habitat FinderDan Ford
 
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...Viet-Trung TRAN
 
Deadline-aware MapReduce Job Scheduling with Dynamic Resource Availability
Deadline-aware MapReduce Job Scheduling with Dynamic Resource AvailabilityDeadline-aware MapReduce Job Scheduling with Dynamic Resource Availability
Deadline-aware MapReduce Job Scheduling with Dynamic Resource AvailabilityJAYAPRAKASH JPINFOTECH
 

La actualidad más candente (20)

Real Time Geodemographics
Real Time GeodemographicsReal Time Geodemographics
Real Time Geodemographics
 
Scalable Data Analytics and Visualization with Cloud Optimized Services
Scalable Data Analytics and Visualization with Cloud Optimized ServicesScalable Data Analytics and Visualization with Cloud Optimized Services
Scalable Data Analytics and Visualization with Cloud Optimized Services
 
Advancing Scientific Data Support in ArcGIS
Advancing Scientific Data Support in ArcGISAdvancing Scientific Data Support in ArcGIS
Advancing Scientific Data Support in ArcGIS
 
ERFEG Seminar Fall 2008
ERFEG Seminar Fall 2008ERFEG Seminar Fall 2008
ERFEG Seminar Fall 2008
 
From analogue to digital history
From analogue to digital historyFrom analogue to digital history
From analogue to digital history
 
Magellen: Geospatial Analytics on Spark by Ram Sriharsha
Magellen: Geospatial Analytics on Spark by Ram SriharshaMagellen: Geospatial Analytics on Spark by Ram Sriharsha
Magellen: Geospatial Analytics on Spark by Ram Sriharsha
 
Processing Geospatial Data At Scale @locationtech
Processing Geospatial Data At Scale @locationtechProcessing Geospatial Data At Scale @locationtech
Processing Geospatial Data At Scale @locationtech
 
Big data mega surveys pushing the boundaries
Big data   mega surveys pushing the boundariesBig data   mega surveys pushing the boundaries
Big data mega surveys pushing the boundaries
 
Big Data and Geospatial with HPCC Systems
Big Data and Geospatial with HPCC SystemsBig Data and Geospatial with HPCC Systems
Big Data and Geospatial with HPCC Systems
 
Computation of spatial data on Hadoop Cluster
Computation of spatial data on Hadoop ClusterComputation of spatial data on Hadoop Cluster
Computation of spatial data on Hadoop Cluster
 
1Spatial: Edinburgh FME World Tour: Performance tips
1Spatial: Edinburgh FME World Tour: Performance tips1Spatial: Edinburgh FME World Tour: Performance tips
1Spatial: Edinburgh FME World Tour: Performance tips
 
Kansa SAA 2014 Archaeological Data on Vastly Different Scales
Kansa SAA 2014 Archaeological Data on Vastly Different ScalesKansa SAA 2014 Archaeological Data on Vastly Different Scales
Kansa SAA 2014 Archaeological Data on Vastly Different Scales
 
Snow cover assessment tool using Python
Snow cover assessment tool using PythonSnow cover assessment tool using Python
Snow cover assessment tool using Python
 
linkIn_CVPR15
linkIn_CVPR15linkIn_CVPR15
linkIn_CVPR15
 
Enabling Access to Big Geospatial Data with LocationTech and Apache projects
Enabling Access to Big Geospatial Data with LocationTech and Apache projectsEnabling Access to Big Geospatial Data with LocationTech and Apache projects
Enabling Access to Big Geospatial Data with LocationTech and Apache projects
 
Murphy presentation
Murphy presentationMurphy presentation
Murphy presentation
 
Reading HDF family of formats via NetCDF-Java / CDM
Reading HDF family of formats via NetCDF-Java / CDMReading HDF family of formats via NetCDF-Java / CDM
Reading HDF family of formats via NetCDF-Java / CDM
 
NJ Wildlife Habitat Finder
NJ Wildlife Habitat FinderNJ Wildlife Habitat Finder
NJ Wildlife Habitat Finder
 
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
 
Deadline-aware MapReduce Job Scheduling with Dynamic Resource Availability
Deadline-aware MapReduce Job Scheduling with Dynamic Resource AvailabilityDeadline-aware MapReduce Job Scheduling with Dynamic Resource Availability
Deadline-aware MapReduce Job Scheduling with Dynamic Resource Availability
 

Similar a Feature Geo Analytics and Big Data Processing: Hybrid Approaches for Earth Science and Real-Time Decision Support

Nye forskninsgresultater inden for geo-spatiale data af Christian S. Jensen, AAU
Nye forskninsgresultater inden for geo-spatiale data af Christian S. Jensen, AAUNye forskninsgresultater inden for geo-spatiale data af Christian S. Jensen, AAU
Nye forskninsgresultater inden for geo-spatiale data af Christian S. Jensen, AAUInfinIT - Innovationsnetværket for it
 
Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013Ian Foster
 
Geospatial Sensor Networks and Partitioning Data
Geospatial Sensor Networks and Partitioning DataGeospatial Sensor Networks and Partitioning Data
Geospatial Sensor Networks and Partitioning DataAlexMiowski
 
Data Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather ForecastingData Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather ForecastingJames Arnold Faeldon
 
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars Joel Saltz
 
PEARC17: Visual exploration and analysis of time series earthquake data
PEARC17: Visual exploration and analysis of time series earthquake dataPEARC17: Visual exploration and analysis of time series earthquake data
PEARC17: Visual exploration and analysis of time series earthquake dataAmit Chourasia
 
CitySprint Fleetmapper use case -Big Data Bootcamp
CitySprint  Fleetmapper use case -Big Data BootcampCitySprint  Fleetmapper use case -Big Data Bootcamp
CitySprint Fleetmapper use case -Big Data BootcampEduard Lazar
 
Analysis Ready Data workshop - OGC presentation
Analysis Ready Data workshop - OGC presentation Analysis Ready Data workshop - OGC presentation
Analysis Ready Data workshop - OGC presentation George Percivall
 
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSExperiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSEd Dodds
 
Big Data Analysis : Deciphering the haystack
Big Data Analysis : Deciphering the haystack Big Data Analysis : Deciphering the haystack
Big Data Analysis : Deciphering the haystack Srinath Perera
 
HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010Cloudera, Inc.
 
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...Accumulo Summit
 
Exascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataExascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataJoel Saltz
 
Geo Analytics Canada Overview - May 2020
Geo Analytics Canada Overview - May 2020Geo Analytics Canada Overview - May 2020
Geo Analytics Canada Overview - May 2020GEO Analytics Canada
 

Similar a Feature Geo Analytics and Big Data Processing: Hybrid Approaches for Earth Science and Real-Time Decision Support (20)

Introduction to Google Earth Engine .pptx
Introduction to Google Earth Engine .pptxIntroduction to Google Earth Engine .pptx
Introduction to Google Earth Engine .pptx
 
Nye forskninsgresultater inden for geo-spatiale data af Christian S. Jensen, AAU
Nye forskninsgresultater inden for geo-spatiale data af Christian S. Jensen, AAUNye forskninsgresultater inden for geo-spatiale data af Christian S. Jensen, AAU
Nye forskninsgresultater inden for geo-spatiale data af Christian S. Jensen, AAU
 
Software for the Hydrographic ocean
Software for the Hydrographic oceanSoftware for the Hydrographic ocean
Software for the Hydrographic ocean
 
Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013
 
ArcGIS and Multi-D: Tools & Roadmap
ArcGIS and Multi-D: Tools & RoadmapArcGIS and Multi-D: Tools & Roadmap
ArcGIS and Multi-D: Tools & Roadmap
 
CLIM Program: Remote Sensing Workshop, Distributed Access and Analysis: NASA ...
CLIM Program: Remote Sensing Workshop, Distributed Access and Analysis: NASA ...CLIM Program: Remote Sensing Workshop, Distributed Access and Analysis: NASA ...
CLIM Program: Remote Sensing Workshop, Distributed Access and Analysis: NASA ...
 
Geospatial Sensor Networks and Partitioning Data
Geospatial Sensor Networks and Partitioning DataGeospatial Sensor Networks and Partitioning Data
Geospatial Sensor Networks and Partitioning Data
 
Data Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather ForecastingData Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather Forecasting
 
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
 
PEARC17: Visual exploration and analysis of time series earthquake data
PEARC17: Visual exploration and analysis of time series earthquake dataPEARC17: Visual exploration and analysis of time series earthquake data
PEARC17: Visual exploration and analysis of time series earthquake data
 
CitySprint Fleetmapper use case -Big Data Bootcamp
CitySprint  Fleetmapper use case -Big Data BootcampCitySprint  Fleetmapper use case -Big Data Bootcamp
CitySprint Fleetmapper use case -Big Data Bootcamp
 
Analysis Ready Data workshop - OGC presentation
Analysis Ready Data workshop - OGC presentation Analysis Ready Data workshop - OGC presentation
Analysis Ready Data workshop - OGC presentation
 
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSExperiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
 
Big Data Analysis : Deciphering the haystack
Big Data Analysis : Deciphering the haystack Big Data Analysis : Deciphering the haystack
Big Data Analysis : Deciphering the haystack
 
HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010
 
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...
Accumulo Summit 2016: GeoMesa: Using Accumulo for Optimized Spatio-Temporal P...
 
Exascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataExascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor Data
 
Geo Analytics Canada Overview - May 2020
Geo Analytics Canada Overview - May 2020Geo Analytics Canada Overview - May 2020
Geo Analytics Canada Overview - May 2020
 
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
 
ACCESS-Opt_Overview
ACCESS-Opt_OverviewACCESS-Opt_Overview
ACCESS-Opt_Overview
 

Más de Dawn Wright

Geospatial as an Accelerator of Impact: Already Converging!
Geospatial as an Accelerator of Impact: Already Converging!Geospatial as an Accelerator of Impact: Already Converging!
Geospatial as an Accelerator of Impact: Already Converging!Dawn Wright
 
Ease Leads to Exposure , Exposure Leads to Adoption
Ease Leads to Exposure, Exposure Leads to AdoptionEase Leads to Exposure, Exposure Leads to Adoption
Ease Leads to Exposure , Exposure Leads to AdoptionDawn Wright
 
Data for the Blue Future: New Collaborations for Progress
Data for the Blue Future: New Collaborations for ProgressData for the Blue Future: New Collaborations for Progress
Data for the Blue Future: New Collaborations for ProgressDawn Wright
 
AGU Sharing Science - Social Media Tips
AGU Sharing Science - Social Media TipsAGU Sharing Science - Social Media Tips
AGU Sharing Science - Social Media TipsDawn Wright
 
The Perils and Promise of Environmental Data Science
The Perils and Promise of Environmental Data ScienceThe Perils and Promise of Environmental Data Science
The Perils and Promise of Environmental Data ScienceDawn Wright
 
Discovery, Technology, Hope: Colorado College Roberts Symposium
Discovery, Technology, Hope: Colorado College Roberts SymposiumDiscovery, Technology, Hope: Colorado College Roberts Symposium
Discovery, Technology, Hope: Colorado College Roberts SymposiumDawn Wright
 
Marie Tharp, Giants of Tectonophysics Session, American Geophysical Union
Marie Tharp, Giants of Tectonophysics Session, American Geophysical UnionMarie Tharp, Giants of Tectonophysics Session, American Geophysical Union
Marie Tharp, Giants of Tectonophysics Session, American Geophysical UnionDawn Wright
 
52 Million Points and Counting: A New Stratification Approach for Mapping Glo...
52 Million Points and Counting: A New Stratification Approach for Mapping Glo...52 Million Points and Counting: A New Stratification Approach for Mapping Glo...
52 Million Points and Counting: A New Stratification Approach for Mapping Glo...Dawn Wright
 
Discovery, Technology, Hope
Discovery, Technology, HopeDiscovery, Technology, Hope
Discovery, Technology, HopeDawn Wright
 
Toward Easy Export of Imagery Products and Feature Classes as Training Data f...
Toward Easy Export of Imagery Products and Feature Classes as Training Data f...Toward Easy Export of Imagery Products and Feature Classes as Training Data f...
Toward Easy Export of Imagery Products and Feature Classes as Training Data f...Dawn Wright
 
Integrated GIS/Machine-Learning Workflows - Seagrass Use Case
Integrated GIS/Machine-Learning Workflows - Seagrass Use CaseIntegrated GIS/Machine-Learning Workflows - Seagrass Use Case
Integrated GIS/Machine-Learning Workflows - Seagrass Use CaseDawn Wright
 
Swells, Soundings, and Sustainability in the Oceans
Swells, Soundings, and Sustainability  in the OceansSwells, Soundings, and Sustainability  in the Oceans
Swells, Soundings, and Sustainability in the OceansDawn Wright
 
University of Redlands Symposium 2018
University of Redlands Symposium 2018University of Redlands Symposium 2018
University of Redlands Symposium 2018Dawn Wright
 
Your Knowledge, Our Community, the Ocean's Resilience
Your Knowledge, Our Community, the Ocean's ResilienceYour Knowledge, Our Community, the Ocean's Resilience
Your Knowledge, Our Community, the Ocean's ResilienceDawn Wright
 
Socialspatial Research for Communities: Telling the Story of People and Place
Socialspatial Research for Communities: Telling the Story of People and PlaceSocialspatial Research for Communities: Telling the Story of People and Place
Socialspatial Research for Communities: Telling the Story of People and PlaceDawn Wright
 
Ocean Solutions, Earth Solutions
Ocean Solutions, Earth SolutionsOcean Solutions, Earth Solutions
Ocean Solutions, Earth SolutionsDawn Wright
 
Ecological Marine Units: A New Public-Private Partnership for the Global Ocean
Ecological Marine Units: A New Public-Private Partnership for the Global OceanEcological Marine Units: A New Public-Private Partnership for the Global Ocean
Ecological Marine Units: A New Public-Private Partnership for the Global OceanDawn Wright
 
A Dark Side to Data-Centric Geography? Where are the Reward Systems?
A Dark Side to Data-Centric Geography? Where are the Reward Systems?A Dark Side to Data-Centric Geography? Where are the Reward Systems?
A Dark Side to Data-Centric Geography? Where are the Reward Systems?Dawn Wright
 
Esri and the Scientific Community
Esri and the Scientific CommunityEsri and the Scientific Community
Esri and the Scientific CommunityDawn Wright
 
Latest Developments in Oceanographic Applications of GIS, including Near-real...
Latest Developments in Oceanographic Applications of GIS, including Near-real...Latest Developments in Oceanographic Applications of GIS, including Near-real...
Latest Developments in Oceanographic Applications of GIS, including Near-real...Dawn Wright
 

Más de Dawn Wright (20)

Geospatial as an Accelerator of Impact: Already Converging!
Geospatial as an Accelerator of Impact: Already Converging!Geospatial as an Accelerator of Impact: Already Converging!
Geospatial as an Accelerator of Impact: Already Converging!
 
Ease Leads to Exposure , Exposure Leads to Adoption
Ease Leads to Exposure, Exposure Leads to AdoptionEase Leads to Exposure, Exposure Leads to Adoption
Ease Leads to Exposure , Exposure Leads to Adoption
 
Data for the Blue Future: New Collaborations for Progress
Data for the Blue Future: New Collaborations for ProgressData for the Blue Future: New Collaborations for Progress
Data for the Blue Future: New Collaborations for Progress
 
AGU Sharing Science - Social Media Tips
AGU Sharing Science - Social Media TipsAGU Sharing Science - Social Media Tips
AGU Sharing Science - Social Media Tips
 
The Perils and Promise of Environmental Data Science
The Perils and Promise of Environmental Data ScienceThe Perils and Promise of Environmental Data Science
The Perils and Promise of Environmental Data Science
 
Discovery, Technology, Hope: Colorado College Roberts Symposium
Discovery, Technology, Hope: Colorado College Roberts SymposiumDiscovery, Technology, Hope: Colorado College Roberts Symposium
Discovery, Technology, Hope: Colorado College Roberts Symposium
 
Marie Tharp, Giants of Tectonophysics Session, American Geophysical Union
Marie Tharp, Giants of Tectonophysics Session, American Geophysical UnionMarie Tharp, Giants of Tectonophysics Session, American Geophysical Union
Marie Tharp, Giants of Tectonophysics Session, American Geophysical Union
 
52 Million Points and Counting: A New Stratification Approach for Mapping Glo...
52 Million Points and Counting: A New Stratification Approach for Mapping Glo...52 Million Points and Counting: A New Stratification Approach for Mapping Glo...
52 Million Points and Counting: A New Stratification Approach for Mapping Glo...
 
Discovery, Technology, Hope
Discovery, Technology, HopeDiscovery, Technology, Hope
Discovery, Technology, Hope
 
Toward Easy Export of Imagery Products and Feature Classes as Training Data f...
Toward Easy Export of Imagery Products and Feature Classes as Training Data f...Toward Easy Export of Imagery Products and Feature Classes as Training Data f...
Toward Easy Export of Imagery Products and Feature Classes as Training Data f...
 
Integrated GIS/Machine-Learning Workflows - Seagrass Use Case
Integrated GIS/Machine-Learning Workflows - Seagrass Use CaseIntegrated GIS/Machine-Learning Workflows - Seagrass Use Case
Integrated GIS/Machine-Learning Workflows - Seagrass Use Case
 
Swells, Soundings, and Sustainability in the Oceans
Swells, Soundings, and Sustainability  in the OceansSwells, Soundings, and Sustainability  in the Oceans
Swells, Soundings, and Sustainability in the Oceans
 
University of Redlands Symposium 2018
University of Redlands Symposium 2018University of Redlands Symposium 2018
University of Redlands Symposium 2018
 
Your Knowledge, Our Community, the Ocean's Resilience
Your Knowledge, Our Community, the Ocean's ResilienceYour Knowledge, Our Community, the Ocean's Resilience
Your Knowledge, Our Community, the Ocean's Resilience
 
Socialspatial Research for Communities: Telling the Story of People and Place
Socialspatial Research for Communities: Telling the Story of People and PlaceSocialspatial Research for Communities: Telling the Story of People and Place
Socialspatial Research for Communities: Telling the Story of People and Place
 
Ocean Solutions, Earth Solutions
Ocean Solutions, Earth SolutionsOcean Solutions, Earth Solutions
Ocean Solutions, Earth Solutions
 
Ecological Marine Units: A New Public-Private Partnership for the Global Ocean
Ecological Marine Units: A New Public-Private Partnership for the Global OceanEcological Marine Units: A New Public-Private Partnership for the Global Ocean
Ecological Marine Units: A New Public-Private Partnership for the Global Ocean
 
A Dark Side to Data-Centric Geography? Where are the Reward Systems?
A Dark Side to Data-Centric Geography? Where are the Reward Systems?A Dark Side to Data-Centric Geography? Where are the Reward Systems?
A Dark Side to Data-Centric Geography? Where are the Reward Systems?
 
Esri and the Scientific Community
Esri and the Scientific CommunityEsri and the Scientific Community
Esri and the Scientific Community
 
Latest Developments in Oceanographic Applications of GIS, including Near-real...
Latest Developments in Oceanographic Applications of GIS, including Near-real...Latest Developments in Oceanographic Applications of GIS, including Near-real...
Latest Developments in Oceanographic Applications of GIS, including Near-real...
 

Último

Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Silpa
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspectsmuralinath2
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Silpa
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxRenuJangid3
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Silpa
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxMohamedFarag457087
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxANSARKHAN96
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....muralinath2
 
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body Areesha Ahmad
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Silpa
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.Silpa
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY1301aanya
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptxSilpa
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIADr. TATHAGAT KHOBRAGADE
 

Último (20)

Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 

Feature Geo Analytics and Big Data Processing: Hybrid Approaches for Earth Science and Real-Time Decision Support

  • 1. Feature Geo Analytics and Big Data Processing: Hybrid Approaches for Earth Science and Real-Time Decision Making Mansour Raad, Erik Hoel, Michael Park, Adam Mollenkopf, Dawn J. Wright Environmental Systems Research Institute (aka Esri) IN12A-01 (Invited) AGU Fall Meeting, 12 December 2016
  • 2. What is Feature Geo Analytics? A new way of processing spatiotemporal data designed for WEB- BASED big data by leveraging distributed analytics and storage • Works with existing GIS data and tabular data • Designed to perform both spatial and temporal analysis • Uses familiar workflows to complete complex analyses • “Hybridity” - integrating open-source frameworks on clusters to run analytics
  • 3. Feature Geo Analytics Geoprocessing Distributed analytics and storage Feature Geo Analytics Portal Web GIS Layers newmore extends
  • 4. Solve New Problems Run analytics: • against data too big for a single desktop machine - Buffer 8.2 million points or thousands of polygons in a little over a minute - billions of observations of ship movements ingested via GeoEvent • designed to gain insight into both spatial and temporal patterns • against massive collections in a scalable manner • and meet time constraints months weeks days hours minutes
  • 5. Geo Analytics Architectural Overview Portal Web GIS Layers Un-Managed Data New Web GIS Layers Register large data stores, then distribute spatial analysis across cluster of machines for parallel processing Store and/or deploy to web Web GIS layers via Pro, Portal, Python Notebooks, or the REST API Managed Data Relational Data Store Spatiotemporal Data Store Files Files Delimited Files EnterpriseShapefiles Big Data Stores Server Cluster
  • 6. Rich Collection of (Web) Analysis Tools Summarize Data Aggregate Points Summarize Nearby Summarize Within Reconstruct Tracks Join Features Find Locations Find Existing Locations Find Similar Locations Analyze Patterns Calculate Density Find Hot Spots Create Space Time Cube Use Proximity Create Buffers Manage Data Extract Data * Temporally aware tools Aggregate Points Summarize Nearby Summarize Within Find Existing Locations Find Similar Locations Calculate Density Find Hot Spots Create Buffers Extract Data
  • 7. Analytical Overview: Aggregating and Summarizing • Spatial Joins • Space-time slices
  • 8. • Spatiotemporal joins Target Features Join Features Intermediate Result Final Result Analytical Overview: Aggregating and Summarizing
  • 10. • Points into Bins Analytical Overview: Aggregating and Summarizing
  • 11. Aggregation – Polygons vs Cells Aggregation By Polygons Aggregation By Cells
  • 12. • Reconstruct Tracks - Summarize time-enabled points into tracks Analytical Overview: Aggregating and Summarizing
  • 13. Use Case: Hurricane Tracts • Hurricane dataset - 120,000 points, ~100 years - Each point has: - ID number - Location - Date - Wind speed and pressure attributes - Problems? - Difficult to visualize that many points - Difficult to visualize hurricane path
  • 14. “Hybridity” for Distributed Computation See also www.esri.com/software/open
  • 15. “Hybridity” for Distributed Computation See also www.esri.com/software/open
  • 16. Real-Time GIS Performance ArcGIS 10.4 10s of thousands of e/s ArcGIS Spatiotemporal Big Data Store DesktopWeb Device ArcGIS Server 4,000 e/s Ingestion GeoEvent 4,000 e/s Visualization Live and Historic Aggregates & Features Enhanced Map and Feature Service • Ingest high-velocity real- time data • Observations in a Big Data Store • Visualize high-velocity, high-volume data - as an AGGREGATION, - as discrete FEATURES, - live & HISTORICALLY • Visualizations CAN scale Stream Service Stream Layer 3,000 e/s Live Features Geo Analytics Performance Spatiotemporal Big Data Store
  • 17. Discussion groups at geonet.esri.com Step 1. Click orange “Join in” button to create your account. Step 2. Join the Big Data or Sciences groups Step 3. Contribute to AGU conversations! Mansour Raad, Esri Big Data Team mraad@esri.com thunderheadxpler.blogspot.com github.com/mraad @mraad For Questions/Discussion

Notas del editor

  1. “hybrid” in that ArcGIS Server integrates open-source big data frameworks such as Apache Hadoop and Apache Spark on the cluster in order to run the analytics
  2. Building blocks of this approach
  3. buffer 8.2 million points or thousands of polygons in a little over a minute Meet time constraints, especially against the next NSF proposal deadlines
  4. These “feature geo analytics” tools run in both batch and streaming spatial analysis mode as distributed computations across a cluster of servers on typical “big” data sets, where static data exist in traditional geospatial formats (e.g., shapefile) locally on a disk or file share, attached as static spatiotemporal big data stores, or streamed in near-real-time. In other words, the approach registers large datasets or data stores with ArcGIS Enterprise (Server), then distributes analysis across a cluster of machines for parallel processing. We aim to register large data stores / data sets with ArcGIS Server, then distribute analysis across a cluster of machines for parallel processing Many frameworks/technologies exist for distributing computation E.g., Hadoop, MapReduce, Spark Spark: processes distributed data in memory; Supports MapReduce programming model Includes additional framework level distributed algorithms ArcGIS Server integrates these technologies on a cluster to solve analytic problems
  5. Due to lack of time, will focus on Aggregation and Summarizing
  6. Many frameworks/technologies exist for distributing computation E.g., Hadoop, MapReduce, Spark Spark: processes distributed data in memory; Supports MapReduce programming model Includes additional framework level distributed algorithms ArcGIS Server integrates these technologies on a cluster to solve analytic problems
  7. For fast, dynamic queries, integrate Cloudera Impala which is an open-source query engine that runs on Apache Hadoop (Hadoop Distributed File System). Delivers fast SQL processing on HDFS Read/write data in HDFS using Impala Write code in Python, Java, Scala (like C, ”scalable language”) ArcPy helps you to perform geographic data analysis in Python By the way, you’ll need at least 8 CPU cores 16 Gb RAM (32 Gb is better) 512 Gb Solid State Drive (1 Tb is better)
  8. e/s = events per second We aim to register large data stores / data sets with ArcGIS Server, then distribute analysis across a cluster of machines for parallel processing Performance example: buffer 8.2 million points or thousands of polygons in a little over a minute, Coming: ~250,000 writes to disk per second across 5 nodes Many frameworks/technologies exist for distributing computation E.g., Hadoop, MapReduce, Spark Spark: processes distributed data in memory; Supports MapReduce programming model Includes additional framework level distributed algorithms ArcGIS Server integrates these technologies on a cluster to solve analytic problems