Tracking Pandemic Recovery Using Graphs

•Descargar como PPTX, PDF•

0 recomendaciones•13 vistas

Neo4j

Tecnología

© 2022 Neo4j, Inc. All rights reserved.
© 2022 Neo4j, Inc. All rights reserved.
Erik Erickson
Chief Data Officer
Alexander Long
Data Engineer
Tracking Pandemic Recovery
using Graphs

© 2022 Neo4j, Inc. All rights reserved.
© 2022 Neo4j, Inc. All rights reserved.
Hennepin County
• MN’s most populous county
• 34th most populous county in the United
States
• ~1.26 million residents
• ~9,000 employees
• State & local partnership for public health
◦ State, county, and city entities
◦ Local delivery

© 2022 Neo4j, Inc. All rights reserved.
Pandemic response
• March 12th, 2020 – First confirmed COVID-19 case in Hennepin County
• March 13th, 2020 – Hennepin County activated its emergency response
structure
◦ April 2020 – data work to support the response had begun
◦ May 2020 – first iteration of operational dashboards established

© 2022 Neo4j, Inc. All rights reserved.
© 2022 Neo4j, Inc. All rights reserved.
Critical Gap
• Limited data insights into how the pandemic
was impacting our community
◦ Official statistics have a significant lag
• Launched effort to try and better understand
the community impact of covid-19 using
timelier, yet non-traditional indicators
The depth of the
downturns induced
by covid-19 has put
a premium on swift
intelligence.”
-The Economist
(July 2020)
“

© 2022 Neo4j, Inc. All rights reserved.
© 2022 Neo4j, Inc. All rights reserved.
Making sense of non-
traditional data sources
• Indicators abound related to
key aspects of interest
◦ Google search trends
◦ Open table reservations
◦ Anonymized cell phone data
◦ Etc.
• Will the whole be greater than
the sum of its parts?

© 2022 Neo4j, Inc. All rights reserved.
6
ZHVI
Zillow Home Value Index
Monthly (~1 mo. lag)
Neighborhood level
Challenges beyond data quality
Affinity
Credit/debit spending
data
Weekly (~2 wk lag)
County level
BTS
Bureau of transportation
experimental data
Daily (~1 day)
County level

© 2022 Neo4j, Inc. All rights reserved.
© 2022 Neo4j, Inc. All rights reserved.
In addition to challenges inherent in each
indicator,
how can we meaningfully
integrate these data?

© 2022 Neo4j, Inc. All rights reserved.
© 2022 Neo4j, Inc. All rights reserved.
Graphs as a solution
• Looked towards
graph databases to
meaningfully integrate data.
• Pilots in early 2020
◦ CosmosDB
◦ TigerGraph
◦ Neo4J
• Looked towards graph
databases to meaningfully
integrate data.
• Pilots in early 2020
◦ CosmosDB
◦ TigerGraph
◦ Neo4J
Exploring graphs as a solution
From Carlson Analytics Lab – "Hennepin County Use Case
Presentation", May 2020

© 2022 Neo4j, Inc. All rights reserved.
Iterative process w/ iterative design
• Implementation
◦ New data or improved front-end
functionality
• Feedback
◦ Interviewed or requested written feedback
from domain experts
• Planning
◦ Use feedback to prioritize next steps

© 2022 Neo4j, Inc. All rights reserved.
Final Product: Power BI Dashboard

© 2022 Neo4j, Inc. All rights reserved.
© 2022 Neo4j, Inc. All rights reserved.
As the pandemic wore on, the graph helped
us fulfill two key duties:
- Interpret complex data
- React to a changing situation

© 2022 Neo4j, Inc. All rights reserved.
Interpret complex data
• Continuity between
logical and storage
models
◦ Data relationships are
built into storage
structure
◦ Intuitive to understand
= easier to use

© 2022 Neo4j, Inc. All rights reserved.
React to changing situation
• Schemaless provides flexibility
◦ COVID-19 data changing constantly
◦ Imperative that we adapt quickly
Implement Feedback
Planning

© 2022 Neo4j, Inc. All rights reserved.
A brief peak at the tech stack
• Starting point: Cypher / APOC load CSVs from "import" folder
• Now: Using Neo4j Python library to automate scripts
• Soon: Migrating code to the cloud, integrating with existing data pipelines
and cloud storage

© 2022 Neo4j, Inc. All rights reserved.
Lessons learned and next steps
• Making better use of the graph
◦ Graph Data Science library
• Integrate person records – explore person/place relationships
• Integrate property records – graph enables "spatial" queries
without requiring special geospatial tools

© 2022 Neo4j, Inc. All rights reserved.
© 2022 Neo4j, Inc. All rights reserved.
What questions do you have?

© 2022 Neo4j, Inc. All rights reserved.
© 2022 Neo4j, Inc. All rights reserved.
Erik Erickson
Chief Data Officer
Alexander Long
Data Engineer
Thank you
Tracking Pandemic Recovery using Graphs

Más contenido relacionado

Similar a Tracking Pandemic Recovery Using Graphs

Graph Analytics in Public Service: Analyzing Water Quality in Rivers and StreamsNeo4j

Denodo Data Innovation Award: Realizing a Data Driven Strategy Through Data V...Denodo

Getting Started with Big Data and SplunkTom Chavez

Trucks on a Graph: How JB Hunt Uses Neo4jNeo4j

Digital Transformation - #StrataData London 2017 - Data101Ellen Friedman

Are You Underestimating the Value Within Your Data? A conversation about grap...Neo4j

AfricaGIS 2017 Keynote Address_Patrick OoroPatrick Ooro

Bigdata-Intro.pptxsmitasatpathy2

Introduction to Big DataSpringPeople

TechM GIS UtilitiesTechM-GIS

Keynote: Graphs in Government_Lance Walter, CMONeo4j

Augmented OLAP Analytics for Big DataTyler Wishnoff

Augmented OLAP for Big DataLuke Han

Applying Big DataJohn Dougherty

Big Data in Action : Operations, Analytics and moreSoftweb Solutions

Rims forum 2013 aspec data standards - george havakis gissaINGENIUMrims

¿En qué se parece el Gobierno del Dato a un parque de atracciones?Denodo

David Hodcroft, Greater Manchester Combined AuthorityBoilerhouse Communications

Workshop Español - Introducción a Neo4jNeo4j

How Graph Data Science can turbocharge your Knowledge GraphNeo4j

Similar a Tracking Pandemic Recovery Using Graphs (20)

Graph Analytics in Public Service: Analyzing Water Quality in Rivers and Streams

Denodo Data Innovation Award: Realizing a Data Driven Strategy Through Data V...

Getting Started with Big Data and Splunk

Trucks on a Graph: How JB Hunt Uses Neo4j

Digital Transformation - #StrataData London 2017 - Data101

Are You Underestimating the Value Within Your Data? A conversation about grap...

AfricaGIS 2017 Keynote Address_Patrick Ooro

Bigdata-Intro.pptx

Introduction to Big Data

TechM GIS Utilities

Keynote: Graphs in Government_Lance Walter, CMO

Augmented OLAP Analytics for Big Data

Augmented OLAP for Big Data

Applying Big Data

Big Data in Action : Operations, Analytics and more

Rims forum 2013 aspec data standards - george havakis gissa

¿En qué se parece el Gobierno del Dato a un parque de atracciones?

David Hodcroft, Greater Manchester Combined Authority

Workshop Español - Introducción a Neo4j

How Graph Data Science can turbocharge your Knowledge Graph

Más de Neo4j

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansNeo4j

EY_Graph Database Powered SustainabilityNeo4j

SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j

Build your next Gen AI Breakthrough - April 2024Neo4j

Connecting the Dots for Information Discovery.pdfNeo4j

ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...Neo4j

BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosNeo4j

Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Neo4j

GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jNeo4j

Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j

Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j

Webinar - IA generativa e grafi Neo4j: RAG time!Neo4j

IA Generativa y Grafos de Neo4j: RAG timeNeo4j

Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j

Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j

Enabling GenAI Breakthroughs with Knowledge GraphsNeo4j

Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j

Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j

Más de Neo4j (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians

EY_Graph Database Powered Sustainability

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph

Build your next Gen AI Breakthrough - April 2024

Connecting the Dots for Information Discovery.pdf

ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...

BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos

Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...

GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j

Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf

Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf

Webinar - IA generativa e grafi Neo4j: RAG time!

IA Generativa y Grafos de Neo4j: RAG time

Neo4j: Data Engineering for RAG (retrieval augmented generation)

Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf

Enabling GenAI Breakthroughs with Knowledge Graphs

Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf

Neo4j Jesus Barrasa The Art of the Possible with Graph

Último

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Artificial Intelligence: Facts and MythsJoaquim Jorge

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

A Year of the Servo Reboot: Where Are We Now?Igalia

Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech

Tracking Pandemic Recovery Using Graphs

2. © 2022 Neo4j, Inc. All rights reserved. © 2022 Neo4j, Inc. All rights reserved. Hennepin County • MN’s most populous county • 34th most populous county in the United States • ~1.26 million residents • ~9,000 employees • State & local partnership for public health ◦ State, county, and city entities ◦ Local delivery

3. © 2022 Neo4j, Inc. All rights reserved. Pandemic response • March 12th, 2020 – First confirmed COVID-19 case in Hennepin County • March 13th, 2020 – Hennepin County activated its emergency response structure ◦ April 2020 – data work to support the response had begun ◦ May 2020 – first iteration of operational dashboards established

4. © 2022 Neo4j, Inc. All rights reserved. © 2022 Neo4j, Inc. All rights reserved. Critical Gap • Limited data insights into how the pandemic was impacting our community ◦ Official statistics have a significant lag • Launched effort to try and better understand the community impact of covid-19 using timelier, yet non-traditional indicators The depth of the downturns induced by covid-19 has put a premium on swift intelligence.” -The Economist (July 2020) “

5. © 2022 Neo4j, Inc. All rights reserved. © 2022 Neo4j, Inc. All rights reserved. Making sense of non- traditional data sources • Indicators abound related to key aspects of interest ◦ Google search trends ◦ Open table reservations ◦ Anonymized cell phone data ◦ Etc. • Will the whole be greater than the sum of its parts?

6. © 2022 Neo4j, Inc. All rights reserved. 6 ZHVI Zillow Home Value Index Monthly (~1 mo. lag) Neighborhood level Challenges beyond data quality Affinity Credit/debit spending data Weekly (~2 wk lag) County level BTS Bureau of transportation experimental data Daily (~1 day) County level

8. © 2022 Neo4j, Inc. All rights reserved. © 2022 Neo4j, Inc. All rights reserved. Graphs as a solution • Looked towards graph databases to meaningfully integrate data. • Pilots in early 2020 ◦ CosmosDB ◦ TigerGraph ◦ Neo4J • Looked towards graph databases to meaningfully integrate data. • Pilots in early 2020 ◦ CosmosDB ◦ TigerGraph ◦ Neo4J Exploring graphs as a solution From Carlson Analytics Lab – "Hennepin County Use Case Presentation", May 2020

9. © 2022 Neo4j, Inc. All rights reserved. Iterative process w/ iterative design • Implementation ◦ New data or improved front-end functionality • Feedback ◦ Interviewed or requested written feedback from domain experts • Planning ◦ Use feedback to prioritize next steps

11. © 2022 Neo4j, Inc. All rights reserved. © 2022 Neo4j, Inc. All rights reserved. As the pandemic wore on, the graph helped us fulfill two key duties: - Interpret complex data - React to a changing situation

12. © 2022 Neo4j, Inc. All rights reserved. Interpret complex data • Continuity between logical and storage models ◦ Data relationships are built into storage structure ◦ Intuitive to understand = easier to use

13. © 2022 Neo4j, Inc. All rights reserved. React to changing situation • Schemaless provides flexibility ◦ COVID-19 data changing constantly ◦ Imperative that we adapt quickly Implement Feedback Planning

14. © 2022 Neo4j, Inc. All rights reserved. A brief peak at the tech stack • Starting point: Cypher / APOC load CSVs from "import" folder • Now: Using Neo4j Python library to automate scripts • Soon: Migrating code to the cloud, integrating with existing data pipelines and cloud storage

15. © 2022 Neo4j, Inc. All rights reserved. Lessons learned and next steps • Making better use of the graph ◦ Graph Data Science library • Integrate person records – explore person/place relationships • Integrate property records – graph enables "spatial" queries without requiring special geospatial tools

Notas del editor

Data helping guide decisions in COVID-19 response | Hennepin County Established an analytics infrastructure to monitor Number of cases, hospitalizations and deaths Key county response activities (vaccine administration, PPE delivery, small business support, etc.)
"Official statistics" - referencing ACS / census data here?
Had an inkling that graphs could help us with this. In early 2020 we piloted a few graph technologies CosmosDB (Microsoft shop) TigerGraph Internuntius consulting Partnership with Carlson Analytics Lab (University of Minnesota) Successfully modelled interaction between SNAP (food stamp) recipients and community demographics to inform the placement of food shelves. With these promising results we began to work on a database to collect metrics on the impact of COVID in our community.
It was obvious to everyone that the situation with COVID was changing quickly. Day to day changes in data availability are hard to handle for any organization, and we don't have the most mature tech stack in the world. We knew we needed to follow an iterative process with fast cycles. Step 1) implement a new idea, like adding new data, a new summary measure, or improved functionality to the user-facing dashboard Step 2) Gather feedback on our implementation, and identify gaps alongside subject matter experts Step 3) Use that feedback to fix issues or plan future improvements As the project came together, these iterations get closer and closer together. The schemaless nature of the graph storage was a key factor to increasing our development speed.
Here's a look at our final product, an interactive dashboard built in Power BI, using a Neo4j database as its main back end data store. This was a win in multiple ways. First, it confirms our ability to write efficient cypher queries to support these types of reports (both in terms of execution time and storage space, both of which an interactive dashboard has little). Second, it showcases our ability to aggregate geography and date hierarchies at any desired summary level. On the right side, we show a monthly indicator at city-level geographies, but that same rollup can be done for quarters or years, for the full county or comissioner districts, only by attaching relationships to those "dimensional" nodes.
One of the first benefits we realized with our COVID graph DB is that having a storage model that closely follows the logical structure allows for greater ease of use. This includes faster development, and easier for us to explain technical details to business partners. For example, "can we summarize this at the county level?" or "can I see all the housing measures for this city?". A simple modification to the query rather than a complicated join or worse, having to completely rewrite tables to fit unexpected schemas. 1) Graph relationships - Easier to communicate capabilities and limitions of each data point
https://commons.wikimedia.org/wiki/File:Jenga_distorted.jpg We said earlier that the volatile nature of the problem, especially early in the pandemic, impressed upon us the need to iterate rapidly. The schemaless data storage of the graph db was instrumental in this. We have some measures that have been around for years, some that became available just as the first COVID cases appeared in Minnesota, and others that weren't available for months after. As the pandemic wound down, some indicators have stopped reporting. Others have changed summary levels or collection methodology. Since we weren't pinned down by a particular table schema, we could quickly add new indicators, remove old ones, and modify relationships to keep ourselves on track.
Cypher / APOC library to load CSVs from "import" folder Shifted to python scripts running on the database's server, executing Cypher via the transactional API Today we run python scripts remotely, loading data using Neo4j python library Soon: moving those scripts to the cloud (Databricks on Azure), integrating more closely with existing data pipelines and cloud storage
Q&A
Q&A

Tracking Pandemic Recovery Using Graphs

Recomendados

Recomendados

Más contenido relacionado

Similar a Tracking Pandemic Recovery Using Graphs

Similar a Tracking Pandemic Recovery Using Graphs (20)

Más de Neo4j

Más de Neo4j (20)

Último

Último (20)

Tracking Pandemic Recovery Using Graphs

Notas del editor