SlideShare a Scribd company logo
1 of 24
Moving Healthcare Analytics to Hadoop to build
better predictive models - Saving Cost and Lives
Dr. Joe Colorafi – Dignity Health CMIO
Dr. Graham Hughes – SAS CMIO
Bill Guise – Dignity Health Senior Director IT
Sunil Kakade – Dignity Health Director IT
June 11th, 2015
2
 Dignity Health
 Emerging Healthcare Landscape
 What outcomes we will enable?
 Big Data Ecosystem
 Hadoop Architecture for
Healthcare Analytics
Agenda
3
4
5
Healthcare Changing Landscape - Business Performance Challenges
Analytics is a mission critical Enterprise Capability to drive Transparency, Insights, Collaboration and
to driver operational improvements
Business
Performance
Increasing
Regulatory
Pressure
Decelerating
price growth
Continuing
Cost Pressure
Shifting Payer
Mix
Rapidly
evolving
technology
Deteriorating
Case Mix
6
For example, every year, severe sepsis strikes more than a million Americans. It’s been
estimated that between 28 and 50 percent of these people die—far more than the
number of U.S. deaths from prostate cancer, breast cancer and AIDS combined***.
The potential for Analytics in Healthcare is huge
*** National Institute of General Medical Sciences
 What if Dignity Health’s 30 TB Clinical Data
 100 billion rows of data
 8000 unique provider users
 1.2 million meds doses/ year
 Could be turned into
 Right Information
 Right Time
 Right Form
 Right People & Processes
Bringing benefits of Big Data to Health systems
8
Healthcare Analytics Challenges
Patient
Data is everywhere but
trapped in a myriad of silos
• High Complexity
• High Variety
• Fast Data
• Privacy
9
Healthcare Analytics Challenges
• Legacy Systems
• Rigid data
formats
• Unstructured
• Dark Data
Copyr ight © 2015, SAS Institute Inc. All rights reser ved.Copyr ight © 2015, SAS Institute Inc. All rights reser ved.
Infinite Volume
and Variety of Data
Disruptive Technology
New Problem-solving Mindset
BUILDING NEW ANALYTICS CULTURE WITH BIG DATA
Unrivaled
Processing Power
11
One Platform, Many Data Sources, Multiple Workloads, All Consumers
NoSQL Logs
Social
Media
Sensors
Legacy
Platforms
Cloudera’s Hadoop Distribution
EHR
CERNER
Lab
Patient
Sat.
ADT
MS4 Billing
Pop
Health
Predictive
Analytics
SAS Enterprise Business
Intelligence Platform
Unified
Security Model
SAS Data
Governance
SAS Visual
Analytics Platform
Text
Mining
Forecasting &
Optimization
Machine
Learning
Real-time
Analytics
Data Science
Unified Audit
and Logging
HIE CMS
Unified
Privacy and
Compliance
Unified
Person
Master Index
Unified API
Platform
Unified
Enterprise
Data Model
Public Data
Sources
RDBMS
Dignity Health Insights Big Data Ecosystem
Unified Data Integration – Source Data Once, Analyze Multiple Times
Analytic
Capability
SAS - Hadoop
Integration
Open Source
Hadoop
Platform with
Unified Dignity
Health
processes
Dignity Health
Data Sources
Open Access
Data
Exploration
SAS Intelligence
Security
Hadoop and SAS can enable analytics Spectrum
12
Hadoop and SAS can enable full analytics Spectrum
13
Roadmap of building Dignity Health Insights Big Data Hub
1. Distributed Storage/Computing -
Hadoop Ecosystem
2. Compliance - Audit & Logging
3. Security - SAS Intelligence
4. Data Governance - Dataflux
5. Analytics - SAS Enterprise Miner
6. SAS Visual Analytics
14EHR Lab
Patient
Sat.
ADT Billing Pop
Health
SAS Analytics
Products
SAS Visual
Analytics
HIE CMS
Dignity Health Insights
Open Source Based in the cloud Big Data Ecosystem Architecture
Secure FTP – SQOOP – FLUME
Big Data Ecosystem
Cerner
SAS Intelligence Platform
Role Based, LDAP integrated and Metadata level security
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
Prepare data IN
Hadoop for
analytics
Move data FROM
Hadoop into a SAS
environment
Deploy and manage
model score code
IN Hadoop
Lift data IN to
memory for analytics
at scale
Model data at scale in-
memory WITH advanced
modeling tools
Use the
right
approach for
what needs
to be done!
Explore data at scale, in-
memory WITH data
visualization
SAS & HADOOP - THE PRAGMATIC APPROACH
Copyr ight © 2012, SAS Institute Inc. All rights reser ved.
ENABLING THE DATA TO DECISION LIFECYCLE WITH SAS
AND HADOOP ECOSYSTEM
Access & Manage Data
Advanced data management
capabilities (ELT, ETL, DQ,
virtualization) enabled for Hadoop
Interactively Explore & Visualize
Quickly Visualize Data in Hadoop, Discover New
Patterns, Publish Reports Via Web Reports, Mobile
Devices, MS Office Apps
Analyze & Model
Uncover Patterns and trends in Hadoop data.
Interactive and visual environment for analytics.
Apply Domain specific high-performance analytics
Deploy & Integrate
Automatically deploy and score analytic
models in the parallel environment.
Manage & analyze real time data
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
Data Store
SAS
Data
In-Database
Data Store
SAS
Traditional SAS
HADOOP + SAS - DESIGN PATTERNS
These approaches are complementary & can be combined for maximum effect
Data Store
SAS
Data
In-Memory
Memory
Data
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
Results
Modeling Code
HiveQL
Data
Enterprise Miner with Hadoop (Model Dev)
Enterprise Miner
Access to Hadoop
Hadoop Cluster
Enterprise Miner
High Performance Analytics
Hadoop Cluster
Small
Data
Volumes
Everything
Else
Data is pulled to the EM
server and computation
happens on the EM
server
Code is pushed to the
Hadoop cluster and
computation is executed
on the cluster
Results
Results
Copyr ight © 2014, SAS Institute Inc. All rights reser ved.Copyr ight © 2014, SAS Institute Inc. All rights reser ved.
SMP Architecture
LASR Analytic Server
Hadoop Cluster
MPP Architecture
LASR Analytic Server
Hadoop Cluster
Small
Data
Volumes
Everything
Else
Data is pulled into
memory on the single
machine
Data is pulled in parallel
from the Hadoop cluster
data nodes directly to the
LASR worker nodes
VISUAL ANALYTICS WITH SAS + HADOOP
Analyze Sepsis Alerts
By Mortality Rate
By Provider Response
By Length of Stay
By Facilties
BioSurvillence SepsisAnalytics with Hadoop and SAS VisualAnalytics
20
21EHR Lab
Patient
Sat.
ADT Billing Pop
Health
SAS Analytics
Products
SAS Visual
Analytics
HIE CMS
Dignity Health Insights
Cloud based Security Compliant Big Data Architecture
Secure FTP – SQOOP – FLUME
HADOOP Ecosystem
Cerner
SAS Intelligence Platform
Role Based, LDAP integrated and Metadata level security
Social Community
UI, Authenticate, Submit & Request Data, Navigate & Access
Applications, Collaborate & Share Insights
Packaged Analytic Applications & Actionable Insights
Predictive Models, Benchmarks, Actions/Alerting – Clinical,
Administrative, Operations, Financial , Quality, Gaming Theory
 Analytic Tools Foundation
Data Connectivity, Data Quality, Visualization, Segmentation,
Data Mining, Forecasting, Audit Trails, NLP, Machine Intelligence
 Storage & Data
Bladed Environment - EDW, ODS, Marts, Hadoop + Customer
Data, In-Memory Databases, Virtual Data Marts
Secure Cloud
EHR Lab
Patient
Sat.
Reg/Adt
 Patients
 EHR
 Call Center
 Visualization
 MS Office
 Regulatory
reporting
 Partners
Data
Insights
Billing Other
Patients
Clinicians
Managers
Analysts
IT Staff
Users
Any Customer
Any Channel
Any Device
Any Input Source
Internal/External
Any Service
Any vertical or
horizontal
Slice of stack
Any Output Destination
Internal/External
3rd Party App
The Technology: Dignity Health Insights
Manage
Financial
Risks &
Incentives
Proactively
Manage Care
Quality &
Outcomes
Improve
Efficiency of
Care Delivery
Population
Health and
Engage
Patients
Capabilities used in
Sepsis Biosurvillence
 Open Source Big
Data Platform
 User Authentication
with Dignity Security
System
 Audit and Logging
 Integration with
Registration and
Clinical systems
 SAS Enterprise
Business Intelligence
 SAS Visual Analytics
 SAS Data Quality
tools
 Mobile Delivery Pilot
Desired
Export
Dignity Health Insights
One Platform, Many Data Sources, Multiple Workloads, All Consumers
23
Dignity Health Insights use cases are endless..
• Patient readmission reduction Predictive Model
• Broad System Exploration with Speed – COMPASS
• Legacy Reporting System modernization
• Pharmacy Analytics
• UMPI – Universal Master Patient Index
24

More Related Content

What's hot

Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterpriseUsing Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
DataWorks Summit
 
SDM (Standardized Data Management) - A Dynamic Adaptive Ingestion Frameworks ...
SDM (Standardized Data Management) - A Dynamic Adaptive Ingestion Frameworks ...SDM (Standardized Data Management) - A Dynamic Adaptive Ingestion Frameworks ...
SDM (Standardized Data Management) - A Dynamic Adaptive Ingestion Frameworks ...
DataWorks Summit
 
The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022
Kai Wähner
 

What's hot (20)

Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterpriseUsing Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
 
Owning Your Own (Data) Lake House
Owning Your Own (Data) Lake HouseOwning Your Own (Data) Lake House
Owning Your Own (Data) Lake House
 
Behind the Buzzword: Understanding Customer Data Platforms in the Light of Pr...
Behind the Buzzword: Understanding Customer Data Platforms in the Light of Pr...Behind the Buzzword: Understanding Customer Data Platforms in the Light of Pr...
Behind the Buzzword: Understanding Customer Data Platforms in the Light of Pr...
 
利用Denodo平台安全地进行数据共享
利用Denodo平台安全地进行数据共享利用Denodo平台安全地进行数据共享
利用Denodo平台安全地进行数据共享
 
Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesIntro to Neo4j and Graph Databases
Intro to Neo4j and Graph Databases
 
Microsoft Fabric.pptx
Microsoft Fabric.pptxMicrosoft Fabric.pptx
Microsoft Fabric.pptx
 
MongoDB World 2019: The Journey of Migration from Oracle to MongoDB at Rakuten
MongoDB World 2019: The Journey of Migration from Oracle to MongoDB at RakutenMongoDB World 2019: The Journey of Migration from Oracle to MongoDB at Rakuten
MongoDB World 2019: The Journey of Migration from Oracle to MongoDB at Rakuten
 
Change Data Streaming Patterns for Microservices With Debezium
Change Data Streaming Patterns for Microservices With Debezium Change Data Streaming Patterns for Microservices With Debezium
Change Data Streaming Patterns for Microservices With Debezium
 
Building a Big Data Pipeline
Building a Big Data PipelineBuilding a Big Data Pipeline
Building a Big Data Pipeline
 
Data In AI_한화시스템_김유신.pdf
Data In AI_한화시스템_김유신.pdfData In AI_한화시스템_김유신.pdf
Data In AI_한화시스템_김유신.pdf
 
Architecting a datalake
Architecting a datalakeArchitecting a datalake
Architecting a datalake
 
Neo4j 4.1 overview
Neo4j 4.1 overviewNeo4j 4.1 overview
Neo4j 4.1 overview
 
Data Lake Overview
Data Lake OverviewData Lake Overview
Data Lake Overview
 
A short introduction to Canis Major
A short introduction to Canis MajorA short introduction to Canis Major
A short introduction to Canis Major
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 
Power BI Architecture
Power BI ArchitecturePower BI Architecture
Power BI Architecture
 
SDM (Standardized Data Management) - A Dynamic Adaptive Ingestion Frameworks ...
SDM (Standardized Data Management) - A Dynamic Adaptive Ingestion Frameworks ...SDM (Standardized Data Management) - A Dynamic Adaptive Ingestion Frameworks ...
SDM (Standardized Data Management) - A Dynamic Adaptive Ingestion Frameworks ...
 
Introduction to NOSQL databases
Introduction to NOSQL databasesIntroduction to NOSQL databases
Introduction to NOSQL databases
 
The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022
 
Overview on Azure Machine Learning
Overview on Azure Machine LearningOverview on Azure Machine Learning
Overview on Azure Machine Learning
 

Viewers also liked

Using Big Data for Improved Healthcare Operations and Analytics
Using Big Data for Improved Healthcare Operations and AnalyticsUsing Big Data for Improved Healthcare Operations and Analytics
Using Big Data for Improved Healthcare Operations and Analytics
Perficient, Inc.
 
Sanjeevani ehr product-brochure
Sanjeevani ehr product-brochure Sanjeevani ehr product-brochure
Sanjeevani ehr product-brochure
Santosh Kedhari
 
media pembelajaran matematika SDN 1 GLAGAH kelas 4 semester 1
media pembelajaran matematika SDN 1 GLAGAH kelas 4 semester 1media pembelajaran matematika SDN 1 GLAGAH kelas 4 semester 1
media pembelajaran matematika SDN 1 GLAGAH kelas 4 semester 1
gita12005202
 

Viewers also liked (20)

Health Insurance Predictive Analysis with Hadoop and Machine Learning. JULIEN...
Health Insurance Predictive Analysis with Hadoop and Machine Learning. JULIEN...Health Insurance Predictive Analysis with Hadoop and Machine Learning. JULIEN...
Health Insurance Predictive Analysis with Hadoop and Machine Learning. JULIEN...
 
Hadoop Enabled Healthcare
Hadoop Enabled HealthcareHadoop Enabled Healthcare
Hadoop Enabled Healthcare
 
Using Big Data for Improved Healthcare Operations and Analytics
Using Big Data for Improved Healthcare Operations and AnalyticsUsing Big Data for Improved Healthcare Operations and Analytics
Using Big Data for Improved Healthcare Operations and Analytics
 
4 Essential Lessons for Adopting Predictive Analytics in Healthcare
4 Essential Lessons for Adopting Predictive Analytics in Healthcare4 Essential Lessons for Adopting Predictive Analytics in Healthcare
4 Essential Lessons for Adopting Predictive Analytics in Healthcare
 
Sanjeevani ehr product-brochure
Sanjeevani ehr product-brochure Sanjeevani ehr product-brochure
Sanjeevani ehr product-brochure
 
SAS and Cloudera – Analytics at Scale
SAS and Cloudera – Analytics at ScaleSAS and Cloudera – Analytics at Scale
SAS and Cloudera – Analytics at Scale
 
Ehr models, standards and semantic interoperability
Ehr models, standards and semantic interoperabilityEhr models, standards and semantic interoperability
Ehr models, standards and semantic interoperability
 
SAS Modernization architectures - Big Data Analytics
SAS Modernization architectures - Big Data AnalyticsSAS Modernization architectures - Big Data Analytics
SAS Modernization architectures - Big Data Analytics
 
media pembelajaran matematika SDN 1 GLAGAH kelas 4 semester 1
media pembelajaran matematika SDN 1 GLAGAH kelas 4 semester 1media pembelajaran matematika SDN 1 GLAGAH kelas 4 semester 1
media pembelajaran matematika SDN 1 GLAGAH kelas 4 semester 1
 
( Big ) Data Management - Collect - Global concepts in 5 slides
( Big ) Data Management - Collect - Global concepts in 5 slides( Big ) Data Management - Collect - Global concepts in 5 slides
( Big ) Data Management - Collect - Global concepts in 5 slides
 
HPE and Hortonworks join forces to Deliver Healthcare Transformation
HPE and Hortonworks join forces to Deliver Healthcare TransformationHPE and Hortonworks join forces to Deliver Healthcare Transformation
HPE and Hortonworks join forces to Deliver Healthcare Transformation
 
Big Data in Medicine
Big Data in MedicineBig Data in Medicine
Big Data in Medicine
 
RHadoop
RHadoopRHadoop
RHadoop
 
BigData in Health Care Systems with IOT
BigData in Health Care Systems with IOTBigData in Health Care Systems with IOT
BigData in Health Care Systems with IOT
 
Electronic Health Records Implementation
Electronic Health Records ImplementationElectronic Health Records Implementation
Electronic Health Records Implementation
 
Lower Total Cost of Care and Gain Valuable Patient Insights through Predictiv...
Lower Total Cost of Care and Gain Valuable Patient Insights through Predictiv...Lower Total Cost of Care and Gain Valuable Patient Insights through Predictiv...
Lower Total Cost of Care and Gain Valuable Patient Insights through Predictiv...
 
Big data in healthcare
Big data in healthcareBig data in healthcare
Big data in healthcare
 
The US Healthcare Industry
The US Healthcare IndustryThe US Healthcare Industry
The US Healthcare Industry
 
Big Data Analytics in Healthcare
Big Data Analytics in HealthcareBig Data Analytics in Healthcare
Big Data Analytics in Healthcare
 
Running Spark and MapReduce together in Production
Running Spark and MapReduce together in ProductionRunning Spark and MapReduce together in Production
Running Spark and MapReduce together in Production
 

Similar to Moving Health Care Analytics to Hadoop to Build a Better Predictive Model

Thesis blending big data and cloud -epilepsy global data research and inform...
Thesis  blending big data and cloud -epilepsy global data research and inform...Thesis  blending big data and cloud -epilepsy global data research and inform...
Thesis blending big data and cloud -epilepsy global data research and inform...
Anup Singh
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
Raul Chong
 

Similar to Moving Health Care Analytics to Hadoop to Build a Better Predictive Model (20)

Webinar: Leveraging big data in life sciences & healthcare
Webinar: Leveraging big data in life sciences & healthcareWebinar: Leveraging big data in life sciences & healthcare
Webinar: Leveraging big data in life sciences & healthcare
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help business
 
Sapiens data science and snowflake data warehouse
Sapiens data science and snowflake data warehouseSapiens data science and snowflake data warehouse
Sapiens data science and snowflake data warehouse
 
Thesis blending big data and cloud -epilepsy global data research and inform...
Thesis  blending big data and cloud -epilepsy global data research and inform...Thesis  blending big data and cloud -epilepsy global data research and inform...
Thesis blending big data and cloud -epilepsy global data research and inform...
 
Expand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big DataExpand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big Data
 
ABD209_Accelerating the Speed of Innovation with a Data Sciences Data & Analy...
ABD209_Accelerating the Speed of Innovation with a Data Sciences Data & Analy...ABD209_Accelerating the Speed of Innovation with a Data Sciences Data & Analy...
ABD209_Accelerating the Speed of Innovation with a Data Sciences Data & Analy...
 
Big Data
Big DataBig Data
Big Data
 
Data-driven Healthcare
Data-driven HealthcareData-driven Healthcare
Data-driven Healthcare
 
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks
 
IoT Crash Course Hadoop Summit SJ
IoT Crash Course Hadoop Summit SJIoT Crash Course Hadoop Summit SJ
IoT Crash Course Hadoop Summit SJ
 
WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
 
Rob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopRob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoop
 
SAP HANA in Healthcare: Real-Time Big Data Analysis
SAP HANA in Healthcare: Real-Time Big Data AnalysisSAP HANA in Healthcare: Real-Time Big Data Analysis
SAP HANA in Healthcare: Real-Time Big Data Analysis
 
Accelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success StoriesAccelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success Stories
 
Lesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptxLesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptx
 
Big Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesBig Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture Capabilities
 
Top 10 data science technologies
Top 10 data science technologiesTop 10 data science technologies
Top 10 data science technologies
 
Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformBig Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop Platform
 

More from DataWorks Summit

HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
DataWorks Summit
 

More from DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Recently uploaded

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Moving Health Care Analytics to Hadoop to Build a Better Predictive Model

  • 1. Moving Healthcare Analytics to Hadoop to build better predictive models - Saving Cost and Lives Dr. Joe Colorafi – Dignity Health CMIO Dr. Graham Hughes – SAS CMIO Bill Guise – Dignity Health Senior Director IT Sunil Kakade – Dignity Health Director IT June 11th, 2015
  • 2. 2  Dignity Health  Emerging Healthcare Landscape  What outcomes we will enable?  Big Data Ecosystem  Hadoop Architecture for Healthcare Analytics Agenda
  • 3. 3
  • 4. 4
  • 5. 5 Healthcare Changing Landscape - Business Performance Challenges Analytics is a mission critical Enterprise Capability to drive Transparency, Insights, Collaboration and to driver operational improvements Business Performance Increasing Regulatory Pressure Decelerating price growth Continuing Cost Pressure Shifting Payer Mix Rapidly evolving technology Deteriorating Case Mix
  • 6. 6 For example, every year, severe sepsis strikes more than a million Americans. It’s been estimated that between 28 and 50 percent of these people die—far more than the number of U.S. deaths from prostate cancer, breast cancer and AIDS combined***. The potential for Analytics in Healthcare is huge *** National Institute of General Medical Sciences
  • 7.  What if Dignity Health’s 30 TB Clinical Data  100 billion rows of data  8000 unique provider users  1.2 million meds doses/ year  Could be turned into  Right Information  Right Time  Right Form  Right People & Processes Bringing benefits of Big Data to Health systems
  • 8. 8 Healthcare Analytics Challenges Patient Data is everywhere but trapped in a myriad of silos • High Complexity • High Variety • Fast Data • Privacy
  • 9. 9 Healthcare Analytics Challenges • Legacy Systems • Rigid data formats • Unstructured • Dark Data
  • 10. Copyr ight © 2015, SAS Institute Inc. All rights reser ved.Copyr ight © 2015, SAS Institute Inc. All rights reser ved. Infinite Volume and Variety of Data Disruptive Technology New Problem-solving Mindset BUILDING NEW ANALYTICS CULTURE WITH BIG DATA Unrivaled Processing Power
  • 11. 11 One Platform, Many Data Sources, Multiple Workloads, All Consumers NoSQL Logs Social Media Sensors Legacy Platforms Cloudera’s Hadoop Distribution EHR CERNER Lab Patient Sat. ADT MS4 Billing Pop Health Predictive Analytics SAS Enterprise Business Intelligence Platform Unified Security Model SAS Data Governance SAS Visual Analytics Platform Text Mining Forecasting & Optimization Machine Learning Real-time Analytics Data Science Unified Audit and Logging HIE CMS Unified Privacy and Compliance Unified Person Master Index Unified API Platform Unified Enterprise Data Model Public Data Sources RDBMS Dignity Health Insights Big Data Ecosystem Unified Data Integration – Source Data Once, Analyze Multiple Times Analytic Capability SAS - Hadoop Integration Open Source Hadoop Platform with Unified Dignity Health processes Dignity Health Data Sources Open Access Data Exploration SAS Intelligence Security
  • 12. Hadoop and SAS can enable analytics Spectrum 12 Hadoop and SAS can enable full analytics Spectrum
  • 13. 13 Roadmap of building Dignity Health Insights Big Data Hub 1. Distributed Storage/Computing - Hadoop Ecosystem 2. Compliance - Audit & Logging 3. Security - SAS Intelligence 4. Data Governance - Dataflux 5. Analytics - SAS Enterprise Miner 6. SAS Visual Analytics
  • 14. 14EHR Lab Patient Sat. ADT Billing Pop Health SAS Analytics Products SAS Visual Analytics HIE CMS Dignity Health Insights Open Source Based in the cloud Big Data Ecosystem Architecture Secure FTP – SQOOP – FLUME Big Data Ecosystem Cerner SAS Intelligence Platform Role Based, LDAP integrated and Metadata level security
  • 15. Copyr ight © 2014, SAS Institute Inc. All rights reser ved. Prepare data IN Hadoop for analytics Move data FROM Hadoop into a SAS environment Deploy and manage model score code IN Hadoop Lift data IN to memory for analytics at scale Model data at scale in- memory WITH advanced modeling tools Use the right approach for what needs to be done! Explore data at scale, in- memory WITH data visualization SAS & HADOOP - THE PRAGMATIC APPROACH
  • 16. Copyr ight © 2012, SAS Institute Inc. All rights reser ved. ENABLING THE DATA TO DECISION LIFECYCLE WITH SAS AND HADOOP ECOSYSTEM Access & Manage Data Advanced data management capabilities (ELT, ETL, DQ, virtualization) enabled for Hadoop Interactively Explore & Visualize Quickly Visualize Data in Hadoop, Discover New Patterns, Publish Reports Via Web Reports, Mobile Devices, MS Office Apps Analyze & Model Uncover Patterns and trends in Hadoop data. Interactive and visual environment for analytics. Apply Domain specific high-performance analytics Deploy & Integrate Automatically deploy and score analytic models in the parallel environment. Manage & analyze real time data
  • 17. Copyr ight © 2014, SAS Institute Inc. All rights reser ved.Copyr ight © 2014, SAS Institute Inc. All rights reser ved. Data Store SAS Data In-Database Data Store SAS Traditional SAS HADOOP + SAS - DESIGN PATTERNS These approaches are complementary & can be combined for maximum effect Data Store SAS Data In-Memory Memory Data
  • 18. Copyr ight © 2014, SAS Institute Inc. All rights reser ved.Copyr ight © 2014, SAS Institute Inc. All rights reser ved. Results Modeling Code HiveQL Data Enterprise Miner with Hadoop (Model Dev) Enterprise Miner Access to Hadoop Hadoop Cluster Enterprise Miner High Performance Analytics Hadoop Cluster Small Data Volumes Everything Else Data is pulled to the EM server and computation happens on the EM server Code is pushed to the Hadoop cluster and computation is executed on the cluster Results Results
  • 19. Copyr ight © 2014, SAS Institute Inc. All rights reser ved.Copyr ight © 2014, SAS Institute Inc. All rights reser ved. SMP Architecture LASR Analytic Server Hadoop Cluster MPP Architecture LASR Analytic Server Hadoop Cluster Small Data Volumes Everything Else Data is pulled into memory on the single machine Data is pulled in parallel from the Hadoop cluster data nodes directly to the LASR worker nodes VISUAL ANALYTICS WITH SAS + HADOOP
  • 20. Analyze Sepsis Alerts By Mortality Rate By Provider Response By Length of Stay By Facilties BioSurvillence SepsisAnalytics with Hadoop and SAS VisualAnalytics 20
  • 21. 21EHR Lab Patient Sat. ADT Billing Pop Health SAS Analytics Products SAS Visual Analytics HIE CMS Dignity Health Insights Cloud based Security Compliant Big Data Architecture Secure FTP – SQOOP – FLUME HADOOP Ecosystem Cerner SAS Intelligence Platform Role Based, LDAP integrated and Metadata level security
  • 22. Social Community UI, Authenticate, Submit & Request Data, Navigate & Access Applications, Collaborate & Share Insights Packaged Analytic Applications & Actionable Insights Predictive Models, Benchmarks, Actions/Alerting – Clinical, Administrative, Operations, Financial , Quality, Gaming Theory  Analytic Tools Foundation Data Connectivity, Data Quality, Visualization, Segmentation, Data Mining, Forecasting, Audit Trails, NLP, Machine Intelligence  Storage & Data Bladed Environment - EDW, ODS, Marts, Hadoop + Customer Data, In-Memory Databases, Virtual Data Marts Secure Cloud EHR Lab Patient Sat. Reg/Adt  Patients  EHR  Call Center  Visualization  MS Office  Regulatory reporting  Partners Data Insights Billing Other Patients Clinicians Managers Analysts IT Staff Users Any Customer Any Channel Any Device Any Input Source Internal/External Any Service Any vertical or horizontal Slice of stack Any Output Destination Internal/External 3rd Party App The Technology: Dignity Health Insights Manage Financial Risks & Incentives Proactively Manage Care Quality & Outcomes Improve Efficiency of Care Delivery Population Health and Engage Patients Capabilities used in Sepsis Biosurvillence  Open Source Big Data Platform  User Authentication with Dignity Security System  Audit and Logging  Integration with Registration and Clinical systems  SAS Enterprise Business Intelligence  SAS Visual Analytics  SAS Data Quality tools  Mobile Delivery Pilot Desired Export Dignity Health Insights One Platform, Many Data Sources, Multiple Workloads, All Consumers
  • 23. 23 Dignity Health Insights use cases are endless.. • Patient readmission reduction Predictive Model • Broad System Exploration with Speed – COMPASS • Legacy Reporting System modernization • Pharmacy Analytics • UMPI – Universal Master Patient Index
  • 24. 24