SlideShare una empresa de Scribd logo
1 de 20
1
Predicting Patient Outcomes in Real-Time at HCA
Presentation by Allison Baker and Cody Hall
Hospital Corporation of America
Department of Data and Analytics, Clinical Services Group
July 20, 2016
2CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
• Introduction to HCA
• Introduction to our team
• Data science pipeline
• Near real-time architecture
• Real-time architecture
• Current POC goals
Overview
3CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
“Above all else, we are committed to the care and improvement of human life. In
recognition of this commitment, we strive to deliver high-quality, cost-effective
healthcare in the communities we serve.” – HCA Mission Statement
• Hospital Corporation of America (HCA) is the leading healthcare provider in the
country
– 169 hospitals
– 116 freestanding surgery centers in 20 states and the U.K.
• Approximately 233,000 employees across the company
• Over 26 million patient encounters each year
• More than 8 million emergency room visits each year
• About 2 million inpatients treated annually
Hospital Corporation of America
4CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
Where We Are
5CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
Data Science and Data Products Teams
Dr. Martin Tobias
Data Scientist
Sandeepkumar Kothiwale
Data Scientist
Allison Baker
Data Scientist
Dr. Nan Chen
Data Scientist
Kunal Marwah
Data Scientist
Gerardo Castro
Data Scientist
Chris Cate
Data Scientist
Igor Ges
Data Product Engineer
Josh Wolter
BI Developer
Dr. Jesse Spencer-Smith
Director of Data Science
Dr. Edmund Jackson
Chief Data Scientist
VP of Data and Analytics
Warren Sadler
Data Product Engineer
Cody Hall
Development Manager of Data Products
Nick Selleh
Application Engineer
6CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
CRISP-DM and Data Science
7CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
• Begin by asking stakeholders and business owners “What business
decisions will be made with the analysis results?”
• Document all project and product features, timelines and code using
GitHub
• Source historical data using Teradata SQL
• Log all data sourcing and data extract steps using DRAKE
• Options
– Continuous integration
– Jenkins to monitor DRAKE builds
Problem Definition and Data Sourcing
8CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
• Run preliminary visualization
• QA data testing for coverage, outliers, abnormalities, format and structural issues,
frequency, duplication and accuracy
• Pre-process data
– Balance outcomes
– Filter patients
– Remove non-data
• Engineer features
Data Manipulation
9CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
• Analytic server
– 64 cores
– 4 Terabytes of hard disk
– 1.5 Terabytes of RAM
• Iterate models
• Evaluate statistics
Modeling
10CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
• Consider
– Re-defining the problem
– Additional modeling
– Additional data sourcing
• Discuss results with clinical owners and
business stakeholders
– Consider additional features
Interpretation and Reporting
11CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
• We can effectively engineer thousands of clinically and statistically relevant
features.
• We can successfully build accurate, complex and sophisticated predictive
models.
• How do we take these models to the patient bedside?
What Now?
12CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
Delivering Value to the Business
13CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
Near Real-Time Tool
• Consists of 3 main components
– Data source (different than historical training source)
– Scoring engine
– User interface
• Shows early value using a minimally viable product-based approach
• Phases POC to include development time for real-time architecture
• Updates in 15 minute batches
• Provides near real-time predictions
• Solicits feedback from facilities, focusing on accuracy and usefulness
14CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
Data Sources are Constantly Changing
15CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
Prediction Product
Facility + Team
Patient
Kafka
Topic
OpenGate
MS
SQL PostgreSQL
Analytic
Store
HDFS Cluster
Predictive Model
• Single POJO .jar
• Clojure (FE library)
ETL
• Independent SQL process
HDFS Cluster
Data Source
• 15 minute batches
• SQL defined
Data Source
• Streaming
• HL7QL defined
• GitHub & Nexus
• Jenkins
• Tableau
Supporting Infrastructure
• PostgreSQL administration
& monitoring
• Docker with Node JS (UI)
User Interface (UI)
• Displays measures + events
• Notifications of predictions
• Prompt for acknowledgement or
dismissal
• On acknowledgement, disable
notifications for 12 hours
Measures + Events:
Vitals
Lab results
Orders
Demographics
Surgery times
Nursing documentations
Prediction
Measures
+ EventsHL-7
Measures
+ Events
& PredictionHL-7
Measures + Events
HL7QL
(Spark)
Kafka
Topic
EDN Predictive Model + ETL
• Clojure (FE library)/Spark job
• PowderKeg
Measures
+ Events
Data Persistence
Near Real-Time System
Real-Time System
16CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
Real-Time Infrastructure
• Continuously consumes HL7 messages from a Kafka topic and parses via Spark and
HL7QL
• Processes (producers) publish messages to Kafka topics (categories) and
subscriptions are made to the topics to process the message feeds
(consumers)
• Apache Spark is the application interface to allow for cloud computing
• HL7 Query Language (HL7QL) parses the messages
• Scores (predicts) on new streaming information
– Runs a .jar file via a Spark process compiled from Clojure code and H2O POJO
• Deploys with Docker
– Container-based application architecture
• Continuously monitors with Jenkins
17CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
18CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
A Proof of Concept Use Case and Goals
Primary:
1. Assess clinical workflow to identify how the model can support the current clinical
processes for treating negative patient outcomes
2. Determine the model’s capability to extract meaningful information from existing
and available patient data and identify patterns that predict the outcome
3. Determine the usefulness of an early prediction model within a clinical workflow
Secondary:
1. Improve the prediction model through incorporation of feedback provided by the
clinical team
2. Maximize the utility of the prediction tool to improve a clinical workflow for the
facility staff
19CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
Summary
20CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
Questions

Más contenido relacionado

La actualidad más candente

Operationalizing Edge Machine Learning with Apache Spark with Nisha Talagala ...
Operationalizing Edge Machine Learning with Apache Spark with Nisha Talagala ...Operationalizing Edge Machine Learning with Apache Spark with Nisha Talagala ...
Operationalizing Edge Machine Learning with Apache Spark with Nisha Talagala ...Databricks
 
Advanced data science algorithms applied to scalable stream processing by Dav...
Advanced data science algorithms applied to scalable stream processing by Dav...Advanced data science algorithms applied to scalable stream processing by Dav...
Advanced data science algorithms applied to scalable stream processing by Dav...Big Data Spain
 
Fast Data Intelligence in the IoT - real-time data analytics with Spark
Fast Data Intelligence in the IoT - real-time data analytics with SparkFast Data Intelligence in the IoT - real-time data analytics with Spark
Fast Data Intelligence in the IoT - real-time data analytics with SparkBas Geerdink
 
Importance of ML Reproducibility & Applications with MLfLow
Importance of ML Reproducibility & Applications with MLfLowImportance of ML Reproducibility & Applications with MLfLow
Importance of ML Reproducibility & Applications with MLfLowDatabricks
 
VP of WW Partners by Alan Chhabra
VP of WW Partners by Alan ChhabraVP of WW Partners by Alan Chhabra
VP of WW Partners by Alan ChhabraBig Data Spain
 
Spark Summit Keynote by Shaun Connolly
Spark Summit Keynote by Shaun ConnollySpark Summit Keynote by Shaun Connolly
Spark Summit Keynote by Shaun ConnollySpark Summit
 
Building data "Py-pelines"
Building data "Py-pelines"Building data "Py-pelines"
Building data "Py-pelines"Rob Winters
 
Jeeves Grows Up: An AI Chatbot for Performance and Quality
Jeeves Grows Up: An AI Chatbot for Performance and QualityJeeves Grows Up: An AI Chatbot for Performance and Quality
Jeeves Grows Up: An AI Chatbot for Performance and QualityDatabricks
 
Mastering Your Customer Data on Apache Spark by Elliott Cordo
Mastering Your Customer Data on Apache Spark by Elliott CordoMastering Your Customer Data on Apache Spark by Elliott Cordo
Mastering Your Customer Data on Apache Spark by Elliott CordoSpark Summit
 
Real-time Recommendations for Retail: Architecture, Algorithms, and Design
Real-time Recommendations for Retail: Architecture, Algorithms, and DesignReal-time Recommendations for Retail: Architecture, Algorithms, and Design
Real-time Recommendations for Retail: Architecture, Algorithms, and DesignJuliet Hougland
 
Misusing MLflow To Help Deduplicate Data At Scale
Misusing MLflow To Help Deduplicate Data At ScaleMisusing MLflow To Help Deduplicate Data At Scale
Misusing MLflow To Help Deduplicate Data At ScaleDatabricks
 
Experimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOpsExperimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOpsDatabricks
 
Wizard Driven AI Anomaly Detection with Databricks in Azure
Wizard Driven AI Anomaly Detection with Databricks in AzureWizard Driven AI Anomaly Detection with Databricks in Azure
Wizard Driven AI Anomaly Detection with Databricks in AzureDatabricks
 
Build Your Own Recommendation Engine
Build Your Own Recommendation EngineBuild Your Own Recommendation Engine
Build Your Own Recommendation EngineSri Ambati
 
Spark Summit Keynote by Seshu Adunuthula
Spark Summit Keynote by Seshu AdunuthulaSpark Summit Keynote by Seshu Adunuthula
Spark Summit Keynote by Seshu AdunuthulaSpark Summit
 
Data Science and Enterprise Engineering with Michael Finger and Chris Robison
Data Science and Enterprise Engineering with Michael Finger and Chris RobisonData Science and Enterprise Engineering with Michael Finger and Chris Robison
Data Science and Enterprise Engineering with Michael Finger and Chris RobisonDatabricks
 
An Architecture for Agile Machine Learning in Real-Time Applications
An Architecture for Agile Machine Learning in Real-Time ApplicationsAn Architecture for Agile Machine Learning in Real-Time Applications
An Architecture for Agile Machine Learning in Real-Time ApplicationsJohann Schleier-Smith
 
No sql and sql - open analytics summit
No sql and sql - open analytics summitNo sql and sql - open analytics summit
No sql and sql - open analytics summitOpen Analytics
 
Big data-science-oanyc
Big data-science-oanycBig data-science-oanyc
Big data-science-oanycOpen Analytics
 
Optier presentation for open analytics event
Optier presentation for open analytics eventOptier presentation for open analytics event
Optier presentation for open analytics eventOpen Analytics
 

La actualidad más candente (20)

Operationalizing Edge Machine Learning with Apache Spark with Nisha Talagala ...
Operationalizing Edge Machine Learning with Apache Spark with Nisha Talagala ...Operationalizing Edge Machine Learning with Apache Spark with Nisha Talagala ...
Operationalizing Edge Machine Learning with Apache Spark with Nisha Talagala ...
 
Advanced data science algorithms applied to scalable stream processing by Dav...
Advanced data science algorithms applied to scalable stream processing by Dav...Advanced data science algorithms applied to scalable stream processing by Dav...
Advanced data science algorithms applied to scalable stream processing by Dav...
 
Fast Data Intelligence in the IoT - real-time data analytics with Spark
Fast Data Intelligence in the IoT - real-time data analytics with SparkFast Data Intelligence in the IoT - real-time data analytics with Spark
Fast Data Intelligence in the IoT - real-time data analytics with Spark
 
Importance of ML Reproducibility & Applications with MLfLow
Importance of ML Reproducibility & Applications with MLfLowImportance of ML Reproducibility & Applications with MLfLow
Importance of ML Reproducibility & Applications with MLfLow
 
VP of WW Partners by Alan Chhabra
VP of WW Partners by Alan ChhabraVP of WW Partners by Alan Chhabra
VP of WW Partners by Alan Chhabra
 
Spark Summit Keynote by Shaun Connolly
Spark Summit Keynote by Shaun ConnollySpark Summit Keynote by Shaun Connolly
Spark Summit Keynote by Shaun Connolly
 
Building data "Py-pelines"
Building data "Py-pelines"Building data "Py-pelines"
Building data "Py-pelines"
 
Jeeves Grows Up: An AI Chatbot for Performance and Quality
Jeeves Grows Up: An AI Chatbot for Performance and QualityJeeves Grows Up: An AI Chatbot for Performance and Quality
Jeeves Grows Up: An AI Chatbot for Performance and Quality
 
Mastering Your Customer Data on Apache Spark by Elliott Cordo
Mastering Your Customer Data on Apache Spark by Elliott CordoMastering Your Customer Data on Apache Spark by Elliott Cordo
Mastering Your Customer Data on Apache Spark by Elliott Cordo
 
Real-time Recommendations for Retail: Architecture, Algorithms, and Design
Real-time Recommendations for Retail: Architecture, Algorithms, and DesignReal-time Recommendations for Retail: Architecture, Algorithms, and Design
Real-time Recommendations for Retail: Architecture, Algorithms, and Design
 
Misusing MLflow To Help Deduplicate Data At Scale
Misusing MLflow To Help Deduplicate Data At ScaleMisusing MLflow To Help Deduplicate Data At Scale
Misusing MLflow To Help Deduplicate Data At Scale
 
Experimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOpsExperimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOps
 
Wizard Driven AI Anomaly Detection with Databricks in Azure
Wizard Driven AI Anomaly Detection with Databricks in AzureWizard Driven AI Anomaly Detection with Databricks in Azure
Wizard Driven AI Anomaly Detection with Databricks in Azure
 
Build Your Own Recommendation Engine
Build Your Own Recommendation EngineBuild Your Own Recommendation Engine
Build Your Own Recommendation Engine
 
Spark Summit Keynote by Seshu Adunuthula
Spark Summit Keynote by Seshu AdunuthulaSpark Summit Keynote by Seshu Adunuthula
Spark Summit Keynote by Seshu Adunuthula
 
Data Science and Enterprise Engineering with Michael Finger and Chris Robison
Data Science and Enterprise Engineering with Michael Finger and Chris RobisonData Science and Enterprise Engineering with Michael Finger and Chris Robison
Data Science and Enterprise Engineering with Michael Finger and Chris Robison
 
An Architecture for Agile Machine Learning in Real-Time Applications
An Architecture for Agile Machine Learning in Real-Time ApplicationsAn Architecture for Agile Machine Learning in Real-Time Applications
An Architecture for Agile Machine Learning in Real-Time Applications
 
No sql and sql - open analytics summit
No sql and sql - open analytics summitNo sql and sql - open analytics summit
No sql and sql - open analytics summit
 
Big data-science-oanyc
Big data-science-oanycBig data-science-oanyc
Big data-science-oanyc
 
Optier presentation for open analytics event
Optier presentation for open analytics eventOptier presentation for open analytics event
Optier presentation for open analytics event
 

Similar a Predicting Patient Outcomes in Real-Time at HCA

Developing and Implementing a QA Plan During Your Legacy Data to S1000D
Developing and Implementing a QA Plan During Your Legacy Data to S1000DDeveloping and Implementing a QA Plan During Your Legacy Data to S1000D
Developing and Implementing a QA Plan During Your Legacy Data to S1000Ddclsocialmedia
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachAgile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachSoftServe
 
Health Care: Cost Reductions through Data Insights - The Data Analysis Group
Health Care: Cost Reductions through Data Insights - The Data Analysis GroupHealth Care: Cost Reductions through Data Insights - The Data Analysis Group
Health Care: Cost Reductions through Data Insights - The Data Analysis GroupJames Karis
 
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...DataWorks Summit
 
Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...Ola Spjuth
 
Predicting Hospital Readmission Using Cascading
Predicting Hospital Readmission Using CascadingPredicting Hospital Readmission Using Cascading
Predicting Hospital Readmission Using CascadingCascading
 
Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...
Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...
Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...Kevin Mader
 
Customer Presentation
Customer PresentationCustomer Presentation
Customer PresentationSplunk
 
Data Café — A Platform For Creating Biomedical Data Lakes
Data Café — A Platform For Creating Biomedical Data LakesData Café — A Platform For Creating Biomedical Data Lakes
Data Café — A Platform For Creating Biomedical Data LakesPradeeban Kathiravelu, Ph.D.
 
Driving Faster Analytics at Symphony Health
Driving Faster Analytics at Symphony HealthDriving Faster Analytics at Symphony Health
Driving Faster Analytics at Symphony HealthPrecisely
 
UCSF Informatics Day 2014 - Jocel Dumlao, "REDCap / MyResearch"
UCSF Informatics Day 2014 - Jocel Dumlao, "REDCap / MyResearch"UCSF Informatics Day 2014 - Jocel Dumlao, "REDCap / MyResearch"
UCSF Informatics Day 2014 - Jocel Dumlao, "REDCap / MyResearch"CTSI at UCSF
 
Medidata AMUG Meeting / Presentation 2013
Medidata AMUG Meeting / Presentation 2013Medidata AMUG Meeting / Presentation 2013
Medidata AMUG Meeting / Presentation 2013Brock Heinz
 
Running Mixed Workloads on Kubernetes at IHME
Running Mixed Workloads on Kubernetes at IHMERunning Mixed Workloads on Kubernetes at IHME
Running Mixed Workloads on Kubernetes at IHMETyrone Grandison
 
Big data's impact on healthcare
Big data's impact on healthcareBig data's impact on healthcare
Big data's impact on healthcareRené Kuipers
 
2013_06_27 Dotmatics UGM
2013_06_27 Dotmatics UGM2013_06_27 Dotmatics UGM
2013_06_27 Dotmatics UGMBob Coner
 
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...Sean Ekins
 
Proof of Concept & Discovery Phase for Data Analytics Platform
Proof of Concept & Discovery Phase for Data Analytics PlatformProof of Concept & Discovery Phase for Data Analytics Platform
Proof of Concept & Discovery Phase for Data Analytics PlatformRelevantz
 
Solving the Disconnected Data Problem in Healthcare Using MongoDB
Solving the Disconnected Data Problem in Healthcare Using MongoDBSolving the Disconnected Data Problem in Healthcare Using MongoDB
Solving the Disconnected Data Problem in Healthcare Using MongoDBMongoDB
 
predictive analysis and usage in procurement ppt 2017
predictive analysis and usage in procurement  ppt 2017predictive analysis and usage in procurement  ppt 2017
predictive analysis and usage in procurement ppt 2017Prashant Bhatmule
 
How to Restructure and Modernize Active Directory
How to Restructure and Modernize Active DirectoryHow to Restructure and Modernize Active Directory
How to Restructure and Modernize Active DirectoryQuest
 

Similar a Predicting Patient Outcomes in Real-Time at HCA (20)

Developing and Implementing a QA Plan During Your Legacy Data to S1000D
Developing and Implementing a QA Plan During Your Legacy Data to S1000DDeveloping and Implementing a QA Plan During Your Legacy Data to S1000D
Developing and Implementing a QA Plan During Your Legacy Data to S1000D
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachAgile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric Approach
 
Health Care: Cost Reductions through Data Insights - The Data Analysis Group
Health Care: Cost Reductions through Data Insights - The Data Analysis GroupHealth Care: Cost Reductions through Data Insights - The Data Analysis Group
Health Care: Cost Reductions through Data Insights - The Data Analysis Group
 
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
 
Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...
 
Predicting Hospital Readmission Using Cascading
Predicting Hospital Readmission Using CascadingPredicting Hospital Readmission Using Cascading
Predicting Hospital Readmission Using Cascading
 
Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...
Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...
Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...
 
Customer Presentation
Customer PresentationCustomer Presentation
Customer Presentation
 
Data Café — A Platform For Creating Biomedical Data Lakes
Data Café — A Platform For Creating Biomedical Data LakesData Café — A Platform For Creating Biomedical Data Lakes
Data Café — A Platform For Creating Biomedical Data Lakes
 
Driving Faster Analytics at Symphony Health
Driving Faster Analytics at Symphony HealthDriving Faster Analytics at Symphony Health
Driving Faster Analytics at Symphony Health
 
UCSF Informatics Day 2014 - Jocel Dumlao, "REDCap / MyResearch"
UCSF Informatics Day 2014 - Jocel Dumlao, "REDCap / MyResearch"UCSF Informatics Day 2014 - Jocel Dumlao, "REDCap / MyResearch"
UCSF Informatics Day 2014 - Jocel Dumlao, "REDCap / MyResearch"
 
Medidata AMUG Meeting / Presentation 2013
Medidata AMUG Meeting / Presentation 2013Medidata AMUG Meeting / Presentation 2013
Medidata AMUG Meeting / Presentation 2013
 
Running Mixed Workloads on Kubernetes at IHME
Running Mixed Workloads on Kubernetes at IHMERunning Mixed Workloads on Kubernetes at IHME
Running Mixed Workloads on Kubernetes at IHME
 
Big data's impact on healthcare
Big data's impact on healthcareBig data's impact on healthcare
Big data's impact on healthcare
 
2013_06_27 Dotmatics UGM
2013_06_27 Dotmatics UGM2013_06_27 Dotmatics UGM
2013_06_27 Dotmatics UGM
 
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
 
Proof of Concept & Discovery Phase for Data Analytics Platform
Proof of Concept & Discovery Phase for Data Analytics PlatformProof of Concept & Discovery Phase for Data Analytics Platform
Proof of Concept & Discovery Phase for Data Analytics Platform
 
Solving the Disconnected Data Problem in Healthcare Using MongoDB
Solving the Disconnected Data Problem in Healthcare Using MongoDBSolving the Disconnected Data Problem in Healthcare Using MongoDB
Solving the Disconnected Data Problem in Healthcare Using MongoDB
 
predictive analysis and usage in procurement ppt 2017
predictive analysis and usage in procurement  ppt 2017predictive analysis and usage in procurement  ppt 2017
predictive analysis and usage in procurement ppt 2017
 
How to Restructure and Modernize Active Directory
How to Restructure and Modernize Active DirectoryHow to Restructure and Modernize Active Directory
How to Restructure and Modernize Active Directory
 

Más de Sri Ambati

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Generative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptxGenerative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptxSri Ambati
 
AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek Sri Ambati
 
LLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5thLLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5thSri Ambati
 
Building, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionBuilding, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionSri Ambati
 
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...Sri Ambati
 
Risk Management for LLMs
Risk Management for LLMsRisk Management for LLMs
Risk Management for LLMsSri Ambati
 
Open-Source AI: Community is the Way
Open-Source AI: Community is the WayOpen-Source AI: Community is the Way
Open-Source AI: Community is the WaySri Ambati
 
Building Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2OBuilding Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2OSri Ambati
 
Applied Gen AI for the Finance Vertical
Applied Gen AI for the Finance Vertical Applied Gen AI for the Finance Vertical
Applied Gen AI for the Finance Vertical Sri Ambati
 
Cutting Edge Tricks from LLM Papers
Cutting Edge Tricks from LLM PapersCutting Edge Tricks from LLM Papers
Cutting Edge Tricks from LLM PapersSri Ambati
 
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Sri Ambati
 
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...Sri Ambati
 
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...Sri Ambati
 
LLM Interpretability
LLM Interpretability LLM Interpretability
LLM Interpretability Sri Ambati
 
Never Reply to an Email Again
Never Reply to an Email AgainNever Reply to an Email Again
Never Reply to an Email AgainSri Ambati
 
Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)Sri Ambati
 
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...Sri Ambati
 
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...Sri Ambati
 
AI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneyAI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneySri Ambati
 

Más de Sri Ambati (20)

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Generative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptxGenerative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptx
 
AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek
 
LLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5thLLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5th
 
Building, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionBuilding, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for Production
 
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
 
Risk Management for LLMs
Risk Management for LLMsRisk Management for LLMs
Risk Management for LLMs
 
Open-Source AI: Community is the Way
Open-Source AI: Community is the WayOpen-Source AI: Community is the Way
Open-Source AI: Community is the Way
 
Building Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2OBuilding Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2O
 
Applied Gen AI for the Finance Vertical
Applied Gen AI for the Finance Vertical Applied Gen AI for the Finance Vertical
Applied Gen AI for the Finance Vertical
 
Cutting Edge Tricks from LLM Papers
Cutting Edge Tricks from LLM PapersCutting Edge Tricks from LLM Papers
Cutting Edge Tricks from LLM Papers
 
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
 
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
 
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
 
LLM Interpretability
LLM Interpretability LLM Interpretability
LLM Interpretability
 
Never Reply to an Email Again
Never Reply to an Email AgainNever Reply to an Email Again
Never Reply to an Email Again
 
Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)
 
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
 
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
 
AI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneyAI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation Journey
 

Último

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 

Último (20)

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 

Predicting Patient Outcomes in Real-Time at HCA

  • 1. 1 Predicting Patient Outcomes in Real-Time at HCA Presentation by Allison Baker and Cody Hall Hospital Corporation of America Department of Data and Analytics, Clinical Services Group July 20, 2016
  • 2. 2CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. • Introduction to HCA • Introduction to our team • Data science pipeline • Near real-time architecture • Real-time architecture • Current POC goals Overview
  • 3. 3CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. “Above all else, we are committed to the care and improvement of human life. In recognition of this commitment, we strive to deliver high-quality, cost-effective healthcare in the communities we serve.” – HCA Mission Statement • Hospital Corporation of America (HCA) is the leading healthcare provider in the country – 169 hospitals – 116 freestanding surgery centers in 20 states and the U.K. • Approximately 233,000 employees across the company • Over 26 million patient encounters each year • More than 8 million emergency room visits each year • About 2 million inpatients treated annually Hospital Corporation of America
  • 4. 4CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. Where We Are
  • 5. 5CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. Data Science and Data Products Teams Dr. Martin Tobias Data Scientist Sandeepkumar Kothiwale Data Scientist Allison Baker Data Scientist Dr. Nan Chen Data Scientist Kunal Marwah Data Scientist Gerardo Castro Data Scientist Chris Cate Data Scientist Igor Ges Data Product Engineer Josh Wolter BI Developer Dr. Jesse Spencer-Smith Director of Data Science Dr. Edmund Jackson Chief Data Scientist VP of Data and Analytics Warren Sadler Data Product Engineer Cody Hall Development Manager of Data Products Nick Selleh Application Engineer
  • 6. 6CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. CRISP-DM and Data Science
  • 7. 7CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. • Begin by asking stakeholders and business owners “What business decisions will be made with the analysis results?” • Document all project and product features, timelines and code using GitHub • Source historical data using Teradata SQL • Log all data sourcing and data extract steps using DRAKE • Options – Continuous integration – Jenkins to monitor DRAKE builds Problem Definition and Data Sourcing
  • 8. 8CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. • Run preliminary visualization • QA data testing for coverage, outliers, abnormalities, format and structural issues, frequency, duplication and accuracy • Pre-process data – Balance outcomes – Filter patients – Remove non-data • Engineer features Data Manipulation
  • 9. 9CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. • Analytic server – 64 cores – 4 Terabytes of hard disk – 1.5 Terabytes of RAM • Iterate models • Evaluate statistics Modeling
  • 10. 10CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. • Consider – Re-defining the problem – Additional modeling – Additional data sourcing • Discuss results with clinical owners and business stakeholders – Consider additional features Interpretation and Reporting
  • 11. 11CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. • We can effectively engineer thousands of clinically and statistically relevant features. • We can successfully build accurate, complex and sophisticated predictive models. • How do we take these models to the patient bedside? What Now?
  • 12. 12CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. Delivering Value to the Business
  • 13. 13CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. Near Real-Time Tool • Consists of 3 main components – Data source (different than historical training source) – Scoring engine – User interface • Shows early value using a minimally viable product-based approach • Phases POC to include development time for real-time architecture • Updates in 15 minute batches • Provides near real-time predictions • Solicits feedback from facilities, focusing on accuracy and usefulness
  • 14. 14CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. Data Sources are Constantly Changing
  • 15. 15CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. Prediction Product Facility + Team Patient Kafka Topic OpenGate MS SQL PostgreSQL Analytic Store HDFS Cluster Predictive Model • Single POJO .jar • Clojure (FE library) ETL • Independent SQL process HDFS Cluster Data Source • 15 minute batches • SQL defined Data Source • Streaming • HL7QL defined • GitHub & Nexus • Jenkins • Tableau Supporting Infrastructure • PostgreSQL administration & monitoring • Docker with Node JS (UI) User Interface (UI) • Displays measures + events • Notifications of predictions • Prompt for acknowledgement or dismissal • On acknowledgement, disable notifications for 12 hours Measures + Events: Vitals Lab results Orders Demographics Surgery times Nursing documentations Prediction Measures + EventsHL-7 Measures + Events & PredictionHL-7 Measures + Events HL7QL (Spark) Kafka Topic EDN Predictive Model + ETL • Clojure (FE library)/Spark job • PowderKeg Measures + Events Data Persistence Near Real-Time System Real-Time System
  • 16. 16CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. Real-Time Infrastructure • Continuously consumes HL7 messages from a Kafka topic and parses via Spark and HL7QL • Processes (producers) publish messages to Kafka topics (categories) and subscriptions are made to the topics to process the message feeds (consumers) • Apache Spark is the application interface to allow for cloud computing • HL7 Query Language (HL7QL) parses the messages • Scores (predicts) on new streaming information – Runs a .jar file via a Spark process compiled from Clojure code and H2O POJO • Deploys with Docker – Container-based application architecture • Continuously monitors with Jenkins
  • 17. 17CONFIDENTIAL - Contains proprietary information. Not intended for external distribution.
  • 18. 18CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. A Proof of Concept Use Case and Goals Primary: 1. Assess clinical workflow to identify how the model can support the current clinical processes for treating negative patient outcomes 2. Determine the model’s capability to extract meaningful information from existing and available patient data and identify patterns that predict the outcome 3. Determine the usefulness of an early prediction model within a clinical workflow Secondary: 1. Improve the prediction model through incorporation of feedback provided by the clinical team 2. Maximize the utility of the prediction tool to improve a clinical workflow for the facility staff
  • 19. 19CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. Summary
  • 20. 20CONFIDENTIAL - Contains proprietary information. Not intended for external distribution. Questions

Notas del editor

  1. Really focusing on the use of Tools Architecture Deployment
  2. Add number of inpatients (~1.8 million) real-time – prediction is used to lengthen the intervention window for therapy. Batch – for operational stuff.
  3. Ask the right question Gather data to support your hypotheses Test your assumptions - Get through this loop as quickly as possible -> h2o makes modeling component straightforward and pain-free. Don’t get caught up on this slide Cross Industry Standard Process for Data Mining, commonly known by its acronym CRISP-DM, was a data mining process model that describes the overall approach to solving business (or clinical) problems with predictive analytics. Working through this process requires both a Business understanding and Data understanding at the forefront of everything. Data preparation Modeling Evaluation Deployment The overall arching goal is to extract knowledge from data, using predictive modeling to visualize and present data with an intelligent awareness of the clinical and/or business consequences
  4. Data science projects begin by asking a clearly defined business question What business decisions will be made using the results of the analysis? What does “done” look like? Establish that the project falls within one of five defined analysis types: Type 1. Classification: Is this A or B? Type 2. Anomaly Detection: Is this unusual? Type 3. Regression: How much/how many? Type 4. Unsupervised Learning: How is it organized? Type 5. Prescriptive: What should I do next? GitHub: web-based tool allowing for version control and SCM Teradata SQL Assistant: Windows-based tool for building and running sql queries against our EDW DRAKE: workflow tool
  5. SQL, R, Clojure Balancing Center and scale Sampling Why do we use R vs. h2o? Engineering Features -> we do FE outside of h2o so pre-processing
  6. Historically we were restricted by the computational availability of our laptops. Nice visualizations for eval results!!!
  7. Weak signal?
  8. Apply the model to real live data and gain clinical feedback on patients we are seeing in our hospitals now Build out infrastructure and architecture to score patients in real-time Preventing negative patient outcomes and saving lives H2o is the harness that runs on the jvm, brining predictive models to the patients’ bedsides
  9. Tableau helps you work with business to solve problems, quickly.
  10. Want to use the model in real life and gain clinical feedback Create a way for model to capture feedback through an application See if the model fits into clinical workflow. Near real-time does not scale
  11. real-time in healthcare means HL7 based messaging. Clojure encapsulates the pojo
  12. Cloudera resilient distributed dataset
  13. Doing all of this on every single commit 4 times an hour (05, 20, 35, 50) the job is started A Docker container is spun up, and a jar is executed Data is retrieved from OpenGate, aggregated and transformed Predictive model is applied Predictions are written to PostgreSQL Logs are stored and execution results are reported
  14. GOAL: The model accurately predicts patient outcomes earlier than those identified through current clinical processes