SlideShare una empresa de Scribd logo
1 de 17
Deep Reinforcement Learning
based Insulin Controller for
Effective Type-1 Diabetic Care
Liliana Cruz Lopez,
Columbia University 2019
Background
● More than 100 million adults in U.S. alone living with diabetes
● This condition results from high blood glucose level due to inadequate natural insulin
(Type 2) or obsolete (Type 1) in the body
● Treatment involves maintaining healthy blood glucose level at all time by injecting
appropriate amount of synthetic insulin at appropriate time
● Two types of insulin are used
○ basal for continuous blood glucose control
○ bolus as a short acting insulin with meal disturbances
CGM and Insulin Pump for Diabetic Control
Key steps in CGM and Insulin Pump based blood
glucose control
● Continuous glucose level monitoring with glucose
sensor
● Algorithm to determine amount and type of insulin to
be delivered at specific time
● Delivering insulin into body through insulin pump
It is a complex problem to keep a healthy level of
blood glucose at all the time!!!
Maintaining Appropriate Blood Glucose Level
● Effective insulin control for diabetic Type-1 patient requires that a healthy-level of
glucose is maintained throughout the day with minimal fluctuations in either direction
○ Inadequate insulin causes high blood sugar (Hyperglycemia) resulting in longer term
complication
○ Excessive insulin leads to low blood sugar (Hypoglycemia) which is often fatal if is too low
● Goal of insulin dependent diabetic care is to administer appropriate amount of insulin
at appropriate time such that glucose level is maintained at near target level without
reaching hypo or hyper level
● Maintenance of “right” amount of blood glucose is a very complex problem because of
lot of day-to-day variability in patient’s condition
○ diet/nutrition changes, exercise amount, exposure to sun, daily life-style etc.
Closed Loop Control System for Optimal Insulin
● Interaction between Glucose and administered Insulin is a closed loop control system
with feedback
○ Typically insulin control modeled as a PID controller to determine optimal insulin that
needs to be administered
● Such model driven approaches have limitations due to complex nature induced by time
varying, non-linear conditions
● Instead we approach this as an AI problem - specifically we consider Reinforcement
Learning model for insulin controller and evaluate its efficacy
Insulin Control as a Reinforcement Learning Model
● We propose Reinforcement Learning based
insulin control where
○ an agent repeatedly interacts with the environment,
each time receiving feedback (reward) for its actions
○ goal of the agent is to learn an optimal policy that
maximizes this feedback (reward) in the long run
● Specifically we propose and evaluate DDPG - a
deep RL framework - for insulin controller
○ suitable for both continuous action and continuous
space
○ Allows us to explicitly account for both basal and
bolus glucose regulation
DDPG Background and Why DDPG
DDPG Background
● Is an off-policy algorithm
● Works with environments with continuous
action spaces
● Is similar to deep Q-learning for continuous
action spaces
● Employs Actor-Critic model
● Learns directly from the observation spaces
through policy gradient method
DDPG as insulin Controller
● Environment: patient-glucose insulin
interaction
● Sate Space: glucose level and meal amount
● Action Space: insulin amount at each time step
which is in a continuous space
DDPG Formulation
- It learns a Q-function and policy:
Q-function:
Policy:
-It uses off-policy data and the Bellman equation to learn the Q-function, and uses the Q-function to learn the policy.
Bellman equation:
Deterministic Gradient Policy (Off-policy):
DDPG Algorithm Implementation
Algorithm Steps
Q function
TD Targets
Greedy Stochastic Policy
Initializations
TD Targets Update
Implementation and Environment
● We evaluate performance of DDPG based insulin controller using synthetic data
generated with well known UVA/Padova model that models human glucose level
● It includes models for 30 virtual patients, 10 adults, 10 adolescents, 10 children
● DDPG insulin controller implemented in Python using OpenAI Gym framework by
extending SimGlucose simulator
● Simulator parameters consists of the following features
○ Meal frequency and amount
○ Patient age, weight, height
Evaluation Framework
● DDPG based Insulin controller vs baseline model based controller (BBController)
to study how well DDPG controller performs under varying/dynamic conditions
● We evaluate under three different scenarios by inducing disturbances in blood
glucose through
○ single meal
○ multiple meals taken frequently in short intervals
○ multiple meals spread across longer intervals
● Rationale behind these choices is to understand if DDPG based controller has any
advantage over typical model based insulin controller under such induced
disturbances in glucose level
Experiment I: Single Meal Disturbance
Disturbance with a single meal of size 30 (CHO value 10) introduced at 7:00am.
● Glucose level rose much higher with BBController before stabilizing to normal range whereas blood glucose fluctuation is
relatively lower with DDPG Controller.
● Achieves tighter control with DDPG due to continuous adjustment based on the environment with purely data driven
approach instead of preset model as with BBController.
BBController DDPG Controller
Experiment II: Multi-Meal Frequent Disturbances
Frequent Multi-meal Disturbance where three meals were taken with shorter gaps between meals
● Blood glucose rose to higher level (>250) for longer duration of interval with BBController and did not react fast
enough after each disturbance as compared to DDPG based controller
● DDPG controller maintained glucose with relatively less fluctuations and closer to target level
● DDPG based controller more suitable for handling rapid and dynamic blood glucose disturbances even alone with just
basal insulin due to continuous reactive nature of the controller
BBController DDPG Controller
Experiment III: Multi-Meal Disturbances Spread
Out
Spread Out Multi-meal Disturbance: This scenario evaluates performance with multiple meal disturbances with longer gaps
between meals
● BBController is able to handle this spread out meal disturbances relatively better compared to when disturbances are
more rapid and dynamic.
● We find that both model based and DDPG controller with basal insulin have similar performance under this scenario. We
need to further investigate if DDPG controller could achieve superior performance when bolus insulin is also combined.
BBController
Conclusion
● Our study is first to propose and evaluate deep RL based insulin controller for Type-1
diabetic blood glucose management
● We implement DDPG based controller using SimGlucose that supports Open AI Gym
framework
● We compare with model based baseline insulin controller (Padova) under three
scenarios
○ Single meal
○ Frequent Multiple meal disturbance
○ Spread out multiple meal disturbance
● Our evaluation indicates that DDPG based controllers are more suitable to handling
rapid and dynamic blood sugar disturbance conditions compared to model based
controller
● DDPG controller is able to achieve more controlled blood glucose level even alone with
basal insulin, possibly due to the continuous reactive nature of such algorithms
Future work
● Results from our initial evaluations are promising that deep RL based insulin
controllers could be more effective in handling rapid fluctuations in blood sugar
● Further evaluate the effectiveness of DDPG based insulin controller by varying
environment variables and additional physical factors such as exercise, life-style
changes
● Incorporate bolus insulin into the DDPG controller action to further improve its
efficacy in reducing blood sugar fluctuations
Acknowledgement
Jinyu Xie, SimGlucose creator
Yuan Zhao, Mobiquity Networks
Dr. Chong Li, Columbia University
InsightZen for infrastructure support.

Más contenido relacionado

La actualidad más candente

Probability Powerpoint
Probability PowerpointProbability Powerpoint
Probability Powerpoint
spike2904
 

La actualidad más candente (20)

Probability Powerpoint
Probability PowerpointProbability Powerpoint
Probability Powerpoint
 
Steepest descent method
Steepest descent methodSteepest descent method
Steepest descent method
 
Decision Tree Learning
Decision Tree LearningDecision Tree Learning
Decision Tree Learning
 
Causal Bayesian Networks
Causal Bayesian NetworksCausal Bayesian Networks
Causal Bayesian Networks
 
Pca ppt
Pca pptPca ppt
Pca ppt
 
Real analysis
Real analysis Real analysis
Real analysis
 
Basic Probability
Basic Probability Basic Probability
Basic Probability
 
Functional analysis
Functional analysis Functional analysis
Functional analysis
 
Group Theory
Group TheoryGroup Theory
Group Theory
 
Decision Tree Learning
Decision Tree LearningDecision Tree Learning
Decision Tree Learning
 
Introduction to Probability and Bayes' Theorom
Introduction to Probability and Bayes' TheoromIntroduction to Probability and Bayes' Theorom
Introduction to Probability and Bayes' Theorom
 
establishing steps of biological control.docx
establishing steps of biological control.docxestablishing steps of biological control.docx
establishing steps of biological control.docx
 
Ch10 molevo
Ch10 molevoCh10 molevo
Ch10 molevo
 
TOPOLOGY and TYPES OF TOPOLOGY PowerPoint
TOPOLOGY and TYPES OF TOPOLOGY PowerPointTOPOLOGY and TYPES OF TOPOLOGY PowerPoint
TOPOLOGY and TYPES OF TOPOLOGY PowerPoint
 
CS6702 graph theory and applications notes pdf book
CS6702 graph theory and applications notes pdf bookCS6702 graph theory and applications notes pdf book
CS6702 graph theory and applications notes pdf book
 
Geometric probability distribution
Geometric probability distributionGeometric probability distribution
Geometric probability distribution
 
Genetic Algorithm
Genetic AlgorithmGenetic Algorithm
Genetic Algorithm
 
Decision tree
Decision treeDecision tree
Decision tree
 
Module 4 part_1
Module 4 part_1Module 4 part_1
Module 4 part_1
 
Initial Value Problems
Initial Value ProblemsInitial Value Problems
Initial Value Problems
 

Similar a Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for Effective Type-1 Diabetic Care

treatment of diabetes mellitus.pptx
treatment of diabetes mellitus.pptxtreatment of diabetes mellitus.pptx
treatment of diabetes mellitus.pptx
Roop
 
Glycemic control in the Intensive Care Units
Glycemic control in the Intensive Care UnitsGlycemic control in the Intensive Care Units
Glycemic control in the Intensive Care Units
Hanna Yudchyts
 
ieee_tbme_apc_nov_2013_May_2014_FINAL
ieee_tbme_apc_nov_2013_May_2014_FINALieee_tbme_apc_nov_2013_May_2014_FINAL
ieee_tbme_apc_nov_2013_May_2014_FINAL
Nick Preiser
 

Similar a Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for Effective Type-1 Diabetic Care (20)

Monitoring Type 1 Diabetes
Monitoring Type 1 DiabetesMonitoring Type 1 Diabetes
Monitoring Type 1 Diabetes
 
GDM Himani (3).pptx
GDM Himani (3).pptxGDM Himani (3).pptx
GDM Himani (3).pptx
 
Glucose monitorimg 2.pdf
Glucose monitorimg 2.pdfGlucose monitorimg 2.pdf
Glucose monitorimg 2.pdf
 
Gestational diabetes mellitus (2)
Gestational diabetes mellitus (2)Gestational diabetes mellitus (2)
Gestational diabetes mellitus (2)
 
Insulin initiation adjustment
Insulin initiation adjustmentInsulin initiation adjustment
Insulin initiation adjustment
 
treatment of diabetes mellitus.pptx
treatment of diabetes mellitus.pptxtreatment of diabetes mellitus.pptx
treatment of diabetes mellitus.pptx
 
Insulin initiation adjustment by Dr Shahjada Selim
Insulin initiation adjustment by Dr Shahjada SelimInsulin initiation adjustment by Dr Shahjada Selim
Insulin initiation adjustment by Dr Shahjada Selim
 
3.Insulin Rx (1)-converted.pptx
3.Insulin Rx  (1)-converted.pptx3.Insulin Rx  (1)-converted.pptx
3.Insulin Rx (1)-converted.pptx
 
Insulin Therapy for Type 2 Diabetes:Update
Insulin Therapy for Type 2 Diabetes:Update Insulin Therapy for Type 2 Diabetes:Update
Insulin Therapy for Type 2 Diabetes:Update
 
Glucose post prandial blood test
Glucose post prandial blood testGlucose post prandial blood test
Glucose post prandial blood test
 
Hyperglycemia management
Hyperglycemia managementHyperglycemia management
Hyperglycemia management
 
Insulin therapy- art of initiation and titration
Insulin therapy- art of initiation and titrationInsulin therapy- art of initiation and titration
Insulin therapy- art of initiation and titration
 
Insulin therapy: art of initiation and titration
Insulin therapy: art of initiation and titration Insulin therapy: art of initiation and titration
Insulin therapy: art of initiation and titration
 
Glycemic control in the Intensive Care Units
Glycemic control in the Intensive Care UnitsGlycemic control in the Intensive Care Units
Glycemic control in the Intensive Care Units
 
Cmg ppt
Cmg pptCmg ppt
Cmg ppt
 
Types of insulin & correction of hyperglycemia
Types of insulin & correction of hyperglycemiaTypes of insulin & correction of hyperglycemia
Types of insulin & correction of hyperglycemia
 
ieee_tbme_apc_nov_2013_May_2014_FINAL
ieee_tbme_apc_nov_2013_May_2014_FINALieee_tbme_apc_nov_2013_May_2014_FINAL
ieee_tbme_apc_nov_2013_May_2014_FINAL
 
Managing Diabetes With Insulin by Dr Shahjada Selim
Managing DiabetesWith Insulin by Dr Shahjada SelimManaging DiabetesWith Insulin by Dr Shahjada Selim
Managing Diabetes With Insulin by Dr Shahjada Selim
 
Proper Use of Diabetes Mellitus Devices
Proper Use of Diabetes Mellitus DevicesProper Use of Diabetes Mellitus Devices
Proper Use of Diabetes Mellitus Devices
 
Low Carbohydrate Diets
Low Carbohydrate DietsLow Carbohydrate Diets
Low Carbohydrate Diets
 

Más de MLconf

Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
MLconf
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
MLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
MLconf
 

Más de MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 

Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for Effective Type-1 Diabetic Care

  • 1. Deep Reinforcement Learning based Insulin Controller for Effective Type-1 Diabetic Care Liliana Cruz Lopez, Columbia University 2019
  • 2. Background ● More than 100 million adults in U.S. alone living with diabetes ● This condition results from high blood glucose level due to inadequate natural insulin (Type 2) or obsolete (Type 1) in the body ● Treatment involves maintaining healthy blood glucose level at all time by injecting appropriate amount of synthetic insulin at appropriate time ● Two types of insulin are used ○ basal for continuous blood glucose control ○ bolus as a short acting insulin with meal disturbances
  • 3. CGM and Insulin Pump for Diabetic Control Key steps in CGM and Insulin Pump based blood glucose control ● Continuous glucose level monitoring with glucose sensor ● Algorithm to determine amount and type of insulin to be delivered at specific time ● Delivering insulin into body through insulin pump It is a complex problem to keep a healthy level of blood glucose at all the time!!!
  • 4. Maintaining Appropriate Blood Glucose Level ● Effective insulin control for diabetic Type-1 patient requires that a healthy-level of glucose is maintained throughout the day with minimal fluctuations in either direction ○ Inadequate insulin causes high blood sugar (Hyperglycemia) resulting in longer term complication ○ Excessive insulin leads to low blood sugar (Hypoglycemia) which is often fatal if is too low ● Goal of insulin dependent diabetic care is to administer appropriate amount of insulin at appropriate time such that glucose level is maintained at near target level without reaching hypo or hyper level ● Maintenance of “right” amount of blood glucose is a very complex problem because of lot of day-to-day variability in patient’s condition ○ diet/nutrition changes, exercise amount, exposure to sun, daily life-style etc.
  • 5. Closed Loop Control System for Optimal Insulin ● Interaction between Glucose and administered Insulin is a closed loop control system with feedback ○ Typically insulin control modeled as a PID controller to determine optimal insulin that needs to be administered ● Such model driven approaches have limitations due to complex nature induced by time varying, non-linear conditions ● Instead we approach this as an AI problem - specifically we consider Reinforcement Learning model for insulin controller and evaluate its efficacy
  • 6. Insulin Control as a Reinforcement Learning Model ● We propose Reinforcement Learning based insulin control where ○ an agent repeatedly interacts with the environment, each time receiving feedback (reward) for its actions ○ goal of the agent is to learn an optimal policy that maximizes this feedback (reward) in the long run ● Specifically we propose and evaluate DDPG - a deep RL framework - for insulin controller ○ suitable for both continuous action and continuous space ○ Allows us to explicitly account for both basal and bolus glucose regulation
  • 7. DDPG Background and Why DDPG DDPG Background ● Is an off-policy algorithm ● Works with environments with continuous action spaces ● Is similar to deep Q-learning for continuous action spaces ● Employs Actor-Critic model ● Learns directly from the observation spaces through policy gradient method DDPG as insulin Controller ● Environment: patient-glucose insulin interaction ● Sate Space: glucose level and meal amount ● Action Space: insulin amount at each time step which is in a continuous space
  • 8. DDPG Formulation - It learns a Q-function and policy: Q-function: Policy: -It uses off-policy data and the Bellman equation to learn the Q-function, and uses the Q-function to learn the policy. Bellman equation: Deterministic Gradient Policy (Off-policy):
  • 9. DDPG Algorithm Implementation Algorithm Steps Q function TD Targets Greedy Stochastic Policy Initializations TD Targets Update
  • 10. Implementation and Environment ● We evaluate performance of DDPG based insulin controller using synthetic data generated with well known UVA/Padova model that models human glucose level ● It includes models for 30 virtual patients, 10 adults, 10 adolescents, 10 children ● DDPG insulin controller implemented in Python using OpenAI Gym framework by extending SimGlucose simulator ● Simulator parameters consists of the following features ○ Meal frequency and amount ○ Patient age, weight, height
  • 11. Evaluation Framework ● DDPG based Insulin controller vs baseline model based controller (BBController) to study how well DDPG controller performs under varying/dynamic conditions ● We evaluate under three different scenarios by inducing disturbances in blood glucose through ○ single meal ○ multiple meals taken frequently in short intervals ○ multiple meals spread across longer intervals ● Rationale behind these choices is to understand if DDPG based controller has any advantage over typical model based insulin controller under such induced disturbances in glucose level
  • 12. Experiment I: Single Meal Disturbance Disturbance with a single meal of size 30 (CHO value 10) introduced at 7:00am. ● Glucose level rose much higher with BBController before stabilizing to normal range whereas blood glucose fluctuation is relatively lower with DDPG Controller. ● Achieves tighter control with DDPG due to continuous adjustment based on the environment with purely data driven approach instead of preset model as with BBController. BBController DDPG Controller
  • 13. Experiment II: Multi-Meal Frequent Disturbances Frequent Multi-meal Disturbance where three meals were taken with shorter gaps between meals ● Blood glucose rose to higher level (>250) for longer duration of interval with BBController and did not react fast enough after each disturbance as compared to DDPG based controller ● DDPG controller maintained glucose with relatively less fluctuations and closer to target level ● DDPG based controller more suitable for handling rapid and dynamic blood glucose disturbances even alone with just basal insulin due to continuous reactive nature of the controller BBController DDPG Controller
  • 14. Experiment III: Multi-Meal Disturbances Spread Out Spread Out Multi-meal Disturbance: This scenario evaluates performance with multiple meal disturbances with longer gaps between meals ● BBController is able to handle this spread out meal disturbances relatively better compared to when disturbances are more rapid and dynamic. ● We find that both model based and DDPG controller with basal insulin have similar performance under this scenario. We need to further investigate if DDPG controller could achieve superior performance when bolus insulin is also combined. BBController
  • 15. Conclusion ● Our study is first to propose and evaluate deep RL based insulin controller for Type-1 diabetic blood glucose management ● We implement DDPG based controller using SimGlucose that supports Open AI Gym framework ● We compare with model based baseline insulin controller (Padova) under three scenarios ○ Single meal ○ Frequent Multiple meal disturbance ○ Spread out multiple meal disturbance ● Our evaluation indicates that DDPG based controllers are more suitable to handling rapid and dynamic blood sugar disturbance conditions compared to model based controller ● DDPG controller is able to achieve more controlled blood glucose level even alone with basal insulin, possibly due to the continuous reactive nature of such algorithms
  • 16. Future work ● Results from our initial evaluations are promising that deep RL based insulin controllers could be more effective in handling rapid fluctuations in blood sugar ● Further evaluate the effectiveness of DDPG based insulin controller by varying environment variables and additional physical factors such as exercise, life-style changes ● Incorporate bolus insulin into the DDPG controller action to further improve its efficacy in reducing blood sugar fluctuations
  • 17. Acknowledgement Jinyu Xie, SimGlucose creator Yuan Zhao, Mobiquity Networks Dr. Chong Li, Columbia University InsightZen for infrastructure support.