Big data and machine learning techniques can be useful tools for health care marketers.
Big data refers to large and complex data sets that are difficult to analyze using traditional methods. Machine learning allows systems to learn from data to discover relationships and make predictions. Bayes' theorem provides a framework to update probabilities based on new evidence or observations. Marketers can use these techniques to better understand customers, predict behaviors, and inform strategy through data-driven insights. Questions were invited on applications of these techniques for health care marketing.
1. Big Data in Health Care
What Marketers Need to Know
Tim Gilchrist, May 2014
@timgilchrist
2. Session Goals
• Cover
– Big Data
– Artificial Intelligence / Machine Learning
– How to be an Informed Consumer
– Applications for Marketers
– Questions
2
3. Big Data
The term for a collection of data sets so large and
complex that they become difficult to process
• Many data sources with different formats
• Data with missing values
• Text / Social Media
• Things that don’t fit in Excel
3
6. Bayes
Thomas Bayes (1701 – 7 April 1761) was an English
mathematician and Presbyterian minister, known for
formulating the theorem that bears his name: Bayes'
theorem
6
Bayes theorem uses prior probabilities, combined with new observations to
calculate the probability of a hypothesis being true or false
Bayes is a natural fit to health care due to the presence of hypothesis
(diagnosis) and events (tests / observations)
7. Example
You are all doctors who have administered a
critical test to 50 patients
You know the test is:
– 75% Accurate
– 10% False Positives
7
– 10 of your patients tested positive
– How many are actually sick?
8. Bayes’ Theorem – Mammogram Example
We can present the data as a decision tree representing the
probabilities confronting doctors and patients
8
If we were to take population-level down to the individual level, much more accurate
probabilities would be possible
#1 Mammogram
#2Biopsy#3Time
Observation
Probability
9. What Does this Mean To Marketers?
• Big data is about discovering relationships
9
“Can’t beat a man with some insurance.
I need that health plan baaaby!”
• Then using data-driven insights to inform
strategy
10. Big Data Health Landscape
HIE
Member PCP Specialist
Sees PCP Gets X-Ray Sees Specialist Ambulatory
Outpatient
Analysis / Transformation
EMR
Admission
Discharge
PrescriptionClaims
Plan Data Portal
CarePathOutputsDataTypes
Direct
ConnectionWearable
Telemetry
What is Happening What Will Happen
Social PurchasedLocation
Cell
13. Training / Processing
13
Tweets extracted from
the Twitter Fire Hose with
key words “Health” and
“Plan”
1MM per day
1
@CapoeiraBatuque
“What's the best plan thru
affordable health care. Blue
cross? Blue shield? Health
net? #confused #healthcare”
@bluecalgal
Obama says don't listen to Fox,
why? Obama lied about keep
your dr, health plan, cheaper
than cell phone (bs) keep your dr
Create Training databases
for any classification desired.
+ and – outcomes used here
2
Weka turns Tweets into
numerical code that can be
analyzed by computer
“String to Word Vector”.
uses Naïve Bayes classifier
3
“What's the best plan thru affordable health care. Blue cross?
Blue shield? Health net? #confused #healthcare”
.225.357
.155
.999
ACA
AccessHealth
HealthPlanNow
Pressure
Confused
Dumb
@ Handle
# Followers
# Tweets
Profile
Retweets
Location
Date/Time
Text Analysis
Stemming/Tokenization
Demographic
Analysis
Once classifications are
established, rules can be
applied to new Tweets with
high accuracy ~90+%
4
15. Other Uses
15
• Model who will be your most
valuable:
• Customer
• Facebook follower
• (Really) Determining Sentiment
• Marketing Mix Simulations
• Consumer Facing Predictive
Technology
• Prod development (HIX)
18. Discovering Relationships Between Data
We can use machine learning to form
relationships between sets of data that
are seemingly unrelated (Causal
Relationships):
• Making your bed in the morning and job
satisfaction
• Artificial Christmas trees and family “brag
letters”
• What you buy and why
18
20. Bayes’ Theorem – Mammogram Example
Problem: Estimates of breast cancer over diagnosis range from 25%–
52%. Physicians often misinterpret their own lab results
20
Prior Probability
Chance that a woman will develop breast cancer in her 40s X 1.4%
New Event: Mammography
Ability of mammogram to detect cancer when present Y 75%
False positives Z 10%
Posterior Probability
Revised Probability, given new event (positive mammogram)
xy+z(1-x)
9.6%
A positive mammogram still leaves a 90.4% chance that the test showed something other than
cancer. When biopsies are performed on this age group, 75% are negative. Physicians rarely
consider the other 90.4%
22. Other Reading
Why Most Marketers Will Fail In The Era Of Big
Data
8 Marketers Doing Big Data Right
The big-data revolution in US health care:
Accelerating value and innovation
HITLAB Speaks with Tim Gilchrist, Director of
eBusiness Strategy for WellPoint
22
Example:
Hand out 50 cards
10 cards will say positive
40 will say negative
Ask the group What percentage of positive group are actually sick (%9.6)
Real Doctors claim 70%
Actual chances you have cancer after a positive mammogram are 9.6%
This is so important, but very few doctors take advantage of these facts
Prior probability was only 1%
Imagine the above decision tree with many many branches including patient demographics, family history, radiographic study info, environmental info, etc. The accuracy would climb tremendously
It’s all about understanding the data and using it to make informed decisions.
Millions of women endure unnecessary biopsies because physicians do not understand the data
%80 of biopsies are negative / unnecessary
Bayes at the bedside
But what about marketing?
Just as there are relationships between a positive mammogram and actually being sick, there are relationships between consumer habits and purchasing patterns
What’s the relationship here?
Here is a generalized look at our Big Data landscape
If we could understand the relationships between all these data what would be possible:
When people need to see a doctor before the hospital – way before the claim
When people switch jobs and need new insurance / HIX
When they need OTC meds etc.
There is no other category that how the types and abundance of leading indicators as health!
Most people do not know they ill buy a car 24hours before they do
Over 1MM Tweets per day regarding health. Most of them garbage
Completely unstructured, no labels, etc
Surprisingly, there are many looking for health plans?
Separating the signal from the noise is what machine learning can be very effective at
Socially viable companies can take advantage of the signal and close sales / No voice in social, don’t bother
If we take 50k tweets on health care
remove 70 of those that are sales prospects
and 300 that are not prospects
We have a training set we can use to teach and machine learning algorithm that is 94% accurate
50k becomes a few hundred qualified leads
This is a decision tree that represents a fraction of the math in this problem
These are well-known in consumer marketing but not so much in health care
Public health researchers have made may discoveries in this area (see appendix)
Again, commercial applications lag in machine learning