SlideShare una empresa de Scribd logo
1 de 42
Descargar para leer sin conexión
A NEW PLATFORM FOR A NEW ERA
Data Driven Drugs:
Predictive Models to Improve
Product Quality in Pharmaceuticals
Sarah Aerni, PhD
Senior Data Scientist at Pivotal
saerni@gopivotal.com
Strata RX
September 26, 2013

© Copyright 2013 Pivotal. All rights reserved.

2
The Quantified Patient
Medical History!

Genetics!

Family !
History!

Imaging!
Clinical!
Narratives!

Medications!

Molecular!
Diagnostics!

Lab tests!

Environment!
© Copyright 2013 Pivotal. All rights reserved.

Sensors!
& Mobile!
3
Data driven drugs: From discovery to delivery
Drug discovery
+ development

RICH DATA SOURCES

Clinical
Trials
Distribution and
surveillance

!  Molecular data

–  Cellular drug screens
–  Animal models

!  Clinical data including notes, images,
markers (e.g. genomics, lab results)
!  Sensor and assay data
!  Internal and partner/purchased external
data

Manufacturing !  Contact center data
Marketing

© Copyright 2013 Pivotal. All rights reserved.

!  Patient registries, public and federal
data, clinical partnerships

4
Data integration
How Pivotal can enable industries to
extract new value from data sources

© Copyright 2013 Pivotal. All rights reserved.

5
Successful transformation into a data-driven
enterprise requires a paradigm shift
!  Bring available data sources to a
central location
Integration of a variety of data leads to
new insights

DATA
IS THE NEW
CENTER OF GRAVITY

!  Analyze large volumes of variable
data for richer models
Building models without data movement
reduces time to insight

!  Share data, insights and ideas
Leveraging various expertise will lead to
more relevant business insights
© Copyright 2013 Pivotal. All rights reserved.

Data > Application!
6
Traditional Analytics Processes

If you think databases are only good for storing data
Time-to-Insights

sample

In-memory
statistics
tool

In-memory
optimization
tool

solution

forecast

© Copyright 2013 Pivotal. All rights reserved.

7
Pivotal One: Heritage
Application Fabric

Data Fabric

GemFire

Ingest & Query: very high-capacity &
in-memory
Scale-out storage: HDFS/Object

vFabric
Languages
&
Frameworks

Services

Analytics

Automation: App Provisioning & Life-cycle
Service Registry
Cloud Abstraction (portability)

Cloud Fabric

© Copyright 2013 Pivotal. All rights reserved.

8
Performance Through Parallelism
!  Automatic parallelization

Database

–  Load and query like any database
–  Automatically distributed tables across
nodes
–  No need for manual partitioning or tuning

!  Analytics Optimized:

–  Analytics-oriented query optimization

!  Extremely scalable MPP shared-nothing
architecture

Interconnect
Compute
Storage

Loading

–  All nodes can scan and process in parallel
–  Linear scalability by adding nodes

© Copyright 2013 Pivotal. All rights reserved.

9
Performance Through Parallelism
!  Automatic parallelization

Database

–  Load and query like any database
–  Automatically distributed tables across
nodes
–  No need for manual partitioning or tuning

!  Analytics Optimized:

–  Analytics-oriented query optimization

!  Extremely scalable MPP shared-nothing
architecture
–  All nodes can scan and process in parallel
–  Linear scalability by adding nodes

© Copyright 2013 Pivotal. All rights reserved.

Interconnect
Compute
Storage

ETL

Loadin
File
g
Systems

External Sources: Loading, streaming, etc.

10
Pivotal HD Architecture
Pivotal HD
Enterprise
Resource
Management
& Workflow

Pig, Hive,
Mahout

HBase

Map Reduce

Configure,
Monitor, Manage

Hadoop Virtualization (HVE)

Yarn

Command

HDFS

Zookeeper

Center
Sqoop

Apache

© Copyright 2013 Pivotal. All rights reserved.

Deploy,

Data Loader

Flume

Pivotal HD Enterprise

11
Pivotal HD Architecture
HAWQ– Advanced
Database Services
ANSI SQL + Analytics

Pivotal HD
Enterprise
Resource
Management
& Workflow

Xtension
Framework
HBase

Query
Optimizer

Dynamic Pipelining

Pig, Hive,
Mahout
Map Reduce

Deploy,
Configure,
Monitor, Manage

Hadoop Virtualization (HVE)

Yarn

Command

HDFS

Zookeeper

Center
Sqoop

Apache

© Copyright 2013 Pivotal. All rights reserved.

Catalog
Services

Flume

Data Loader

Pivotal HD Enterprise

HAWQ

12
Leveraging healthcare data to drive predictive and
precision care
Clinical!
Narratives!
Medications!

Decision support

Imaging!

Precision care

Genetics!
Environment!

Labs test!

Cohort identification

Unified data supporting unified risk evaluation, decision-making, etc.
! Acting on full patient and medical profile!
© Copyright 2013 Pivotal. All rights reserved.

13
Traditional Analytics Processes

If you think databases are only good for storing data
Time-to-Insights

sample

In-memory
statistics
tool

In-memory
optimization
tool

solution

forecast

© Copyright 2013 Pivotal. All rights reserved.

14
Analytics with Pivotal

A single address for everything analytics
Time-to-Insights

Forecasting

Clustering

Regression

Optimization
Classification

© Copyright 2013 Pivotal. All rights reserved.

15
Analytics Ecosystem
COMMERCIAL

OPEN SOURCE
MADlib

SAS/ACCESS&
SAS&Scoring&Accelerator&
SAS&High&Performance&
Analy7cs&

In0database&analy6cs&

PL/R,&PL/Python&PL/Java&

© Copyright 2013 Pivotal. All rights reserved.

16
MADlib: Machine Learning at Scale

Collaborators

© Copyright 2013 Pivotal. All rights reserved.

17
Data driven drugs: From discovery to delivery
Drug discovery
+ development

!  Molecular data
Clinical
Trials

Distribution and
surveillance

Marketing

© Copyright 2013 Pivotal. All rights reserved.

–  Cellular drug screens
–  Animal models

!  Clinical data including notes,
images, markers (e.g. genomics,
lab results)
!  Sensor and assay data

!  Internal and partner/purchased
external data
Manufacturing
!  Contact center data
!  Patient registries, public and
federal data, clinical partnerships
18
Manufacturing
Data-driven approaches to tuning a
drug manufacturing process

© Copyright 2013 Pivotal. All rights reserved.

19
Predicting potency in vaccine manufacturing
Customer

Solution

A major pharmaceutical company

• 

Introduced a new data model to make
data accessible and enable analytics

• 

Built automated outlier detection/
correction methods to address manual
data entry quality issues

• 

Devised imputation methods to deal with
data completeness issues

• 

Built predictive models with high accuracy

Business Problem
Predict potency and antigen levels of live
virus vaccines based on manufacturing
sensor data and manual data collected
throughout the process.
Challenges
• 

Customer’s data model was not optimal
for running analytical queries

• 

Manual data quality issues

• 

Data capture was performed with
varying consistency due to high cost
associated with manual data collection

© Copyright 2013 Pivotal. All rights reserved.

20
Building predictive models to improved outcomes in
manufacturing of vaccines
Temp

Counts

Future Looking
Predictive Models

Cell
expansion

Virus
propagation

Duration of step

Time
Warning!
Entered value not
in expected range

© Copyright 2013 Pivotal. All rights reserved.

Pooling into
final product

Backward Looking
Models

21
Enabling predictive models through rearchitecting
Challenges
•  Accessibility
–  Certain parts of the data have
never been used in any predictive
modeling since it is extremely hard
to query them

Cell
expansion

•  Data Integrity
–  Manual data entries are prone to
errors. There is no immediate
feedback to examine the validity of
the values entered

Virus
propagation

•  Data Completeness
–  Manual data entry is time
consuming. There is no feedback
on what data is most useful in
improving the efficiency and
quality and hence no prioritization
of what data should be collected
© Copyright 2013 Pivotal. All rights reserved.

Pooling into
final product

22
Enabling predictive models through rearchitecting
Challenges
•  Accessibility
–  Certain parts of the data have
never been used in any predictive
modeling since it is extremely hard
to query them

Purpose-built data models for rapid
data querying and exploration

•  Data Integrity
–  Manual data entries are prone to
errors. There is no immediate
feedback to examine the validity of
the values entered

Automated data cleansing
techniques

•  Data Completeness
–  Manual data entry is time
consuming. There is no feedback
on what data is most useful in
improving the efficiency and
quality and hence no prioritization
of what data should be collected
© Copyright 2013 Pivotal. All rights reserved.

Opportunities to eliminate collection
of incomplete or non-predictive data

23
Identifying and correcting data integrity problems
Creating automated methods for detection and correction
all data

60

80

100

!  Data integrity problems cause
challenges in modeling

0

20

40

!  Sources of variation in entries
of measurements
1

3

5

7

9

11

13

15

17

19

21

23

–  Variable units of
measurement
–  Manual data entry errors

Approach: Detect the optimal
threshold to separate two
distributions
© Copyright 2013 Pivotal. All rights reserved.

24
Identifying and correcting data integrity problems
Creating automated methods for detection and correction
all data

60

80

100

!  Data integrity problems cause
challenges in modeling

20

40

!  Sources of variation in entries
of measurements
–  Variable units of
measurement
–  Manual data entry errors

0

1

3

5

7

9

11

13

15

17

19

lower half
lower half
upper half

23

!  Approach: Detect the
optimal threshold to
separate two distributions

40
10

20

510 5 20 10
10 15 20
30

lower half

30

Frequency

15
40

50

5020 60

60

upper half

0

0
00

Frequency
Frequency Frequency

21

0.12

0.12
0.12 12

0.14

0.16

0.18

0.20

0.14
0.16
0.18
0.14 newVals[seq(1, maxBreak, 1)] 0.20 22
0.16
14
16
180.18 20 0.20
newVals[seq(1, maxBreak, 1)]
newVals[seq(1, maxBreak, 1)]
newVals[seq(maxBreak + 1, length(newVals), 1)]

© Copyright 2013 Pivotal. All rights reserved.

0.22

0.22
0.22
24

12

14

16

18

20

22

24

newVals[seq(maxBreak + 1, length(newVals), 1)]

25
Identifying and correcting data integrity problems
Creating automated methods for detection and correction

0

20

40

60

80

100

all data

1

3

5

7

9

11

13

15

17

19

lower half
lower half
upper half

23

Foreground

Background

40
10

20

510 5 20 10
10 15 20
30

lower half

30

Frequency

15
40

50

5020 60

60

upper half

0

0
00

Frequency
Frequency Frequency

21

0.12

0.12
0.12 12

0.14

0.16

0.18

0.20

0.14
0.16
0.18
0.14 newVals[seq(1, maxBreak, 1)] 0.20 22
0.16
14
16
180.18 20 0.20
newVals[seq(1, maxBreak, 1)]
newVals[seq(1, maxBreak, 1)]
newVals[seq(maxBreak + 1, length(newVals), 1)]

© Copyright 2013 Pivotal. All rights reserved.

0.22

0.22
0.22
24

12

14

16

18

20

22

24

newVals[seq(maxBreak + 1, length(newVals), 1)]

26
Identifying and correcting data integrity problems
Creating automated methods for detection and correction

0

20

40

60

80

100

all data

1

3

5

7

9

11

13

15

17

19

lower half
lower half
upper half

23

Foreground

Background

40
10

20

510 5 20 10
10 15 20
30

lower half

30

Frequency

15
40

50

5020 60

60

upper half

0

0
00

Frequency
Frequency Frequency

21

0.12

0.12
0.12 12

0.14

0.16

0.18

0.20

0.14
0.16
0.18
0.14 newVals[seq(1, maxBreak, 1)] 0.20 22
0.16
14
16
180.18 20 0.20
newVals[seq(1, maxBreak, 1)]
newVals[seq(1, maxBreak, 1)]
newVals[seq(maxBreak + 1, length(newVals), 1)]

© Copyright 2013 Pivotal. All rights reserved.

0.22

0.22
0.22
24

12

14

16

18

20

22

24

newVals[seq(maxBreak + 1, length(newVals), 1)]

27
Identifying and correcting data integrity problems
Creating automated methods for detection and correction

60

80

100

all data

5

7

9

11

13

15

17

19

lower half
lower half
upper half

23

0

40

12

20

510 5 20 10
10 15 20
30

lower half

30

Frequency

15
40

50

5020 60

60

20
20

upper half

12
12

14

14
14

16 16
16

18 18
18

20 20
20

22 22
22

24
24

10

c(loh, uph)

0

0
00

Frequency
Frequency Frequency

21

40
40

3

Frequency

1

60
60

0

20

8080

40

cleanedHistogram of c(loh, uph) = 100
histogram with multiplier

0.12

0.12
0.12 12

0.14

0.16

0.18

0.20

0.14
0.16
0.18
0.14 newVals[seq(1, maxBreak, 1)] 0.20 22
0.16
14
16
180.18 20 0.20
newVals[seq(1, maxBreak, 1)]
newVals[seq(1, maxBreak, 1)]
newVals[seq(maxBreak + 1, length(newVals), 1)]

© Copyright 2013 Pivotal. All rights reserved.

0.22

0.22
0.22
24

12

14

16

18

20

22

24

newVals[seq(maxBreak + 1, length(newVals), 1)]

28
Building models: First, start with the answer
How to build models that solve the right problem
Cell
expansion

Approach: Use historical data to build a model
predicting potency of a final product using data
from the manufacturing process
!  Model form, how do we pick the right one?

Virus
propagation

–  How do we deal with correlated features?
–  Accuracy or interpretability?

!  Available data
Pooling into
final product

© Copyright 2013 Pivotal. All rights reserved.

–  Thousands of features, without expert guidance how do we
choose the right ones?
–  What data do we want to use to predict? When is the right
time for an intervention?

29
Model generation and evaluation
Predicting vaccine potency using manufacturing data

13.5

!  Feature engineering and transformation

Test R2=0.742
Train R2=0.823

–  Enabled by rapid in-database processing

●
●
●

13.0

●

●

predTest[, i]

Predicted Potency

Total test 0.742003189411406

●

●

●
●
●

12.5

●
●

●
●
●
●

●
●

●
●
●
●
●●
●
●●
●
●
●
●
●
●
● ● ●●
●
● ●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●●
●
●● ●
●
●
●●

●
●

●
● ●
●
●
●
●●
●
●
● ● ●
●● ● ●
●
●
●
●●●
●●
●
●
● ●
● ●
●
●
● ●
●
●
●
●
●
●
● ●
●
●
●
● ●● ● ●
●
●
●
●
●
●
●● ● ● ●●● ●
●
●
●
●
●
● ● ●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●

●

12.0

●
●
●

●● ●
●

●

●

●
● ●

●
●

●
●

●
●●
●●
●
●● ● ●
● ●
● ●●
●
●
●
●● ●
● ● ●
●
●
●
●
●
●
●● ●
● ● ●
●● ●
●
●
●
●
●
●
●● ●
● ● ●
● ●●●
●
●●
●
●
●●
●
● ●●
●
●
● ●
● ●
●
●
●
●●
●
●●
●
●
●
●
●

●

●
●

● ●
●

●
●

●

●

●
●

●

●
●
●
●
●
●

●
●
●

●

●●

–  Partial least squares
–  Random forest
–  Regularized regression

●

●
●
●

!  Interpretation of model results for
insight generation

●
●

●

●

●

12.0

12.5

13.0

True Potency
allTest[, i]

© Copyright 2013 Pivotal. All rights reserved.

!  Experimentation with model forms

13.5

–  Use cross-validation framework to
assess variable importance
30
Sample model insights
Interpreting the utility of a measure obtained during manufacturing based
on model outcomes
13.0
12.8

13.0

Log of Potency

12.6

Potency

12.6

12.2

12.4

!  Features consistently absent
from models may be
uninformative for predicting
potency

12.4

12.8

Potency

12.0

12.2
12.0

Log of Potency

!  Some features may reveal
tunable parameters to alter
potency, others may simply
be markers

Correlation = 0.38

Correlation = -0.45

0.20

0.25

0.30

0.35

0.40

SP1 Total Viable Cells Harvested Per Sq. Cm

Assayed value

© Copyright 2013 Pivotal. All rights reserved.

0.45

12

12.5

13

13.5

14

14.5

15

15.5

SP2 Total Trypsinization Exposure Time of per CCS

Duration of a step

>=16

!  Opportunities to provide realtime feedback on data entry
errors and predicted potency
outcomes

31
Data-driven drugs
Opportunities for data mining across the
pharmaceutical industry

© Copyright 2013 Pivotal. All rights reserved.

32
Data driven drugs: From discovery to delivery
Drug discovery
+ development
Clinical
Trials
Distribution and
surveillance

Manufacturing
Marketing

© Copyright 2013 Pivotal. All rights reserved.

33
Data driven drugs: From discovery to delivery
Drug discovery
+ development
Clinical
Trials
Distribution and
surveillance

!  Data repurposing
New value exists in leveraging
historical data across drugs and stages
!  Data discovery
External and publicly available
datasets can augment proprietary
sources

Manufacturing !  Data collection
Marketing

© Copyright 2013 Pivotal. All rights reserved.

Obtaining new data from different
sources drives additional value

34
Data driven drugs: From discovery to delivery
Drug discovery
+ development
Clinical
Trials
Distribution and
surveillance

!  Data repurposing
New value exists in leveraging
historical data across drugs and stages
Adverse events for new clinical
indications
!  Data discovery
External and publicly available
datasets can augment proprietary
sources
Twitter data to forecast demand

Manufacturing !  Data collection
Marketing

© Copyright 2013 Pivotal. All rights reserved.

Obtaining new data from different
sources drives additional value
Mobile and sensor data to measure
patient adherence and outcomes
35
Leveraging Data to Improve Demand Forecasts
Hospitals
Doctor’s Offices
Supply Distr.

Surgery Centers

Sales Data

Pharmacies

Analyze orders from
customers

Patients

Laboratories

Self-Reporting

Publicly Available Resources
Monitoring Patient Populations

© Copyright 2013 Pivotal. All rights reserved.

36
Promising Advancements in Diabetes Studies
Use of telehealth to provide tight glucose control

Biochemical
Measurements

EMR
Genomics
Lifestyle

Intervention

© Copyright 2013 Pivotal. All rights reserved.

37
Launching a successful diabetes management program
Multiple potential points of failure, requires use of analytics at every step

Increase
Awareness

Patient
Enrollment

Comparative
Effectiveness

Remote
Patient
Monitoring

Design
Interventions

Measure
Impact on
Population

Best channel
per cohort

Best therapy for
Resource
each cohort:
allocation
Identify highest
•  Medication
decisions
impact channels
•  Delivery
Medication
Method
adherence
Stochastic •  Monitoring
Churn
Identify
entity
prediction
influencers
Method
Predict risk of
resolution
negative
Measure
Campaign
outcome for
engagement
optimization
A/B testing to design best
next 3 months
engagement platform
© Copyright 2013 Pivotal. All rights reserved.

Attribution
models

Careful design
of experiment to
quantify the
Impact

38
Launching a successful diabetes management program
Interdisciplinary collaboration of data scientists essential to success
Marketing

Increase
Awareness

Healthcare

Patient
Enrollment

Web Analytics

Comparative
Effectiveness

Remote
Patient
Monitoring

Optimization

Design
Interventions

General ML

Measure
Impact on
Population

Best channel
per cohort

Best therapy for
Resource
each cohort:
allocation
Identify highest
•  Medication
decisions
impact channels
•  Delivery
Medication
Method
adherence
Stochastic •  Monitoring
Churn
Identify
entity
prediction
influencers
Method
Predict risk of
resolution
negative
Measure
Campaign
outcome for
engagement
optimization
A/B testing to design best
next 3 months
engagement platform
© Copyright 2013 Pivotal. All rights reserved.

Attribution
models

Careful design
of experiment
to quantify the
Impact

39
Pivotal Labs rapid application development
!  Rheumatoid arthritis remote patient
monitoring system
–  Self-reporting
–  Intuitive user interface

https://itunes.apple.com/us/app/myra/id563338979?mt=8

© Copyright 2013 Pivotal. All rights reserved.

40
Pivotal One: Heritage
Application Fabric

Data Fabric

GemFire

Ingest & Query: very high-capacity &
in-memory
Scale-out storage: HDFS/Object

vFabric
Languages
&
Frameworks

Services

Analytics

Automation: App Provisioning & Life-cycle
Service Registry
Cloud Abstraction (portability)

Cloud Fabric

© Copyright 2013 Pivotal. All rights reserved.

41
A NEW PLATFORM FOR A NEW ERA

Más contenido relacionado

La actualidad más candente

Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...
Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...
Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...Saama
 
Big-Data in HealthCare _ Overview
Big-Data in HealthCare _ OverviewBig-Data in HealthCare _ Overview
Big-Data in HealthCare _ OverviewHamdaoui Younes
 
Big data analytics in healthcare
Big data analytics in healthcareBig data analytics in healthcare
Big data analytics in healthcareJoseph Thottungal
 
Big data in Healthcare & Life Sciences
Big data in Healthcare & Life SciencesBig data in Healthcare & Life Sciences
Big data in Healthcare & Life SciencesMatthias Vallaey
 
Building a Next Generation Clinical and Scientific Data Management Solution
Building a Next Generation Clinical and Scientific Data Management SolutionBuilding a Next Generation Clinical and Scientific Data Management Solution
Building a Next Generation Clinical and Scientific Data Management SolutionSaama
 
Big data in healthcare
Big data in healthcareBig data in healthcare
Big data in healthcareDeZyre
 
Future of RWE - Big Data and Analytics for Pharma 2017 presentation
Future of RWE - Big Data and Analytics for Pharma 2017 presentationFuture of RWE - Big Data and Analytics for Pharma 2017 presentation
Future of RWE - Big Data and Analytics for Pharma 2017 presentationSaama
 
How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...
How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...
How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...Perficient, Inc.
 
Our Journey to Release a Patient-Centric AI App to Reduce Public Health Costs
Our Journey to Release a Patient-Centric AI App to Reduce Public Health CostsOur Journey to Release a Patient-Centric AI App to Reduce Public Health Costs
Our Journey to Release a Patient-Centric AI App to Reduce Public Health CostsDatabricks
 
BIG Data & Hadoop Applications in Healthcare
BIG Data & Hadoop Applications in HealthcareBIG Data & Hadoop Applications in Healthcare
BIG Data & Hadoop Applications in HealthcareSkillspeed
 
Big data and the Healthcare Sector
Big data and the Healthcare Sector Big data and the Healthcare Sector
Big data and the Healthcare Sector Chris Groves
 
Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data SAP Technology
 
Next Gen Clinical Data Sciences
Next Gen Clinical Data SciencesNext Gen Clinical Data Sciences
Next Gen Clinical Data SciencesSaama
 
UCSF Informatics Day 2014 - Doug Berman, "A Brief Tour of UCSF’s Clinical Dat...
UCSF Informatics Day 2014 - Doug Berman, "A Brief Tour of UCSF’s Clinical Dat...UCSF Informatics Day 2014 - Doug Berman, "A Brief Tour of UCSF’s Clinical Dat...
UCSF Informatics Day 2014 - Doug Berman, "A Brief Tour of UCSF’s Clinical Dat...CTSI at UCSF
 
How BrackenData Leverages Data on Over 250,000 Clinical Trials
How BrackenData Leverages Data on Over 250,000 Clinical TrialsHow BrackenData Leverages Data on Over 250,000 Clinical Trials
How BrackenData Leverages Data on Over 250,000 Clinical TrialsBracken
 
HealthCare and Big Data with Hadoop
HealthCare and Big Data with HadoopHealthCare and Big Data with Hadoop
HealthCare and Big Data with HadoopEdureka!
 
Big Data Solutions for Healthcare
Big Data Solutions for HealthcareBig Data Solutions for Healthcare
Big Data Solutions for HealthcareOdinot Stanislas
 
Late Binding in Data Warehouses: Desiging for Analytic Agility
Late Binding in Data Warehouses: Desiging for Analytic AgilityLate Binding in Data Warehouses: Desiging for Analytic Agility
Late Binding in Data Warehouses: Desiging for Analytic AgilityHealth Catalyst
 
Enterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for HealthcareEnterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for HealthcareDATA360US
 

La actualidad más candente (20)

Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...
Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...
Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...
 
Big-Data in HealthCare _ Overview
Big-Data in HealthCare _ OverviewBig-Data in HealthCare _ Overview
Big-Data in HealthCare _ Overview
 
Big data analytics in healthcare
Big data analytics in healthcareBig data analytics in healthcare
Big data analytics in healthcare
 
Big data in Healthcare & Life Sciences
Big data in Healthcare & Life SciencesBig data in Healthcare & Life Sciences
Big data in Healthcare & Life Sciences
 
Building a Next Generation Clinical and Scientific Data Management Solution
Building a Next Generation Clinical and Scientific Data Management SolutionBuilding a Next Generation Clinical and Scientific Data Management Solution
Building a Next Generation Clinical and Scientific Data Management Solution
 
Big data in healthcare
Big data in healthcareBig data in healthcare
Big data in healthcare
 
Future of RWE - Big Data and Analytics for Pharma 2017 presentation
Future of RWE - Big Data and Analytics for Pharma 2017 presentationFuture of RWE - Big Data and Analytics for Pharma 2017 presentation
Future of RWE - Big Data and Analytics for Pharma 2017 presentation
 
How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...
How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...
How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...
 
Our Journey to Release a Patient-Centric AI App to Reduce Public Health Costs
Our Journey to Release a Patient-Centric AI App to Reduce Public Health CostsOur Journey to Release a Patient-Centric AI App to Reduce Public Health Costs
Our Journey to Release a Patient-Centric AI App to Reduce Public Health Costs
 
Life Science Analytics
Life Science AnalyticsLife Science Analytics
Life Science Analytics
 
BIG Data & Hadoop Applications in Healthcare
BIG Data & Hadoop Applications in HealthcareBIG Data & Hadoop Applications in Healthcare
BIG Data & Hadoop Applications in Healthcare
 
Big data and the Healthcare Sector
Big data and the Healthcare Sector Big data and the Healthcare Sector
Big data and the Healthcare Sector
 
Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data
 
Next Gen Clinical Data Sciences
Next Gen Clinical Data SciencesNext Gen Clinical Data Sciences
Next Gen Clinical Data Sciences
 
UCSF Informatics Day 2014 - Doug Berman, "A Brief Tour of UCSF’s Clinical Dat...
UCSF Informatics Day 2014 - Doug Berman, "A Brief Tour of UCSF’s Clinical Dat...UCSF Informatics Day 2014 - Doug Berman, "A Brief Tour of UCSF’s Clinical Dat...
UCSF Informatics Day 2014 - Doug Berman, "A Brief Tour of UCSF’s Clinical Dat...
 
How BrackenData Leverages Data on Over 250,000 Clinical Trials
How BrackenData Leverages Data on Over 250,000 Clinical TrialsHow BrackenData Leverages Data on Over 250,000 Clinical Trials
How BrackenData Leverages Data on Over 250,000 Clinical Trials
 
HealthCare and Big Data with Hadoop
HealthCare and Big Data with HadoopHealthCare and Big Data with Hadoop
HealthCare and Big Data with Hadoop
 
Big Data Solutions for Healthcare
Big Data Solutions for HealthcareBig Data Solutions for Healthcare
Big Data Solutions for Healthcare
 
Late Binding in Data Warehouses: Desiging for Analytic Agility
Late Binding in Data Warehouses: Desiging for Analytic AgilityLate Binding in Data Warehouses: Desiging for Analytic Agility
Late Binding in Data Warehouses: Desiging for Analytic Agility
 
Enterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for HealthcareEnterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for Healthcare
 

Destacado

Drug Repurposing Against Infectious Diseases
Drug Repurposing Against Infectious Diseases Drug Repurposing Against Infectious Diseases
Drug Repurposing Against Infectious Diseases Philip Bourne
 
เรื่อง Facebook ม.5.1 ล่าสุด
เรื่อง Facebook ม.5.1 ล่าสุดเรื่อง Facebook ม.5.1 ล่าสุด
เรื่อง Facebook ม.5.1 ล่าสุดeinscream
 
Pedagogisk utprøving med ny vri
Pedagogisk utprøving med ny vriPedagogisk utprøving med ny vri
Pedagogisk utprøving med ny vrirhusem
 
ภาษาไทย
ภาษาไทยภาษาไทย
ภาษาไทยjojowhisky
 
EMC IT's Journey to Cloud : IT PRODUCTION SERVER VIRTUALIZATION
EMC IT's Journey to Cloud : IT PRODUCTION SERVER VIRTUALIZATION EMC IT's Journey to Cloud : IT PRODUCTION SERVER VIRTUALIZATION
EMC IT's Journey to Cloud : IT PRODUCTION SERVER VIRTUALIZATION EMC
 
Information security 2008
Information security 2008Information security 2008
Information security 2008xrayjamie79
 
знатокам литературы (к учебнику 6 класса)
знатокам литературы (к учебнику 6 класса)знатокам литературы (к учебнику 6 класса)
знатокам литературы (к учебнику 6 класса)irinaanissimova
 
Reinforcedslab 100917010457-phpapp02
Reinforcedslab 100917010457-phpapp02Reinforcedslab 100917010457-phpapp02
Reinforcedslab 100917010457-phpapp02Laquiao Ganagan
 

Destacado (13)

Drug Repurposing Against Infectious Diseases
Drug Repurposing Against Infectious Diseases Drug Repurposing Against Infectious Diseases
Drug Repurposing Against Infectious Diseases
 
เรื่อง Facebook ม.5.1 ล่าสุด
เรื่อง Facebook ม.5.1 ล่าสุดเรื่อง Facebook ม.5.1 ล่าสุด
เรื่อง Facebook ม.5.1 ล่าสุด
 
Atlassian Clover
Atlassian CloverAtlassian Clover
Atlassian Clover
 
Pedagogisk utprøving med ny vri
Pedagogisk utprøving med ny vriPedagogisk utprøving med ny vri
Pedagogisk utprøving med ny vri
 
50 states
50 states50 states
50 states
 
ภาษาไทย
ภาษาไทยภาษาไทย
ภาษาไทย
 
EMC IT's Journey to Cloud : IT PRODUCTION SERVER VIRTUALIZATION
EMC IT's Journey to Cloud : IT PRODUCTION SERVER VIRTUALIZATION EMC IT's Journey to Cloud : IT PRODUCTION SERVER VIRTUALIZATION
EMC IT's Journey to Cloud : IT PRODUCTION SERVER VIRTUALIZATION
 
Information security 2008
Information security 2008Information security 2008
Information security 2008
 
знатокам литературы (к учебнику 6 класса)
знатокам литературы (к учебнику 6 класса)знатокам литературы (к учебнику 6 класса)
знатокам литературы (к учебнику 6 класса)
 
Flip book
Flip bookFlip book
Flip book
 
Flip book
Flip bookFlip book
Flip book
 
Flip book
Flip bookFlip book
Flip book
 
Reinforcedslab 100917010457-phpapp02
Reinforcedslab 100917010457-phpapp02Reinforcedslab 100917010457-phpapp02
Reinforcedslab 100917010457-phpapp02
 

Similar a Data-Driven Insights to Improve Pharmaceutical Manufacturing

Data drift and machine learning
Data drift and machine learningData drift and machine learning
Data drift and machine learningSmita Agrawal
 
Keeping the Pulse of Your Data:  Why You Need Data Observability 
Keeping the Pulse of Your Data:  Why You Need Data Observability Keeping the Pulse of Your Data:  Why You Need Data Observability 
Keeping the Pulse of Your Data:  Why You Need Data Observability Precisely
 
Big data-analytics-for-smart-manufacturing-systems-report
Big data-analytics-for-smart-manufacturing-systems-reportBig data-analytics-for-smart-manufacturing-systems-report
Big data-analytics-for-smart-manufacturing-systems-reportAravindharamanan S
 
Data drift and machine learning
Data drift and machine learningData drift and machine learning
Data drift and machine learningSmita Agrawal
 
How to generate Synthetic Data for an effective App Testing strategy.pdf
How to generate Synthetic Data for an effective App Testing strategy.pdfHow to generate Synthetic Data for an effective App Testing strategy.pdf
How to generate Synthetic Data for an effective App Testing strategy.pdfpCloudy
 
Improving practitioner decision making capabilities with data and analytics v1
Improving practitioner decision making capabilities with data and analytics v1Improving practitioner decision making capabilities with data and analytics v1
Improving practitioner decision making capabilities with data and analytics v1Ali Khan
 
JR's Lifetime Advanced Analytics
JR's Lifetime Advanced AnalyticsJR's Lifetime Advanced Analytics
JR's Lifetime Advanced AnalyticsChase Hamilton
 
JR's Lifetime Advanced Analytics
JR's Lifetime Advanced AnalyticsJR's Lifetime Advanced Analytics
JR's Lifetime Advanced Analyticsd-Wise Technologies
 
BioSymetrics Bioconnect April 2018
BioSymetrics Bioconnect April 2018BioSymetrics Bioconnect April 2018
BioSymetrics Bioconnect April 2018Wendy Tsai
 
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...Bigfinite
 
RWE & Patient Analytics Leveraging Databricks – A Use Case
RWE & Patient Analytics Leveraging Databricks – A Use CaseRWE & Patient Analytics Leveraging Databricks – A Use Case
RWE & Patient Analytics Leveraging Databricks – A Use CaseDatabricks
 
Self Service BI for Healthcare
Self Service BI for HealthcareSelf Service BI for Healthcare
Self Service BI for HealthcareVeerendra Raju
 
Consumer Behavior: Factors Affecting Member Attrition and Retention
Consumer Behavior: Factors Affecting Member Attrition and RetentionConsumer Behavior: Factors Affecting Member Attrition and Retention
Consumer Behavior: Factors Affecting Member Attrition and RetentionAltegra Health
 
Healthcare Analytics Adoption Model
Healthcare Analytics Adoption ModelHealthcare Analytics Adoption Model
Healthcare Analytics Adoption ModelHealth Catalyst
 
Remote Urine Analysis for Cancer Patients At Home Using Machine Learning
Remote Urine Analysis for Cancer Patients At Home Using Machine LearningRemote Urine Analysis for Cancer Patients At Home Using Machine Learning
Remote Urine Analysis for Cancer Patients At Home Using Machine LearningGslab1
 
AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...
AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...
AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...Value Amplify Consulting
 
Enterprise Information Architecture Using Data Mining
Enterprise Information Architecture Using Data MiningEnterprise Information Architecture Using Data Mining
Enterprise Information Architecture Using Data Miningcshamik
 
Transforming GE Healthcare with Data Platform Strategy
Transforming GE Healthcare with Data Platform StrategyTransforming GE Healthcare with Data Platform Strategy
Transforming GE Healthcare with Data Platform StrategyDatabricks
 

Similar a Data-Driven Insights to Improve Pharmaceutical Manufacturing (20)

Data drift and machine learning
Data drift and machine learningData drift and machine learning
Data drift and machine learning
 
Keeping the Pulse of Your Data:  Why You Need Data Observability 
Keeping the Pulse of Your Data:  Why You Need Data Observability Keeping the Pulse of Your Data:  Why You Need Data Observability 
Keeping the Pulse of Your Data:  Why You Need Data Observability 
 
Big data-analytics-for-smart-manufacturing-systems-report
Big data-analytics-for-smart-manufacturing-systems-reportBig data-analytics-for-smart-manufacturing-systems-report
Big data-analytics-for-smart-manufacturing-systems-report
 
Data drift and machine learning
Data drift and machine learningData drift and machine learning
Data drift and machine learning
 
How to generate Synthetic Data for an effective App Testing strategy.pdf
How to generate Synthetic Data for an effective App Testing strategy.pdfHow to generate Synthetic Data for an effective App Testing strategy.pdf
How to generate Synthetic Data for an effective App Testing strategy.pdf
 
Improving practitioner decision making capabilities with data and analytics v1
Improving practitioner decision making capabilities with data and analytics v1Improving practitioner decision making capabilities with data and analytics v1
Improving practitioner decision making capabilities with data and analytics v1
 
JR's Lifetime Advanced Analytics
JR's Lifetime Advanced AnalyticsJR's Lifetime Advanced Analytics
JR's Lifetime Advanced Analytics
 
JR's Lifetime Advanced Analytics
JR's Lifetime Advanced AnalyticsJR's Lifetime Advanced Analytics
JR's Lifetime Advanced Analytics
 
BioSymetrics Bioconnect April 2018
BioSymetrics Bioconnect April 2018BioSymetrics Bioconnect April 2018
BioSymetrics Bioconnect April 2018
 
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
 
RWE & Patient Analytics Leveraging Databricks – A Use Case
RWE & Patient Analytics Leveraging Databricks – A Use CaseRWE & Patient Analytics Leveraging Databricks – A Use Case
RWE & Patient Analytics Leveraging Databricks – A Use Case
 
Sgcp14dunlea
Sgcp14dunleaSgcp14dunlea
Sgcp14dunlea
 
Self Service BI for Healthcare
Self Service BI for HealthcareSelf Service BI for Healthcare
Self Service BI for Healthcare
 
Self Service BI for Healthcare
Self Service BI for HealthcareSelf Service BI for Healthcare
Self Service BI for Healthcare
 
Consumer Behavior: Factors Affecting Member Attrition and Retention
Consumer Behavior: Factors Affecting Member Attrition and RetentionConsumer Behavior: Factors Affecting Member Attrition and Retention
Consumer Behavior: Factors Affecting Member Attrition and Retention
 
Healthcare Analytics Adoption Model
Healthcare Analytics Adoption ModelHealthcare Analytics Adoption Model
Healthcare Analytics Adoption Model
 
Remote Urine Analysis for Cancer Patients At Home Using Machine Learning
Remote Urine Analysis for Cancer Patients At Home Using Machine LearningRemote Urine Analysis for Cancer Patients At Home Using Machine Learning
Remote Urine Analysis for Cancer Patients At Home Using Machine Learning
 
AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...
AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...
AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...
 
Enterprise Information Architecture Using Data Mining
Enterprise Information Architecture Using Data MiningEnterprise Information Architecture Using Data Mining
Enterprise Information Architecture Using Data Mining
 
Transforming GE Healthcare with Data Platform Strategy
Transforming GE Healthcare with Data Platform StrategyTransforming GE Healthcare with Data Platform Strategy
Transforming GE Healthcare with Data Platform Strategy
 

Más de EMC

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDINDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDEMC
 
Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote EMC
 
EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC
 
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOTransforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOEMC
 
Citrix ready-webinar-xtremio
Citrix ready-webinar-xtremioCitrix ready-webinar-xtremio
Citrix ready-webinar-xtremioEMC
 
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC
 
EMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC
 
Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lakeEMC
 
Force Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereForce Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereEMC
 
Pivotal : Moments in Container History
Pivotal : Moments in Container History Pivotal : Moments in Container History
Pivotal : Moments in Container History EMC
 
Data Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewData Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewEMC
 
Mobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeMobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeEMC
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic EMC
 
Intelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityIntelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityEMC
 
The Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeThe Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeEMC
 
EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC
 
EMC Academic Summit 2015
EMC Academic Summit 2015EMC Academic Summit 2015
EMC Academic Summit 2015EMC
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesEMC
 
Using EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsUsing EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsEMC
 
Using EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookUsing EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookEMC
 

Más de EMC (20)

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDINDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
 
Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote
 
EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX
 
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOTransforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
 
Citrix ready-webinar-xtremio
Citrix ready-webinar-xtremioCitrix ready-webinar-xtremio
Citrix ready-webinar-xtremio
 
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
 
EMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC with Mirantis Openstack
EMC with Mirantis Openstack
 
Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lake
 
Force Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereForce Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop Elsewhere
 
Pivotal : Moments in Container History
Pivotal : Moments in Container History Pivotal : Moments in Container History
Pivotal : Moments in Container History
 
Data Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewData Lake Protection - A Technical Review
Data Lake Protection - A Technical Review
 
Mobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeMobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or Foe
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic
 
Intelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityIntelligence-Driven GRC for Security
Intelligence-Driven GRC for Security
 
The Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeThe Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure Age
 
EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015
 
EMC Academic Summit 2015
EMC Academic Summit 2015EMC Academic Summit 2015
EMC Academic Summit 2015
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education Services
 
Using EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsUsing EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere Environments
 
Using EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookUsing EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBook
 

Último

Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 

Último (20)

Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 

Data-Driven Insights to Improve Pharmaceutical Manufacturing

  • 1. A NEW PLATFORM FOR A NEW ERA
  • 2. Data Driven Drugs: Predictive Models to Improve Product Quality in Pharmaceuticals Sarah Aerni, PhD Senior Data Scientist at Pivotal saerni@gopivotal.com Strata RX September 26, 2013 © Copyright 2013 Pivotal. All rights reserved. 2
  • 3. The Quantified Patient Medical History! Genetics! Family ! History! Imaging! Clinical! Narratives! Medications! Molecular! Diagnostics! Lab tests! Environment! © Copyright 2013 Pivotal. All rights reserved. Sensors! & Mobile! 3
  • 4. Data driven drugs: From discovery to delivery Drug discovery + development RICH DATA SOURCES Clinical Trials Distribution and surveillance !  Molecular data –  Cellular drug screens –  Animal models !  Clinical data including notes, images, markers (e.g. genomics, lab results) !  Sensor and assay data !  Internal and partner/purchased external data Manufacturing !  Contact center data Marketing © Copyright 2013 Pivotal. All rights reserved. !  Patient registries, public and federal data, clinical partnerships 4
  • 5. Data integration How Pivotal can enable industries to extract new value from data sources © Copyright 2013 Pivotal. All rights reserved. 5
  • 6. Successful transformation into a data-driven enterprise requires a paradigm shift !  Bring available data sources to a central location Integration of a variety of data leads to new insights DATA IS THE NEW CENTER OF GRAVITY !  Analyze large volumes of variable data for richer models Building models without data movement reduces time to insight !  Share data, insights and ideas Leveraging various expertise will lead to more relevant business insights © Copyright 2013 Pivotal. All rights reserved. Data > Application! 6
  • 7. Traditional Analytics Processes If you think databases are only good for storing data Time-to-Insights sample In-memory statistics tool In-memory optimization tool solution forecast © Copyright 2013 Pivotal. All rights reserved. 7
  • 8. Pivotal One: Heritage Application Fabric Data Fabric GemFire Ingest & Query: very high-capacity & in-memory Scale-out storage: HDFS/Object vFabric Languages & Frameworks Services Analytics Automation: App Provisioning & Life-cycle Service Registry Cloud Abstraction (portability) Cloud Fabric © Copyright 2013 Pivotal. All rights reserved. 8
  • 9. Performance Through Parallelism !  Automatic parallelization Database –  Load and query like any database –  Automatically distributed tables across nodes –  No need for manual partitioning or tuning !  Analytics Optimized: –  Analytics-oriented query optimization !  Extremely scalable MPP shared-nothing architecture Interconnect Compute Storage Loading –  All nodes can scan and process in parallel –  Linear scalability by adding nodes © Copyright 2013 Pivotal. All rights reserved. 9
  • 10. Performance Through Parallelism !  Automatic parallelization Database –  Load and query like any database –  Automatically distributed tables across nodes –  No need for manual partitioning or tuning !  Analytics Optimized: –  Analytics-oriented query optimization !  Extremely scalable MPP shared-nothing architecture –  All nodes can scan and process in parallel –  Linear scalability by adding nodes © Copyright 2013 Pivotal. All rights reserved. Interconnect Compute Storage ETL Loadin File g Systems External Sources: Loading, streaming, etc. 10
  • 11. Pivotal HD Architecture Pivotal HD Enterprise Resource Management & Workflow Pig, Hive, Mahout HBase Map Reduce Configure, Monitor, Manage Hadoop Virtualization (HVE) Yarn Command HDFS Zookeeper Center Sqoop Apache © Copyright 2013 Pivotal. All rights reserved. Deploy, Data Loader Flume Pivotal HD Enterprise 11
  • 12. Pivotal HD Architecture HAWQ– Advanced Database Services ANSI SQL + Analytics Pivotal HD Enterprise Resource Management & Workflow Xtension Framework HBase Query Optimizer Dynamic Pipelining Pig, Hive, Mahout Map Reduce Deploy, Configure, Monitor, Manage Hadoop Virtualization (HVE) Yarn Command HDFS Zookeeper Center Sqoop Apache © Copyright 2013 Pivotal. All rights reserved. Catalog Services Flume Data Loader Pivotal HD Enterprise HAWQ 12
  • 13. Leveraging healthcare data to drive predictive and precision care Clinical! Narratives! Medications! Decision support Imaging! Precision care Genetics! Environment! Labs test! Cohort identification Unified data supporting unified risk evaluation, decision-making, etc. ! Acting on full patient and medical profile! © Copyright 2013 Pivotal. All rights reserved. 13
  • 14. Traditional Analytics Processes If you think databases are only good for storing data Time-to-Insights sample In-memory statistics tool In-memory optimization tool solution forecast © Copyright 2013 Pivotal. All rights reserved. 14
  • 15. Analytics with Pivotal A single address for everything analytics Time-to-Insights Forecasting Clustering Regression Optimization Classification © Copyright 2013 Pivotal. All rights reserved. 15
  • 17. MADlib: Machine Learning at Scale Collaborators © Copyright 2013 Pivotal. All rights reserved. 17
  • 18. Data driven drugs: From discovery to delivery Drug discovery + development !  Molecular data Clinical Trials Distribution and surveillance Marketing © Copyright 2013 Pivotal. All rights reserved. –  Cellular drug screens –  Animal models !  Clinical data including notes, images, markers (e.g. genomics, lab results) !  Sensor and assay data !  Internal and partner/purchased external data Manufacturing !  Contact center data !  Patient registries, public and federal data, clinical partnerships 18
  • 19. Manufacturing Data-driven approaches to tuning a drug manufacturing process © Copyright 2013 Pivotal. All rights reserved. 19
  • 20. Predicting potency in vaccine manufacturing Customer Solution A major pharmaceutical company •  Introduced a new data model to make data accessible and enable analytics •  Built automated outlier detection/ correction methods to address manual data entry quality issues •  Devised imputation methods to deal with data completeness issues •  Built predictive models with high accuracy Business Problem Predict potency and antigen levels of live virus vaccines based on manufacturing sensor data and manual data collected throughout the process. Challenges •  Customer’s data model was not optimal for running analytical queries •  Manual data quality issues •  Data capture was performed with varying consistency due to high cost associated with manual data collection © Copyright 2013 Pivotal. All rights reserved. 20
  • 21. Building predictive models to improved outcomes in manufacturing of vaccines Temp Counts Future Looking Predictive Models Cell expansion Virus propagation Duration of step Time Warning! Entered value not in expected range © Copyright 2013 Pivotal. All rights reserved. Pooling into final product Backward Looking Models 21
  • 22. Enabling predictive models through rearchitecting Challenges •  Accessibility –  Certain parts of the data have never been used in any predictive modeling since it is extremely hard to query them Cell expansion •  Data Integrity –  Manual data entries are prone to errors. There is no immediate feedback to examine the validity of the values entered Virus propagation •  Data Completeness –  Manual data entry is time consuming. There is no feedback on what data is most useful in improving the efficiency and quality and hence no prioritization of what data should be collected © Copyright 2013 Pivotal. All rights reserved. Pooling into final product 22
  • 23. Enabling predictive models through rearchitecting Challenges •  Accessibility –  Certain parts of the data have never been used in any predictive modeling since it is extremely hard to query them Purpose-built data models for rapid data querying and exploration •  Data Integrity –  Manual data entries are prone to errors. There is no immediate feedback to examine the validity of the values entered Automated data cleansing techniques •  Data Completeness –  Manual data entry is time consuming. There is no feedback on what data is most useful in improving the efficiency and quality and hence no prioritization of what data should be collected © Copyright 2013 Pivotal. All rights reserved. Opportunities to eliminate collection of incomplete or non-predictive data 23
  • 24. Identifying and correcting data integrity problems Creating automated methods for detection and correction all data 60 80 100 !  Data integrity problems cause challenges in modeling 0 20 40 !  Sources of variation in entries of measurements 1 3 5 7 9 11 13 15 17 19 21 23 –  Variable units of measurement –  Manual data entry errors Approach: Detect the optimal threshold to separate two distributions © Copyright 2013 Pivotal. All rights reserved. 24
  • 25. Identifying and correcting data integrity problems Creating automated methods for detection and correction all data 60 80 100 !  Data integrity problems cause challenges in modeling 20 40 !  Sources of variation in entries of measurements –  Variable units of measurement –  Manual data entry errors 0 1 3 5 7 9 11 13 15 17 19 lower half lower half upper half 23 !  Approach: Detect the optimal threshold to separate two distributions 40 10 20 510 5 20 10 10 15 20 30 lower half 30 Frequency 15 40 50 5020 60 60 upper half 0 0 00 Frequency Frequency Frequency 21 0.12 0.12 0.12 12 0.14 0.16 0.18 0.20 0.14 0.16 0.18 0.14 newVals[seq(1, maxBreak, 1)] 0.20 22 0.16 14 16 180.18 20 0.20 newVals[seq(1, maxBreak, 1)] newVals[seq(1, maxBreak, 1)] newVals[seq(maxBreak + 1, length(newVals), 1)] © Copyright 2013 Pivotal. All rights reserved. 0.22 0.22 0.22 24 12 14 16 18 20 22 24 newVals[seq(maxBreak + 1, length(newVals), 1)] 25
  • 26. Identifying and correcting data integrity problems Creating automated methods for detection and correction 0 20 40 60 80 100 all data 1 3 5 7 9 11 13 15 17 19 lower half lower half upper half 23 Foreground Background 40 10 20 510 5 20 10 10 15 20 30 lower half 30 Frequency 15 40 50 5020 60 60 upper half 0 0 00 Frequency Frequency Frequency 21 0.12 0.12 0.12 12 0.14 0.16 0.18 0.20 0.14 0.16 0.18 0.14 newVals[seq(1, maxBreak, 1)] 0.20 22 0.16 14 16 180.18 20 0.20 newVals[seq(1, maxBreak, 1)] newVals[seq(1, maxBreak, 1)] newVals[seq(maxBreak + 1, length(newVals), 1)] © Copyright 2013 Pivotal. All rights reserved. 0.22 0.22 0.22 24 12 14 16 18 20 22 24 newVals[seq(maxBreak + 1, length(newVals), 1)] 26
  • 27. Identifying and correcting data integrity problems Creating automated methods for detection and correction 0 20 40 60 80 100 all data 1 3 5 7 9 11 13 15 17 19 lower half lower half upper half 23 Foreground Background 40 10 20 510 5 20 10 10 15 20 30 lower half 30 Frequency 15 40 50 5020 60 60 upper half 0 0 00 Frequency Frequency Frequency 21 0.12 0.12 0.12 12 0.14 0.16 0.18 0.20 0.14 0.16 0.18 0.14 newVals[seq(1, maxBreak, 1)] 0.20 22 0.16 14 16 180.18 20 0.20 newVals[seq(1, maxBreak, 1)] newVals[seq(1, maxBreak, 1)] newVals[seq(maxBreak + 1, length(newVals), 1)] © Copyright 2013 Pivotal. All rights reserved. 0.22 0.22 0.22 24 12 14 16 18 20 22 24 newVals[seq(maxBreak + 1, length(newVals), 1)] 27
  • 28. Identifying and correcting data integrity problems Creating automated methods for detection and correction 60 80 100 all data 5 7 9 11 13 15 17 19 lower half lower half upper half 23 0 40 12 20 510 5 20 10 10 15 20 30 lower half 30 Frequency 15 40 50 5020 60 60 20 20 upper half 12 12 14 14 14 16 16 16 18 18 18 20 20 20 22 22 22 24 24 10 c(loh, uph) 0 0 00 Frequency Frequency Frequency 21 40 40 3 Frequency 1 60 60 0 20 8080 40 cleanedHistogram of c(loh, uph) = 100 histogram with multiplier 0.12 0.12 0.12 12 0.14 0.16 0.18 0.20 0.14 0.16 0.18 0.14 newVals[seq(1, maxBreak, 1)] 0.20 22 0.16 14 16 180.18 20 0.20 newVals[seq(1, maxBreak, 1)] newVals[seq(1, maxBreak, 1)] newVals[seq(maxBreak + 1, length(newVals), 1)] © Copyright 2013 Pivotal. All rights reserved. 0.22 0.22 0.22 24 12 14 16 18 20 22 24 newVals[seq(maxBreak + 1, length(newVals), 1)] 28
  • 29. Building models: First, start with the answer How to build models that solve the right problem Cell expansion Approach: Use historical data to build a model predicting potency of a final product using data from the manufacturing process !  Model form, how do we pick the right one? Virus propagation –  How do we deal with correlated features? –  Accuracy or interpretability? !  Available data Pooling into final product © Copyright 2013 Pivotal. All rights reserved. –  Thousands of features, without expert guidance how do we choose the right ones? –  What data do we want to use to predict? When is the right time for an intervention? 29
  • 30. Model generation and evaluation Predicting vaccine potency using manufacturing data 13.5 !  Feature engineering and transformation Test R2=0.742 Train R2=0.823 –  Enabled by rapid in-database processing ● ● ● 13.0 ● ● predTest[, i] Predicted Potency Total test 0.742003189411406 ● ● ● ● ● 12.5 ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ●● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ●● ● ●● ● ● ● ●● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ●● ● ● ● ● ● ●●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ●● ● ● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 12.0 ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ●● ● ● ● ● ● ●● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ●● ● ● ● ● ● ● ● ●● ● ● ● ● ● ●●● ● ●● ● ● ●● ● ● ●● ● ● ● ● ● ● ● ● ● ●● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● –  Partial least squares –  Random forest –  Regularized regression ● ● ● ● !  Interpretation of model results for insight generation ● ● ● ● ● 12.0 12.5 13.0 True Potency allTest[, i] © Copyright 2013 Pivotal. All rights reserved. !  Experimentation with model forms 13.5 –  Use cross-validation framework to assess variable importance 30
  • 31. Sample model insights Interpreting the utility of a measure obtained during manufacturing based on model outcomes 13.0 12.8 13.0 Log of Potency 12.6 Potency 12.6 12.2 12.4 !  Features consistently absent from models may be uninformative for predicting potency 12.4 12.8 Potency 12.0 12.2 12.0 Log of Potency !  Some features may reveal tunable parameters to alter potency, others may simply be markers Correlation = 0.38 Correlation = -0.45 0.20 0.25 0.30 0.35 0.40 SP1 Total Viable Cells Harvested Per Sq. Cm Assayed value © Copyright 2013 Pivotal. All rights reserved. 0.45 12 12.5 13 13.5 14 14.5 15 15.5 SP2 Total Trypsinization Exposure Time of per CCS Duration of a step >=16 !  Opportunities to provide realtime feedback on data entry errors and predicted potency outcomes 31
  • 32. Data-driven drugs Opportunities for data mining across the pharmaceutical industry © Copyright 2013 Pivotal. All rights reserved. 32
  • 33. Data driven drugs: From discovery to delivery Drug discovery + development Clinical Trials Distribution and surveillance Manufacturing Marketing © Copyright 2013 Pivotal. All rights reserved. 33
  • 34. Data driven drugs: From discovery to delivery Drug discovery + development Clinical Trials Distribution and surveillance !  Data repurposing New value exists in leveraging historical data across drugs and stages !  Data discovery External and publicly available datasets can augment proprietary sources Manufacturing !  Data collection Marketing © Copyright 2013 Pivotal. All rights reserved. Obtaining new data from different sources drives additional value 34
  • 35. Data driven drugs: From discovery to delivery Drug discovery + development Clinical Trials Distribution and surveillance !  Data repurposing New value exists in leveraging historical data across drugs and stages Adverse events for new clinical indications !  Data discovery External and publicly available datasets can augment proprietary sources Twitter data to forecast demand Manufacturing !  Data collection Marketing © Copyright 2013 Pivotal. All rights reserved. Obtaining new data from different sources drives additional value Mobile and sensor data to measure patient adherence and outcomes 35
  • 36. Leveraging Data to Improve Demand Forecasts Hospitals Doctor’s Offices Supply Distr. Surgery Centers Sales Data Pharmacies Analyze orders from customers Patients Laboratories Self-Reporting Publicly Available Resources Monitoring Patient Populations © Copyright 2013 Pivotal. All rights reserved. 36
  • 37. Promising Advancements in Diabetes Studies Use of telehealth to provide tight glucose control Biochemical Measurements EMR Genomics Lifestyle Intervention © Copyright 2013 Pivotal. All rights reserved. 37
  • 38. Launching a successful diabetes management program Multiple potential points of failure, requires use of analytics at every step Increase Awareness Patient Enrollment Comparative Effectiveness Remote Patient Monitoring Design Interventions Measure Impact on Population Best channel per cohort Best therapy for Resource each cohort: allocation Identify highest •  Medication decisions impact channels •  Delivery Medication Method adherence Stochastic •  Monitoring Churn Identify entity prediction influencers Method Predict risk of resolution negative Measure Campaign outcome for engagement optimization A/B testing to design best next 3 months engagement platform © Copyright 2013 Pivotal. All rights reserved. Attribution models Careful design of experiment to quantify the Impact 38
  • 39. Launching a successful diabetes management program Interdisciplinary collaboration of data scientists essential to success Marketing Increase Awareness Healthcare Patient Enrollment Web Analytics Comparative Effectiveness Remote Patient Monitoring Optimization Design Interventions General ML Measure Impact on Population Best channel per cohort Best therapy for Resource each cohort: allocation Identify highest •  Medication decisions impact channels •  Delivery Medication Method adherence Stochastic •  Monitoring Churn Identify entity prediction influencers Method Predict risk of resolution negative Measure Campaign outcome for engagement optimization A/B testing to design best next 3 months engagement platform © Copyright 2013 Pivotal. All rights reserved. Attribution models Careful design of experiment to quantify the Impact 39
  • 40. Pivotal Labs rapid application development !  Rheumatoid arthritis remote patient monitoring system –  Self-reporting –  Intuitive user interface https://itunes.apple.com/us/app/myra/id563338979?mt=8 © Copyright 2013 Pivotal. All rights reserved. 40
  • 41. Pivotal One: Heritage Application Fabric Data Fabric GemFire Ingest & Query: very high-capacity & in-memory Scale-out storage: HDFS/Object vFabric Languages & Frameworks Services Analytics Automation: App Provisioning & Life-cycle Service Registry Cloud Abstraction (portability) Cloud Fabric © Copyright 2013 Pivotal. All rights reserved. 41
  • 42. A NEW PLATFORM FOR A NEW ERA