SlideShare una empresa de Scribd logo
1 de 34
How Big Data Insights become Easily
Accessible with Workflow Tools
Session Overview
➢ Introduction To Workflows and Big Data For Data Scientists/Data
Citizen
➢ Examples Of Customers Benefiting From Using Workflows –
Reduction In Cost, Speed To Deploy
➢ Pulling It All Together - Introduction To Deploying Models To
External Systems
➢ Technical Overview Into Building Predictive Analytics Workflows For
Big Data (Tibco Statistica)
3
Introduction To Workflows and
Big Data For Data
Scientists/Data Citizen
Insight Action
Making sense of the dataPlatform
© Copyright 2000-2017 TIBCO Software Inc.
PREDICT
MODEL
WRANGLE
ANALYZE
ACCESS
Predictive
Analytics
Visual
Analytics
Learning Cycles
MODEL
ACCESS
ANALYZE
WRANGLE
Insight
RULES
MODELS
© Copyright 2000-2017 TIBCO Software Inc.
MONITOR
PREDICT
ACT
DECIDE
MODELPredictive
Analytics
Streaming
Analytics
Action
MONITOR
PREDICT
ACT
DECIDE
Operational Cycles
RULES
MODELS
© Copyright 2000-2017 TIBCO Software Inc.
Personas
Its all about empowering more people
10
Examples Of Customers
Benefiting From Using
Workflows
In Cost, Speed To Deploy
Reduction
Big data analytics transforms
the operating room
CASE STUDY
Company: University of Iowa (UIHC) | Industry: Healthcare | Country: USA | Web: www.uihealthcare.org
UIHC surgeons needed to know the susceptibility of patients to
infections in order to make critical treatment decisions in the
operating room. Infection rates have major implications to overall
patient health and cost savings.
UIHC used Big Data and Analytics and transformed outcomes as
different points on the patients care.
Reduced surgical site infection occurrence by 58 percent
Merged historical and live patient data to predict likelihood of infection
Personalized care based on patients’ own characteristics
Improved efficiency by enabling staff to run predictive models and access
results with a mobile application or web browser
BUSINESS CHALLENGE
SOLUTION
RESULTS
© Copyright 2000-2017 TIBCO Software Inc.
“Predictive analytics is allowing us to deal with the ever-increasing types of data that healthcare institutions need to deal with.”
Dr. John Cromwell, MD Director of Gastrointestinal Surgery
Bank speeds time to market
with advanced analytics
Business need
To deliver timely and accurate credit decisions and other customer
services in today’s 24/7 world, Danske Bank needed to be able to
quickly build and deploy advanced analytical models.
Benefits
● Slashed time to develop and deploy analytical models by 50
percent
● Improved decision-making with more advanced analytical models
● Delivered an easy-to-use, standardized toolbox that can quickly
be customized to meet users’ needs
● Ensured fast ROI by deploying easily and integrating smoothly with
existing systems
Solution
Enable customers to apply for products such as loans through Danske
Banks portal. Generate scoring models to determine whether customers
applications are accepted.
“We have reduced the time we spend
on models up to 50 percent with
Statistica. Our development process is
much leaner and smoother compared
to what it was before.”
Jens Christian Ipsen,
First Vice President, Danske Bank
13
➢ Pulling It All Together
Introduction To Deploying Models
To External Systems
PREDICT
MODEL
WRANGLE
ANALYZE
ACCESS
Predictive
Analytics
Visual
Analytics
Learning Cycles
MODEL
ACCESS
ANALYZE
WRANGLE
Insight
RULES
MODELS
© Copyright 2000-2017 TIBCO Software Inc.
Machine Learning in TIBCO Statistica
TIBCO StreamBase for real-time scoring and action
TIBCO Statistica Deploy To External Application
• Model built in TIBCO Statistica
• Score model in TIBCO StreamBase on live data
• Action: equipment intervention
MONITOR
PREDICT
ACT
DECIDE
MODELPredictive
Analytics
Streaming
Analytics
Action
MONITOR
PREDICT
ACT
DECIDE
Operational Cycles
RULES
MODELS
© Copyright 2000-2017 TIBCO Software Inc.
One stop shop for actionable insights
DATA SCIENCE STREAMING ANALYTICSBI & ANALYTICS
AI-driven visualization
to gain insight to
find actionable
insights
Create analytics that
can predict the future
based on history
Provide analytics and
take action on real
time streaming data
One stop shop for actionable insights
DATA SCIENCE STREAMING ANALYTICSBI & ANALYTICS
AI-driven visualization
to gain insight to
find actionable
insights
Create analytics that
can predict the future
based on history
Provide analytics and
take action on real
time streaming data
19
Technical Overview
Into Building
Predictive Analytics
Workflows For Big
Data (Tibco
Statistica)
Workflows
• 1000s of stats, machine and deep learning
• Supervised learning - models, ensambles
• Unsupervised learning - anomaly detection, clustering
• 100s native validated step nodes, workflows
Data Blending
• Traditional - SQL sources, Flat Files
• Big Data - HDFS in, out, data maps
Models and Rules Management
• Deployment Code generators
(C/C#/PMML/Java(POJO,MapReduce)/Teradata/SAS)
• Audit, validation, user and version control
Collaboration
• Scripted nodes (use/manage): R, Python, Scala, C#
• Algorithmic marketplaces plugins
Big Data
• In-database analytics
• H20, Spark Nodes
• Deep Learning (CNTK) © Copyright 2000-2017 TIBCO Software Inc.
Statistica - Data Science Workbench for Big Data Analytics
© Copyright 2000-2017 TIBCO Software Inc.
Simple workflow example - Predictive Modeling (Classic)
Statistica Enterprise Server
© Copyright 2000-2017 TIBCO Software Inc.
TIBCO Statistica Platform Architecture for Big Data
Model
Monitoring &
Process Control
Statistica Big
Data Analytics
Monitoring
Alerting
Server
Live Score
Server
Metadata
Repository
Document
Management
System
Spark, H2O,
In-DB, HDFS
wrappers
Change
management
& ComplianceAnalytics
Modelling
Deployment
Data
aggregation &
preparation
Web and API
access
Real time model
scoring
Enterprise Tools
Governance
Batch Jobs
Access Roles
Models / Rules
Management
© Copyright 2000-2017 TIBCO Software Inc.
Change control, access management and model versioning
Data Science Workbench
• Workflows - Package and maintain your predictive model building process steps
Data Blending
• HDFS data in,out,feed
H20 - Sparkling water
• Sparkling water nodes and workflow templates
Spark ML Nodes (Scala code)
• Data
• Feature Selection
• Decision Trees
• Regression
• Classification
• PCA …. plus workflow templates
Deep Learning (CNTK)
• Regression
• Classification
• Deployment
© Copyright 2000-2017 TIBCO Software Inc.
TIBCO Statistica for Big Data Analytic Pipelines
© Copyright 2000-2017 TIBCO Software Inc.
Big Data Analytics - In-Database Processing
Dedicated In-Database Steps
(nodes)
• Descriptives, Correlactions
• GLZ, Lasso Regression, ..
Supported SQL Databases
Support as of today for
• Microsoft SQL Server
• Oracle
• Apache Hive
• Teradata
• MySQL
(as of today, Statistica version 13.3)
Why
• Move compute to data
• Reduce data travel, use resourcesDatabase
SQL, ...In-Db
Step
Results,
Metadata
© Copyright 2000-2017 TIBCO Software Inc.
Statistica Collective Intelligence - App Market Connectors
Use models / code from marketplaces (*if good/trusted)
Example : Azure ML Model Consumer
Parameters of the Statistica Azure ML step:
• Model API key
• Connection
• Webservice in/out datasets
• Batch score in storage container (optional)
Why
• (Re)Use typical use cases models
Marketing Campaigns, Churn
Predictive Maintenance
etc
• Bring existing IP into platform and manage
• Data Prep /merge, clean, transform)
• Control / Schedule jobs for execution /
Monitor performance
Similar approach for other marketplaces
and existing code (R, Python, C#)
© Copyright 2000-2017 TIBCO Software Inc.
Open Source Options, Scripting
Any R package as a node
• Integrate any R package as
a process step
• Augment/customize core capabilities
Python scripts
SCALA code
Scripting in C#, VB
Native Custom (open code) Nodes
Shipped with many Examples
© Copyright 2000-2017 TIBCO Software Inc.
Deep Learning
CNTK based deep learning NNs
• Regression
• Classification
• Generic
• Deployment
H2O Deep Learning nodes
BTW - if you are not looking for deep learning
discovery of hidden intrinsic relationships in
with an unsupervised NN … we have the
“classical” Statistica Automated NNs
© Copyright 2000-2017 TIBCO Software Inc.
Statistica H2O nodes
H2O easy to use “wrappers”
• Example Workspaces provider
• All nodes described in Statistica Help
• See SparklingWaterBooklet.pdf from
h2o-release.s3.amazonaws.com/h2o/
© Copyright 2000-2017 TIBCO Software Inc.
Statistica Spark (Scala) nodes
© Copyright 2000-2017 TIBCO Software Inc.
Statistica Scala nodes - Architecture
Livy
Server
REST
Server
Managed Cluster
Spark at least ver. 2.0.2
(e.g. Spark nodes + YARN)
Statistica Desktop
Analytics Workbench
Statistica
Enterprise
Repository
© Copyright 2000-2017 TIBCO Software Inc.
Goals:
• Predict yield for semiconductor manufacturing process
• Detect potential quality issues early
Problems:
• Ultra-wide data: Thousands to Millions of variables
• Handling billion(s) of cases
• Mix of categorical and continuous predictors
• Sparse data
Solution:
• Spark parallel processing in big data platform
• Feature selection algorithm + Lasso regression
• Hadoop cluster (100s of cores)
• Statistica analytic workflows submit code to spark cluster
• Spotfire dashboards used for visualization
Performance:
• Analysis Running time reduced to minutes
Big Data – Practical Use case - Yield, Root Causes and
Quality Issues detection - Complex Manufacturing Process
© Copyright 2000-2017 TIBCO Software Inc.
Summary
Validated
Big Data
Options
36
Questions

Más contenido relacionado

La actualidad más candente

Open Source Data Management for Industry 4.0
Open Source Data Management for Industry 4.0Open Source Data Management for Industry 4.0
Open Source Data Management for Industry 4.0
DataWorks Summit
 
Gov & Private Sector Regulatory Compliance: Using Hadoop to Address Requirements
Gov & Private Sector Regulatory Compliance: Using Hadoop to Address RequirementsGov & Private Sector Regulatory Compliance: Using Hadoop to Address Requirements
Gov & Private Sector Regulatory Compliance: Using Hadoop to Address Requirements
DataWorks Summit
 
Stream Scaling in Pravega
Stream Scaling in PravegaStream Scaling in Pravega
Stream Scaling in Pravega
DataWorks Summit
 
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersMonitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service Providers
DataWorks Summit
 

La actualidad más candente (20)

Driven by data - Why we need a Modern Enterprise Data Analytics Platform
Driven by data - Why we need a Modern Enterprise Data Analytics PlatformDriven by data - Why we need a Modern Enterprise Data Analytics Platform
Driven by data - Why we need a Modern Enterprise Data Analytics Platform
 
What is the future of data strategy?
What is the future of data strategy?What is the future of data strategy?
What is the future of data strategy?
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
 
BIG DATA ANALYTICS MEANS “IN-DATABASE” ANALYTICS
BIG DATA ANALYTICS MEANS “IN-DATABASE” ANALYTICSBIG DATA ANALYTICS MEANS “IN-DATABASE” ANALYTICS
BIG DATA ANALYTICS MEANS “IN-DATABASE” ANALYTICS
 
Ibm big data
Ibm big dataIbm big data
Ibm big data
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
 
Modern Data Architecture
Modern Data Architecture Modern Data Architecture
Modern Data Architecture
 
Jan van der Vegt. Challenges faced with machine learning in practice
Jan van der Vegt. Challenges faced with machine learning in practiceJan van der Vegt. Challenges faced with machine learning in practice
Jan van der Vegt. Challenges faced with machine learning in practice
 
Benefits of Transferring Real-Time Data to Hadoop at Scale
Benefits of Transferring Real-Time Data to Hadoop at ScaleBenefits of Transferring Real-Time Data to Hadoop at Scale
Benefits of Transferring Real-Time Data to Hadoop at Scale
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
 
Big Data as a Service: A Neo-Metropolis Model Approach for Innovation
Big Data as a Service: A Neo-Metropolis Model Approach for InnovationBig Data as a Service: A Neo-Metropolis Model Approach for Innovation
Big Data as a Service: A Neo-Metropolis Model Approach for Innovation
 
Who changed my data? Need for data governance and provenance in a streaming w...
Who changed my data? Need for data governance and provenance in a streaming w...Who changed my data? Need for data governance and provenance in a streaming w...
Who changed my data? Need for data governance and provenance in a streaming w...
 
ttec - ParStream
ttec - ParStreamttec - ParStream
ttec - ParStream
 
Open Source Data Management for Industry 4.0
Open Source Data Management for Industry 4.0Open Source Data Management for Industry 4.0
Open Source Data Management for Industry 4.0
 
Gov & Private Sector Regulatory Compliance: Using Hadoop to Address Requirements
Gov & Private Sector Regulatory Compliance: Using Hadoop to Address RequirementsGov & Private Sector Regulatory Compliance: Using Hadoop to Address Requirements
Gov & Private Sector Regulatory Compliance: Using Hadoop to Address Requirements
 
Skilwise Big data
Skilwise Big dataSkilwise Big data
Skilwise Big data
 
Turning an idea into a Data-Driven Production System: An Energy Load Forecas...
 Turning an idea into a Data-Driven Production System: An Energy Load Forecas... Turning an idea into a Data-Driven Production System: An Energy Load Forecas...
Turning an idea into a Data-Driven Production System: An Energy Load Forecas...
 
Stream Scaling in Pravega
Stream Scaling in PravegaStream Scaling in Pravega
Stream Scaling in Pravega
 
HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...
HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...
HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...
 
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersMonitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service Providers
 

Similar a Big Data LDN 2017: How Big Data Insights Become Easily Accessible With Workflow Tools

Apply Machine Learning to Microservices
Apply Machine Learning to MicroservicesApply Machine Learning to Microservices
Apply Machine Learning to Microservices
Kai Wähner
 

Similar a Big Data LDN 2017: How Big Data Insights Become Easily Accessible With Workflow Tools (20)

JASPERSOFT LIVE DEMO - NAM
JASPERSOFT LIVE DEMO - NAMJASPERSOFT LIVE DEMO - NAM
JASPERSOFT LIVE DEMO - NAM
 
TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...
TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...
TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
How to Apply Big Data Analytics and Machine Learning to Real Time Processing ...
How to Apply Big Data Analytics and Machine Learning to Real Time Processing ...How to Apply Big Data Analytics and Machine Learning to Real Time Processing ...
How to Apply Big Data Analytics and Machine Learning to Real Time Processing ...
 
Applying the R Language to BI and Real Time Applications
Applying the R Language to BI and Real Time ApplicationsApplying the R Language to BI and Real Time Applications
Applying the R Language to BI and Real Time Applications
 
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
 
Big data for Telco: opportunity or threat?
Big data for Telco: opportunity or threat?Big data for Telco: opportunity or threat?
Big data for Telco: opportunity or threat?
 
R, Spark, Tensorflow, H20.ai Applied to Streaming Analytics
R, Spark, Tensorflow, H20.ai Applied to Streaming AnalyticsR, Spark, Tensorflow, H20.ai Applied to Streaming Analytics
R, Spark, Tensorflow, H20.ai Applied to Streaming Analytics
 
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
 
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
 
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...
 
Apply Machine Learning to Microservices
Apply Machine Learning to MicroservicesApply Machine Learning to Microservices
Apply Machine Learning to Microservices
 
Data Science Salon: Applying Machine Learning to Modernize Business Processes
Data Science Salon: Applying Machine Learning to Modernize Business ProcessesData Science Salon: Applying Machine Learning to Modernize Business Processes
Data Science Salon: Applying Machine Learning to Modernize Business Processes
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
 
Integrating Advanced Analytics with Autodesk Solutions
Integrating Advanced Analytics with Autodesk SolutionsIntegrating Advanced Analytics with Autodesk Solutions
Integrating Advanced Analytics with Autodesk Solutions
 
Webinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDBWebinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDB
 
It Consulting & Services - Black Basil Technologies
It Consulting & Services  - Black Basil TechnologiesIt Consulting & Services  - Black Basil Technologies
It Consulting & Services - Black Basil Technologies
 
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
 
StreamCentral for the IT Professional
StreamCentral for the IT ProfessionalStreamCentral for the IT Professional
StreamCentral for the IT Professional
 

Más de Matt Stubbs

Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Matt Stubbs
 
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Matt Stubbs
 

Más de Matt Stubbs (20)

Blueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
Blueprint Series: Banking In The Cloud – Ultra-high Reliability ArchitecturesBlueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
Blueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
 
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
 
Blueprint Series: Expedia Partner Solutions, Data Platform
Blueprint Series: Expedia Partner Solutions, Data PlatformBlueprint Series: Expedia Partner Solutions, Data Platform
Blueprint Series: Expedia Partner Solutions, Data Platform
 
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
 
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
 
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCEBig Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
 
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQLBig Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
 
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTSBig Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
 
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
 
Big Data LDN 2018: AI VS. GDPR
Big Data LDN 2018: AI VS. GDPRBig Data LDN 2018: AI VS. GDPR
Big Data LDN 2018: AI VS. GDPR
 
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
 
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
 
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
 
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
 
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICSBig Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
 
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSEBig Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
 
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNINGBig Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
 
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
 
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
 
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATE
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATEBig Data LDN 2018: DATA APIS DON’T DISCRIMINATE
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATE
 

Último

Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
gajnagarg
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
gajnagarg
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
gajnagarg
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
gajnagarg
 

Último (20)

Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
 

Big Data LDN 2017: How Big Data Insights Become Easily Accessible With Workflow Tools

  • 1. How Big Data Insights become Easily Accessible with Workflow Tools
  • 2. Session Overview ➢ Introduction To Workflows and Big Data For Data Scientists/Data Citizen ➢ Examples Of Customers Benefiting From Using Workflows – Reduction In Cost, Speed To Deploy ➢ Pulling It All Together - Introduction To Deploying Models To External Systems ➢ Technical Overview Into Building Predictive Analytics Workflows For Big Data (Tibco Statistica)
  • 3. 3 Introduction To Workflows and Big Data For Data Scientists/Data Citizen
  • 4.
  • 5. Insight Action Making sense of the dataPlatform © Copyright 2000-2017 TIBCO Software Inc.
  • 9. Its all about empowering more people
  • 10. 10 Examples Of Customers Benefiting From Using Workflows In Cost, Speed To Deploy Reduction
  • 11. Big data analytics transforms the operating room CASE STUDY Company: University of Iowa (UIHC) | Industry: Healthcare | Country: USA | Web: www.uihealthcare.org UIHC surgeons needed to know the susceptibility of patients to infections in order to make critical treatment decisions in the operating room. Infection rates have major implications to overall patient health and cost savings. UIHC used Big Data and Analytics and transformed outcomes as different points on the patients care. Reduced surgical site infection occurrence by 58 percent Merged historical and live patient data to predict likelihood of infection Personalized care based on patients’ own characteristics Improved efficiency by enabling staff to run predictive models and access results with a mobile application or web browser BUSINESS CHALLENGE SOLUTION RESULTS © Copyright 2000-2017 TIBCO Software Inc. “Predictive analytics is allowing us to deal with the ever-increasing types of data that healthcare institutions need to deal with.” Dr. John Cromwell, MD Director of Gastrointestinal Surgery
  • 12. Bank speeds time to market with advanced analytics Business need To deliver timely and accurate credit decisions and other customer services in today’s 24/7 world, Danske Bank needed to be able to quickly build and deploy advanced analytical models. Benefits ● Slashed time to develop and deploy analytical models by 50 percent ● Improved decision-making with more advanced analytical models ● Delivered an easy-to-use, standardized toolbox that can quickly be customized to meet users’ needs ● Ensured fast ROI by deploying easily and integrating smoothly with existing systems Solution Enable customers to apply for products such as loans through Danske Banks portal. Generate scoring models to determine whether customers applications are accepted. “We have reduced the time we spend on models up to 50 percent with Statistica. Our development process is much leaner and smoother compared to what it was before.” Jens Christian Ipsen, First Vice President, Danske Bank
  • 13. 13 ➢ Pulling It All Together Introduction To Deploying Models To External Systems
  • 15. Machine Learning in TIBCO Statistica TIBCO StreamBase for real-time scoring and action TIBCO Statistica Deploy To External Application • Model built in TIBCO Statistica • Score model in TIBCO StreamBase on live data • Action: equipment intervention
  • 17. One stop shop for actionable insights DATA SCIENCE STREAMING ANALYTICSBI & ANALYTICS AI-driven visualization to gain insight to find actionable insights Create analytics that can predict the future based on history Provide analytics and take action on real time streaming data
  • 18. One stop shop for actionable insights DATA SCIENCE STREAMING ANALYTICSBI & ANALYTICS AI-driven visualization to gain insight to find actionable insights Create analytics that can predict the future based on history Provide analytics and take action on real time streaming data
  • 19. 19 Technical Overview Into Building Predictive Analytics Workflows For Big Data (Tibco Statistica)
  • 20. Workflows • 1000s of stats, machine and deep learning • Supervised learning - models, ensambles • Unsupervised learning - anomaly detection, clustering • 100s native validated step nodes, workflows Data Blending • Traditional - SQL sources, Flat Files • Big Data - HDFS in, out, data maps Models and Rules Management • Deployment Code generators (C/C#/PMML/Java(POJO,MapReduce)/Teradata/SAS) • Audit, validation, user and version control Collaboration • Scripted nodes (use/manage): R, Python, Scala, C# • Algorithmic marketplaces plugins Big Data • In-database analytics • H20, Spark Nodes • Deep Learning (CNTK) © Copyright 2000-2017 TIBCO Software Inc. Statistica - Data Science Workbench for Big Data Analytics
  • 21. © Copyright 2000-2017 TIBCO Software Inc. Simple workflow example - Predictive Modeling (Classic)
  • 22. Statistica Enterprise Server © Copyright 2000-2017 TIBCO Software Inc. TIBCO Statistica Platform Architecture for Big Data Model Monitoring & Process Control Statistica Big Data Analytics Monitoring Alerting Server Live Score Server Metadata Repository Document Management System Spark, H2O, In-DB, HDFS wrappers Change management & ComplianceAnalytics Modelling Deployment Data aggregation & preparation Web and API access Real time model scoring Enterprise Tools Governance Batch Jobs Access Roles Models / Rules Management
  • 23. © Copyright 2000-2017 TIBCO Software Inc. Change control, access management and model versioning
  • 24. Data Science Workbench • Workflows - Package and maintain your predictive model building process steps Data Blending • HDFS data in,out,feed H20 - Sparkling water • Sparkling water nodes and workflow templates Spark ML Nodes (Scala code) • Data • Feature Selection • Decision Trees • Regression • Classification • PCA …. plus workflow templates Deep Learning (CNTK) • Regression • Classification • Deployment © Copyright 2000-2017 TIBCO Software Inc. TIBCO Statistica for Big Data Analytic Pipelines
  • 25. © Copyright 2000-2017 TIBCO Software Inc. Big Data Analytics - In-Database Processing Dedicated In-Database Steps (nodes) • Descriptives, Correlactions • GLZ, Lasso Regression, .. Supported SQL Databases Support as of today for • Microsoft SQL Server • Oracle • Apache Hive • Teradata • MySQL (as of today, Statistica version 13.3) Why • Move compute to data • Reduce data travel, use resourcesDatabase SQL, ...In-Db Step Results, Metadata
  • 26. © Copyright 2000-2017 TIBCO Software Inc. Statistica Collective Intelligence - App Market Connectors Use models / code from marketplaces (*if good/trusted) Example : Azure ML Model Consumer Parameters of the Statistica Azure ML step: • Model API key • Connection • Webservice in/out datasets • Batch score in storage container (optional) Why • (Re)Use typical use cases models Marketing Campaigns, Churn Predictive Maintenance etc • Bring existing IP into platform and manage • Data Prep /merge, clean, transform) • Control / Schedule jobs for execution / Monitor performance Similar approach for other marketplaces and existing code (R, Python, C#)
  • 27. © Copyright 2000-2017 TIBCO Software Inc. Open Source Options, Scripting Any R package as a node • Integrate any R package as a process step • Augment/customize core capabilities Python scripts SCALA code Scripting in C#, VB Native Custom (open code) Nodes Shipped with many Examples
  • 28. © Copyright 2000-2017 TIBCO Software Inc. Deep Learning CNTK based deep learning NNs • Regression • Classification • Generic • Deployment H2O Deep Learning nodes BTW - if you are not looking for deep learning discovery of hidden intrinsic relationships in with an unsupervised NN … we have the “classical” Statistica Automated NNs
  • 29. © Copyright 2000-2017 TIBCO Software Inc. Statistica H2O nodes H2O easy to use “wrappers” • Example Workspaces provider • All nodes described in Statistica Help • See SparklingWaterBooklet.pdf from h2o-release.s3.amazonaws.com/h2o/
  • 30. © Copyright 2000-2017 TIBCO Software Inc. Statistica Spark (Scala) nodes
  • 31. © Copyright 2000-2017 TIBCO Software Inc. Statistica Scala nodes - Architecture Livy Server REST Server Managed Cluster Spark at least ver. 2.0.2 (e.g. Spark nodes + YARN) Statistica Desktop Analytics Workbench Statistica Enterprise Repository
  • 32. © Copyright 2000-2017 TIBCO Software Inc. Goals: • Predict yield for semiconductor manufacturing process • Detect potential quality issues early Problems: • Ultra-wide data: Thousands to Millions of variables • Handling billion(s) of cases • Mix of categorical and continuous predictors • Sparse data Solution: • Spark parallel processing in big data platform • Feature selection algorithm + Lasso regression • Hadoop cluster (100s of cores) • Statistica analytic workflows submit code to spark cluster • Spotfire dashboards used for visualization Performance: • Analysis Running time reduced to minutes Big Data – Practical Use case - Yield, Root Causes and Quality Issues detection - Complex Manufacturing Process
  • 33. © Copyright 2000-2017 TIBCO Software Inc. Summary Validated Big Data Options