SlideShare una empresa de Scribd logo
1 de 19
© 2018 IBM Corporation
AI Maturity Roadmap for Becoming a Data-
Driven Organization
David Solomon
IBM Executive Architect and Senior Cloud and
Cognitive Evangelist
April 19, 2018
About Me
Focus / Passion
• AI, Cognitive, Emerging Technology
• Analytics
• Data (Architecture, Modeling, Integration)
• Cloud Service Architecture
• Applying the above to real-world business problems
Education & Certification
• M.S. Software Engineering
• B.S, Physics
• Data Mgmt, AI, Cloud, Docker, DevOps, …
Proud Member of
the IBM WolfPack
David Solomon
Technical
Evangelist, IBM
dsdlsolomo
@dlsolomo
Team-wolfpack
What is AI? (my definition)
3© 2018 IBM Corporation
AI is the application of technology to re-produce and automate cognitive tasks
that are time-consuming and/or costly to perform. It is not a single technology
or system, but rather a collection of capabilities that are applied in specific
combinations to provide the automation of cognitive tasks for achieving a
specific business outcome.
• Examples Technologies- natural language understanding, visual recognition, machine learning,
and speech recognition
• Example Applications- insurance claims approvals, review medical imaging to narrow down
diagnosis and smarter logistics routing
• Effective AI requires a LOT of reliable data!
• Recent public examples (e.g., IBM Watson, Alexa, Siri) have put AI high in Business Leader
priorities
Tools & Infrastructure
• Need an environment
that enables a “fail
fast” approach
• Discrete tools present
barriers to productivity
Governance
• If the data isn’t secure,
self-service isn’t a
reality
• Challenge
understanding data
lineage and getting to
a system of truth
Skills
• Data Science skills are
in low supply and high
demand
• Nurturing new data
professionals is
challenging
Data
• Data resides in silos &
difficult to access
• Unstructured and
external data wasn’t
considered
4
Why are enterprises struggling to
capture the value of AI?
IBM Cloud / Watson and Cloud Platform / © 2018 IBM Corporation
The Essential Ingredients for Effective AI
5© 2018 IBM Corporation
Effective manage and
leverage both structured
and unstructured data
regardless of source or
location
Enablement of timely
business decisions
derived from accurate
and consumable insight
Effective application of
data science to improve
and drive business
outcomes
An organization’s readiness for AI is determined by
how effectively these ingredients are applied. How can
we assess this?
Get your Information Architecture in Order: Introducing the “AI
Ladder”
6© 2018 IBM Corporation
− AI should not be considered a stand-alone capability, but is a culmination of your
investments in other key disciplines, as shown in the IBM “AI Ladder” below.
− The entry-point for your AI initiative will depend on where your organization
currently sits on this ladder in the areas of data, analytics, and machine learning
”There is no AI without IA!”
7© 2018 IBM Corporation
• One of the biggest success factors in an AI initiative involves access to accurate, relevant, and timely data
• AI-driven conclusions are almost solely driven by the data, so if the data is not accurate, your business
could be adversely impacted
• The best way to address this is to ensure that your AI effort is supported by a solid Information Architecture
that supports the following disciplines,
• Data Management
• Data Integration and Governance
• Data Science and Analytics
• The degree to which you apply these disciplines determines where on the ladder you should start
Gain value
from your data,
without limits
Access your data
All sources and all types
Flexibility
Support all data
types, all workloads,
all consumption models
Machine Learning
Make better decisions,
provide smarter capabilities
Democratize access
Provide data-driven decisions
to everyone
Simplicity
A unified experience in
managing your data
landscape
Cloud journey
Support your data regardless
of location
Essential elements of a hybrid data management strategy
Use your data
Build a single source of truth to
drive a 360-degree view of your
data. Unleash insights and
deepen customer relationships.
9IBM Cloud / © 2018 IBM Corporation
Trust your data
Capture lineage, help ensure
quality of dynamic data and
stay on top of regulations.
Know your data
Discover, find, integrate,
classify and catalog all types
of data.
Essential elements of Data Integration and Governance
Essential elements of Analytics and Data Science
10
11
Introducing an AI Readiness Maturity Model
Insight
Hindered
Hindsight-
Driven
Data-
Driven
Insight-
Driven
AI-Driven
• Minimal data mgmt.
• Spreadsheets are
primary data tool
• Minimal standards
• Minimal Governance
• Centralized DBs for
critical data
• Some governance
• Siloed use of
unstructured data
• Data integration and
governance practice
• Organized use of
unstructured data
• Siloed Data Science
practices
• Data Science
practices in place
• Hybrid-data mgmt.
practice in place
• Leverage both cloud
and on-prem data
• Fully data-driven
business
• Access to all
required AI training
data
Data
Readiness
• Spreadsheet
analysis
• Desktop BI tools
• Minimal standards
• Soiled practices
• Focus on descriptive
analytics (What
Happened?)
• Standardized
reporting formats
• Diagnostic analytics
• Siloed use of
predictive analytics
• Siloed use of
Machine Learning
models
• Standard use of
Machine Learning
• Predictive analytics
• Siloed use of
prescriptive
analytics
• Prescriptive
analytics
• Fully insight-driven
business
Analytics
Readiness
Hindered
Business
Outcomes
Operational
Efficiency and
Cost Savings
Competitiveness Competitive
Advantage
Market Leader
• None
• Siloed
Experimentation
• Limited use for
siloed applications
• Initial production AI
applications
• Some alignment of
AI with business
strategy
• Standard AI practice
• Full alignment of AI
with business
strategy
AI
Capability
On February 14, 2011
made history
…and has grown to an entire portfolio of cognitive technologies
Retrieve and Rank
Language
• Conversation
• Document
Conversion
• Language
Translator
• Natural Language
Classifier
• Natural Language
Understanding
• Personality Insights
• Retrieve and Rank
• Tone Analyzer
Speech
• Speech to Text
• Text to Speech
Vision
• Visual Recognition
Data Insights
• Discovery
• Discovery News
• Watson Knowledge
Studio
Natural Language
Classifier
Tone Analyzer
Tools & Infrastructure
• Need an environment
that enables a “fail
fast” approach
• Discrete tools present
barriers to productivity
Governance
• If the data isn’t secure,
self-service isn’t a
reality
• Challenge
understanding data
lineage and getting to
a system of truth
Skills
• Data Science skills are
in low supply and high
demand
• Nurturing new data
professionals is
challenging
Data
• Data resides in silos &
difficult to access
• Unstructured and
external data wasn’t
considered
14
Why are enterprises struggling to
capture the value of AI?
How can these challenges be tackled in a timely manner?
Watson Studio
Supporting the end-to-end AI workflow
Prepare Data
for Analysis
Build and Train
ML/DL Models
Deploy Models
Monitor, Analyze
and Manage
Search and Find
Relevant Data
Connect &
Access Data
• Connect and
discover content
from multiple data
sources in the
cloud or on
premises.
• Bring structured
and unstructured
data to one toolkit.
• Clean and prepare
your data with Data
Refinery, a tool to
create data
preparation
pipelines visually.
• Use popular open
source libraries to
prepare
unstructured data.
• Democratize the
creation of ML and DL
models. Design your
AI models
programmatically or
visually with the most
popular open source
and IBM ML/DL
frameworks
• Leverage transfer
learning on pre-
trained models using
Watson tools to
adapt to your business
domain.
• Train at scale on
GPUs and
distributed compute
• Deploy your models
easily and have
them scale
automatically for
online, batch or
streaming use
cases
• Monitor the
performance of the
models in
production and
trigger automatic
retraining and
redeployment of
models.
• Find data
(structured,
unstructured) and
AI assets (e.g.,
ML/DL models,
notebooks, Watson
Data Kits) in the
Knowledge
Catalog
15
Her Job:
Builds AI application that meet the
requirements of the business.
What she does:
• Starts PoCs which includes
gathering content, dialog
building and model training
• Focus is on app building for the
team or company to use. Will
handle ML Ops as needed
Sometimes known as:
Front-end, back-end, full stack,
mobile or low-code developer
Tanya
Domain Expert
Her Job:
To transfer knowledge to Watson for
a successful user experience.
What she does:
• Range of domain knowledge and
uses that to teach Watson and
develop a custom models
• As Tanya gains more experience
she optimizes her knowledge to
teach Watson to design better
end-user experiences.
Sometimes known as:
Subject matter expert, content
strategist.
His Job:
Transform data into knowledge for
solving business problems.
What he does:
•Runs experiments to build custom
models that solve business problems.
•Use techniques such as Machine
Learning or Deep Learning and
works with Tanya to validate success
of trained models.
Watson Studio
Built for AI teams – enabling team productivity and collaboration
Sometimes known as:
ML/DL engineer, Modeler, Data Miner
Ed
Data Engineer
His Job:
Architects how data is organized
and ensures operability
What he does:
• Builds data infrastructure and ETL
pipelines. Works with Spark,
Hadoop, and HDFS.
• Works with data scientist to
transform research models into
production quality systems.
Sometimes known as:
Data infrastructure engineer
Mike
Data Scientist
Deb
The Developer
16
Watson Studio
Comprehensive set of tools for the end-to-end AI workflow
Model Lifecycle Management
Machine Learning Runtimes Deep Learning Runtimes
Authoring Tools
Cloud Infrastructure as a Service
Watson
API
Tools
Model
Builder
• Most popular open source frameworks
• IBM best-in-class frameworks
• Create, collaborate, deploy, and monitor
• Best of breed open source & IBM tools
• Code (R, Python or Scala) and no-code/visual
modeling tools
• Fully managed service
• Container-based resource management
• Elastic pay as you go CPU/GPU power
Data
Refinery
17
Watson Studio
Differentiating Capabilities
• Data Scientists, Subject Matter experts,
Business Analysts & Developers all in one
environment to accelerate innovation,
collaboration and productivity
• Built-in learning to get started or go the
distance with advanced tutorials
Integrated Collaboration Environment
• Best in-breed open source and IBM tools that
support the end-to-end AI lifecycle
• Choice of code or no-code tools to build and
train your own ML/DL models or easily train
and customize pre-trained Watson APIs
Choice of Tools for the full AI lifecycle
• Use Watson smarts and recommendations
for the best algorithms to use given your
data, OR
• Use the rich capabilities and controls to fine
tune your models
Support for all levels of expertise
• Monitor batch training experiments then
compare cross-model performance without
worrying about log transfers and scripts to
visualize results.
• You focus on designing your neural networks.
We’ll manage and track your assets.
Experiment centric DL workflow
• Deploy models into production then monitor
them to evaluate performance.
• Capture new data for continuous learning and
retrain models so they continually adapt to
changing conditions.
Model lifecycle & management
• Intelligent discovery of data and AI assets
that enables reuse & improves productivity
• Seamlessly integrated for productive use with
Machine Learning and Data science
• Powerful governance tools to control and
protect access to data
Integrated with Knowledge Catalog
18
© 2018 IBM Corporation
19

Más contenido relacionado

La actualidad más candente

Machine Learning Models in Production
Machine Learning Models in ProductionMachine Learning Models in Production
Machine Learning Models in Production
DataWorks Summit
 
Solution Architecture And (Robotic) Process Automation Solutions
Solution Architecture And (Robotic) Process Automation SolutionsSolution Architecture And (Robotic) Process Automation Solutions
Solution Architecture And (Robotic) Process Automation Solutions
Alan McSweeney
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 

La actualidad más candente (20)

A Pragmatic AI Maturity Model
A Pragmatic AI Maturity ModelA Pragmatic AI Maturity Model
A Pragmatic AI Maturity Model
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best Practices
 
Introduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AIIntroduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AI
 
Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 
AIOps, IT Analytics, and Business Performance: What’s Needed and What Works
AIOps, IT Analytics, and Business Performance: What’s Needed and What Works AIOps, IT Analytics, and Business Performance: What’s Needed and What Works
AIOps, IT Analytics, and Business Performance: What’s Needed and What Works
 
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITYGENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
 
Becoming a Data-Driven Organization - Aligning Business & Data Strategy
Becoming a Data-Driven Organization - Aligning Business & Data StrategyBecoming a Data-Driven Organization - Aligning Business & Data Strategy
Becoming a Data-Driven Organization - Aligning Business & Data Strategy
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdf
 
Machine Learning Models in Production
Machine Learning Models in ProductionMachine Learning Models in Production
Machine Learning Models in Production
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Rethinking Site Reliability Engineering for ITSM - SDI virtual event "New Way...
Rethinking Site Reliability Engineering for ITSM - SDI virtual event "New Way...Rethinking Site Reliability Engineering for ITSM - SDI virtual event "New Way...
Rethinking Site Reliability Engineering for ITSM - SDI virtual event "New Way...
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital Transformation
 
Solution Architecture And (Robotic) Process Automation Solutions
Solution Architecture And (Robotic) Process Automation SolutionsSolution Architecture And (Robotic) Process Automation Solutions
Solution Architecture And (Robotic) Process Automation Solutions
 
Estimating the Total Costs of Your Cloud Analytics Platform 
Estimating the Total Costs of Your Cloud Analytics Platform Estimating the Total Costs of Your Cloud Analytics Platform 
Estimating the Total Costs of Your Cloud Analytics Platform 
 
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricUsing a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
 
Data-Ed Slides: Best Practices in Data Stewardship (Technical)
Data-Ed Slides: Best Practices in Data Stewardship (Technical)Data-Ed Slides: Best Practices in Data Stewardship (Technical)
Data-Ed Slides: Best Practices in Data Stewardship (Technical)
 
8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy
 

Similar a An AI Maturity Roadmap for Becoming a Data-Driven Organization

Getting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide WebinarGetting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide Webinar
Concept Searching, Inc
 
Analytics in a Day Virtual Workshop
Analytics in a Day Virtual WorkshopAnalytics in a Day Virtual Workshop
Analytics in a Day Virtual Workshop
CCG
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
tsigitnist02
 

Similar a An AI Maturity Roadmap for Becoming a Data-Driven Organization (20)

06 summary
06 summary06 summary
06 summary
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
 
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...
 
Getting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide WebinarGetting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide Webinar
 
How to Consume Your Data for AI
How to Consume Your Data for AIHow to Consume Your Data for AI
How to Consume Your Data for AI
 
How to classify documents automatically using NLP
How to classify documents automatically using NLPHow to classify documents automatically using NLP
How to classify documents automatically using NLP
 
ICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data Science
 
IBM i & Data Science in the AI era.
IBM i & Data Science in the AI era.  IBM i & Data Science in the AI era.
IBM i & Data Science in the AI era.
 
A journey to faster, repeatable data commercialization
A journey to faster, repeatable data commercializationA journey to faster, repeatable data commercialization
A journey to faster, repeatable data commercialization
 
Scaling Training Data for AI Applications
Scaling Training Data for AI ApplicationsScaling Training Data for AI Applications
Scaling Training Data for AI Applications
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBM
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBM
 
Machine Learning Everywhere
Machine Learning EverywhereMachine Learning Everywhere
Machine Learning Everywhere
 
Analytics in a Day Virtual Workshop
Analytics in a Day Virtual WorkshopAnalytics in a Day Virtual Workshop
Analytics in a Day Virtual Workshop
 
Data Science at Speed. At Scale.
Data Science at Speed. At Scale.Data Science at Speed. At Scale.
Data Science at Speed. At Scale.
 
Bridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudBridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the Cloud
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
Just ask Watson Seminar
Just ask Watson SeminarJust ask Watson Seminar
Just ask Watson Seminar
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
 

Último

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Último (20)

Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

An AI Maturity Roadmap for Becoming a Data-Driven Organization

  • 1. © 2018 IBM Corporation AI Maturity Roadmap for Becoming a Data- Driven Organization David Solomon IBM Executive Architect and Senior Cloud and Cognitive Evangelist April 19, 2018
  • 2. About Me Focus / Passion • AI, Cognitive, Emerging Technology • Analytics • Data (Architecture, Modeling, Integration) • Cloud Service Architecture • Applying the above to real-world business problems Education & Certification • M.S. Software Engineering • B.S, Physics • Data Mgmt, AI, Cloud, Docker, DevOps, … Proud Member of the IBM WolfPack David Solomon Technical Evangelist, IBM dsdlsolomo @dlsolomo Team-wolfpack
  • 3. What is AI? (my definition) 3© 2018 IBM Corporation AI is the application of technology to re-produce and automate cognitive tasks that are time-consuming and/or costly to perform. It is not a single technology or system, but rather a collection of capabilities that are applied in specific combinations to provide the automation of cognitive tasks for achieving a specific business outcome. • Examples Technologies- natural language understanding, visual recognition, machine learning, and speech recognition • Example Applications- insurance claims approvals, review medical imaging to narrow down diagnosis and smarter logistics routing • Effective AI requires a LOT of reliable data! • Recent public examples (e.g., IBM Watson, Alexa, Siri) have put AI high in Business Leader priorities
  • 4. Tools & Infrastructure • Need an environment that enables a “fail fast” approach • Discrete tools present barriers to productivity Governance • If the data isn’t secure, self-service isn’t a reality • Challenge understanding data lineage and getting to a system of truth Skills • Data Science skills are in low supply and high demand • Nurturing new data professionals is challenging Data • Data resides in silos & difficult to access • Unstructured and external data wasn’t considered 4 Why are enterprises struggling to capture the value of AI? IBM Cloud / Watson and Cloud Platform / © 2018 IBM Corporation
  • 5. The Essential Ingredients for Effective AI 5© 2018 IBM Corporation Effective manage and leverage both structured and unstructured data regardless of source or location Enablement of timely business decisions derived from accurate and consumable insight Effective application of data science to improve and drive business outcomes An organization’s readiness for AI is determined by how effectively these ingredients are applied. How can we assess this?
  • 6. Get your Information Architecture in Order: Introducing the “AI Ladder” 6© 2018 IBM Corporation − AI should not be considered a stand-alone capability, but is a culmination of your investments in other key disciplines, as shown in the IBM “AI Ladder” below. − The entry-point for your AI initiative will depend on where your organization currently sits on this ladder in the areas of data, analytics, and machine learning
  • 7. ”There is no AI without IA!” 7© 2018 IBM Corporation • One of the biggest success factors in an AI initiative involves access to accurate, relevant, and timely data • AI-driven conclusions are almost solely driven by the data, so if the data is not accurate, your business could be adversely impacted • The best way to address this is to ensure that your AI effort is supported by a solid Information Architecture that supports the following disciplines, • Data Management • Data Integration and Governance • Data Science and Analytics • The degree to which you apply these disciplines determines where on the ladder you should start
  • 8. Gain value from your data, without limits Access your data All sources and all types Flexibility Support all data types, all workloads, all consumption models Machine Learning Make better decisions, provide smarter capabilities Democratize access Provide data-driven decisions to everyone Simplicity A unified experience in managing your data landscape Cloud journey Support your data regardless of location Essential elements of a hybrid data management strategy
  • 9. Use your data Build a single source of truth to drive a 360-degree view of your data. Unleash insights and deepen customer relationships. 9IBM Cloud / © 2018 IBM Corporation Trust your data Capture lineage, help ensure quality of dynamic data and stay on top of regulations. Know your data Discover, find, integrate, classify and catalog all types of data. Essential elements of Data Integration and Governance
  • 10. Essential elements of Analytics and Data Science 10
  • 11. 11 Introducing an AI Readiness Maturity Model Insight Hindered Hindsight- Driven Data- Driven Insight- Driven AI-Driven • Minimal data mgmt. • Spreadsheets are primary data tool • Minimal standards • Minimal Governance • Centralized DBs for critical data • Some governance • Siloed use of unstructured data • Data integration and governance practice • Organized use of unstructured data • Siloed Data Science practices • Data Science practices in place • Hybrid-data mgmt. practice in place • Leverage both cloud and on-prem data • Fully data-driven business • Access to all required AI training data Data Readiness • Spreadsheet analysis • Desktop BI tools • Minimal standards • Soiled practices • Focus on descriptive analytics (What Happened?) • Standardized reporting formats • Diagnostic analytics • Siloed use of predictive analytics • Siloed use of Machine Learning models • Standard use of Machine Learning • Predictive analytics • Siloed use of prescriptive analytics • Prescriptive analytics • Fully insight-driven business Analytics Readiness Hindered Business Outcomes Operational Efficiency and Cost Savings Competitiveness Competitive Advantage Market Leader • None • Siloed Experimentation • Limited use for siloed applications • Initial production AI applications • Some alignment of AI with business strategy • Standard AI practice • Full alignment of AI with business strategy AI Capability
  • 12. On February 14, 2011 made history
  • 13. …and has grown to an entire portfolio of cognitive technologies Retrieve and Rank Language • Conversation • Document Conversion • Language Translator • Natural Language Classifier • Natural Language Understanding • Personality Insights • Retrieve and Rank • Tone Analyzer Speech • Speech to Text • Text to Speech Vision • Visual Recognition Data Insights • Discovery • Discovery News • Watson Knowledge Studio Natural Language Classifier Tone Analyzer
  • 14. Tools & Infrastructure • Need an environment that enables a “fail fast” approach • Discrete tools present barriers to productivity Governance • If the data isn’t secure, self-service isn’t a reality • Challenge understanding data lineage and getting to a system of truth Skills • Data Science skills are in low supply and high demand • Nurturing new data professionals is challenging Data • Data resides in silos & difficult to access • Unstructured and external data wasn’t considered 14 Why are enterprises struggling to capture the value of AI? How can these challenges be tackled in a timely manner?
  • 15. Watson Studio Supporting the end-to-end AI workflow Prepare Data for Analysis Build and Train ML/DL Models Deploy Models Monitor, Analyze and Manage Search and Find Relevant Data Connect & Access Data • Connect and discover content from multiple data sources in the cloud or on premises. • Bring structured and unstructured data to one toolkit. • Clean and prepare your data with Data Refinery, a tool to create data preparation pipelines visually. • Use popular open source libraries to prepare unstructured data. • Democratize the creation of ML and DL models. Design your AI models programmatically or visually with the most popular open source and IBM ML/DL frameworks • Leverage transfer learning on pre- trained models using Watson tools to adapt to your business domain. • Train at scale on GPUs and distributed compute • Deploy your models easily and have them scale automatically for online, batch or streaming use cases • Monitor the performance of the models in production and trigger automatic retraining and redeployment of models. • Find data (structured, unstructured) and AI assets (e.g., ML/DL models, notebooks, Watson Data Kits) in the Knowledge Catalog 15
  • 16. Her Job: Builds AI application that meet the requirements of the business. What she does: • Starts PoCs which includes gathering content, dialog building and model training • Focus is on app building for the team or company to use. Will handle ML Ops as needed Sometimes known as: Front-end, back-end, full stack, mobile or low-code developer Tanya Domain Expert Her Job: To transfer knowledge to Watson for a successful user experience. What she does: • Range of domain knowledge and uses that to teach Watson and develop a custom models • As Tanya gains more experience she optimizes her knowledge to teach Watson to design better end-user experiences. Sometimes known as: Subject matter expert, content strategist. His Job: Transform data into knowledge for solving business problems. What he does: •Runs experiments to build custom models that solve business problems. •Use techniques such as Machine Learning or Deep Learning and works with Tanya to validate success of trained models. Watson Studio Built for AI teams – enabling team productivity and collaboration Sometimes known as: ML/DL engineer, Modeler, Data Miner Ed Data Engineer His Job: Architects how data is organized and ensures operability What he does: • Builds data infrastructure and ETL pipelines. Works with Spark, Hadoop, and HDFS. • Works with data scientist to transform research models into production quality systems. Sometimes known as: Data infrastructure engineer Mike Data Scientist Deb The Developer 16
  • 17. Watson Studio Comprehensive set of tools for the end-to-end AI workflow Model Lifecycle Management Machine Learning Runtimes Deep Learning Runtimes Authoring Tools Cloud Infrastructure as a Service Watson API Tools Model Builder • Most popular open source frameworks • IBM best-in-class frameworks • Create, collaborate, deploy, and monitor • Best of breed open source & IBM tools • Code (R, Python or Scala) and no-code/visual modeling tools • Fully managed service • Container-based resource management • Elastic pay as you go CPU/GPU power Data Refinery 17
  • 18. Watson Studio Differentiating Capabilities • Data Scientists, Subject Matter experts, Business Analysts & Developers all in one environment to accelerate innovation, collaboration and productivity • Built-in learning to get started or go the distance with advanced tutorials Integrated Collaboration Environment • Best in-breed open source and IBM tools that support the end-to-end AI lifecycle • Choice of code or no-code tools to build and train your own ML/DL models or easily train and customize pre-trained Watson APIs Choice of Tools for the full AI lifecycle • Use Watson smarts and recommendations for the best algorithms to use given your data, OR • Use the rich capabilities and controls to fine tune your models Support for all levels of expertise • Monitor batch training experiments then compare cross-model performance without worrying about log transfers and scripts to visualize results. • You focus on designing your neural networks. We’ll manage and track your assets. Experiment centric DL workflow • Deploy models into production then monitor them to evaluate performance. • Capture new data for continuous learning and retrain models so they continually adapt to changing conditions. Model lifecycle & management • Intelligent discovery of data and AI assets that enables reuse & improves productivity • Seamlessly integrated for productive use with Machine Learning and Data science • Powerful governance tools to control and protect access to data Integrated with Knowledge Catalog 18
  • 19. © 2018 IBM Corporation 19

Notas del editor

  1. For too long – data has been held captive within our systems of record. Isolated by the rigidity of platform/application/workload choices, segregated by business line, business function, and data type or initial usage. The result is splintered views of segmented data that’s difficult to access on the whole, and impossible to attempt to gain true analytical insight from….. And even this only speaks to the snapshot today and current models. The challenges are compounded as businesses look to change, grow, iterate practices, innovate, or disrupt markets. Attempts at data science, machine learning, and deep learning are made moot by the fact that insights are only as good as the access to supporting data – which again is too fragmented to provide full value. We believe, that in order to change this paradigm, a hybrid data management strategy should contain the elements here: Access to all data regardless of source or type The flexibility to support changing workloads and consumption cases Possess intelligent analytics such as machine learning AT the data source And… Provide access to insights across the business, its functions, and to all users for better decision making
  2. # # # You need three essential elements on your journey to digital transformation. You need to know your data. Typically this means building a 360-degree view of your focus area—for example, a 360-degree view of your customer. You need to gather your internal data and may also need to include external data from social media, click stream, census or other relevant sources. This data must also be accessible by all users and/or applications that need it. This could mean making data globally accessible or running applications in the cloud. Consider that an application may need to access data from multiple data sources, so providing a common access layer is important to reduce application coding. 2. You need to be able to trust your data. Well-governed data provides confidence in not just the data itself, but the outcomes from analytics, reports and other tasks based on that data. There are two key points to data governance: First, you must have the ability to ensure the data is secure and adheres to compliance regulations. And second, you must have the ability to govern the data so your users can find and access information themselves, at the exact time they need it. 3. You must be able to use your data as a source for insights and intelligence. This means having not only the right skills and tools in place to surface insights, but also the right technology to learn from the data and improve accuracy each time that data is analyzed.
  3. Three years ago Watson made it’s debut on the US Quiz show, Jeopardy, in a very public proof point of radical new technology. Jeopardy was the result of an IBM Grand Challenge – putting top scientists to work on a seemingly impossible task. IBM undertakes Grand Challenges every decade or so. The last grand challenge was “Deep Blue” in 1997 - a chess-playing computer that won the second six-game match against world champion Garry Kasparov by two wins to one with three draws. Whether you attended one of the many IBM watch parties, watched the show at home, viewed it on YouTube later, or just read the newspapers, you witnessed history. Watson bested the 2 top champions, including Ken Jennings, who won 74 games and over $3M – the longest winning streak in J history. Not only did Watson win, but in doing so it ushered in a whole new era of computing. Additional Background: What fascinated the IBM researchers was how Jeopardy was the ultimate test of IT capabilities because it relied on many human cognitive abilities traditionally seen beyond the capability of computers, such as: The ability to discern double meanings of words, puns, rhymes, and inferred hints. Extremely rapid responses (sifting through 200 million pages of information - in the span of seconds) The ability to process vast amounts of information to make complex and subtle logical connections A team of 15 IBM researchers working in collaboration with a pool of top universities as a “Deep QA” project. For the Watson team, replicating the human capabilities was an enormous challenge, moving beyond keyword searches and queries of structured data to asking questions and accessing and assessing a vast amount of unstructured data to find the best answer. But IBM that knew the solution to this challenge had the potential to change the way businesses use information and make decisions.
  4. APIs in the cloud. FREE to play with today.
  5. 19