SlideShare una empresa de Scribd logo
1 de 30
New Analytical Architectures
          Why Classic Data Warehousing Approaches
                         Miss the Mark with Big Data
                                       March 21, 2013




                Casey Kiernan • casey.l.kiernan@gmail.com
                         Blog • www.the-data-platform.com
“The Future is Data”
“Hadoopis the kernel of a new Distributed Data OS”

                                     Doug Cutting
Data has Changed
 > Analytics has Changed

   Transactional
     > Trailing Indicators
                  Communities
                    > Reach/Influence
                             Personal
                               > Interactive
   Can the Data Warehouse Architecture adapt?
“Data” is the Platform




                         The World as I See it
Wink Eller




New Data
Clutch Analytics
My Mountain Bike as a Data Platform
                            Data Collection
                            Heart Rate


                                                           Data Collection
                                                           Altitude
   Data Collection                                         Temperature
   Speed / Trip Miles                                      Time                  Guidance

                                                                               Performance
                                                                               Rate of Climb
                                                                               Calories Burned
                                                                               Miles Obtained
                                                                               Total Climbed
                                                                               Elapsed Time

                                                                               Current,
                                                                               Average,
                                                                               Max Values

                             Data Collection
                             Cadence / RPM

             Data Architecture - on a Local Wireless Network (ANT+ Protocol)
“Personal” Ride Analytics




                            …is this a Data Warehouse?
New Data Behaviors (individual actions)   > Content > Time




                Behaviors




                              Content
New Data More is Better…


                                     Meaningful
     Guidance




                           Massive
                    Data

                                                  9
“Business”Analytics - Classic “DW”



    BUSINESS INTELLIGENCE
   OLAP / DATA WAREHOUSE
     OLTP / TRANSACTIONS
             DATA.
                                     Answers the question:
                      What are our most profitable Products?
Classic “Business” Analytics
Good for Reporting, Forecasting




            What did Happen?                    What will Happen?

            Operational Reporting              Tactical Trending    Strategic

                      Months      WeeksWeeks     Months     Years




                                                  Descriptive/Trending Analytics11
New“Personal”Analytics


           SELF-SERVICE
            GUIDANCE
           BEHAVIOURS
          DATA.                   Answers the question:
                         Show me a good movie to watch!
“Personal” Analytics
“Right Now” is a very important time-frame!


                               What is Happening
                                 RIGHT NOW!



            What did Happen?                   What will Happen?

            Operational Reporting             Tactical Trending   Strategic

                      Months    WeeksWeeks      Months    Years




                                              Predictive/Prescriptive Analytics
                                                                              13
14
15
“Business”Analytical Architecture
 Classic “DW” Data Flow - Uni-Directional, Latent,…




   Ordering App
                                Data
                    Staging                   OLAP / Reports      Business
   Financial App              Warehouse
                              OLTP to OLAP   Facts/Dimensions     Analyst
                                Mapping
                                             Business Metrics,
   Master Data                  Facts &      KPI, YTD Reporting
                              Dimensions




                              What are our most Profitable Products?
                                                                             16
“Personal” Analytical Architecture
“New” Data Flow - Iterative, Specialized, Extensible, plug & play Analytics, near real-time
[Some components are open-source]




                                    Application / UX
                                                                   Analytical Capabilities
                                                                   Scoring/Ranking,
                                                                   Recommendations,
                                                                   Natural Language
                                                                   Processing, Relevancy,
                                                                   Classification, Optimization,
                             Data                 Analytics        Collaborative Filtering,
                                                                   Personalization,
                                                                   Digital Attribution,…
                                        Data
                                       Analysts



                                             What movie should I watch tonight?                    17
“Personal Analytics” Data Architecture
 “New” Data Flow – Detailed View of Components


End-User Experience
    Browser, Tablet,                   Self-Service Application
          Mobile,…

                    Personalization,                              Personalized
                  Preferences, State                              Recommendations

                           App Persistence           Published Analytics
Persistence/Analytics     “State” Persistence        “Read” Performance
                                                                           Analytics Engines
                                                                              Pluggable
       Social Signals                 Mass Data Storage
      RSS/Facebook/…            Behaviors / “Write” Performance
                                                                              Data Scientists




                                                                                                18
Let’s get personal…

       SALLY LIKES TACOS




      HOW DO WE MODEL THIS DATA?
Classic “DW” Data Model
Modeling Social Data
“Triples” - Directed (Weighted) Acyclic Graph


                        Reach and Influence
               OBJECT       PREDICATE (Score)      SUBJECT
                SALLY          LIKES (143)          TACOS
                MARY           LIKES (200)          TACOS
           THE_TACO_SHOP      MENU_ITEM             TACOS
                SALLY          LIKES (125)      THE_TACO_SHOP
                SALLY             CITY           VENICE BEACH
           THE_TACO_SHOP          CITY           VENICE BEACH
                SALLY         FRIEND (187)          MARY




                                                 Collaborative Filtering
Reach and Influence
Analyzing Relationships
How important is Social?




                                          Shows you who is actively
                   Install ghostery.com   watching you surf the web!
                                          Lots of people!!!
Signals – The Core of New Data
Mixture of Proprietary and Public Data




                      Social
                     Personal
                     Content
                      Time
The New “Analytical Application” Architecture
“New” Data Flow – Specialized Technology Choices


 End-User Experience
     Browser, Tablet,              Self-Service Application
           Mobile,…
                             Personalization,        Personalized
                             Preferences, State      Recommendations

                           App Persistence         Published Analytics
 Persistence/Analytics    Cassandra, Riak,…              Hbase
 Data-Center or Cloud
                                                                            Analytics
                                                                         R, Mahout, Pig
                                      Mass Data Storage
                                          Hadoop




                                           Specialization of Data Technologies            26
Servicing Multiple Analytical Systems
Using Shared Analytical Mas- Storage


      Self-Service Application A        Self-Service Application B


                      Published                         Published
     Persistence                       Persistence
                      Analytics                         Analytics
        Riak                           Cassandra
                       HBase                             MySQL



                                                                          Data Scientists
                                                                       Analytics Engine
                                                                          Pluggable
                          Mass Data Storage                           Analytics Engine
                     Behaviors / “Write”Performance                      Pluggable
                                                                     Analytics Engine
                             Hadoop / AWS
                                                                       Pluggable



                                                                                          p.
                                                                                          27
Integrating the Architectures
“Personal” Analytics Stack + Classic “DW” Stack



                              Only Financial Events ($$$) cross the threshold
          App                 (and are recorded into) the Data Warehouse



                            Staging
           App
                                        Data Warehouse             OLAP /       Business
                                        OLTP to OLAP Mapping       Reports      Analyst


          App

                       “Local” Events stay Local
                       (they are analyzed locally)



                     Not all DATA Belongs in the Data Warehouse!                           28
Classic DW Vs. the New Analytics
The Shift from “Business” Analytics to “Personal” Analytics


                         Classic DW                New Analytics
Scope                    Enterprise                Application
Analytics                Trailing:OLAP             Predictive: Machine Learning
                                                   Sentiment Analysis, Recommendations,
                                                   Personalization,Natural Language
                                                   Processing, Classification, Clustering,
                                                   Optimization, Collaborative Filtering,
                                                   Digital Attribution,…
Actionable?              Loosely Coupled           Tightly Coupled
                                                   Analytics Embedded in Application
Data Structures          Facts/Dimensions          Semantic Data,Graph / Triples,
                         (Requires a DW)           Observations, Direct Signals
Knowledge Expert         Business Analyst          Data Scientist
Technology Stack         Vendor Driven ($$$)       Open-Source
Architecture             Scale-Up                  Scale-Out (or in the Cloud)
New Signals + New Analytics = NewScenarios



         Data                                        New
                                                  Analytics
        Signals               New             Recommendations,
         Social            Scenarios          Natural Language
        Location            Customer             Processing,
                         Engagement,             Relevancy,
       Personal
                      Customer Loyalty /        Classification,
       Behaviors     Attrition / Retention,     Optimization,
      Transactions   Fraud, Risk Analysis,      Collaborative
        Content        Intent, Customer           Filtering,
                        Personalization             Digital
         Time
                                                Attribution,…
Thank You!


              casey.l.kiernan@gmail.com
        blog: www.the-data-platform.com

Más contenido relacionado

La actualidad más candente

Analytical Revolution
Analytical RevolutionAnalytical Revolution
Analytical Revolution
NedODoherty
 
Linalis UK introduction
Linalis UK introductionLinalis UK introduction
Linalis UK introduction
Steve Adams
 

La actualidad más candente (16)

Using Big Data to create a data drive organization
Using Big Data to create a data drive organizationUsing Big Data to create a data drive organization
Using Big Data to create a data drive organization
 
No Time-Outs: How to Empower Round-the-Clock Analytics
No Time-Outs: How to Empower Round-the-Clock AnalyticsNo Time-Outs: How to Empower Round-the-Clock Analytics
No Time-Outs: How to Empower Round-the-Clock Analytics
 
Razorfish Multi-Channel Marketing: Better Customer Segmentation and Targeting
Razorfish Multi-Channel Marketing: Better Customer Segmentation and TargetingRazorfish Multi-Channel Marketing: Better Customer Segmentation and Targeting
Razorfish Multi-Channel Marketing: Better Customer Segmentation and Targeting
 
Introducing Splunk – The Big Data Engine
Introducing Splunk – The Big Data EngineIntroducing Splunk – The Big Data Engine
Introducing Splunk – The Big Data Engine
 
Retail Location Intelligence Predicting and positioning with location analytics
Retail Location Intelligence  Predicting and positioning with location analyticsRetail Location Intelligence  Predicting and positioning with location analytics
Retail Location Intelligence Predicting and positioning with location analytics
 
Future of Analytics is here
Future of Analytics is hereFuture of Analytics is here
Future of Analytics is here
 
Cognos Presentation Gartner BI
Cognos Presentation Gartner BICognos Presentation Gartner BI
Cognos Presentation Gartner BI
 
Break Through the Traditional Advertisement Services with Big Data and Apache...
Break Through the Traditional Advertisement Services with Big Data and Apache...Break Through the Traditional Advertisement Services with Big Data and Apache...
Break Through the Traditional Advertisement Services with Big Data and Apache...
 
Analytical Revolution
Analytical RevolutionAnalytical Revolution
Analytical Revolution
 
SAS Modernization architectures - Big Data Analytics
SAS Modernization architectures - Big Data AnalyticsSAS Modernization architectures - Big Data Analytics
SAS Modernization architectures - Big Data Analytics
 
Introducing the SAP high-performance analytic appliance (SAP HANA)
Introducing the SAP high-performance analytic appliance (SAP HANA)Introducing the SAP high-performance analytic appliance (SAP HANA)
Introducing the SAP high-performance analytic appliance (SAP HANA)
 
Mobile Analytics
Mobile AnalyticsMobile Analytics
Mobile Analytics
 
Linalis UK introduction
Linalis UK introductionLinalis UK introduction
Linalis UK introduction
 
Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...
Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...
Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...
 
Brotenlevinezhou friday
Brotenlevinezhou fridayBrotenlevinezhou friday
Brotenlevinezhou friday
 
Enterprise Location Intelligence
Enterprise Location IntelligenceEnterprise Location Intelligence
Enterprise Location Intelligence
 

Similar a New Analytical Architectures for Big Data

Streaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise AdoptionStreaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise Adoption
DATAVERSITY
 
Big Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureBig Data and Implications on Platform Architecture
Big Data and Implications on Platform Architecture
Odinot Stanislas
 
Big Data Needs Big Analytics
Big Data Needs Big AnalyticsBig Data Needs Big Analytics
Big Data Needs Big Analytics
Deepak Ramanathan
 
Big dataforcf os1_23_12_final
Big dataforcf os1_23_12_finalBig dataforcf os1_23_12_final
Big dataforcf os1_23_12_final
BurrPilgerMayer
 

Similar a New Analytical Architectures for Big Data (20)

2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics 2011 - TDWI Big Data Forum - The New Analytics
2011 - TDWI Big Data Forum - The New Analytics
 
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
 
Streaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise AdoptionStreaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise Adoption
 
Mesh Labs Introduction June 2012
Mesh Labs Introduction June 2012Mesh Labs Introduction June 2012
Mesh Labs Introduction June 2012
 
Big Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureBig Data and Implications on Platform Architecture
Big Data and Implications on Platform Architecture
 
IBM Cognos - Vad handlar egentligen prediktiv analys om?
IBM Cognos - Vad handlar egentligen prediktiv analys om?IBM Cognos - Vad handlar egentligen prediktiv analys om?
IBM Cognos - Vad handlar egentligen prediktiv analys om?
 
Cetas Analytics as a Service for Predictive Analytics
Cetas Analytics as a Service for Predictive AnalyticsCetas Analytics as a Service for Predictive Analytics
Cetas Analytics as a Service for Predictive Analytics
 
Cetas Predictive Analytics Prezo
Cetas Predictive Analytics PrezoCetas Predictive Analytics Prezo
Cetas Predictive Analytics Prezo
 
Predictive analytics km chicago
Predictive analytics km chicagoPredictive analytics km chicago
Predictive analytics km chicago
 
Big Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of SybaseBig Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
 
Farklı Ortamlarda Büyük Veri Kavramı -Big Data by Sybase
Farklı Ortamlarda Büyük Veri Kavramı -Big Data by Sybase Farklı Ortamlarda Büyük Veri Kavramı -Big Data by Sybase
Farklı Ortamlarda Büyük Veri Kavramı -Big Data by Sybase
 
Big Data Needs Big Analytics
Big Data Needs Big AnalyticsBig Data Needs Big Analytics
Big Data Needs Big Analytics
 
The New Normal: Predictive Power on the Front Lines
The New Normal: Predictive Power on the Front LinesThe New Normal: Predictive Power on the Front Lines
The New Normal: Predictive Power on the Front Lines
 
Big dataforcf os1_23_12_final
Big dataforcf os1_23_12_finalBig dataforcf os1_23_12_final
Big dataforcf os1_23_12_final
 
Information Management: Answering Today’s Enterprise Challenge
Information Management: Answering Today’s Enterprise ChallengeInformation Management: Answering Today’s Enterprise Challenge
Information Management: Answering Today’s Enterprise Challenge
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data Solutions
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
 
Don't be Hadooped when looking for Big Data ROI
Don't be Hadooped when looking for Big Data ROIDon't be Hadooped when looking for Big Data ROI
Don't be Hadooped when looking for Big Data ROI
 
SAS Big Data Forum - Transforming Big Data into Corporate Gold
SAS Big Data Forum - Transforming Big Data into Corporate GoldSAS Big Data Forum - Transforming Big Data into Corporate Gold
SAS Big Data Forum - Transforming Big Data into Corporate Gold
 
Big data meets big analytics
Big data meets big analyticsBig data meets big analytics
Big data meets big analytics
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 

New Analytical Architectures for Big Data

  • 1. New Analytical Architectures Why Classic Data Warehousing Approaches Miss the Mark with Big Data March 21, 2013 Casey Kiernan • casey.l.kiernan@gmail.com Blog • www.the-data-platform.com
  • 2. “The Future is Data” “Hadoopis the kernel of a new Distributed Data OS” Doug Cutting
  • 3. Data has Changed > Analytics has Changed Transactional > Trailing Indicators Communities > Reach/Influence Personal > Interactive Can the Data Warehouse Architecture adapt?
  • 4. “Data” is the Platform The World as I See it
  • 6. My Mountain Bike as a Data Platform Data Collection Heart Rate Data Collection Altitude Data Collection Temperature Speed / Trip Miles Time Guidance Performance Rate of Climb Calories Burned Miles Obtained Total Climbed Elapsed Time Current, Average, Max Values Data Collection Cadence / RPM Data Architecture - on a Local Wireless Network (ANT+ Protocol)
  • 7. “Personal” Ride Analytics …is this a Data Warehouse?
  • 8. New Data Behaviors (individual actions) > Content > Time Behaviors Content
  • 9. New Data More is Better… Meaningful Guidance Massive Data 9
  • 10. “Business”Analytics - Classic “DW” BUSINESS INTELLIGENCE OLAP / DATA WAREHOUSE OLTP / TRANSACTIONS DATA. Answers the question: What are our most profitable Products?
  • 11. Classic “Business” Analytics Good for Reporting, Forecasting What did Happen? What will Happen? Operational Reporting Tactical Trending Strategic Months WeeksWeeks Months Years Descriptive/Trending Analytics11
  • 12. New“Personal”Analytics SELF-SERVICE GUIDANCE BEHAVIOURS DATA. Answers the question: Show me a good movie to watch!
  • 13. “Personal” Analytics “Right Now” is a very important time-frame! What is Happening RIGHT NOW! What did Happen? What will Happen? Operational Reporting Tactical Trending Strategic Months WeeksWeeks Months Years Predictive/Prescriptive Analytics 13
  • 14. 14
  • 15. 15
  • 16. “Business”Analytical Architecture Classic “DW” Data Flow - Uni-Directional, Latent,… Ordering App Data Staging OLAP / Reports Business Financial App Warehouse OLTP to OLAP Facts/Dimensions Analyst Mapping Business Metrics, Master Data Facts & KPI, YTD Reporting Dimensions What are our most Profitable Products? 16
  • 17. “Personal” Analytical Architecture “New” Data Flow - Iterative, Specialized, Extensible, plug & play Analytics, near real-time [Some components are open-source] Application / UX Analytical Capabilities Scoring/Ranking, Recommendations, Natural Language Processing, Relevancy, Classification, Optimization, Data Analytics Collaborative Filtering, Personalization, Digital Attribution,… Data Analysts What movie should I watch tonight? 17
  • 18. “Personal Analytics” Data Architecture “New” Data Flow – Detailed View of Components End-User Experience Browser, Tablet, Self-Service Application Mobile,… Personalization, Personalized Preferences, State Recommendations App Persistence Published Analytics Persistence/Analytics “State” Persistence “Read” Performance Analytics Engines Pluggable Social Signals Mass Data Storage RSS/Facebook/… Behaviors / “Write” Performance Data Scientists 18
  • 19. Let’s get personal… SALLY LIKES TACOS HOW DO WE MODEL THIS DATA?
  • 21. Modeling Social Data “Triples” - Directed (Weighted) Acyclic Graph Reach and Influence OBJECT PREDICATE (Score) SUBJECT SALLY LIKES (143) TACOS MARY LIKES (200) TACOS THE_TACO_SHOP MENU_ITEM TACOS SALLY LIKES (125) THE_TACO_SHOP SALLY CITY VENICE BEACH THE_TACO_SHOP CITY VENICE BEACH SALLY FRIEND (187) MARY Collaborative Filtering
  • 23. How important is Social? Shows you who is actively Install ghostery.com watching you surf the web! Lots of people!!!
  • 24. Signals – The Core of New Data Mixture of Proprietary and Public Data Social Personal Content Time
  • 25. The New “Analytical Application” Architecture “New” Data Flow – Specialized Technology Choices End-User Experience Browser, Tablet, Self-Service Application Mobile,… Personalization, Personalized Preferences, State Recommendations App Persistence Published Analytics Persistence/Analytics Cassandra, Riak,… Hbase Data-Center or Cloud Analytics R, Mahout, Pig Mass Data Storage Hadoop Specialization of Data Technologies 26
  • 26. Servicing Multiple Analytical Systems Using Shared Analytical Mas- Storage Self-Service Application A Self-Service Application B Published Published Persistence Persistence Analytics Analytics Riak Cassandra HBase MySQL Data Scientists Analytics Engine Pluggable Mass Data Storage Analytics Engine Behaviors / “Write”Performance Pluggable Analytics Engine Hadoop / AWS Pluggable p. 27
  • 27. Integrating the Architectures “Personal” Analytics Stack + Classic “DW” Stack Only Financial Events ($$$) cross the threshold App (and are recorded into) the Data Warehouse Staging App Data Warehouse OLAP / Business OLTP to OLAP Mapping Reports Analyst App “Local” Events stay Local (they are analyzed locally) Not all DATA Belongs in the Data Warehouse! 28
  • 28. Classic DW Vs. the New Analytics The Shift from “Business” Analytics to “Personal” Analytics Classic DW New Analytics Scope Enterprise Application Analytics Trailing:OLAP Predictive: Machine Learning Sentiment Analysis, Recommendations, Personalization,Natural Language Processing, Classification, Clustering, Optimization, Collaborative Filtering, Digital Attribution,… Actionable? Loosely Coupled Tightly Coupled Analytics Embedded in Application Data Structures Facts/Dimensions Semantic Data,Graph / Triples, (Requires a DW) Observations, Direct Signals Knowledge Expert Business Analyst Data Scientist Technology Stack Vendor Driven ($$$) Open-Source Architecture Scale-Up Scale-Out (or in the Cloud)
  • 29. New Signals + New Analytics = NewScenarios Data New Analytics Signals New Recommendations, Social Scenarios Natural Language Location Customer Processing, Engagement, Relevancy, Personal Customer Loyalty / Classification, Behaviors Attrition / Retention, Optimization, Transactions Fraud, Risk Analysis, Collaborative Content Intent, Customer Filtering, Personalization Digital Time Attribution,…
  • 30. Thank You! casey.l.kiernan@gmail.com blog: www.the-data-platform.com

Notas del editor

  1. Most people see data as a storage issue – persistence / serializatioAnd the associated technologies – oracle,mysql, sql server, and now Hadoop / riak / Hbasebut I don’t see data as a persistence problem – simply – “where do you put it?”I see data lattiss - theirs more data in the network than in databases… – it’s value is comes from how/where the data is used – not how it is stored. I had the opportunity to attend a number of training sessions help by Chris Date – we learned about RI, dead locks,… But one thing he said to me that really stuck – He said that the “data” is in the transaction log, not the database – the database only contains a current snapshot – What is happening is in the transaction log.