SlideShare una empresa de Scribd logo
1 de 26
Descargar para leer sin conexión
Data Mining
with Excel 2010
and PowerPivot

Mark Tabladillo Ph.D.
http://marktab.net
September 18, 2010
SQL Saturday 46 -- Raleigh NC
#sqlsat46




                                © 2010 Mark Tabladillo Ph.D.
                                    2
MarkTab & Data Mining




    © 2010 Mark Tabladillo Ph.D.
3
© 2010 Mark Tabladillo Ph.D.
4
© 2010 Mark Tabladillo Ph.D.
5
Outline




                                   © 2010 Mark Tabladillo Ph.D.
  What is       What is
                           Demos
Data Mining   PowerPivot




                                       6
Data Mining as a Service




    © 2010 Mark Tabladillo Ph.D.
7
Outline




                                   © 2010 Mark Tabladillo Ph.D.
  What is       What is
                           Demos
Data Mining   PowerPivot




                                       8
Data Mining Definitions
• Data mining
• Machine Learning
• Data mining algorithms -- typically use estimation or
  optimization to achieve results (as opposed to only
  calculations).




                                                          © 2010 Mark Tabladillo Ph.D.
                                                              9
Data Mining Tasks
• Supervised
  • Answer known, what is correlated?
• Unsupervised
  • Answer unknown (unspecified), what are the groups?
• Forecasting




                                                                 © 2010 Mark Tabladillo Ph.D.
  • Given a trend, what is next?



                                                         Value
                                                         Slide

                                                                 10
Data Mining Add-In for Excel
• Requires Analysis Services instance
• Version 10.00.2531.00 (April 2009)
• 32-Bit Add-In
• Microsoft .NET Framework 2.0 (32-bit)
• Office 2007 (Professional, Professional Plus, Ultimate,




                                                             © 2010 Mark Tabladillo Ph.D.
  Enterprise)
• SQL Server Enterprise or Standard (or Developer) 2008 or
  higher



                                                             11
The Analyze Tab




     © 2010 Mark Tabladillo Ph.D.
12
The Analyze Tab


  Menu Option                     Data Mining Algorithm
  Analyze Key Influencers         Naïve Bayes




                                                          © 2010 Mark Tabladillo Ph.D.
  Detect Categories               Clustering
  Fill from Example               Logistic Regression
  Forecast                        Time Series
  Highlight Exceptions            Clustering
  Scenario Analysis (Goal Seek)   Logistic Regression
  Scenario Analysis (What If)     Logistic Regression
  Prediction Calculator           Logistic Regression
                                                          13
  Shopping Basket Analysis        Association Rules
Data Mining Tab




     © 2010 Mark Tabladillo Ph.D.
14
Data Mining Tab




Many




       © 2010 Mark Tabladillo Ph.D.
15
Data Mining Capacities

SQL Server 2008 R2 Analysis Services
                                            Maximum sizes/numbers
Object
Maximum data mining models per
                                             2^31-1 = 2,147,483,647
structure
Maximum data mining structures per




                                                                          © 2010 Mark Tabladillo Ph.D.
                                             2^31-1 = 2,147,483,647
solution
Maximum data mining structures per
                                             2^31-1 = 2,147,483,647
Analysis Services database
Maximum data mining attributes
                                             2^31-1 = 2,147,483,647
(variables) per structure

     Reference:
     http://www.marktab.net/datamining/index.php/2010/08/01/sql-server-
     data-mining-capacities-2008-r2/                                      16
Data Mining Tab




     © 2010 Mark Tabladillo Ph.D.
17
Outline




                                   © 2010 Mark Tabladillo Ph.D.
  What is       What is
                           Demos
Data Mining   PowerPivot




                                   18
PowerPivot for Excel
• Take advantage of familiar Excel tools and
  features
• Process massive amounts of data in seconds
• Load even the largest data sets from virtually any




                                                       © 2010 Mark Tabladillo Ph.D.
  source
• Use powerful new analytical capabilities, such as
  Data Analysis Expressions (DAX)
• Make the most of multi-core processors and
  gigabytes of memory
                                                       19
PowerPivot for Excel Sources
• SQL Server
• SQL Azure
• Oracle, Teradata, Sybase, Informix, IBM DB2
• OLEDB/ODBC




                                                © 2010 Mark Tabladillo Ph.D.
• Analysis Services (SSAS)
• Reporting Services (SSRS)
• Excel, Text File

                                                20
PowerPivot Reference
• http://www.powerpivot.com (Product Site)
• http://www.powerpivotpro.com (Blog Site)




                                             © 2010 Mark Tabladillo Ph.D.
                                             21
Outline




                                   © 2010 Mark Tabladillo Ph.D.
  What is       What is
                           Demos
Data Mining   PowerPivot




                                   22
Resources
• MarkTab.NET
  Blog, links, video resources and information for
  data mining
• Blog: http://marktab.net/datamining




                                                     © 2010 Mark Tabladillo Ph.D.
• Twitter: @MarkTabNet




                                                     23
© 2010 Mark Tabladillo Ph.D.
24
Regroup and Conclusion
• Main Points from this Presentation




                                       © 2010 Mark Tabladillo Ph.D.
                                       25
Contact Information
• Mark Tabladillo
  http://marktab.net

• Also on:
  Twitter @marktabnet




                        © 2010 Mark Tabladillo Ph.D.
  Linked In




                        26

Más contenido relacionado

La actualidad más candente

SAS DATAFLUX DATA MANAGEMENT STUDIO TRAINING
SAS DATAFLUX DATA MANAGEMENT STUDIO TRAININGSAS DATAFLUX DATA MANAGEMENT STUDIO TRAINING
SAS DATAFLUX DATA MANAGEMENT STUDIO TRAININGbidwhm
 
Dataflux Training syllabus Dataflux management studio training syllabus ,Dat...
Dataflux Training  syllabus Dataflux management studio training syllabus ,Dat...Dataflux Training  syllabus Dataflux management studio training syllabus ,Dat...
Dataflux Training syllabus Dataflux management studio training syllabus ,Dat...bidwhm
 
[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...
[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...
[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...Insight Technology, Inc.
 
Metadata Use Cases You Can Use
Metadata Use Cases You Can UseMetadata Use Cases You Can Use
Metadata Use Cases You Can Usedmurph4
 
Real World Business Intelligence and Data Warehousing
Real World Business Intelligence and Data WarehousingReal World Business Intelligence and Data Warehousing
Real World Business Intelligence and Data Warehousingukc4
 
DATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALA
DATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALADATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALA
DATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALASaikiran Panjala
 
An introduction to data virtualization in business intelligence
An introduction to data virtualization in business intelligenceAn introduction to data virtualization in business intelligence
An introduction to data virtualization in business intelligenceDavid Walker
 
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault ModelingKent Graziano
 
Data Warehouse Modeling
Data Warehouse ModelingData Warehouse Modeling
Data Warehouse Modelingvivekjv
 
Sas dataflux management studio Training ,data flux corporate trainig
Sas dataflux management studio Training ,data flux corporate trainig Sas dataflux management studio Training ,data flux corporate trainig
Sas dataflux management studio Training ,data flux corporate trainig bidwhm
 
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESB
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESBData Integration Alternatives: When to use Data Virtualization, ETL, and ESB
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESBDenodo
 
OpenDataSoft platform designed for big data issues
OpenDataSoft platform designed for big data issuesOpenDataSoft platform designed for big data issues
OpenDataSoft platform designed for big data issuesOpenDataSoft
 
Quality of Groundwater in Lingala Mandal of YSR Kadapa District, Andhraprades...
Quality of Groundwater in Lingala Mandal of YSR Kadapa District, Andhraprades...Quality of Groundwater in Lingala Mandal of YSR Kadapa District, Andhraprades...
Quality of Groundwater in Lingala Mandal of YSR Kadapa District, Andhraprades...IRJET Journal
 
ROI in Linking Content to CRM by Applying the Linked Data Stack
ROI in Linking Content to CRM by Applying the Linked Data StackROI in Linking Content to CRM by Applying the Linked Data Stack
ROI in Linking Content to CRM by Applying the Linked Data StackMartin Voigt
 
Enabling Cloud Data Integration (EMEA)
Enabling Cloud Data Integration (EMEA)Enabling Cloud Data Integration (EMEA)
Enabling Cloud Data Integration (EMEA)Denodo
 
Applied enterprise semantic mining
Applied enterprise semantic miningApplied enterprise semantic mining
Applied enterprise semantic miningMark Tabladillo
 
Why Data Virtualization? An Introduction by Denodo
Why Data Virtualization? An Introduction by DenodoWhy Data Virtualization? An Introduction by Denodo
Why Data Virtualization? An Introduction by DenodoJusto Hidalgo
 

La actualidad más candente (20)

SAS DATAFLUX DATA MANAGEMENT STUDIO TRAINING
SAS DATAFLUX DATA MANAGEMENT STUDIO TRAININGSAS DATAFLUX DATA MANAGEMENT STUDIO TRAINING
SAS DATAFLUX DATA MANAGEMENT STUDIO TRAINING
 
Dataflux Training syllabus Dataflux management studio training syllabus ,Dat...
Dataflux Training  syllabus Dataflux management studio training syllabus ,Dat...Dataflux Training  syllabus Dataflux management studio training syllabus ,Dat...
Dataflux Training syllabus Dataflux management studio training syllabus ,Dat...
 
[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...
[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...
[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...
 
Metadata Use Cases You Can Use
Metadata Use Cases You Can UseMetadata Use Cases You Can Use
Metadata Use Cases You Can Use
 
Real World Business Intelligence and Data Warehousing
Real World Business Intelligence and Data WarehousingReal World Business Intelligence and Data Warehousing
Real World Business Intelligence and Data Warehousing
 
DATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALA
DATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALADATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALA
DATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALA
 
An introduction to data virtualization in business intelligence
An introduction to data virtualization in business intelligenceAn introduction to data virtualization in business intelligence
An introduction to data virtualization in business intelligence
 
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
 
Data Warehouse Modeling
Data Warehouse ModelingData Warehouse Modeling
Data Warehouse Modeling
 
Why Data Vault?
Why Data Vault? Why Data Vault?
Why Data Vault?
 
Sas dataflux management studio Training ,data flux corporate trainig
Sas dataflux management studio Training ,data flux corporate trainig Sas dataflux management studio Training ,data flux corporate trainig
Sas dataflux management studio Training ,data flux corporate trainig
 
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESB
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESBData Integration Alternatives: When to use Data Virtualization, ETL, and ESB
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESB
 
OpenDataSoft platform designed for big data issues
OpenDataSoft platform designed for big data issuesOpenDataSoft platform designed for big data issues
OpenDataSoft platform designed for big data issues
 
Quality of Groundwater in Lingala Mandal of YSR Kadapa District, Andhraprades...
Quality of Groundwater in Lingala Mandal of YSR Kadapa District, Andhraprades...Quality of Groundwater in Lingala Mandal of YSR Kadapa District, Andhraprades...
Quality of Groundwater in Lingala Mandal of YSR Kadapa District, Andhraprades...
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
ROI in Linking Content to CRM by Applying the Linked Data Stack
ROI in Linking Content to CRM by Applying the Linked Data StackROI in Linking Content to CRM by Applying the Linked Data Stack
ROI in Linking Content to CRM by Applying the Linked Data Stack
 
Enabling Cloud Data Integration (EMEA)
Enabling Cloud Data Integration (EMEA)Enabling Cloud Data Integration (EMEA)
Enabling Cloud Data Integration (EMEA)
 
Applied enterprise semantic mining
Applied enterprise semantic miningApplied enterprise semantic mining
Applied enterprise semantic mining
 
Why Data Virtualization? An Introduction by Denodo
Why Data Virtualization? An Introduction by DenodoWhy Data Virtualization? An Introduction by Denodo
Why Data Virtualization? An Introduction by Denodo
 
Operational Data Vault
Operational Data VaultOperational Data Vault
Operational Data Vault
 

Destacado

Excel Datamining Addin Beginner
Excel Datamining Addin BeginnerExcel Datamining Addin Beginner
Excel Datamining Addin Beginnerexcel content
 
Excel 2010
Excel 2010Excel 2010
Excel 2010nhumar
 
Tutorial 11: Connecting to External Data
Tutorial 11: Connecting to External DataTutorial 11: Connecting to External Data
Tutorial 11: Connecting to External Datacios135
 
Tutorial 10: Performing What-IF Analyses
Tutorial 10: Performing What-IF AnalysesTutorial 10: Performing What-IF Analyses
Tutorial 10: Performing What-IF Analysescios135
 
MS Excel 2010 training module
MS Excel 2010 training moduleMS Excel 2010 training module
MS Excel 2010 training moduleAijaz Ali Mooro
 
Excel 2010 Unit A PPT
Excel 2010 Unit A PPTExcel 2010 Unit A PPT
Excel 2010 Unit A PPTokmomwalking
 
Tutorial 6: Multiple Worksheets and Workbooks
Tutorial 6: Multiple Worksheets and WorkbooksTutorial 6: Multiple Worksheets and Workbooks
Tutorial 6: Multiple Worksheets and Workbookscios135
 
Tutorial 2
Tutorial 2Tutorial 2
Tutorial 2cios135
 
Tutorial 4 Charts and Graphs
Tutorial 4 Charts and GraphsTutorial 4 Charts and Graphs
Tutorial 4 Charts and Graphscios135
 
Tutorial 5: Excel Tables, PivotTables, and Pivot Charts
Tutorial 5: Excel Tables, PivotTables, and Pivot ChartsTutorial 5: Excel Tables, PivotTables, and Pivot Charts
Tutorial 5: Excel Tables, PivotTables, and Pivot Chartscios135
 
Tutorial 3 Working with Formulas and Functions
Tutorial 3 Working with Formulas and FunctionsTutorial 3 Working with Formulas and Functions
Tutorial 3 Working with Formulas and Functionscios135
 
Tutorial 7: Advanced Functions and Conitional Formating
Tutorial 7: Advanced Functions and Conitional FormatingTutorial 7: Advanced Functions and Conitional Formating
Tutorial 7: Advanced Functions and Conitional Formatingcios135
 
Tutorial 8: Developing an Excel Application
Tutorial 8: Developing an Excel ApplicationTutorial 8: Developing an Excel Application
Tutorial 8: Developing an Excel Applicationcios135
 
Presentation Skills for Teachers version 3.0
Presentation Skills for Teachers  version 3.0Presentation Skills for Teachers  version 3.0
Presentation Skills for Teachers version 3.0Simon Jones
 
Teaching Excel
Teaching ExcelTeaching Excel
Teaching Excelsam ran
 
Introduction to Business Process Management
Introduction to Business Process ManagementIntroduction to Business Process Management
Introduction to Business Process ManagementAlan McSweeney
 

Destacado (17)

Excel Datamining Addin Beginner
Excel Datamining Addin BeginnerExcel Datamining Addin Beginner
Excel Datamining Addin Beginner
 
Matlab for marketing people
Matlab for marketing peopleMatlab for marketing people
Matlab for marketing people
 
Excel 2010
Excel 2010Excel 2010
Excel 2010
 
Tutorial 11: Connecting to External Data
Tutorial 11: Connecting to External DataTutorial 11: Connecting to External Data
Tutorial 11: Connecting to External Data
 
Tutorial 10: Performing What-IF Analyses
Tutorial 10: Performing What-IF AnalysesTutorial 10: Performing What-IF Analyses
Tutorial 10: Performing What-IF Analyses
 
MS Excel 2010 training module
MS Excel 2010 training moduleMS Excel 2010 training module
MS Excel 2010 training module
 
Excel 2010 Unit A PPT
Excel 2010 Unit A PPTExcel 2010 Unit A PPT
Excel 2010 Unit A PPT
 
Tutorial 6: Multiple Worksheets and Workbooks
Tutorial 6: Multiple Worksheets and WorkbooksTutorial 6: Multiple Worksheets and Workbooks
Tutorial 6: Multiple Worksheets and Workbooks
 
Tutorial 2
Tutorial 2Tutorial 2
Tutorial 2
 
Tutorial 4 Charts and Graphs
Tutorial 4 Charts and GraphsTutorial 4 Charts and Graphs
Tutorial 4 Charts and Graphs
 
Tutorial 5: Excel Tables, PivotTables, and Pivot Charts
Tutorial 5: Excel Tables, PivotTables, and Pivot ChartsTutorial 5: Excel Tables, PivotTables, and Pivot Charts
Tutorial 5: Excel Tables, PivotTables, and Pivot Charts
 
Tutorial 3 Working with Formulas and Functions
Tutorial 3 Working with Formulas and FunctionsTutorial 3 Working with Formulas and Functions
Tutorial 3 Working with Formulas and Functions
 
Tutorial 7: Advanced Functions and Conitional Formating
Tutorial 7: Advanced Functions and Conitional FormatingTutorial 7: Advanced Functions and Conitional Formating
Tutorial 7: Advanced Functions and Conitional Formating
 
Tutorial 8: Developing an Excel Application
Tutorial 8: Developing an Excel ApplicationTutorial 8: Developing an Excel Application
Tutorial 8: Developing an Excel Application
 
Presentation Skills for Teachers version 3.0
Presentation Skills for Teachers  version 3.0Presentation Skills for Teachers  version 3.0
Presentation Skills for Teachers version 3.0
 
Teaching Excel
Teaching ExcelTeaching Excel
Teaching Excel
 
Introduction to Business Process Management
Introduction to Business Process ManagementIntroduction to Business Process Management
Introduction to Business Process Management
 

Similar a Data Mining with Excel 2010 and PowerPivot

SQL Server 2008 Data Mining with PowerPivot and Excel 2010
SQL Server 2008 Data Mining with PowerPivot and Excel 2010SQL Server 2008 Data Mining with PowerPivot and Excel 2010
SQL Server 2008 Data Mining with PowerPivot and Excel 2010Mark Tabladillo
 
Document Classification using DMX in SQL Server Analysis Services
Document Classification using DMX in SQL Server Analysis ServicesDocument Classification using DMX in SQL Server Analysis Services
Document Classification using DMX in SQL Server Analysis ServicesMark Tabladillo
 
Data Mining With Excel 2007 And SQL Server 2008
Data Mining With Excel 2007 And SQL Server 2008Data Mining With Excel 2007 And SQL Server 2008
Data Mining With Excel 2007 And SQL Server 2008Mark Tabladillo
 
SQL Server Data Mining for SQL Server Professionals
SQL Server Data Mining for SQL Server Professionals SQL Server Data Mining for SQL Server Professionals
SQL Server Data Mining for SQL Server Professionals Mark Tabladillo
 
The Perfect Storm: The Impact of Analytics, Big Data and Analytics
The Perfect Storm: The Impact of Analytics, Big Data and AnalyticsThe Perfect Storm: The Impact of Analytics, Big Data and Analytics
The Perfect Storm: The Impact of Analytics, Big Data and AnalyticsInside Analysis
 
Enterprise Data Mining for SQL Server Professionals 20110319
Enterprise Data Mining for SQL Server Professionals 20110319Enterprise Data Mining for SQL Server Professionals 20110319
Enterprise Data Mining for SQL Server Professionals 20110319Mark Tabladillo
 
Getting Big Value from Big Data
Getting Big Value from Big DataGetting Big Value from Big Data
Getting Big Value from Big DataDataStax
 
Big Data Ecosystem- Impetus Technologies
Big Data Ecosystem-  Impetus TechnologiesBig Data Ecosystem-  Impetus Technologies
Big Data Ecosystem- Impetus TechnologiesImpetus Technologies
 
How to develop a data scientist – What business has requested v02
How to develop a data scientist – What business has requested v02How to develop a data scientist – What business has requested v02
How to develop a data scientist – What business has requested v02Data Science London
 
Databricks for Dummies
Databricks for DummiesDatabricks for Dummies
Databricks for DummiesRodney Joyce
 
SQL Saturday 79 Enterprise Data Mining for SQL Server 2008 R2
SQL Saturday 79 Enterprise Data Mining for SQL Server 2008 R2SQL Saturday 79 Enterprise Data Mining for SQL Server 2008 R2
SQL Saturday 79 Enterprise Data Mining for SQL Server 2008 R2Mark Tabladillo
 
Rabobank - There is something about Data
Rabobank - There is something about DataRabobank - There is something about Data
Rabobank - There is something about DataBigDataExpo
 
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...Denodo
 
Enteprise Data Mining with SQL Server by Mark Tabladillo
Enteprise Data Mining with SQL Server by Mark TabladilloEnteprise Data Mining with SQL Server by Mark Tabladillo
Enteprise Data Mining with SQL Server by Mark TabladilloFelipe Ferreira
 
Big Data Benchmarking Tutorial
Big Data Benchmarking TutorialBig Data Benchmarking Tutorial
Big Data Benchmarking TutorialTilmann Rabl
 
ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News!
ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News! ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News!
ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News! Embarcadero Technologies
 
Oracle databáze – Konsolidovaná Data Management Platforma
Oracle databáze – Konsolidovaná Data Management PlatformaOracle databáze – Konsolidovaná Data Management Platforma
Oracle databáze – Konsolidovaná Data Management PlatformaMarketingArrowECS_CZ
 
Obiee 12c and the leap forward in lifecycle management
Obiee 12c and the leap forward in lifecycle managementObiee 12c and the leap forward in lifecycle management
Obiee 12c and the leap forward in lifecycle managementLauren Prezby
 
Obiee 12C and the Leap Forward in Lifecycle Management
Obiee 12C and the Leap Forward in Lifecycle ManagementObiee 12C and the Leap Forward in Lifecycle Management
Obiee 12C and the Leap Forward in Lifecycle ManagementStewart Bryson
 
NoSQL & Big Data Analytics: History, Hype, Opportunities
NoSQL & Big Data Analytics: History, Hype, OpportunitiesNoSQL & Big Data Analytics: History, Hype, Opportunities
NoSQL & Big Data Analytics: History, Hype, OpportunitiesVishy Poosala
 

Similar a Data Mining with Excel 2010 and PowerPivot (20)

SQL Server 2008 Data Mining with PowerPivot and Excel 2010
SQL Server 2008 Data Mining with PowerPivot and Excel 2010SQL Server 2008 Data Mining with PowerPivot and Excel 2010
SQL Server 2008 Data Mining with PowerPivot and Excel 2010
 
Document Classification using DMX in SQL Server Analysis Services
Document Classification using DMX in SQL Server Analysis ServicesDocument Classification using DMX in SQL Server Analysis Services
Document Classification using DMX in SQL Server Analysis Services
 
Data Mining With Excel 2007 And SQL Server 2008
Data Mining With Excel 2007 And SQL Server 2008Data Mining With Excel 2007 And SQL Server 2008
Data Mining With Excel 2007 And SQL Server 2008
 
SQL Server Data Mining for SQL Server Professionals
SQL Server Data Mining for SQL Server Professionals SQL Server Data Mining for SQL Server Professionals
SQL Server Data Mining for SQL Server Professionals
 
The Perfect Storm: The Impact of Analytics, Big Data and Analytics
The Perfect Storm: The Impact of Analytics, Big Data and AnalyticsThe Perfect Storm: The Impact of Analytics, Big Data and Analytics
The Perfect Storm: The Impact of Analytics, Big Data and Analytics
 
Enterprise Data Mining for SQL Server Professionals 20110319
Enterprise Data Mining for SQL Server Professionals 20110319Enterprise Data Mining for SQL Server Professionals 20110319
Enterprise Data Mining for SQL Server Professionals 20110319
 
Getting Big Value from Big Data
Getting Big Value from Big DataGetting Big Value from Big Data
Getting Big Value from Big Data
 
Big Data Ecosystem- Impetus Technologies
Big Data Ecosystem-  Impetus TechnologiesBig Data Ecosystem-  Impetus Technologies
Big Data Ecosystem- Impetus Technologies
 
How to develop a data scientist – What business has requested v02
How to develop a data scientist – What business has requested v02How to develop a data scientist – What business has requested v02
How to develop a data scientist – What business has requested v02
 
Databricks for Dummies
Databricks for DummiesDatabricks for Dummies
Databricks for Dummies
 
SQL Saturday 79 Enterprise Data Mining for SQL Server 2008 R2
SQL Saturday 79 Enterprise Data Mining for SQL Server 2008 R2SQL Saturday 79 Enterprise Data Mining for SQL Server 2008 R2
SQL Saturday 79 Enterprise Data Mining for SQL Server 2008 R2
 
Rabobank - There is something about Data
Rabobank - There is something about DataRabobank - There is something about Data
Rabobank - There is something about Data
 
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
 
Enteprise Data Mining with SQL Server by Mark Tabladillo
Enteprise Data Mining with SQL Server by Mark TabladilloEnteprise Data Mining with SQL Server by Mark Tabladillo
Enteprise Data Mining with SQL Server by Mark Tabladillo
 
Big Data Benchmarking Tutorial
Big Data Benchmarking TutorialBig Data Benchmarking Tutorial
Big Data Benchmarking Tutorial
 
ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News!
ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News! ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News!
ER/Studio and DB PowerStudio Launch Webinar: Big Data, Big Models, Big News!
 
Oracle databáze – Konsolidovaná Data Management Platforma
Oracle databáze – Konsolidovaná Data Management PlatformaOracle databáze – Konsolidovaná Data Management Platforma
Oracle databáze – Konsolidovaná Data Management Platforma
 
Obiee 12c and the leap forward in lifecycle management
Obiee 12c and the leap forward in lifecycle managementObiee 12c and the leap forward in lifecycle management
Obiee 12c and the leap forward in lifecycle management
 
Obiee 12C and the Leap Forward in Lifecycle Management
Obiee 12C and the Leap Forward in Lifecycle ManagementObiee 12C and the Leap Forward in Lifecycle Management
Obiee 12C and the Leap Forward in Lifecycle Management
 
NoSQL & Big Data Analytics: History, Hype, Opportunities
NoSQL & Big Data Analytics: History, Hype, OpportunitiesNoSQL & Big Data Analytics: History, Hype, Opportunities
NoSQL & Big Data Analytics: History, Hype, Opportunities
 

Más de Mark Tabladillo

How to find low-cost or free data science resources 202006
How to find low-cost or free data science resources 202006How to find low-cost or free data science resources 202006
How to find low-cost or free data science resources 202006Mark Tabladillo
 
Microsoft Build 2020: Data Science Recap
Microsoft Build 2020: Data Science RecapMicrosoft Build 2020: Data Science Recap
Microsoft Build 2020: Data Science RecapMark Tabladillo
 
201909 Automated ML for Developers
201909 Automated ML for Developers201909 Automated ML for Developers
201909 Automated ML for DevelopersMark Tabladillo
 
201908 Overview of Automated ML
201908 Overview of Automated ML201908 Overview of Automated ML
201908 Overview of Automated MLMark Tabladillo
 
201906 01 Introduction to ML.NET 1.0
201906 01 Introduction to ML.NET 1.0201906 01 Introduction to ML.NET 1.0
201906 01 Introduction to ML.NET 1.0Mark Tabladillo
 
201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019Mark Tabladillo
 
201906 03 Introduction to NimbusML
201906 03 Introduction to NimbusML201906 03 Introduction to NimbusML
201906 03 Introduction to NimbusMLMark Tabladillo
 
201906 02 Introduction to AutoML with ML.NET 1.0
201906 02 Introduction to AutoML with ML.NET 1.0201906 02 Introduction to AutoML with ML.NET 1.0
201906 02 Introduction to AutoML with ML.NET 1.0Mark Tabladillo
 
201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine LearningMark Tabladillo
 
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...Mark Tabladillo
 
Big Data Advanced Analytics on Microsoft Azure 201904
Big Data Advanced Analytics on Microsoft Azure 201904Big Data Advanced Analytics on Microsoft Azure 201904
Big Data Advanced Analytics on Microsoft Azure 201904Mark Tabladillo
 
Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904Mark Tabladillo
 
Training of Python scikit-learn models on Azure
Training of Python scikit-learn models on AzureTraining of Python scikit-learn models on Azure
Training of Python scikit-learn models on AzureMark Tabladillo
 
Big Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureBig Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureMark Tabladillo
 
Advanced Analytics with Power BI 201808
Advanced Analytics with Power BI 201808Advanced Analytics with Power BI 201808
Advanced Analytics with Power BI 201808Mark Tabladillo
 
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)Mark Tabladillo
 
Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017Mark Tabladillo
 
Microsoft Technologies for Data Science 201612
Microsoft Technologies for Data Science 201612Microsoft Technologies for Data Science 201612
Microsoft Technologies for Data Science 201612Mark Tabladillo
 
How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610Mark Tabladillo
 
Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016Mark Tabladillo
 

Más de Mark Tabladillo (20)

How to find low-cost or free data science resources 202006
How to find low-cost or free data science resources 202006How to find low-cost or free data science resources 202006
How to find low-cost or free data science resources 202006
 
Microsoft Build 2020: Data Science Recap
Microsoft Build 2020: Data Science RecapMicrosoft Build 2020: Data Science Recap
Microsoft Build 2020: Data Science Recap
 
201909 Automated ML for Developers
201909 Automated ML for Developers201909 Automated ML for Developers
201909 Automated ML for Developers
 
201908 Overview of Automated ML
201908 Overview of Automated ML201908 Overview of Automated ML
201908 Overview of Automated ML
 
201906 01 Introduction to ML.NET 1.0
201906 01 Introduction to ML.NET 1.0201906 01 Introduction to ML.NET 1.0
201906 01 Introduction to ML.NET 1.0
 
201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019
 
201906 03 Introduction to NimbusML
201906 03 Introduction to NimbusML201906 03 Introduction to NimbusML
201906 03 Introduction to NimbusML
 
201906 02 Introduction to AutoML with ML.NET 1.0
201906 02 Introduction to AutoML with ML.NET 1.0201906 02 Introduction to AutoML with ML.NET 1.0
201906 02 Introduction to AutoML with ML.NET 1.0
 
201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning
 
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
 
Big Data Advanced Analytics on Microsoft Azure 201904
Big Data Advanced Analytics on Microsoft Azure 201904Big Data Advanced Analytics on Microsoft Azure 201904
Big Data Advanced Analytics on Microsoft Azure 201904
 
Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904
 
Training of Python scikit-learn models on Azure
Training of Python scikit-learn models on AzureTraining of Python scikit-learn models on Azure
Training of Python scikit-learn models on Azure
 
Big Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureBig Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft Azure
 
Advanced Analytics with Power BI 201808
Advanced Analytics with Power BI 201808Advanced Analytics with Power BI 201808
Advanced Analytics with Power BI 201808
 
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
 
Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017
 
Microsoft Technologies for Data Science 201612
Microsoft Technologies for Data Science 201612Microsoft Technologies for Data Science 201612
Microsoft Technologies for Data Science 201612
 
How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610
 
Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016
 

Último

Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...Roland Driesen
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Dipal Arora
 
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyThe Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyEthan lee
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...Paul Menig
 
Insurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageInsurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageMatteo Carbone
 
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...amitlee9823
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMRavindra Nath Shukla
 
Value Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsValue Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsP&CO
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communicationskarancommunications
 
RSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors DataRSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors DataExhibitors Data
 
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 DelhiCall Girls in Delhi
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Lviv Startup Club
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfPaul Menig
 
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...anilsa9823
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.Aaiza Hassan
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst SummitHolger Mueller
 
Call Girls In Holiday Inn Express Gurugram➥99902@11544 ( Best price)100% Genu...
Call Girls In Holiday Inn Express Gurugram➥99902@11544 ( Best price)100% Genu...Call Girls In Holiday Inn Express Gurugram➥99902@11544 ( Best price)100% Genu...
Call Girls In Holiday Inn Express Gurugram➥99902@11544 ( Best price)100% Genu...lizamodels9
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesDipal Arora
 

Último (20)

Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
 
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyThe Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...
 
Insurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageInsurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usage
 
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
 
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabiunwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSM
 
Forklift Operations: Safety through Cartoons
Forklift Operations: Safety through CartoonsForklift Operations: Safety through Cartoons
Forklift Operations: Safety through Cartoons
 
Value Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsValue Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and pains
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communications
 
RSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors DataRSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors Data
 
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
9599632723 Top Call Girls in Delhi at your Door Step Available 24x7 Delhi
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdf
 
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst Summit
 
Call Girls In Holiday Inn Express Gurugram➥99902@11544 ( Best price)100% Genu...
Call Girls In Holiday Inn Express Gurugram➥99902@11544 ( Best price)100% Genu...Call Girls In Holiday Inn Express Gurugram➥99902@11544 ( Best price)100% Genu...
Call Girls In Holiday Inn Express Gurugram➥99902@11544 ( Best price)100% Genu...
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
 

Data Mining with Excel 2010 and PowerPivot

  • 1. Data Mining with Excel 2010 and PowerPivot Mark Tabladillo Ph.D. http://marktab.net September 18, 2010
  • 2. SQL Saturday 46 -- Raleigh NC #sqlsat46 © 2010 Mark Tabladillo Ph.D. 2
  • 3. MarkTab & Data Mining © 2010 Mark Tabladillo Ph.D. 3
  • 4. © 2010 Mark Tabladillo Ph.D. 4
  • 5. © 2010 Mark Tabladillo Ph.D. 5
  • 6. Outline © 2010 Mark Tabladillo Ph.D. What is What is Demos Data Mining PowerPivot 6
  • 7. Data Mining as a Service © 2010 Mark Tabladillo Ph.D. 7
  • 8. Outline © 2010 Mark Tabladillo Ph.D. What is What is Demos Data Mining PowerPivot 8
  • 9. Data Mining Definitions • Data mining • Machine Learning • Data mining algorithms -- typically use estimation or optimization to achieve results (as opposed to only calculations). © 2010 Mark Tabladillo Ph.D. 9
  • 10. Data Mining Tasks • Supervised • Answer known, what is correlated? • Unsupervised • Answer unknown (unspecified), what are the groups? • Forecasting © 2010 Mark Tabladillo Ph.D. • Given a trend, what is next? Value Slide 10
  • 11. Data Mining Add-In for Excel • Requires Analysis Services instance • Version 10.00.2531.00 (April 2009) • 32-Bit Add-In • Microsoft .NET Framework 2.0 (32-bit) • Office 2007 (Professional, Professional Plus, Ultimate, © 2010 Mark Tabladillo Ph.D. Enterprise) • SQL Server Enterprise or Standard (or Developer) 2008 or higher 11
  • 12. The Analyze Tab © 2010 Mark Tabladillo Ph.D. 12
  • 13. The Analyze Tab Menu Option Data Mining Algorithm Analyze Key Influencers Naïve Bayes © 2010 Mark Tabladillo Ph.D. Detect Categories Clustering Fill from Example Logistic Regression Forecast Time Series Highlight Exceptions Clustering Scenario Analysis (Goal Seek) Logistic Regression Scenario Analysis (What If) Logistic Regression Prediction Calculator Logistic Regression 13 Shopping Basket Analysis Association Rules
  • 14. Data Mining Tab © 2010 Mark Tabladillo Ph.D. 14
  • 15. Data Mining Tab Many © 2010 Mark Tabladillo Ph.D. 15
  • 16. Data Mining Capacities SQL Server 2008 R2 Analysis Services Maximum sizes/numbers Object Maximum data mining models per 2^31-1 = 2,147,483,647 structure Maximum data mining structures per © 2010 Mark Tabladillo Ph.D. 2^31-1 = 2,147,483,647 solution Maximum data mining structures per 2^31-1 = 2,147,483,647 Analysis Services database Maximum data mining attributes 2^31-1 = 2,147,483,647 (variables) per structure Reference: http://www.marktab.net/datamining/index.php/2010/08/01/sql-server- data-mining-capacities-2008-r2/ 16
  • 17. Data Mining Tab © 2010 Mark Tabladillo Ph.D. 17
  • 18. Outline © 2010 Mark Tabladillo Ph.D. What is What is Demos Data Mining PowerPivot 18
  • 19. PowerPivot for Excel • Take advantage of familiar Excel tools and features • Process massive amounts of data in seconds • Load even the largest data sets from virtually any © 2010 Mark Tabladillo Ph.D. source • Use powerful new analytical capabilities, such as Data Analysis Expressions (DAX) • Make the most of multi-core processors and gigabytes of memory 19
  • 20. PowerPivot for Excel Sources • SQL Server • SQL Azure • Oracle, Teradata, Sybase, Informix, IBM DB2 • OLEDB/ODBC © 2010 Mark Tabladillo Ph.D. • Analysis Services (SSAS) • Reporting Services (SSRS) • Excel, Text File 20
  • 21. PowerPivot Reference • http://www.powerpivot.com (Product Site) • http://www.powerpivotpro.com (Blog Site) © 2010 Mark Tabladillo Ph.D. 21
  • 22. Outline © 2010 Mark Tabladillo Ph.D. What is What is Demos Data Mining PowerPivot 22
  • 23. Resources • MarkTab.NET Blog, links, video resources and information for data mining • Blog: http://marktab.net/datamining © 2010 Mark Tabladillo Ph.D. • Twitter: @MarkTabNet 23
  • 24. © 2010 Mark Tabladillo Ph.D. 24
  • 25. Regroup and Conclusion • Main Points from this Presentation © 2010 Mark Tabladillo Ph.D. 25
  • 26. Contact Information • Mark Tabladillo http://marktab.net • Also on: Twitter @marktabnet © 2010 Mark Tabladillo Ph.D. Linked In 26