SlideShare una empresa de Scribd logo
1 de 20
Descargar para leer sin conexión
An Introduction to SQL Server 2005
       and 2008 Data Mining
                           Peter Myers
          Mentor – Solid Quality Mentors
 San Francisco SQL Server User Group – 8 July, 2009



        Copyright © 2009, Solid Quality Mentors. All rights reserved.
WHO WE ARE

• Industry experts:
  Growing, elite group of over 90 of the world’s best technical experts who, as
  reflected by the high concentration of Microsoft MVP’s and RD’s in our ranks,
  achieve excellence in their industry by maintaining the highest credentials.
• Published authors:
  Best technical reference books, Microsoft reference materials, industry white
  papers, technical magazine articles, and webcasts.
• Top technical speakers:
  PASS Community Summit, Microsoft TechEd, The Microsoft BI Conference,
  SQL Server DevConnections, countless user groups, international
  conferences and events.
• For more information visit www.solidq.com


                   Copyright © 2009, Solid Quality Mentors. All rights reserved.
WHAT WE DO

Provide advanced, world-class expertise across the entire
Microsoft relational data and development platforms and
              complimenting technologies.

 PRACTICE AREAS                                     SERVICES
 Relational Database Management                     Advanced, Public Training
 Business Intelligence                              Customized, Private Training
 Development Methodologies                          Solution Delivery & Tuning
 SharePoint Collaboration                           Enhanced, Mentoring Services



      For more information visit www.solidq.com
                  Copyright © 2009, Solid Quality Mentors. All rights reserved.
AGENDA

• Introducing Data Mining
• Describing the Data Mining Process
• SQL Server™ 2008 Data Mining
• Data Preparation
• Data Mining Visualization
• Demonstrations


             Copyright © 2009, Solid Quality Mentors. All rights reserved.
INTRODUCING
                                                                   DATA MINING
• Addresses the problem:
 “Too much data and not enough information”
• Enables data exploration, pattern discovery, and pattern
 prediction—which lead to knowledge discovery
• Forms a key part of a BI solution




              Copyright © 2009, Solid Quality Mentors. All rights reserved.
BUSINESS
                                                                         SCENARIOS
• Identifying responsive customers/unresponsive
 customers (also known as churn analysis)
• Targeting promotions
• Detecting and preventing fraud
• Correcting data during ETL
• Forecasting sales and inventory
• Cross-selling

             Copyright © 2009, Solid Quality Mentors. All rights reserved.
DESCRIBING THE
                                    DATA MINING PROCESS
                                                                       “Doing Data
                                                                         Mining”
  Business                                       Data
Understanding                                Understanding




                                                              Data
                                                           Preparation

                             Data
 Deployment

                                                             Modeling


                           Evaluation
 “Putting Data
Mining to Work”                                               www.crisp-dm.org

       Copyright © 2009, Solid Quality Mentors. All rights reserved.
DATA
                                                                        PREPARATION
• Often significant amounts of effort are required to prepare data
  for mining:
   • Transforming for cleaning and reformatting
   • Isolating and flagging abnormal data
   • Appropriately substituting missing values
   • Discretizing continuous values into ranges
   • Normalizing values between 0 and 1
• Of course, having the required data to begin with is important:
   • When designing systems, give consideration to attributes that may be
     required as inputs for classification
      o For example, demographic data: Age, Gender, Region, etc


                   Copyright © 2009, Solid Quality Mentors. All rights reserved.
MODELING


Design time
Process time
Query time                                                                     Mining Model




               Copyright © 2009, Solid Quality Mentors. All rights reserved.
MODELING


Design time
Process time
Query time                                                                     Mining Model




                                                    Data
                                                   Mining
                                                   Engine
                Training Data



               Copyright © 2009, Solid Quality Mentors. All rights reserved.
MODELING


Design time
Process time
Query time                                                                     Mining Model




                                                    Data
                                                   Mining
                                                   Engine


               Predicted Data                                                  Data to Predict

               Copyright © 2009, Solid Quality Mentors. All rights reserved.
MODEL
                                                                             VALIDATION
• It is important that the model makes sense
  •   Accuracy
      o Does it correlate and predict correctly?
  •   Reliability
      o Does it work similarly for different test data?
  •   Usefulness
      o Does it provide insight or only obvious trivialities?
• Commonly a holdout set of data is used to test model
 accuracy


                    Copyright © 2009, Solid Quality Mentors. All rights reserved.
SQL SERVER™ 2008
                                                              DATA MINING
• Hides the complexity of an advanced technology
• Includes full suite of algorithms to automatically extract
 information from data
• Handles large volumes of data and complex data
• Data can be sourced from relational and OLAP databases
• Uses standard programming interfaces:
   • XMLA
   • DMX
• Delivers a complete framework for building and deploying
  intelligent applications

                Copyright © 2009, Solid Quality Mentors. All rights reserved.
SQL SERVER™ 2008
                                               ALGORITHMS

     • Microsoft Naïve Bayes
         •    Quick and approachable algorithm
         •    Used for classification


     • Microsoft Decision Trees
         •    Popular data mining technique
         •    Used for classification, regression and association


     • Microsoft Linear Regression
         •    Finds the best possible straight line through a series of
              points
         •    Used for prediction analysis


Copyright © 2009, Solid Quality Mentors. All rights reserved.
SQL SERVER™ 2008
                                               ALGORITHMS

     • Microsoft Neural Network
         •    More sophisticated than Decision Trees and Naïve
              Bayes, this algorithm can explore extremely complex
              scenarios
         •    Used for classification and regression tasks


     • Microsoft Logistic Regression
         •    A particular case of the Neural Network algorithm


     • Microsoft Clustering
         •    Finds natural groupings inside data
         •    Supports segmentation and anomaly detection tasks


Copyright © 2009, Solid Quality Mentors. All rights reserved.
SQL SERVER™ 2008
                                               ALGORITHMS

     • Microsoft Sequence Clustering
         •    Groups a sequence of discrete events into natural
              groups based on similarity


     • Microsoft Time Series
         •    Used to predict future values from a time series
         •    Has been improved in SQL Server 2008 to produce
              more accurate long-term forecasts


     • Microsoft Association Rules
         •    Commonly supports market basket analysis to learn
              what products are purchased together


Copyright © 2009, Solid Quality Mentors. All rights reserved.
SQL SERVER™ 2008
                                                               ALGORITHMS


  Classify       Estimate                 Cluster                Forecast         Associate

• Decision     • Decision             • Clustering            • Time Series     • Association
  Trees          Trees                                                            Rules
• Logistic     • Linear                                                         • Decision
  Regression     Regression                                                       Trees
• Naïve        • Logistic
  Bayes          Regression
• Neural       • Neural
  Networks       Networks




                Copyright © 2009, Solid Quality Mentors. All rights reserved.
DATA MINING
                                                                  VISUALIZATION
• In contrast to OLTP and OLAP queries, data mining queries
 typically extract information that the user is not aware of
• Appreciate that end users do not typically query data mining
 models directly
• Visualizations can effectively present data discoveries
• SQL Server™ 2008 provides algorithm-specific visualizations that
  can:
   • Test and explore models in BIDS
   • Be embedded into Web and Windows Forms applications
• Developers can construct and plug-in custom data mining
  viewers
                Copyright © 2009, Solid Quality Mentors. All rights reserved.
DEMONSTRATIONS
1.   Introducing the Table Analysis Tools for Excel 2007
2.   Creating, Training, Testing and Querying Mining Models with BIDS
3.   Developing a Data Mining Report
4.   Enhancing an E-Commerce Site with Market Basket Analysis




                    Copyright © 2009, Solid Quality Mentors. All rights reserved.
                                2008,
RESOURCES

• www.microsoft.com/sql/technologies/dm
  •   Links to technical resources, case studies, news, and reviews
• www.sqlserverdatamining.com
  •   Site designed and maintained by the SQL Server Data Mining
      team
  •   Includes: Live samples, tutorials, webcasts, tips and tricks, and
      FAQ
• Data Mining for SQL Server 2008, by ZhaoHui Tang and
 Jamie MacLennan

                 Copyright © 2009, Solid Quality Mentors. All rights reserved.

Más contenido relacionado

La actualidad más candente

DDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-finalDDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-finalIntelHealthcare
 
Hadoop World 2011: Big Data Analytics – Data Professionals: The New Enterpris...
Hadoop World 2011: Big Data Analytics – Data Professionals: The New Enterpris...Hadoop World 2011: Big Data Analytics – Data Professionals: The New Enterpris...
Hadoop World 2011: Big Data Analytics – Data Professionals: The New Enterpris...Cloudera, Inc.
 
Superb Hosting – Ahead of the Rest.®
Superb Hosting – Ahead of the Rest.®Superb Hosting – Ahead of the Rest.®
Superb Hosting – Ahead of the Rest.®emccleave
 
General Presentation
General PresentationGeneral Presentation
General PresentationNsid123456
 
SharePoint Performance - Tales from the Field
SharePoint Performance - Tales from the FieldSharePoint Performance - Tales from the Field
SharePoint Performance - Tales from the FieldChris McNulty
 
Webinar: Scaling MongoDB through Sharding - A Case Study with CIGNEX Datamatics
Webinar: Scaling MongoDB through Sharding - A Case Study with CIGNEX DatamaticsWebinar: Scaling MongoDB through Sharding - A Case Study with CIGNEX Datamatics
Webinar: Scaling MongoDB through Sharding - A Case Study with CIGNEX DatamaticsMongoDB
 
Radovan Janecek R E S Tor S O A Pv1
Radovan  Janecek    R E S Tor S O A Pv1Radovan  Janecek    R E S Tor S O A Pv1
Radovan Janecek R E S Tor S O A Pv1SOA Symposium
 
To_Infinity_and_Beyond_Internet_Scale_Workloads_Data_Center_Design_v6
To_Infinity_and_Beyond_Internet_Scale_Workloads_Data_Center_Design_v6To_Infinity_and_Beyond_Internet_Scale_Workloads_Data_Center_Design_v6
To_Infinity_and_Beyond_Internet_Scale_Workloads_Data_Center_Design_v6John Sing
 

La actualidad más candente (10)

DDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-finalDDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
 
Hadoop World 2011: Big Data Analytics – Data Professionals: The New Enterpris...
Hadoop World 2011: Big Data Analytics – Data Professionals: The New Enterpris...Hadoop World 2011: Big Data Analytics – Data Professionals: The New Enterpris...
Hadoop World 2011: Big Data Analytics – Data Professionals: The New Enterpris...
 
Superb Hosting – Ahead of the Rest.®
Superb Hosting – Ahead of the Rest.®Superb Hosting – Ahead of the Rest.®
Superb Hosting – Ahead of the Rest.®
 
MDM - Oracle Site Hub 101
MDM - Oracle Site Hub 101MDM - Oracle Site Hub 101
MDM - Oracle Site Hub 101
 
General Presentation
General PresentationGeneral Presentation
General Presentation
 
SharePoint Performance - Tales from the Field
SharePoint Performance - Tales from the FieldSharePoint Performance - Tales from the Field
SharePoint Performance - Tales from the Field
 
Webinar: Scaling MongoDB through Sharding - A Case Study with CIGNEX Datamatics
Webinar: Scaling MongoDB through Sharding - A Case Study with CIGNEX DatamaticsWebinar: Scaling MongoDB through Sharding - A Case Study with CIGNEX Datamatics
Webinar: Scaling MongoDB through Sharding - A Case Study with CIGNEX Datamatics
 
CP Overview
CP OverviewCP Overview
CP Overview
 
Radovan Janecek R E S Tor S O A Pv1
Radovan  Janecek    R E S Tor S O A Pv1Radovan  Janecek    R E S Tor S O A Pv1
Radovan Janecek R E S Tor S O A Pv1
 
To_Infinity_and_Beyond_Internet_Scale_Workloads_Data_Center_Design_v6
To_Infinity_and_Beyond_Internet_Scale_Workloads_Data_Center_Design_v6To_Infinity_and_Beyond_Internet_Scale_Workloads_Data_Center_Design_v6
To_Infinity_and_Beyond_Internet_Scale_Workloads_Data_Center_Design_v6
 

Destacado

2015 CIA Annual Meeting - Middle Market Gap
2015 CIA Annual Meeting - Middle Market Gap2015 CIA Annual Meeting - Middle Market Gap
2015 CIA Annual Meeting - Middle Market GapKevin Pledge
 
SOA Annual Meeting 2014 - EU Gender Neutral Pricing
SOA Annual Meeting 2014 - EU Gender Neutral PricingSOA Annual Meeting 2014 - EU Gender Neutral Pricing
SOA Annual Meeting 2014 - EU Gender Neutral PricingKevin Pledge
 
MOSS 2007 & Office 2007 Functionalities
MOSS 2007 & Office 2007 FunctionalitiesMOSS 2007 & Office 2007 Functionalities
MOSS 2007 & Office 2007 FunctionalitiesMark Ginnebaugh
 
Microsoft SQL Server Flash Storage
Microsoft SQL Server Flash StorageMicrosoft SQL Server Flash Storage
Microsoft SQL Server Flash StorageMark Ginnebaugh
 
Canadian Institute of Actuaries 2013 Annual Meeting - Social Media
Canadian Institute of Actuaries 2013 Annual Meeting - Social MediaCanadian Institute of Actuaries 2013 Annual Meeting - Social Media
Canadian Institute of Actuaries 2013 Annual Meeting - Social MediaKevin Pledge
 
PASS Summit 2009 Keynote Dave DeWitt
PASS Summit 2009 Keynote Dave DeWittPASS Summit 2009 Keynote Dave DeWitt
PASS Summit 2009 Keynote Dave DeWittMark Ginnebaugh
 
Microsoft SQL Server PASS News July 2010
Microsoft SQL Server PASS News July 2010Microsoft SQL Server PASS News July 2010
Microsoft SQL Server PASS News July 2010Mark Ginnebaugh
 
Actuarial Evidence Seminar 20120915
Actuarial Evidence Seminar 20120915Actuarial Evidence Seminar 20120915
Actuarial Evidence Seminar 20120915Kevin Pledge
 

Destacado (8)

2015 CIA Annual Meeting - Middle Market Gap
2015 CIA Annual Meeting - Middle Market Gap2015 CIA Annual Meeting - Middle Market Gap
2015 CIA Annual Meeting - Middle Market Gap
 
SOA Annual Meeting 2014 - EU Gender Neutral Pricing
SOA Annual Meeting 2014 - EU Gender Neutral PricingSOA Annual Meeting 2014 - EU Gender Neutral Pricing
SOA Annual Meeting 2014 - EU Gender Neutral Pricing
 
MOSS 2007 & Office 2007 Functionalities
MOSS 2007 & Office 2007 FunctionalitiesMOSS 2007 & Office 2007 Functionalities
MOSS 2007 & Office 2007 Functionalities
 
Microsoft SQL Server Flash Storage
Microsoft SQL Server Flash StorageMicrosoft SQL Server Flash Storage
Microsoft SQL Server Flash Storage
 
Canadian Institute of Actuaries 2013 Annual Meeting - Social Media
Canadian Institute of Actuaries 2013 Annual Meeting - Social MediaCanadian Institute of Actuaries 2013 Annual Meeting - Social Media
Canadian Institute of Actuaries 2013 Annual Meeting - Social Media
 
PASS Summit 2009 Keynote Dave DeWitt
PASS Summit 2009 Keynote Dave DeWittPASS Summit 2009 Keynote Dave DeWitt
PASS Summit 2009 Keynote Dave DeWitt
 
Microsoft SQL Server PASS News July 2010
Microsoft SQL Server PASS News July 2010Microsoft SQL Server PASS News July 2010
Microsoft SQL Server PASS News July 2010
 
Actuarial Evidence Seminar 20120915
Actuarial Evidence Seminar 20120915Actuarial Evidence Seminar 20120915
Actuarial Evidence Seminar 20120915
 

Similar a Introduction to Data Mining

SQL Server Data Mining - Taking your Application Design to the Next Level
SQL Server Data Mining - Taking your Application Design to the Next LevelSQL Server Data Mining - Taking your Application Design to the Next Level
SQL Server Data Mining - Taking your Application Design to the Next LevelMark Ginnebaugh
 
Zakipoint Introduction
Zakipoint IntroductionZakipoint Introduction
Zakipoint Introductionrameshkbudhani
 
2009/12 - Database Architechs - Presentation
2009/12 - Database Architechs - Presentation2009/12 - Database Architechs - Presentation
2009/12 - Database Architechs - PresentationDatabase Architechs
 
2009/12 Database Architechs Presentation
2009/12   Database Architechs Presentation2009/12   Database Architechs Presentation
2009/12 Database Architechs Presentationguest248edc
 
Metadata Use Cases You Can Use
Metadata Use Cases You Can UseMetadata Use Cases You Can Use
Metadata Use Cases You Can Usedmurph4
 
Metadata Use Cases
Metadata Use CasesMetadata Use Cases
Metadata Use Casesdmurph4
 
Getting Cloud Architecture Right the First Time Ver 2
Getting Cloud Architecture Right the First Time Ver 2Getting Cloud Architecture Right the First Time Ver 2
Getting Cloud Architecture Right the First Time Ver 2David Linthicum
 
Improving Quality and Adoption: EIM SQL Server 2012
Improving Quality and Adoption: EIM SQL Server 2012Improving Quality and Adoption: EIM SQL Server 2012
Improving Quality and Adoption: EIM SQL Server 2012Perficient, Inc.
 
Rhapsody Technologies Introduction Deck 01 31 12
Rhapsody Technologies   Introduction Deck 01 31 12Rhapsody Technologies   Introduction Deck 01 31 12
Rhapsody Technologies Introduction Deck 01 31 12ebreger
 
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]Rhapsody Technologies, Inc.
 
Datameer Analytics Solution
Datameer Analytics SolutionDatameer Analytics Solution
Datameer Analytics Solutiontempledf
 
Dimension Data Cloud Demo
Dimension Data Cloud DemoDimension Data Cloud Demo
Dimension Data Cloud DemoKeao Caindec
 
Perfect Storm: HR in the Cloud
Perfect Storm: HR in the CloudPerfect Storm: HR in the Cloud
Perfect Storm: HR in the CloudStanton Jones
 
X duce corporate_overview
X duce corporate_overviewX duce corporate_overview
X duce corporate_overviewgcdelmar
 
Reed Technology Melbourne
Reed Technology MelbourneReed Technology Melbourne
Reed Technology MelbourneReedTechnology
 

Similar a Introduction to Data Mining (20)

SQL Server Data Mining - Taking your Application Design to the Next Level
SQL Server Data Mining - Taking your Application Design to the Next LevelSQL Server Data Mining - Taking your Application Design to the Next Level
SQL Server Data Mining - Taking your Application Design to the Next Level
 
Zakipoint Introduction
Zakipoint IntroductionZakipoint Introduction
Zakipoint Introduction
 
2009/12 - Database Architechs - Presentation
2009/12 - Database Architechs - Presentation2009/12 - Database Architechs - Presentation
2009/12 - Database Architechs - Presentation
 
2009/12 Database Architechs Presentation
2009/12   Database Architechs Presentation2009/12   Database Architechs Presentation
2009/12 Database Architechs Presentation
 
Metadata Use Cases You Can Use
Metadata Use Cases You Can UseMetadata Use Cases You Can Use
Metadata Use Cases You Can Use
 
Metadata Use Cases
Metadata Use CasesMetadata Use Cases
Metadata Use Cases
 
Getting Cloud Architecture Right the First Time Ver 2
Getting Cloud Architecture Right the First Time Ver 2Getting Cloud Architecture Right the First Time Ver 2
Getting Cloud Architecture Right the First Time Ver 2
 
Improving Quality and Adoption: EIM SQL Server 2012
Improving Quality and Adoption: EIM SQL Server 2012Improving Quality and Adoption: EIM SQL Server 2012
Improving Quality and Adoption: EIM SQL Server 2012
 
Rhapsody Technologies Introduction Deck 01 31 12
Rhapsody Technologies   Introduction Deck 01 31 12Rhapsody Technologies   Introduction Deck 01 31 12
Rhapsody Technologies Introduction Deck 01 31 12
 
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
 
Acuma Introduction
Acuma IntroductionAcuma Introduction
Acuma Introduction
 
Datameer Analytics Solution
Datameer Analytics SolutionDatameer Analytics Solution
Datameer Analytics Solution
 
Dimension Data Cloud Demo
Dimension Data Cloud DemoDimension Data Cloud Demo
Dimension Data Cloud Demo
 
Sod Profile
Sod ProfileSod Profile
Sod Profile
 
Perfect Storm: HR in the Cloud
Perfect Storm: HR in the CloudPerfect Storm: HR in the Cloud
Perfect Storm: HR in the Cloud
 
Cogent overview
Cogent overviewCogent overview
Cogent overview
 
X duce corporate_overview
X duce corporate_overviewX duce corporate_overview
X duce corporate_overview
 
Reed Technology Melbourne
Reed Technology MelbourneReed Technology Melbourne
Reed Technology Melbourne
 
Reed Technology
Reed TechnologyReed Technology
Reed Technology
 
Reed Technology
Reed TechnologyReed Technology
Reed Technology
 

Más de Mark Ginnebaugh

Automating Microsoft Power BI Creations 2015
Automating Microsoft Power BI Creations 2015Automating Microsoft Power BI Creations 2015
Automating Microsoft Power BI Creations 2015Mark Ginnebaugh
 
Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction
Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction
Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction Mark Ginnebaugh
 
Platfora - An Analytics Sandbox In A World Of Big Data
Platfora - An Analytics Sandbox In A World Of Big DataPlatfora - An Analytics Sandbox In A World Of Big Data
Platfora - An Analytics Sandbox In A World Of Big DataMark Ginnebaugh
 
Microsoft SQL Server Relational Databases and Primary Keys
Microsoft SQL Server Relational Databases and Primary KeysMicrosoft SQL Server Relational Databases and Primary Keys
Microsoft SQL Server Relational Databases and Primary KeysMark Ginnebaugh
 
DesignMind Microsoft Business Intelligence SQL Server
DesignMind Microsoft Business Intelligence SQL ServerDesignMind Microsoft Business Intelligence SQL Server
DesignMind Microsoft Business Intelligence SQL ServerMark Ginnebaugh
 
San Francisco Bay Area SQL Server July 2013 meetings
San Francisco Bay Area SQL Server July 2013 meetingsSan Francisco Bay Area SQL Server July 2013 meetings
San Francisco Bay Area SQL Server July 2013 meetingsMark Ginnebaugh
 
Silicon Valley SQL Server User Group June 2013
Silicon Valley SQL Server User Group June 2013Silicon Valley SQL Server User Group June 2013
Silicon Valley SQL Server User Group June 2013Mark Ginnebaugh
 
Microsoft SQL Server Continuous Integration
Microsoft SQL Server Continuous IntegrationMicrosoft SQL Server Continuous Integration
Microsoft SQL Server Continuous IntegrationMark Ginnebaugh
 
Hortonworks Big Data & Hadoop
Hortonworks Big Data & HadoopHortonworks Big Data & Hadoop
Hortonworks Big Data & HadoopMark Ginnebaugh
 
Microsoft SQL Server Physical Join Operators
Microsoft SQL Server Physical Join OperatorsMicrosoft SQL Server Physical Join Operators
Microsoft SQL Server Physical Join OperatorsMark Ginnebaugh
 
Microsoft PowerPivot & Power View in Excel 2013
Microsoft PowerPivot & Power View in Excel 2013Microsoft PowerPivot & Power View in Excel 2013
Microsoft PowerPivot & Power View in Excel 2013Mark Ginnebaugh
 
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball ApproachMicrosoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball ApproachMark Ginnebaugh
 
Fusion-io Memory Flash for Microsoft SQL Server 2012
Fusion-io Memory Flash for Microsoft SQL Server 2012Fusion-io Memory Flash for Microsoft SQL Server 2012
Fusion-io Memory Flash for Microsoft SQL Server 2012Mark Ginnebaugh
 
Microsoft SQL Server PASS News August 2012
Microsoft SQL Server PASS News August 2012Microsoft SQL Server PASS News August 2012
Microsoft SQL Server PASS News August 2012Mark Ginnebaugh
 
Business Intelligence Dashboard Design Best Practices
Business Intelligence Dashboard Design Best PracticesBusiness Intelligence Dashboard Design Best Practices
Business Intelligence Dashboard Design Best PracticesMark Ginnebaugh
 
Microsoft Mobile Business Intelligence
Microsoft Mobile Business Intelligence Microsoft Mobile Business Intelligence
Microsoft Mobile Business Intelligence Mark Ginnebaugh
 
Microsoft SQL Server 2012 Cloud Ready
Microsoft SQL Server 2012 Cloud ReadyMicrosoft SQL Server 2012 Cloud Ready
Microsoft SQL Server 2012 Cloud ReadyMark Ginnebaugh
 
Microsoft SQL Server 2012 Master Data Services
Microsoft SQL Server 2012 Master Data ServicesMicrosoft SQL Server 2012 Master Data Services
Microsoft SQL Server 2012 Master Data ServicesMark Ginnebaugh
 
Microsoft SQL Server PowerPivot
Microsoft SQL Server PowerPivotMicrosoft SQL Server PowerPivot
Microsoft SQL Server PowerPivotMark Ginnebaugh
 
Microsoft SQL Server Testing Frameworks
Microsoft SQL Server Testing FrameworksMicrosoft SQL Server Testing Frameworks
Microsoft SQL Server Testing FrameworksMark Ginnebaugh
 

Más de Mark Ginnebaugh (20)

Automating Microsoft Power BI Creations 2015
Automating Microsoft Power BI Creations 2015Automating Microsoft Power BI Creations 2015
Automating Microsoft Power BI Creations 2015
 
Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction
Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction
Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction
 
Platfora - An Analytics Sandbox In A World Of Big Data
Platfora - An Analytics Sandbox In A World Of Big DataPlatfora - An Analytics Sandbox In A World Of Big Data
Platfora - An Analytics Sandbox In A World Of Big Data
 
Microsoft SQL Server Relational Databases and Primary Keys
Microsoft SQL Server Relational Databases and Primary KeysMicrosoft SQL Server Relational Databases and Primary Keys
Microsoft SQL Server Relational Databases and Primary Keys
 
DesignMind Microsoft Business Intelligence SQL Server
DesignMind Microsoft Business Intelligence SQL ServerDesignMind Microsoft Business Intelligence SQL Server
DesignMind Microsoft Business Intelligence SQL Server
 
San Francisco Bay Area SQL Server July 2013 meetings
San Francisco Bay Area SQL Server July 2013 meetingsSan Francisco Bay Area SQL Server July 2013 meetings
San Francisco Bay Area SQL Server July 2013 meetings
 
Silicon Valley SQL Server User Group June 2013
Silicon Valley SQL Server User Group June 2013Silicon Valley SQL Server User Group June 2013
Silicon Valley SQL Server User Group June 2013
 
Microsoft SQL Server Continuous Integration
Microsoft SQL Server Continuous IntegrationMicrosoft SQL Server Continuous Integration
Microsoft SQL Server Continuous Integration
 
Hortonworks Big Data & Hadoop
Hortonworks Big Data & HadoopHortonworks Big Data & Hadoop
Hortonworks Big Data & Hadoop
 
Microsoft SQL Server Physical Join Operators
Microsoft SQL Server Physical Join OperatorsMicrosoft SQL Server Physical Join Operators
Microsoft SQL Server Physical Join Operators
 
Microsoft PowerPivot & Power View in Excel 2013
Microsoft PowerPivot & Power View in Excel 2013Microsoft PowerPivot & Power View in Excel 2013
Microsoft PowerPivot & Power View in Excel 2013
 
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball ApproachMicrosoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
 
Fusion-io Memory Flash for Microsoft SQL Server 2012
Fusion-io Memory Flash for Microsoft SQL Server 2012Fusion-io Memory Flash for Microsoft SQL Server 2012
Fusion-io Memory Flash for Microsoft SQL Server 2012
 
Microsoft SQL Server PASS News August 2012
Microsoft SQL Server PASS News August 2012Microsoft SQL Server PASS News August 2012
Microsoft SQL Server PASS News August 2012
 
Business Intelligence Dashboard Design Best Practices
Business Intelligence Dashboard Design Best PracticesBusiness Intelligence Dashboard Design Best Practices
Business Intelligence Dashboard Design Best Practices
 
Microsoft Mobile Business Intelligence
Microsoft Mobile Business Intelligence Microsoft Mobile Business Intelligence
Microsoft Mobile Business Intelligence
 
Microsoft SQL Server 2012 Cloud Ready
Microsoft SQL Server 2012 Cloud ReadyMicrosoft SQL Server 2012 Cloud Ready
Microsoft SQL Server 2012 Cloud Ready
 
Microsoft SQL Server 2012 Master Data Services
Microsoft SQL Server 2012 Master Data ServicesMicrosoft SQL Server 2012 Master Data Services
Microsoft SQL Server 2012 Master Data Services
 
Microsoft SQL Server PowerPivot
Microsoft SQL Server PowerPivotMicrosoft SQL Server PowerPivot
Microsoft SQL Server PowerPivot
 
Microsoft SQL Server Testing Frameworks
Microsoft SQL Server Testing FrameworksMicrosoft SQL Server Testing Frameworks
Microsoft SQL Server Testing Frameworks
 

Último

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 

Último (20)

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 

Introduction to Data Mining

  • 1. An Introduction to SQL Server 2005 and 2008 Data Mining Peter Myers Mentor – Solid Quality Mentors San Francisco SQL Server User Group – 8 July, 2009 Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 2. WHO WE ARE • Industry experts: Growing, elite group of over 90 of the world’s best technical experts who, as reflected by the high concentration of Microsoft MVP’s and RD’s in our ranks, achieve excellence in their industry by maintaining the highest credentials. • Published authors: Best technical reference books, Microsoft reference materials, industry white papers, technical magazine articles, and webcasts. • Top technical speakers: PASS Community Summit, Microsoft TechEd, The Microsoft BI Conference, SQL Server DevConnections, countless user groups, international conferences and events. • For more information visit www.solidq.com Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 3. WHAT WE DO Provide advanced, world-class expertise across the entire Microsoft relational data and development platforms and complimenting technologies. PRACTICE AREAS SERVICES Relational Database Management Advanced, Public Training Business Intelligence Customized, Private Training Development Methodologies Solution Delivery & Tuning SharePoint Collaboration Enhanced, Mentoring Services For more information visit www.solidq.com Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 4. AGENDA • Introducing Data Mining • Describing the Data Mining Process • SQL Server™ 2008 Data Mining • Data Preparation • Data Mining Visualization • Demonstrations Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 5. INTRODUCING DATA MINING • Addresses the problem: “Too much data and not enough information” • Enables data exploration, pattern discovery, and pattern prediction—which lead to knowledge discovery • Forms a key part of a BI solution Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 6. BUSINESS SCENARIOS • Identifying responsive customers/unresponsive customers (also known as churn analysis) • Targeting promotions • Detecting and preventing fraud • Correcting data during ETL • Forecasting sales and inventory • Cross-selling Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 7. DESCRIBING THE DATA MINING PROCESS “Doing Data Mining” Business Data Understanding Understanding Data Preparation Data Deployment Modeling Evaluation “Putting Data Mining to Work” www.crisp-dm.org Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 8. DATA PREPARATION • Often significant amounts of effort are required to prepare data for mining: • Transforming for cleaning and reformatting • Isolating and flagging abnormal data • Appropriately substituting missing values • Discretizing continuous values into ranges • Normalizing values between 0 and 1 • Of course, having the required data to begin with is important: • When designing systems, give consideration to attributes that may be required as inputs for classification o For example, demographic data: Age, Gender, Region, etc Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 9. MODELING Design time Process time Query time Mining Model Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 10. MODELING Design time Process time Query time Mining Model Data Mining Engine Training Data Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 11. MODELING Design time Process time Query time Mining Model Data Mining Engine Predicted Data Data to Predict Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 12. MODEL VALIDATION • It is important that the model makes sense • Accuracy o Does it correlate and predict correctly? • Reliability o Does it work similarly for different test data? • Usefulness o Does it provide insight or only obvious trivialities? • Commonly a holdout set of data is used to test model accuracy Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 13. SQL SERVER™ 2008 DATA MINING • Hides the complexity of an advanced technology • Includes full suite of algorithms to automatically extract information from data • Handles large volumes of data and complex data • Data can be sourced from relational and OLAP databases • Uses standard programming interfaces: • XMLA • DMX • Delivers a complete framework for building and deploying intelligent applications Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 14. SQL SERVER™ 2008 ALGORITHMS • Microsoft Naïve Bayes • Quick and approachable algorithm • Used for classification • Microsoft Decision Trees • Popular data mining technique • Used for classification, regression and association • Microsoft Linear Regression • Finds the best possible straight line through a series of points • Used for prediction analysis Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 15. SQL SERVER™ 2008 ALGORITHMS • Microsoft Neural Network • More sophisticated than Decision Trees and Naïve Bayes, this algorithm can explore extremely complex scenarios • Used for classification and regression tasks • Microsoft Logistic Regression • A particular case of the Neural Network algorithm • Microsoft Clustering • Finds natural groupings inside data • Supports segmentation and anomaly detection tasks Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 16. SQL SERVER™ 2008 ALGORITHMS • Microsoft Sequence Clustering • Groups a sequence of discrete events into natural groups based on similarity • Microsoft Time Series • Used to predict future values from a time series • Has been improved in SQL Server 2008 to produce more accurate long-term forecasts • Microsoft Association Rules • Commonly supports market basket analysis to learn what products are purchased together Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 17. SQL SERVER™ 2008 ALGORITHMS Classify Estimate Cluster Forecast Associate • Decision • Decision • Clustering • Time Series • Association Trees Trees Rules • Logistic • Linear • Decision Regression Regression Trees • Naïve • Logistic Bayes Regression • Neural • Neural Networks Networks Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 18. DATA MINING VISUALIZATION • In contrast to OLTP and OLAP queries, data mining queries typically extract information that the user is not aware of • Appreciate that end users do not typically query data mining models directly • Visualizations can effectively present data discoveries • SQL Server™ 2008 provides algorithm-specific visualizations that can: • Test and explore models in BIDS • Be embedded into Web and Windows Forms applications • Developers can construct and plug-in custom data mining viewers Copyright © 2009, Solid Quality Mentors. All rights reserved.
  • 19. DEMONSTRATIONS 1. Introducing the Table Analysis Tools for Excel 2007 2. Creating, Training, Testing and Querying Mining Models with BIDS 3. Developing a Data Mining Report 4. Enhancing an E-Commerce Site with Market Basket Analysis Copyright © 2009, Solid Quality Mentors. All rights reserved. 2008,
  • 20. RESOURCES • www.microsoft.com/sql/technologies/dm • Links to technical resources, case studies, news, and reviews • www.sqlserverdatamining.com • Site designed and maintained by the SQL Server Data Mining team • Includes: Live samples, tutorials, webcasts, tips and tricks, and FAQ • Data Mining for SQL Server 2008, by ZhaoHui Tang and Jamie MacLennan Copyright © 2009, Solid Quality Mentors. All rights reserved.