Presentation Title

Adventures in SegmentationUsing Applied Data Mining to add Business Value Drew Minkin

The Value Add of Data Mining Segmentation 101 Segmentation Tools in Analysis Services Methodology for Segmentation Analysis Building Confidence in your Model 2 Agenda

3 The Value Add of Data Mining

Statistics for the Computer Age Evolution, not revolution with traditional statistics Statistics enriched with brute-force capabilities of modern computing Associated with industrial-sized data sets 4 Value Add - What is Data Mining?

5 Data Mining OLAP Reports (Ad hoc) Reports (Static) Value Add - Data Mining in the BI Spectrum Business Knowledge SQL-Server 2008 Relative Business Value Easy Difficult

VoterVault From Mid-1990s Massive get-out-the-vote drive for those expected to vote Republican Demzilla Names typically have 200 to 400 information items 6 Value Add – Data Mining and Democracy

“The quiet statisticians have changed our world; not by discovering new facts or technical developments, but by changing the ways that we reason, experiment and form our opinions.” -- Ian Hacking Value Add – The Promise of Data Mining 7

8 Value Add – Spheres of Influence

Value Add – Operational Benefits Improved efficiency Inventory management Risk management

Value Add – Strategic Benefits The Bottom Line Increased agility Brand building Differentiate message “Relationship” building

Value Add – Tactical Benefits Reduction of costs Transactional leakage Outlier analysis

Identify a group of customers who are expected to attrite Conduct marketing campaigns to change the behavior in the desired direction change their behavior, reduce the attrition rate. Value Add - Customer Attrition Analysis

Slow attriters: Customers who slowly pay down their outstanding balance until they become inactive. Fast attriters: Customers who quickly pay down their balance and either lapse it or close it via phone call or write in. Value Add - Target Result

Credit models Retention models Elasticity models Cross-sell models Lifetime Value models Agent/agency monitoring Target marketing Fraud detection Value Add - Sample Applications 14

Unsupervised learning Associations and patterns many entities target information Market basket analysis (“diapers and beer”) Supervised learning Predict the value target variable well-defined predictive variables Credit / non-credit scoring engines 16 Segmentation – Machine Learning

Segmentation –Sample Data Sources Data Warehouse: Credit Card Data Warehouse containing about 200 product specific fields Third Party Data : A set of account related demographic and credit bureau information Segmentation files :Set of account related segmentation values based on our client's segmentation scheme which combines Risk, Profitability and External potential Payment Database :Database that stores all checks processed. The database can categorize source of checks

18 Methodology for Segmentation Analysis

19 Methodology–Distribution of Effort

20 Methodology – Segmentation Lifecycle

Research/Evaluate possible data sources Availability Hit rate Implementability Cost-effectiveness Extract/purchase data Check data for quality (QA) At this stage, data is still in a “raw” form Often start with voluminous transactional data Much of the data mining process is “messy” Methodology – Acquiring Raw Data 21

Reflects data changes over time. Recognizes and removes statistically insignificant fields Defines and introduces the "target" field Allows for second stage preprocessing and statistical analysis. Methodology – Goals of Refinement

Scoring engine Formula that classifies or separates policies (or risks, accounts, agents…) into profitable vs. unprofitable Retaining vs. non-retaining… (Non-)Linear equation f() of several predictive variables Produces continuous range of scores score = f(X1, X2, …, XN) Methodology - Scoring Engines 23

Data To Predict Training Data Mining Model Mining Model Mining Model Methodology – Deployed Model DB data Client data Application log “Just one row” New Entry New Txion DM Engine DM Engine Predicted Data

Randomly divide data into 3 pieces Training data Test data Validation data Use Training data to fit models Score the Test data to create a lift curve Perform the train/test steps iteratively until you have a model you’re happy with During this iterative phase, validation data is set aside in a “lock box” Score the Validation data and produce a lift curve Unbiased estimate of future performance Methodology - Testing 25

Examine correlations among the variables Weed out redundant, weak, poorly distributed variables Model design Build candidate models Regression/GLM Decision Trees/MARS Neural Networks Select final model 26 Methodology - Multivariate Analysis

27 Segmentation Tools in Analysis Services

Data Mining - Algorithm Matrix Segmentation Advanced Data Exploration Classification Forecasting Association Text Analysis Estimation Association Rules Clustering Decision Trees Linear Regression Logistic Regression Naïve Bayes Neural Nets Sequence Clustering Time Series

29 Data Mining - SQL-Server Algorithms Decision Trees Time Series Neural Net Clustering Sequence Clustering Association Naïve Bayes Linear and Logistic Regression

Offline and online modes Everything you do stays on the server Offline requires server admin privileges to deploy ,[object Object]

Define Mining Structure and Models

Train (process) the Structures

Regularly update and re-validate the ModelData Mining - Blueprint for Toolset

Data Mining - Cross-Validation SQL Server 2008 X iterations of retraining and retesting the model Results from each test statistically collated Model deemed accurate (and perhaps reliable) when variance is low and results meet expectations

Data Mining - Microsoft Decision Trees Use for: Classification: churn and risk analysis Regression: predict profit or income Association analysis based on multiple predictable variable Builds one tree for each predictable attribute Fast

COMPLEXITY_PENALTY FORCE_REGRESSOR MAXIMUM_INPUT_ATTRIBUTES MAXIMUM_OUTPUT_ATTRIBUTES MINIMUM_SUPPORT SCORE_METHOD SPLIT_METHOD Data Mining - Decision Tree Parameters

Data Mining - Microsoft Naïve Bayes Use for: Classification Association with multiple predictable attributes Assumes all inputs are independent Simple classification technique based on conditional probability

MAXIMUM_INPUT_ATTRIBUTES MAXIMUM_OUTPUT_ATTRIBUTES MAXIMUM_STATES MINIMUM_DEPENDENCY_PROBABILITY Data Mining - Naïve Bayes Parameters

Data Mining - Clustering Applied to Segmentation: Customer grouping, Mailing campaign Also: classification and regression Anomaly detection Discrete and continuous Note: “Predict Only” attributes not used for clustering

CLUSTER_COUNT CLUSTER_SEED CLUSTERING_METHOD MAXIMUM_INPUT_ATTRIBUTES MAXIMUM_STATES MINIMUM_SUPPORT MODELLING_CARDINALITY SAMPLE_SIZE STOPPING_TOLERANCE Data Mining - Clustering Parameters

Data Mining - Neural Network Applied to Classification Regression Great for finding complicated relationship among attributes Difficult to interpret results Gradient Descent method Output Layer Loyalty Hidden Layers Input Layer Age Education Sex Income

HIDDEN_NODE_RATIO HOLDOUT_PERCENTAGE HOLDOUT_SEED MAXIMUM_INPUT_ATTRIBUTES MAXIMUM_OUTPUT_ATTRIBUTES MAXIMUM_STATES SAMPLE_SIZE Data Mining - Neural Network Parameters

Data Mining - Sequence Clustering Analysis of: Customer behaviour Transaction patterns Click stream Customer segmentation Sequence prediction Mix of clustering and sequence technologies Groups individuals based on their profiles including sequence data

To discover the most likely beginning, paths, and ends of a customer’s journey through our domain consider using: Association Rules Sequence Clustering Data Mining - What is a Sequence?

Your “if” statement will test the value returned from a prediction – typically, predicted probability or outcome Steps: Build a case (set of attributes) representing the transaction you are processing at the moment E.g. Shopping basket of a customer plus their shipping info Execute a “SELECT ... PREDICTION JOIN” on the pre-loaded mining model Read returned attributes, especially case probability for a some outcome E.g. Probability > 50% that “TransactionOutcome=ShippingDeliveryFailure” Your application has just made an intelligent decision! Remember to refresh and retest the model regularly – daily? Data Mining – Minor Introduction to DMX

CLUSTER_COUNT MAXIMUM_SEQUENCE_STATES MAXIMUM_STATES MINIMUM_SUPPORT Data Mining- Sequence Clustering Parameters

45 Data Mining – Detailed Workflow

46 Data Mining – Detailed Mining Model

47 Data Mining – Detailed Mining Model

Presentation Title

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (19)

Destacado

Destacado (10)

Similar a Presentation Title

Similar a Presentation Title (20)

Más de butest

Más de butest (20)

Presentation Title