Tutorial on Scikit Learn I gave at SF Data Mining meetup on May 1st 2017. Review of major parts of the Scikit-Learn API and quick coding exercise on Iris Dataset
2. Catalit LLC
BEFORE WE START
Download and install:
MINICONDA PYTHON 2.7
from here:
https://conda.io/miniconda.html
3. Catalit LLC
INTHIS WORKSHOP
• Recognize problems & choose right ML technique
• Load and manipulate data with Pandas
• Build classification model with Scikit-Learn
• Evaluate model performance with Scikit-Learn
30. Catalit LLC
CONFUSION MATRIX
• Accuracy: Overall, how often is it correct?
• (TP +TN) / total
Test Negative Test Positive
Condition
Negative
TRUE NEGATIVE
FALSE POSITIVE
(Type I error)
Condition
Positive
FALSE NEGATIVE
(Type II error)
TRUE POSITIVE
31. Catalit LLC
TRAIN -TEST SPLIT
Training
data
Testing
data
Model
Train
Model
Measure
performance
Alldataavailable
38. Catalit LLC
THANKYOU
Data Weekends
Next Data Weekends Dates:
2-day Machine Learning: May 6-7
2-day Intro Deep Learning: May 20 - 21
2-day Advanced Deep Learning: Jun 3 - 4
2-day Intro Deep Learning: Jun 17 - 18