SlideShare una empresa de Scribd logo
1 de 29
What we got from Red Hat competition
By Umaporn Kerdsaeng
DSTO Knowledge Sharing : 10/27/2016
Topics:
1. Overview: Red Hat Competition
2. Introduction to Decision Tree
3. R package for Decision Tree (xgboost)
Overview: Red Hat Competition
Red Had Competition
What is ROC?
• ROC : receiver operating characteristic
• The ROC curve was first developed by electrical engineers and radar
engineers during World War II for detecting enemy objects in battlefields.
• ROC curve is a graphical plot that illustrates the performance of a binary
classifier system as its discrimination threshold is varied.
• The curve is created by plotting the true positive rate (TPR) against the false
positive rate (FPR) at various threshold settings.
https://en.wikipedia.org/wiki/Receiver_operating_characteristic
Sensitivity and Specificity
https://www.youtube.com/watch?v=Z5TtopYX1Gc
• True Positive (tp) – Detection
• False Positive (fp) – False alarm
• True Negative (tn)
• False Negative (fn)
• Sensitivity = Probability of Detection
• Specificity = Probability of True Negative
• 1-Specificity = Probability of False alarm
Actual outcome distribution
AUC = 0.991725
receiver operating characteristic (ROC)
https://www.youtube.com/watch?v=gYIlKUP2hk0
the ROC curve can be generated by
plotting the cumulative distribution
function of the detection probability
in the y-axis versus the cumulative
distribution function of the false-
alarm probability in x-axis.
ROC Curve
• https://www.youtube.com/watch?v=OAl6eAyP-yo
Bad Good
https://www.youtube.com/watch?v=DiFL-i_zsFg
Red Hat Data:
Introduction to Decision Tree
Introduction to Decision Tree:
https://www.youtube.com/watch?v=eKD5gxPPeY0
Introduction to Decision Tree:
https://www.youtube.com/watch?v=eKD5gxPPeY0
Introduction to Decision Tree:
https://www.youtube.com/watch?v=eKD5gxPPeY0
Introduction to Decision Tree:
https://www.youtube.com/watch?v=eKD5gxPPeY0
Introduction to Decision Tree:
https://www.youtube.com/watch?v=eKD5gxPPeY0
Introduction to Decision Tree:
https://www.youtube.com/watch?v=eKD5gxPPeY0
Introduction to Decision Tree:
https://www.youtube.com/watch?v=AmCV4g7_-QM
Introduction to Decision Tree:
https://www.youtube.com/watch?v=AmCV4g7_-QM
Introduction to Decision Tree:
https://www.youtube.com/watch?v=nodQ2s0CUbI
Introduction to Decision Tree:
https://www.youtube.com/watch?v=AmCV4g7_-QM
Count All 14 5 4 5
Count Yes 9 2 4 3
Count No 5 3 0 2
P+ 0.64 0.40 1 0.60
P- 0.36 0.60 0 0.40
(P+)(log(P+,2) -0.41 -0.53 0 -0.44
(P-)(log(P-,2) -0.53 -0.44 #NUM! -0.53
H(S) 0.94 0.97 0 0.97
weigth 0.36 0.29 0.36
0.94 0.35 0.00 0.35
Gain(S,wind) 0.25
Count All 14 8 6
Count Yes 9 6 3
Count No 5 2 3
P+ 0.64 0.75 0.50
P- 0.36 0.25 0.50
(P+)(log(P+,2) -0.41 -0.31 -0.50
(P-)(log(P-,2) -0.53 -0.50 -0.50
H(S) 0.94 0.81 1.00
weigth 0.57 0.43
0.94 0.46 0.43
Gain(S,wind) 0.0481
https://www.youtube.com/watch?v=Q4NVG1IHQOU
Introduction to Decision Tree:
A Visual Introduction to Machine Learning
http://www.r2d3.us/visual-intro-to-machine-learning-part-1/
R package for Decision Tree
(xgboost)
XGBoost: Extreme Gradient Boosting
• An optimized distributed gradient boosting library
• XGBoost only works with numeric vectors. you need to convert all
other forms of data into numeric vectors.
• XGBoost provides a convenient function to do cross (an important
method to measure the model’s prediction power).
• XGBoost can handle missing values in the data
XGBoost: Extreme Gradient Boosting
https://www.youtube.com/watch?v=ufHo8vbk6g4
http://blog.nycdatascience.com/faculty/kaggle-winning-solution-xgboost-algorithm-let-us-learn-from-its-author-3/
The minimum information we need to provide is
XGBoost: Extreme Gradient Boosting
• Step 1 Load all the libraries
• Step 2 Load the dataset
• Step 4 Tune and Run the model
• Step 3 Data Cleaning & Feature Engineering
• Step 5 Score the Test Population
https://www.analyticsvidhya.com/blog/2016/01/xgboost-algorithm-easy-steps/
จิปาถะ
• เรียนรู้เรื่องเดิมๆ ซ้ำๆ รอบหลังๆ จะเข้ำใจมำกขึ้น
• English Knowledge Source
• ไอเดียจะมำแบบไม่เป็นระเบียบ แต่เรำต้องจัดระเบียบควำมคิดและกำรทำงำน
• ลองผิดลองถูกและเรียนรู้ไปพร้อมๆ กัน ต้องลงมือทำ
• จดทุกอย่ำงที่ทำ (พำยเรือวนในอ่ำง)

Más contenido relacionado

La actualidad más candente

hadoop introduce
hadoop introducehadoop introduce
hadoop introducejustlooks
 
Coq for ML users
Coq for ML usersCoq for ML users
Coq for ML userstmiya
 
Ece512 h1 20139_621386735458ece512_test2_solutions
Ece512 h1 20139_621386735458ece512_test2_solutionsEce512 h1 20139_621386735458ece512_test2_solutions
Ece512 h1 20139_621386735458ece512_test2_solutionsnadia abd
 
Implement a modified algorithm PF in a FPGA
Implement a modified algorithm PF in a FPGAImplement a modified algorithm PF in a FPGA
Implement a modified algorithm PF in a FPGABruno Martínez Bargiela
 
Logistic Regression in R-An Exmple.
Logistic Regression in R-An Exmple. Logistic Regression in R-An Exmple.
Logistic Regression in R-An Exmple. Dr. Volkan OBAN
 
แผนการจัดการเรียนรู้ที่ ๓
แผนการจัดการเรียนรู้ที่  ๓แผนการจัดการเรียนรู้ที่  ๓
แผนการจัดการเรียนรู้ที่ ๓mathawee wattana
 
Tim lucas-id2ox
Tim lucas-id2oxTim lucas-id2ox
Tim lucas-id2oxTim Lucas
 
The FE-I4 Pixel Readout System-on-Chip for ATLAS Experiment Upgrades
The FE-I4 Pixel Readout System-on-Chip  for ATLAS Experiment UpgradesThe FE-I4 Pixel Readout System-on-Chip  for ATLAS Experiment Upgrades
The FE-I4 Pixel Readout System-on-Chip for ATLAS Experiment Upgradesthemperek
 
Track Finding in LHCb's 2020 Trigger
Track Finding in LHCb's 2020 TriggerTrack Finding in LHCb's 2020 Trigger
Track Finding in LHCb's 2020 TriggerTimothy Head
 
Loader and Tester Swarming Drones for Cellular Phone Network Loading and Fiel...
Loader and Tester Swarming Drones for Cellular Phone Network Loading and Fiel...Loader and Tester Swarming Drones for Cellular Phone Network Loading and Fiel...
Loader and Tester Swarming Drones for Cellular Phone Network Loading and Fiel...Amir MirzaeiNia
 
Quantum Private Comparison based on Entanglement Swapping among EPR Pairs
Quantum Private Comparison based on Entanglement Swapping among EPR PairsQuantum Private Comparison based on Entanglement Swapping among EPR Pairs
Quantum Private Comparison based on Entanglement Swapping among EPR PairsShih-Hung Kao
 
All Pairs-Shortest Path (Fast Floyd-Warshall) Code
All Pairs-Shortest Path (Fast Floyd-Warshall) Code All Pairs-Shortest Path (Fast Floyd-Warshall) Code
All Pairs-Shortest Path (Fast Floyd-Warshall) Code Ehsan Sharifi
 
Wavelet Lifting on Application Specific Vector Processor
Wavelet Lifting on Application Specific Vector ProcessorWavelet Lifting on Application Specific Vector Processor
Wavelet Lifting on Application Specific Vector ProcessorDavid Bařina
 
Slids of intellegent assignment
Slids of intellegent assignmentSlids of intellegent assignment
Slids of intellegent assignmentengrasha
 
LetSwift 2017 - ARKit
LetSwift 2017 - ARKitLetSwift 2017 - ARKit
LetSwift 2017 - ARKitBongwon Lee
 
Cryptography : From Demaratus to RSA
Cryptography : From Demaratus to RSACryptography : From Demaratus to RSA
Cryptography : From Demaratus to RSAbenlamm
 

La actualidad más candente (20)

Av
AvAv
Av
 
hadoop introduce
hadoop introducehadoop introduce
hadoop introduce
 
Coq for ML users
Coq for ML usersCoq for ML users
Coq for ML users
 
Ece512 h1 20139_621386735458ece512_test2_solutions
Ece512 h1 20139_621386735458ece512_test2_solutionsEce512 h1 20139_621386735458ece512_test2_solutions
Ece512 h1 20139_621386735458ece512_test2_solutions
 
Implement a modified algorithm PF in a FPGA
Implement a modified algorithm PF in a FPGAImplement a modified algorithm PF in a FPGA
Implement a modified algorithm PF in a FPGA
 
Logistic Regression in R-An Exmple.
Logistic Regression in R-An Exmple. Logistic Regression in R-An Exmple.
Logistic Regression in R-An Exmple.
 
แผนการจัดการเรียนรู้ที่ ๓
แผนการจัดการเรียนรู้ที่  ๓แผนการจัดการเรียนรู้ที่  ๓
แผนการจัดการเรียนรู้ที่ ๓
 
Tim lucas-id2ox
Tim lucas-id2oxTim lucas-id2ox
Tim lucas-id2ox
 
The FE-I4 Pixel Readout System-on-Chip for ATLAS Experiment Upgrades
The FE-I4 Pixel Readout System-on-Chip  for ATLAS Experiment UpgradesThe FE-I4 Pixel Readout System-on-Chip  for ATLAS Experiment Upgrades
The FE-I4 Pixel Readout System-on-Chip for ATLAS Experiment Upgrades
 
RM FUNCIONAL
RM FUNCIONALRM FUNCIONAL
RM FUNCIONAL
 
Track Finding in LHCb's 2020 Trigger
Track Finding in LHCb's 2020 TriggerTrack Finding in LHCb's 2020 Trigger
Track Finding in LHCb's 2020 Trigger
 
Loader and Tester Swarming Drones for Cellular Phone Network Loading and Fiel...
Loader and Tester Swarming Drones for Cellular Phone Network Loading and Fiel...Loader and Tester Swarming Drones for Cellular Phone Network Loading and Fiel...
Loader and Tester Swarming Drones for Cellular Phone Network Loading and Fiel...
 
polar-cap-power
polar-cap-powerpolar-cap-power
polar-cap-power
 
Quantum Private Comparison based on Entanglement Swapping among EPR Pairs
Quantum Private Comparison based on Entanglement Swapping among EPR PairsQuantum Private Comparison based on Entanglement Swapping among EPR Pairs
Quantum Private Comparison based on Entanglement Swapping among EPR Pairs
 
All Pairs-Shortest Path (Fast Floyd-Warshall) Code
All Pairs-Shortest Path (Fast Floyd-Warshall) Code All Pairs-Shortest Path (Fast Floyd-Warshall) Code
All Pairs-Shortest Path (Fast Floyd-Warshall) Code
 
Wavelet Lifting on Application Specific Vector Processor
Wavelet Lifting on Application Specific Vector ProcessorWavelet Lifting on Application Specific Vector Processor
Wavelet Lifting on Application Specific Vector Processor
 
Img 0003
Img 0003Img 0003
Img 0003
 
Slids of intellegent assignment
Slids of intellegent assignmentSlids of intellegent assignment
Slids of intellegent assignment
 
LetSwift 2017 - ARKit
LetSwift 2017 - ARKitLetSwift 2017 - ARKit
LetSwift 2017 - ARKit
 
Cryptography : From Demaratus to RSA
Cryptography : From Demaratus to RSACryptography : From Demaratus to RSA
Cryptography : From Demaratus to RSA
 

Similar a What we got from the Predicting Red Hat Business Value competition

Developing fast low-rank tensor methods for solving PDEs with uncertain coef...
Developing fast  low-rank tensor methods for solving PDEs with uncertain coef...Developing fast  low-rank tensor methods for solving PDEs with uncertain coef...
Developing fast low-rank tensor methods for solving PDEs with uncertain coef...Alexander Litvinenko
 
My presentation at University of Nottingham "Fast low-rank methods for solvin...
My presentation at University of Nottingham "Fast low-rank methods for solvin...My presentation at University of Nottingham "Fast low-rank methods for solvin...
My presentation at University of Nottingham "Fast low-rank methods for solvin...Alexander Litvinenko
 
Class 18: Measuring Cost
Class 18: Measuring CostClass 18: Measuring Cost
Class 18: Measuring CostDavid Evans
 
材料科学とスーパーコンピュータ: 基礎編
材料科学とスーパーコンピュータ: 基礎編材料科学とスーパーコンピュータ: 基礎編
材料科学とスーパーコンピュータ: 基礎編Michio Katouda
 
DAW: Duplicate-AWare Federated Query Processing over the Web of Data
DAW: Duplicate-AWare Federated Query Processing over the Web of DataDAW: Duplicate-AWare Federated Query Processing over the Web of Data
DAW: Duplicate-AWare Federated Query Processing over the Web of DataMuhammad Saleem
 
機械学習によるデータ分析 実践編
機械学習によるデータ分析 実践編機械学習によるデータ分析 実践編
機械学習によるデータ分析 実践編Ryota Kamoshida
 
Deep Learning, Microsoft Cognitive Toolkit (CNTK) and Azure Machine Learning ...
Deep Learning, Microsoft Cognitive Toolkit (CNTK) and Azure Machine Learning ...Deep Learning, Microsoft Cognitive Toolkit (CNTK) and Azure Machine Learning ...
Deep Learning, Microsoft Cognitive Toolkit (CNTK) and Azure Machine Learning ...Naoki (Neo) SATO
 
Optimal Multisine Probing Signal Design for Power System Electromechanical Mo...
Optimal Multisine Probing Signal Design for Power System Electromechanical Mo...Optimal Multisine Probing Signal Design for Power System Electromechanical Mo...
Optimal Multisine Probing Signal Design for Power System Electromechanical Mo...Luigi Vanfretti
 
Otter 2016-11-28-01-ss
Otter 2016-11-28-01-ssOtter 2016-11-28-01-ss
Otter 2016-11-28-01-ssRuo Ando
 
Anomaly Detection in Sequences of Short Text Using Iterative Language Models
Anomaly Detection in Sequences of Short Text Using Iterative Language ModelsAnomaly Detection in Sequences of Short Text Using Iterative Language Models
Anomaly Detection in Sequences of Short Text Using Iterative Language ModelsCynthia Freeman
 
continious hmm.pdf
continious  hmm.pdfcontinious  hmm.pdf
continious hmm.pdfRahul Halder
 
pycon2018 "RL Adventure : DQN 부터 Rainbow DQN까지"
pycon2018 "RL Adventure : DQN 부터 Rainbow DQN까지"pycon2018 "RL Adventure : DQN 부터 Rainbow DQN까지"
pycon2018 "RL Adventure : DQN 부터 Rainbow DQN까지"YeChan(Paul) Kim
 
Data Structure: Algorithm and analysis
Data Structure: Algorithm and analysisData Structure: Algorithm and analysis
Data Structure: Algorithm and analysisDr. Rajdeep Chatterjee
 
材料科学とスーパーコンピュータ: 基礎編
材料科学とスーパーコンピュータ: 基礎編材料科学とスーパーコンピュータ: 基礎編
材料科学とスーパーコンピュータ: 基礎編Michio Katouda
 
design-compiler.pdf
design-compiler.pdfdesign-compiler.pdf
design-compiler.pdfFrangoCamila
 
TensorFlow 2: New Era of Developing Deep Learning Models
TensorFlow 2: New Era of Developing Deep Learning ModelsTensorFlow 2: New Era of Developing Deep Learning Models
TensorFlow 2: New Era of Developing Deep Learning ModelsJeongkyu Shin
 

Similar a What we got from the Predicting Red Hat Business Value competition (20)

hw-sol.pdf
hw-sol.pdfhw-sol.pdf
hw-sol.pdf
 
Developing fast low-rank tensor methods for solving PDEs with uncertain coef...
Developing fast  low-rank tensor methods for solving PDEs with uncertain coef...Developing fast  low-rank tensor methods for solving PDEs with uncertain coef...
Developing fast low-rank tensor methods for solving PDEs with uncertain coef...
 
My presentation at University of Nottingham "Fast low-rank methods for solvin...
My presentation at University of Nottingham "Fast low-rank methods for solvin...My presentation at University of Nottingham "Fast low-rank methods for solvin...
My presentation at University of Nottingham "Fast low-rank methods for solvin...
 
Thesis_Presentation
Thesis_PresentationThesis_Presentation
Thesis_Presentation
 
Class 18: Measuring Cost
Class 18: Measuring CostClass 18: Measuring Cost
Class 18: Measuring Cost
 
材料科学とスーパーコンピュータ: 基礎編
材料科学とスーパーコンピュータ: 基礎編材料科学とスーパーコンピュータ: 基礎編
材料科学とスーパーコンピュータ: 基礎編
 
DAW: Duplicate-AWare Federated Query Processing over the Web of Data
DAW: Duplicate-AWare Federated Query Processing over the Web of DataDAW: Duplicate-AWare Federated Query Processing over the Web of Data
DAW: Duplicate-AWare Federated Query Processing over the Web of Data
 
機械学習によるデータ分析 実践編
機械学習によるデータ分析 実践編機械学習によるデータ分析 実践編
機械学習によるデータ分析 実践編
 
Deep Learning, Microsoft Cognitive Toolkit (CNTK) and Azure Machine Learning ...
Deep Learning, Microsoft Cognitive Toolkit (CNTK) and Azure Machine Learning ...Deep Learning, Microsoft Cognitive Toolkit (CNTK) and Azure Machine Learning ...
Deep Learning, Microsoft Cognitive Toolkit (CNTK) and Azure Machine Learning ...
 
Optimal Multisine Probing Signal Design for Power System Electromechanical Mo...
Optimal Multisine Probing Signal Design for Power System Electromechanical Mo...Optimal Multisine Probing Signal Design for Power System Electromechanical Mo...
Optimal Multisine Probing Signal Design for Power System Electromechanical Mo...
 
Codes and Isogenies
Codes and IsogeniesCodes and Isogenies
Codes and Isogenies
 
Otter 2016-11-28-01-ss
Otter 2016-11-28-01-ssOtter 2016-11-28-01-ss
Otter 2016-11-28-01-ss
 
Anomaly Detection in Sequences of Short Text Using Iterative Language Models
Anomaly Detection in Sequences of Short Text Using Iterative Language ModelsAnomaly Detection in Sequences of Short Text Using Iterative Language Models
Anomaly Detection in Sequences of Short Text Using Iterative Language Models
 
Graph Algebra
Graph AlgebraGraph Algebra
Graph Algebra
 
continious hmm.pdf
continious  hmm.pdfcontinious  hmm.pdf
continious hmm.pdf
 
pycon2018 "RL Adventure : DQN 부터 Rainbow DQN까지"
pycon2018 "RL Adventure : DQN 부터 Rainbow DQN까지"pycon2018 "RL Adventure : DQN 부터 Rainbow DQN까지"
pycon2018 "RL Adventure : DQN 부터 Rainbow DQN까지"
 
Data Structure: Algorithm and analysis
Data Structure: Algorithm and analysisData Structure: Algorithm and analysis
Data Structure: Algorithm and analysis
 
材料科学とスーパーコンピュータ: 基礎編
材料科学とスーパーコンピュータ: 基礎編材料科学とスーパーコンピュータ: 基礎編
材料科学とスーパーコンピュータ: 基礎編
 
design-compiler.pdf
design-compiler.pdfdesign-compiler.pdf
design-compiler.pdf
 
TensorFlow 2: New Era of Developing Deep Learning Models
TensorFlow 2: New Era of Developing Deep Learning ModelsTensorFlow 2: New Era of Developing Deep Learning Models
TensorFlow 2: New Era of Developing Deep Learning Models
 

Último

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...PsychoTech Services
 

Último (20)

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 

What we got from the Predicting Red Hat Business Value competition

  • 1. What we got from Red Hat competition By Umaporn Kerdsaeng DSTO Knowledge Sharing : 10/27/2016
  • 2. Topics: 1. Overview: Red Hat Competition 2. Introduction to Decision Tree 3. R package for Decision Tree (xgboost)
  • 3. Overview: Red Hat Competition
  • 5. What is ROC? • ROC : receiver operating characteristic • The ROC curve was first developed by electrical engineers and radar engineers during World War II for detecting enemy objects in battlefields. • ROC curve is a graphical plot that illustrates the performance of a binary classifier system as its discrimination threshold is varied. • The curve is created by plotting the true positive rate (TPR) against the false positive rate (FPR) at various threshold settings. https://en.wikipedia.org/wiki/Receiver_operating_characteristic
  • 6. Sensitivity and Specificity https://www.youtube.com/watch?v=Z5TtopYX1Gc • True Positive (tp) – Detection • False Positive (fp) – False alarm • True Negative (tn) • False Negative (fn) • Sensitivity = Probability of Detection • Specificity = Probability of True Negative • 1-Specificity = Probability of False alarm
  • 8. receiver operating characteristic (ROC) https://www.youtube.com/watch?v=gYIlKUP2hk0 the ROC curve can be generated by plotting the cumulative distribution function of the detection probability in the y-axis versus the cumulative distribution function of the false- alarm probability in x-axis.
  • 13. Introduction to Decision Tree: https://www.youtube.com/watch?v=eKD5gxPPeY0
  • 14. Introduction to Decision Tree: https://www.youtube.com/watch?v=eKD5gxPPeY0
  • 15. Introduction to Decision Tree: https://www.youtube.com/watch?v=eKD5gxPPeY0
  • 16. Introduction to Decision Tree: https://www.youtube.com/watch?v=eKD5gxPPeY0
  • 17. Introduction to Decision Tree: https://www.youtube.com/watch?v=eKD5gxPPeY0
  • 18. Introduction to Decision Tree: https://www.youtube.com/watch?v=eKD5gxPPeY0
  • 19. Introduction to Decision Tree: https://www.youtube.com/watch?v=AmCV4g7_-QM
  • 20. Introduction to Decision Tree: https://www.youtube.com/watch?v=AmCV4g7_-QM
  • 21. Introduction to Decision Tree: https://www.youtube.com/watch?v=nodQ2s0CUbI
  • 22. Introduction to Decision Tree: https://www.youtube.com/watch?v=AmCV4g7_-QM Count All 14 5 4 5 Count Yes 9 2 4 3 Count No 5 3 0 2 P+ 0.64 0.40 1 0.60 P- 0.36 0.60 0 0.40 (P+)(log(P+,2) -0.41 -0.53 0 -0.44 (P-)(log(P-,2) -0.53 -0.44 #NUM! -0.53 H(S) 0.94 0.97 0 0.97 weigth 0.36 0.29 0.36 0.94 0.35 0.00 0.35 Gain(S,wind) 0.25 Count All 14 8 6 Count Yes 9 6 3 Count No 5 2 3 P+ 0.64 0.75 0.50 P- 0.36 0.25 0.50 (P+)(log(P+,2) -0.41 -0.31 -0.50 (P-)(log(P-,2) -0.53 -0.50 -0.50 H(S) 0.94 0.81 1.00 weigth 0.57 0.43 0.94 0.46 0.43 Gain(S,wind) 0.0481
  • 24. A Visual Introduction to Machine Learning http://www.r2d3.us/visual-intro-to-machine-learning-part-1/
  • 25. R package for Decision Tree (xgboost)
  • 26. XGBoost: Extreme Gradient Boosting • An optimized distributed gradient boosting library • XGBoost only works with numeric vectors. you need to convert all other forms of data into numeric vectors. • XGBoost provides a convenient function to do cross (an important method to measure the model’s prediction power). • XGBoost can handle missing values in the data
  • 27. XGBoost: Extreme Gradient Boosting https://www.youtube.com/watch?v=ufHo8vbk6g4 http://blog.nycdatascience.com/faculty/kaggle-winning-solution-xgboost-algorithm-let-us-learn-from-its-author-3/ The minimum information we need to provide is
  • 28. XGBoost: Extreme Gradient Boosting • Step 1 Load all the libraries • Step 2 Load the dataset • Step 4 Tune and Run the model • Step 3 Data Cleaning & Feature Engineering • Step 5 Score the Test Population https://www.analyticsvidhya.com/blog/2016/01/xgboost-algorithm-easy-steps/
  • 29. จิปาถะ • เรียนรู้เรื่องเดิมๆ ซ้ำๆ รอบหลังๆ จะเข้ำใจมำกขึ้น • English Knowledge Source • ไอเดียจะมำแบบไม่เป็นระเบียบ แต่เรำต้องจัดระเบียบควำมคิดและกำรทำงำน • ลองผิดลองถูกและเรียนรู้ไปพร้อมๆ กัน ต้องลงมือทำ • จดทุกอย่ำงที่ทำ (พำยเรือวนในอ่ำง)

Notas del editor

  1. Which customers have the most potential business value Prediction model Classification algorithm Data: Characteristics (People) Activities (act_train, act_test)
  2. True Positive (tp) – สัญญาณกันขโมยดัง เมือมีขโมยมาขโมยรถ False Positive (fp) – หมาฉี่ มอไซด์ผ่าน สัญญาณกันขโมยก็ดังแล้ว – คนขี้ระแวง (Type 1 error) True Negative (tn) –เหตุการณ์ทั่วไปไม่มีอะไรเกิดขึ้น สัญญาณกันขโมยไม่ดัง False Negative (fn) – ขโมยมาขโมยรถแล้วแต่สัญญาณกันขโมยไม่ดัง – คนชะล่าใจ - เสียหายแท้จริง (Type 2 error)