SlideShare una empresa de Scribd logo
1 de 27
Descargar para leer sin conexión
Introduction to Machine
       Learning
                     Lecture 4
     Slides based on Francisco Herrera course on Data Mining




                  Albert Orriols i Puig
                 aorriols@salle.url.edu
                     i l @ ll       ld

      Artificial Intelligence – Machine Learning
          Enginyeria i Arquitectura La Salle
              gy           q
                 Universitat Ramon Llull
Recap of Lecture 3

        Typically, techniques in ML have been divided in
        different paradigms
                Inductive learning
                Explanation-based learning
                  p                      g
                Analogy-based learning
                Evolutionary learning
                Connectionist Learning




                                                           Slide 2
Artificial Intelligence                 Machine Learning
Recap of Lecture 3

        Problems that we’ll study
                Data l
                D t classification: C4 5 kNN N ï B
                         ifi ti     C4.5, kNN, Naïve Bayes …
        1.

                Statistical learning: SVM
        2.

                Association analysis: A-priori
        3.

                Link mining: Page Rank
        4.

                Clustering: k-means
        5.

                Reinforcement learning: Q-learning, XCS
                                     g           g,
        6.

                Regression
        7.

                Genetic Fuzzy Systems
        8.
        8




                                                               Slide 3
Artificial Intelligence                     Machine Learning
Today’s Agenda

        Situation: Where Are We?
        Classification
        Prediction
        Clustering
        Association
        Data Mining Systems
        D t Mi i S t




                                                  Slide 4
Artificial Intelligence        Machine Learning
Situation: Where Are We?

       The input consists of examples featured by
       different characteristics




                                                           Slide 5
Artificial Intelligence                 Machine Learning
Situation: Where Are We?
        What can we do with a bunch of examples?
        Depend on the type of examples we may have
                Classification: Find the class to which a new instance belongs to
                                                                            g
                          E.g.: Find whether a new patient has cancer or not

                Numeric prediction: A variation of classification in which the output
                         p                                                        p
                consists of numeric classes
                          E.g.: Find the frequency of cancerous cell found

                Regression: Find a function that fits your examples
                          E.g.: Find a function that controls your chain process

                Association: Find association among your problem attributes or
                variables
                          E.g.: Find relations such as a patient with high-blood-pressure i
                          E     Fi d l ti         h        ti t ith hi h bl d             is
                          more likely to have heart-attack disease

                Clustering: Process to cluster/group the instances into classes
                          E.g.: Group clients whose purchases are similar
                                                                                           Slide 6
Artificial Intelligence                         Machine Learning
Data Classification

                                                                                                  Test set



                                                                                New instance

                           Information based                      Knowledge
                           on experience                          extraction
                                                                    t ti
                                                Learner                          Model
               Dataset


                                                                               Predicted Output




                            Training set




                                                                                                   Slide 7
Artificial Intelligence                        Machine Learning
Example of Data Classification

                  Data Set               Classification Model         How




       The classification model can be implemented in several ways:
               • Rules
               • Decision trees
               • Mathematical formulae




                                                                            Slide 8
Artificial Intelligence                    Machine Learning
Classification as a Two-Step Process

        Model usage: to classify future or unknown objects
                 g             y                     j
                Estimate the accuracy of the model
                          The known label of test samples is compared with the label
                          predicted by the system
                          The accuracy rate is the p p
                                       y            proportion of test examples that are
                                                                           p
                          correctly classified by the model
                          The test set is independent of the training set


                If the experts thing that the model is acceptable
                          Then, use to the model to predict unknown examples




                                                                                    Slide 9
Artificial Intelligence                       Machine Learning
Going to Real World
                                                      katydids


 Definition: Given a collection of
 a o a ed data (in s
 annotated da a ( this case katydids
                                a yd ds
 and grasshoppers), decide what type
 of insect in the following one



                                                    grasshoppers




                                                                   Slide 10
Artificial Intelligence          Machine Learning
Going to Real World
        How can I put a katydid or a g
                  p        y         grasshopper into my
                                            pp         y
        computer?




                                                           Slide 11
Artificial Intelligence          Machine Learning
Going to Real World
        Thus, the classification problem has been reduced to
            ,                    p

                          Insect   Abdomen           Antennae        Insect
                            ID      Length
                                    L     th          Length
                                                      L     th       Class
                                                                     Cl
                             1        2.7               5.5       Grasshopper
                             2        8.0               9.1         Katydid
                             3        0.9
                                      09                4.7
                                                        47        Grasshopper
                             4        1.1               3.1       Grasshopper
                             5        5.4               8.5         Katykid
                             6        2.9               1.9       Grasshopper
                             7        6.1               6.6         Katydid
                             8        0.5               1.0       Grasshopper
                             9        8.3               6.6         Katydid
                            10        8.1
                                      81                4.7
                                                        47          Katydid



                We have an observation with abdomen length 5 1 and
                                                           5.1
                antennae length 7?



                                                                                Slide 12
Artificial Intelligence                        Machine Learning
Going to Real World
        Actually, we could write that
               y,




        How do I classify this domain?
                                                    Slide 13
Artificial Intelligence          Machine Learning
How to Create Classification Models




        We will study some of this methods:
                The decision tree C4 5
                                  C4.5
                The instance based classifier kNN
                The probabilistic classifier Naïve Bayes
                                                            Slide 14
Artificial Intelligence                  Machine Learning
Regression or Prediction
        Prediction vs data classification
                Similarities: Both learn from a data set
                Difference:
                Diff
                          In classification, each example has a class associated
                          In
                          I prediction, each example has a numerical value
                               di ti       h      lh            ill
                          associated




                                                                                   Slide 15
Artificial Intelligence                      Machine Learning
How to Extract a Model?

        Prediction works analogously to data classification
                Use
                U an algorithm to b ild a model
                                  build
                      l ih                  dl
                Use this model to predict the new unknown example
        Types of regression
                          Linear and multiple regression
                          Non-linear regression

        Two of the most-used approaches to regression
                              pp             g
                Neural networks
                F       lb     d    t
                Fuzzy rule-based systems




                                                                    Slide 16
Artificial Intelligence                    Machine Learning
Clustering
        The clustering problem
                     gp
                Given a data base D={t1, t2, …, tn} of transactions and an
                integer value k, the c us e g p ob e refers to de e a
                   ege a ue , e clustering problem e e s o define
                mapping f: D {1,…, k} where each ti is assigned to one cluster
                kj, 1<=j<=k
        Main difference with classification
                In classification, each example is labeled with a class
                   classification
                In clustering, examples are not labeled
                                                          Examples of clustering
                                                                Segment customer data base based on
                                                                similar buying patterns
                                                                Group houses in a town into
                                                                G     h      i    t    it
                                                                neighborhoods based on similar features
                                                                Identify new plant species
                                                                Identify similar web usage patterns



                                                                                              Slide 17
Artificial Intelligence                Machine Learning
Example of Clustering
        Put these people in different clusters
                  pp


                                                  Which are the keys?
                                                    Define what’s similar
                                                    Group similar things in
                                                    different clusters
                                                       Size of the clusters?

                                                    Which type of clustering do I want?

                                                       Hierarchical clustering?

                                                       Partition-based clustering?




                                                                               Slide 18
Artificial Intelligence        Machine Learning
Are They Similar?




                                                   Slide 19
Artificial Intelligence         Machine Learning
How to Group the Elements?




                                             Slide 20
Artificial Intelligence   Machine Learning
Which Type of Clustering?
        Many types of clustering
           y yp                g
                Hierarchical: Nested set of clusters
                Partition-based: One set of clusters
                Incremental: Each element handled at one time
                Simultaneous: All elements h dl d t
                Si lt              l    t handled together
                                                      th
                Overlapping/non-overlapping

         Hierarchical Clustering                             Partition-based Clustering




                                                                                          Slide 21
Artificial Intelligence                   Machine Learning
Association Rules
        Given a set of items I={I1, I2, …, Im} and a database of
                               {, , , }
        transactions D={t1, t2, …, tn} where ti={Ii1, Ii2, …, Iik}
        and Iij Є I
        The association rule problem is to identify all the rules
        with form
                X         Y
        Rules ith minimum s pport
        R les with minim m support and confidence
                Support: Fraction of transactions which contain both X and Y
                Confidence: Measures of how often items in Y appear in
                transactions that contain X




                                                                          Slide 22
Artificial Intelligence               Machine Learning
Example Association Rules




        I = {Beer, Bread Jelly Milk PeanutButter}
            {Beer Bread, Jelly, Milk,
        Support of {Bread, PeanutButter} is 60%


                                                    Slide 23
Artificial Intelligence       Machine Learning
Example Association Rules




                                             Slide 24
Artificial Intelligence   Machine Learning
Before Finishing…
        Some environments that contain algorithms to perform
                                            g           p
        data classification, regression, clustering and
        association rule mining


                KEEL: http://www keel es
                      http://www.keel.es


                Weka: http://www.cs.waikato.ac.nz/ml/weka/


                Rapid Miner: http://rapid-i.com/content/blogcategory/38/69/




                                                                              Slide 25
Artificial Intelligence                Machine Learning
Next Class


        Start with data classification
                C4.5




                                                  Slide 26
Artificial Intelligence        Machine Learning
Introduction to Machine
       Learning
                     Lecture 4
     Slides based on Francisco Herrera course on Data Mining




                  Albert Orriols i Puig
                 aorriols@salle.url.edu
                     i l @ ll       ld

      Artificial Intelligence – Machine Learning
          Enginyeria i Arquitectura La Salle
              gy           q
                 Universitat Ramon Llull

Más contenido relacionado

La actualidad más candente

Learning Vector Quantization LVQ
Learning Vector Quantization LVQLearning Vector Quantization LVQ
Learning Vector Quantization LVQESCOM
 
Svm Presentation
Svm PresentationSvm Presentation
Svm Presentationshahparin
 
Support Vector Machines for Classification
Support Vector Machines for ClassificationSupport Vector Machines for Classification
Support Vector Machines for ClassificationPrakash Pimpale
 
Support vector machine
Support vector machineSupport vector machine
Support vector machineRishabh Gupta
 
Neural net and back propagation
Neural net and back propagationNeural net and back propagation
Neural net and back propagationMohit Shrivastava
 
Improved Trainings of Wasserstein GANs (WGAN-GP)
Improved Trainings of Wasserstein GANs (WGAN-GP)Improved Trainings of Wasserstein GANs (WGAN-GP)
Improved Trainings of Wasserstein GANs (WGAN-GP)Sangwoo Mo
 
Optimization for Neural Network Training - Veronica Vilaplana - UPC Barcelona...
Optimization for Neural Network Training - Veronica Vilaplana - UPC Barcelona...Optimization for Neural Network Training - Veronica Vilaplana - UPC Barcelona...
Optimization for Neural Network Training - Veronica Vilaplana - UPC Barcelona...Universitat Politècnica de Catalunya
 
Support Vector Machine ppt presentation
Support Vector Machine ppt presentationSupport Vector Machine ppt presentation
Support Vector Machine ppt presentationAyanaRukasar
 
Unsupervised learning (clustering)
Unsupervised learning (clustering)Unsupervised learning (clustering)
Unsupervised learning (clustering)Pravinkumar Landge
 
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
An Introduction to Supervised Machine Learning and Pattern Classification: Th...An Introduction to Supervised Machine Learning and Pattern Classification: Th...
An Introduction to Supervised Machine Learning and Pattern Classification: Th...Sebastian Raschka
 
Independent Component Analysis
Independent Component AnalysisIndependent Component Analysis
Independent Component AnalysisTatsuya Yokota
 
Feature selection concepts and methods
Feature selection concepts and methodsFeature selection concepts and methods
Feature selection concepts and methodsReza Ramezani
 

La actualidad más candente (20)

Isolation Forest
Isolation ForestIsolation Forest
Isolation Forest
 
Learning Vector Quantization LVQ
Learning Vector Quantization LVQLearning Vector Quantization LVQ
Learning Vector Quantization LVQ
 
Lecture6 - C4.5
Lecture6 - C4.5Lecture6 - C4.5
Lecture6 - C4.5
 
Svm Presentation
Svm PresentationSvm Presentation
Svm Presentation
 
Support Vector Machines for Classification
Support Vector Machines for ClassificationSupport Vector Machines for Classification
Support Vector Machines for Classification
 
03 Single layer Perception Classifier
03 Single layer Perception Classifier03 Single layer Perception Classifier
03 Single layer Perception Classifier
 
Support vector machine
Support vector machineSupport vector machine
Support vector machine
 
Perceptron & Neural Networks
Perceptron & Neural NetworksPerceptron & Neural Networks
Perceptron & Neural Networks
 
Neural net and back propagation
Neural net and back propagationNeural net and back propagation
Neural net and back propagation
 
Lecture 9 Perceptron
Lecture 9 PerceptronLecture 9 Perceptron
Lecture 9 Perceptron
 
Lecture12 - SVM
Lecture12 - SVMLecture12 - SVM
Lecture12 - SVM
 
Improved Trainings of Wasserstein GANs (WGAN-GP)
Improved Trainings of Wasserstein GANs (WGAN-GP)Improved Trainings of Wasserstein GANs (WGAN-GP)
Improved Trainings of Wasserstein GANs (WGAN-GP)
 
Regularization
RegularizationRegularization
Regularization
 
Optimization for Neural Network Training - Veronica Vilaplana - UPC Barcelona...
Optimization for Neural Network Training - Veronica Vilaplana - UPC Barcelona...Optimization for Neural Network Training - Veronica Vilaplana - UPC Barcelona...
Optimization for Neural Network Training - Veronica Vilaplana - UPC Barcelona...
 
Support Vector Machine ppt presentation
Support Vector Machine ppt presentationSupport Vector Machine ppt presentation
Support Vector Machine ppt presentation
 
Unsupervised learning (clustering)
Unsupervised learning (clustering)Unsupervised learning (clustering)
Unsupervised learning (clustering)
 
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
An Introduction to Supervised Machine Learning and Pattern Classification: Th...An Introduction to Supervised Machine Learning and Pattern Classification: Th...
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
 
Independent Component Analysis
Independent Component AnalysisIndependent Component Analysis
Independent Component Analysis
 
Feature selection concepts and methods
Feature selection concepts and methodsFeature selection concepts and methods
Feature selection concepts and methods
 
SVM
SVMSVM
SVM
 

Destacado

Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningLior Rokach
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningRahul Jain
 
Lecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceLecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceAlbert Orriols-Puig
 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learningbutest
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningLars Marius Garshol
 
Coding style of Linux Kernel
Coding style of Linux KernelCoding style of Linux Kernel
Coding style of Linux KernelPeter Chang
 
Machine learning
Machine learningMachine learning
Machine learningAmit Gupta
 
final year project_leaf recognition
final year project_leaf recognitionfinal year project_leaf recognition
final year project_leaf recognitionNupur Aggarwal
 
Matlab Image Enhancement Techniques
Matlab Image Enhancement TechniquesMatlab Image Enhancement Techniques
Matlab Image Enhancement Techniquesmatlab Content
 
An OCR System for recognition of Urdu text in Nastaliq Font
An OCR System for recognition of Urdu text in Nastaliq FontAn OCR System for recognition of Urdu text in Nastaliq Font
An OCR System for recognition of Urdu text in Nastaliq FontDr. Syed Hassan Amin
 
Matlab and Image Processing Workshop-SKERG
Matlab and Image Processing Workshop-SKERG Matlab and Image Processing Workshop-SKERG
Matlab and Image Processing Workshop-SKERG Sulaf Almagooshi
 
Fikrimuhal TRHUG 2016 Machine Learning
Fikrimuhal TRHUG 2016 Machine LearningFikrimuhal TRHUG 2016 Machine Learning
Fikrimuhal TRHUG 2016 Machine LearningSukru Hasdemir
 
ENHANCED SIGNATURE VERIFICATION AND RECOGNITION USING MATLAB
ENHANCED SIGNATURE VERIFICATION AND RECOGNITION USING MATLABENHANCED SIGNATURE VERIFICATION AND RECOGNITION USING MATLAB
ENHANCED SIGNATURE VERIFICATION AND RECOGNITION USING MATLABAM Publications
 

Destacado (20)

Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Lecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceLecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligence
 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learning
 
Machine Learning for Dummies
Machine Learning for DummiesMachine Learning for Dummies
Machine Learning for Dummies
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine Learning
 
Coding style of Linux Kernel
Coding style of Linux KernelCoding style of Linux Kernel
Coding style of Linux Kernel
 
Machine Learning at Geeky Base
Machine Learning at Geeky BaseMachine Learning at Geeky Base
Machine Learning at Geeky Base
 
Lecture3 - Machine Learning
Lecture3 - Machine LearningLecture3 - Machine Learning
Lecture3 - Machine Learning
 
Lecture2 - Machine Learning
Lecture2 - Machine LearningLecture2 - Machine Learning
Lecture2 - Machine Learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Image processing
Image processingImage processing
Image processing
 
final year project_leaf recognition
final year project_leaf recognitionfinal year project_leaf recognition
final year project_leaf recognition
 
Matlab Image Enhancement Techniques
Matlab Image Enhancement TechniquesMatlab Image Enhancement Techniques
Matlab Image Enhancement Techniques
 
An OCR System for recognition of Urdu text in Nastaliq Font
An OCR System for recognition of Urdu text in Nastaliq FontAn OCR System for recognition of Urdu text in Nastaliq Font
An OCR System for recognition of Urdu text in Nastaliq Font
 
Matlab and Image Processing Workshop-SKERG
Matlab and Image Processing Workshop-SKERG Matlab and Image Processing Workshop-SKERG
Matlab and Image Processing Workshop-SKERG
 
Lecture1 - Machine Learning
Lecture1 - Machine LearningLecture1 - Machine Learning
Lecture1 - Machine Learning
 
Fikrimuhal TRHUG 2016 Machine Learning
Fikrimuhal TRHUG 2016 Machine LearningFikrimuhal TRHUG 2016 Machine Learning
Fikrimuhal TRHUG 2016 Machine Learning
 
ENHANCED SIGNATURE VERIFICATION AND RECOGNITION USING MATLAB
ENHANCED SIGNATURE VERIFICATION AND RECOGNITION USING MATLABENHANCED SIGNATURE VERIFICATION AND RECOGNITION USING MATLAB
ENHANCED SIGNATURE VERIFICATION AND RECOGNITION USING MATLAB
 

Similar a Lecture4 - Machine Learning

Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Miningbutest
 
CCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data setsCCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data setsAlbert Orriols-Puig
 
Auto-encoding variational bayes
Auto-encoding variational bayesAuto-encoding variational bayes
Auto-encoding variational bayesKyuri Kim
 
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...Albert Orriols-Puig
 
Deep Learning: concepts and use cases (October 2018)
Deep Learning: concepts and use cases (October 2018)Deep Learning: concepts and use cases (October 2018)
Deep Learning: concepts and use cases (October 2018)Julien SIMON
 
Machine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachinePulse
 
FUNCTION APPROXIMATION
FUNCTION APPROXIMATIONFUNCTION APPROXIMATION
FUNCTION APPROXIMATIONankita pandey
 
Machine Learning presentation.
Machine Learning presentation.Machine Learning presentation.
Machine Learning presentation.butest
 
Machine learning and_neural_network_lecture_slide_ece_dku
Machine learning and_neural_network_lecture_slide_ece_dkuMachine learning and_neural_network_lecture_slide_ece_dku
Machine learning and_neural_network_lecture_slide_ece_dkuSeokhyun Yoon
 
Deep learning: Cutting through the Myths and Hype
Deep learning: Cutting through the Myths and HypeDeep learning: Cutting through the Myths and Hype
Deep learning: Cutting through the Myths and HypeSiby Jose Plathottam
 
Mis End Term Exam Theory Concepts
Mis End Term Exam Theory ConceptsMis End Term Exam Theory Concepts
Mis End Term Exam Theory ConceptsVidya sagar Sharma
 
Improving Classifier Accuracy using Unlabeled Data..doc
Improving Classifier Accuracy using Unlabeled Data..docImproving Classifier Accuracy using Unlabeled Data..doc
Improving Classifier Accuracy using Unlabeled Data..docbutest
 
Dive into Machine Learning Event MUGDSC.pptx
Dive into Machine Learning Event MUGDSC.pptxDive into Machine Learning Event MUGDSC.pptx
Dive into Machine Learning Event MUGDSC.pptxRakshaAgrawal21
 
Dive into Machine Learning Event--MUGDSC
Dive into Machine Learning Event--MUGDSCDive into Machine Learning Event--MUGDSC
Dive into Machine Learning Event--MUGDSCRakshaAgrawal21
 
Artificial Neural Networks for NIU
Artificial Neural Networks for NIUArtificial Neural Networks for NIU
Artificial Neural Networks for NIUProf. Neeta Awasthy
 
Deployment of ID3 decision tree algorithm for placement prediction
Deployment of ID3 decision tree algorithm for placement predictionDeployment of ID3 decision tree algorithm for placement prediction
Deployment of ID3 decision tree algorithm for placement predictionijtsrd
 

Similar a Lecture4 - Machine Learning (20)

Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Mining
 
CCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data setsCCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data sets
 
Lecture8 - From CBR to IBk
Lecture8 - From CBR to IBkLecture8 - From CBR to IBk
Lecture8 - From CBR to IBk
 
Lecture17
Lecture17Lecture17
Lecture17
 
Dbm630 lecture06
Dbm630 lecture06Dbm630 lecture06
Dbm630 lecture06
 
Auto-encoding variational bayes
Auto-encoding variational bayesAuto-encoding variational bayes
Auto-encoding variational bayes
 
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...
 
Deep Learning: concepts and use cases (October 2018)
Deep Learning: concepts and use cases (October 2018)Deep Learning: concepts and use cases (October 2018)
Deep Learning: concepts and use cases (October 2018)
 
Machine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachine Learning and Real-World Applications
Machine Learning and Real-World Applications
 
FUNCTION APPROXIMATION
FUNCTION APPROXIMATIONFUNCTION APPROXIMATION
FUNCTION APPROXIMATION
 
Machine learning
 Machine learning Machine learning
Machine learning
 
Machine Learning presentation.
Machine Learning presentation.Machine Learning presentation.
Machine Learning presentation.
 
Machine learning and_neural_network_lecture_slide_ece_dku
Machine learning and_neural_network_lecture_slide_ece_dkuMachine learning and_neural_network_lecture_slide_ece_dku
Machine learning and_neural_network_lecture_slide_ece_dku
 
Deep learning: Cutting through the Myths and Hype
Deep learning: Cutting through the Myths and HypeDeep learning: Cutting through the Myths and Hype
Deep learning: Cutting through the Myths and Hype
 
Mis End Term Exam Theory Concepts
Mis End Term Exam Theory ConceptsMis End Term Exam Theory Concepts
Mis End Term Exam Theory Concepts
 
Improving Classifier Accuracy using Unlabeled Data..doc
Improving Classifier Accuracy using Unlabeled Data..docImproving Classifier Accuracy using Unlabeled Data..doc
Improving Classifier Accuracy using Unlabeled Data..doc
 
Dive into Machine Learning Event MUGDSC.pptx
Dive into Machine Learning Event MUGDSC.pptxDive into Machine Learning Event MUGDSC.pptx
Dive into Machine Learning Event MUGDSC.pptx
 
Dive into Machine Learning Event--MUGDSC
Dive into Machine Learning Event--MUGDSCDive into Machine Learning Event--MUGDSC
Dive into Machine Learning Event--MUGDSC
 
Artificial Neural Networks for NIU
Artificial Neural Networks for NIUArtificial Neural Networks for NIU
Artificial Neural Networks for NIU
 
Deployment of ID3 decision tree algorithm for placement prediction
Deployment of ID3 decision tree algorithm for placement predictionDeployment of ID3 decision tree algorithm for placement prediction
Deployment of ID3 decision tree algorithm for placement prediction
 

Más de Albert Orriols-Puig

HAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsHAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsAlbert Orriols-Puig
 
Lecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART IIILecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART IIIAlbert Orriols-Puig
 
Lecture15 - Advances topics on association rules PART II
Lecture15 - Advances topics on association rules PART IILecture15 - Advances topics on association rules PART II
Lecture15 - Advances topics on association rules PART IIAlbert Orriols-Puig
 
Lecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rulesLecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rulesAlbert Orriols-Puig
 
Lecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-TheoryLecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-TheoryAlbert Orriols-Puig
 
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...Albert Orriols-Puig
 
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...Albert Orriols-Puig
 
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSHIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSAlbert Orriols-Puig
 
HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...
HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...
HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...Albert Orriols-Puig
 

Más de Albert Orriols-Puig (19)

HAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsHAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasets
 
Lecture24
Lecture24Lecture24
Lecture24
 
Lecture23
Lecture23Lecture23
Lecture23
 
Lecture22
Lecture22Lecture22
Lecture22
 
Lecture21
Lecture21Lecture21
Lecture21
 
Lecture20
Lecture20Lecture20
Lecture20
 
Lecture19
Lecture19Lecture19
Lecture19
 
Lecture18
Lecture18Lecture18
Lecture18
 
Lecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART IIILecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART III
 
Lecture15 - Advances topics on association rules PART II
Lecture15 - Advances topics on association rules PART IILecture15 - Advances topics on association rules PART II
Lecture15 - Advances topics on association rules PART II
 
Lecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rulesLecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rules
 
Lecture13 - Association Rules
Lecture13 - Association RulesLecture13 - Association Rules
Lecture13 - Association Rules
 
Lecture10 - Naïve Bayes
Lecture10 - Naïve BayesLecture10 - Naïve Bayes
Lecture10 - Naïve Bayes
 
Lecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-TheoryLecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-Theory
 
Lecture7 - IBk
Lecture7 - IBkLecture7 - IBk
Lecture7 - IBk
 
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...
 
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
 
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSHIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
 
HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...
HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...
HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...
 

Último

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxUmeshTimilsina1
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsKarakKing
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...Nguyen Thanh Tu Collection
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 

Último (20)

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 

Lecture4 - Machine Learning

  • 1. Introduction to Machine Learning Lecture 4 Slides based on Francisco Herrera course on Data Mining Albert Orriols i Puig aorriols@salle.url.edu i l @ ll ld Artificial Intelligence – Machine Learning Enginyeria i Arquitectura La Salle gy q Universitat Ramon Llull
  • 2. Recap of Lecture 3 Typically, techniques in ML have been divided in different paradigms Inductive learning Explanation-based learning p g Analogy-based learning Evolutionary learning Connectionist Learning Slide 2 Artificial Intelligence Machine Learning
  • 3. Recap of Lecture 3 Problems that we’ll study Data l D t classification: C4 5 kNN N ï B ifi ti C4.5, kNN, Naïve Bayes … 1. Statistical learning: SVM 2. Association analysis: A-priori 3. Link mining: Page Rank 4. Clustering: k-means 5. Reinforcement learning: Q-learning, XCS g g, 6. Regression 7. Genetic Fuzzy Systems 8. 8 Slide 3 Artificial Intelligence Machine Learning
  • 4. Today’s Agenda Situation: Where Are We? Classification Prediction Clustering Association Data Mining Systems D t Mi i S t Slide 4 Artificial Intelligence Machine Learning
  • 5. Situation: Where Are We? The input consists of examples featured by different characteristics Slide 5 Artificial Intelligence Machine Learning
  • 6. Situation: Where Are We? What can we do with a bunch of examples? Depend on the type of examples we may have Classification: Find the class to which a new instance belongs to g E.g.: Find whether a new patient has cancer or not Numeric prediction: A variation of classification in which the output p p consists of numeric classes E.g.: Find the frequency of cancerous cell found Regression: Find a function that fits your examples E.g.: Find a function that controls your chain process Association: Find association among your problem attributes or variables E.g.: Find relations such as a patient with high-blood-pressure i E Fi d l ti h ti t ith hi h bl d is more likely to have heart-attack disease Clustering: Process to cluster/group the instances into classes E.g.: Group clients whose purchases are similar Slide 6 Artificial Intelligence Machine Learning
  • 7. Data Classification Test set New instance Information based Knowledge on experience extraction t ti Learner Model Dataset Predicted Output Training set Slide 7 Artificial Intelligence Machine Learning
  • 8. Example of Data Classification Data Set Classification Model How The classification model can be implemented in several ways: • Rules • Decision trees • Mathematical formulae Slide 8 Artificial Intelligence Machine Learning
  • 9. Classification as a Two-Step Process Model usage: to classify future or unknown objects g y j Estimate the accuracy of the model The known label of test samples is compared with the label predicted by the system The accuracy rate is the p p y proportion of test examples that are p correctly classified by the model The test set is independent of the training set If the experts thing that the model is acceptable Then, use to the model to predict unknown examples Slide 9 Artificial Intelligence Machine Learning
  • 10. Going to Real World katydids Definition: Given a collection of a o a ed data (in s annotated da a ( this case katydids a yd ds and grasshoppers), decide what type of insect in the following one grasshoppers Slide 10 Artificial Intelligence Machine Learning
  • 11. Going to Real World How can I put a katydid or a g p y grasshopper into my pp y computer? Slide 11 Artificial Intelligence Machine Learning
  • 12. Going to Real World Thus, the classification problem has been reduced to , p Insect Abdomen Antennae Insect ID Length L th Length L th Class Cl 1 2.7 5.5 Grasshopper 2 8.0 9.1 Katydid 3 0.9 09 4.7 47 Grasshopper 4 1.1 3.1 Grasshopper 5 5.4 8.5 Katykid 6 2.9 1.9 Grasshopper 7 6.1 6.6 Katydid 8 0.5 1.0 Grasshopper 9 8.3 6.6 Katydid 10 8.1 81 4.7 47 Katydid We have an observation with abdomen length 5 1 and 5.1 antennae length 7? Slide 12 Artificial Intelligence Machine Learning
  • 13. Going to Real World Actually, we could write that y, How do I classify this domain? Slide 13 Artificial Intelligence Machine Learning
  • 14. How to Create Classification Models We will study some of this methods: The decision tree C4 5 C4.5 The instance based classifier kNN The probabilistic classifier Naïve Bayes Slide 14 Artificial Intelligence Machine Learning
  • 15. Regression or Prediction Prediction vs data classification Similarities: Both learn from a data set Difference: Diff In classification, each example has a class associated In I prediction, each example has a numerical value di ti h lh ill associated Slide 15 Artificial Intelligence Machine Learning
  • 16. How to Extract a Model? Prediction works analogously to data classification Use U an algorithm to b ild a model build l ih dl Use this model to predict the new unknown example Types of regression Linear and multiple regression Non-linear regression Two of the most-used approaches to regression pp g Neural networks F lb d t Fuzzy rule-based systems Slide 16 Artificial Intelligence Machine Learning
  • 17. Clustering The clustering problem gp Given a data base D={t1, t2, …, tn} of transactions and an integer value k, the c us e g p ob e refers to de e a ege a ue , e clustering problem e e s o define mapping f: D {1,…, k} where each ti is assigned to one cluster kj, 1<=j<=k Main difference with classification In classification, each example is labeled with a class classification In clustering, examples are not labeled Examples of clustering Segment customer data base based on similar buying patterns Group houses in a town into G h i t it neighborhoods based on similar features Identify new plant species Identify similar web usage patterns Slide 17 Artificial Intelligence Machine Learning
  • 18. Example of Clustering Put these people in different clusters pp Which are the keys? Define what’s similar Group similar things in different clusters Size of the clusters? Which type of clustering do I want? Hierarchical clustering? Partition-based clustering? Slide 18 Artificial Intelligence Machine Learning
  • 19. Are They Similar? Slide 19 Artificial Intelligence Machine Learning
  • 20. How to Group the Elements? Slide 20 Artificial Intelligence Machine Learning
  • 21. Which Type of Clustering? Many types of clustering y yp g Hierarchical: Nested set of clusters Partition-based: One set of clusters Incremental: Each element handled at one time Simultaneous: All elements h dl d t Si lt l t handled together th Overlapping/non-overlapping Hierarchical Clustering Partition-based Clustering Slide 21 Artificial Intelligence Machine Learning
  • 22. Association Rules Given a set of items I={I1, I2, …, Im} and a database of {, , , } transactions D={t1, t2, …, tn} where ti={Ii1, Ii2, …, Iik} and Iij Є I The association rule problem is to identify all the rules with form X Y Rules ith minimum s pport R les with minim m support and confidence Support: Fraction of transactions which contain both X and Y Confidence: Measures of how often items in Y appear in transactions that contain X Slide 22 Artificial Intelligence Machine Learning
  • 23. Example Association Rules I = {Beer, Bread Jelly Milk PeanutButter} {Beer Bread, Jelly, Milk, Support of {Bread, PeanutButter} is 60% Slide 23 Artificial Intelligence Machine Learning
  • 24. Example Association Rules Slide 24 Artificial Intelligence Machine Learning
  • 25. Before Finishing… Some environments that contain algorithms to perform g p data classification, regression, clustering and association rule mining KEEL: http://www keel es http://www.keel.es Weka: http://www.cs.waikato.ac.nz/ml/weka/ Rapid Miner: http://rapid-i.com/content/blogcategory/38/69/ Slide 25 Artificial Intelligence Machine Learning
  • 26. Next Class Start with data classification C4.5 Slide 26 Artificial Intelligence Machine Learning
  • 27. Introduction to Machine Learning Lecture 4 Slides based on Francisco Herrera course on Data Mining Albert Orriols i Puig aorriols@salle.url.edu i l @ ll ld Artificial Intelligence – Machine Learning Enginyeria i Arquitectura La Salle gy q Universitat Ramon Llull