SlideShare una empresa de Scribd logo
1 de 39
600.325/425 Declarative Methods - J. Eisner 1
Machine Learning
A large and fascinating field;
there’s much more than what you’ll see
in this class!
600.325/425 Declarative Methods - J. Eisner 2
What should we try to learn, if we want to…
 make computer systems more efficient or secure?
 make money in the stock market?
 avoid losing money to fraud or scams?
 do science or medicine?
 win at games?
 make more entertaining games?
 improve user interfaces?
 even brain-computer interfaces …
 make traditional applications more useful?
 word processors, drawing programs, email, web search,
photo organizers, …
600.325/425 Declarative Methods - J. Eisner 3
What should we try to learn, if we want to…
 make computer systems more efficient or secure?
 make money in the stock market?
 avoid losing money to fraud or scams?
 do science or medicine?
 win at games?
 make more entertaining games?
 improve user interfaces?
 even brain-computer interfaces …
 make traditional applications more useful?
 word processors, drawing programs, email, web search,
photo organizers, …
This stuff has got to be an
important part of the future …
… beats trying to program all the
special cases directly
… and there are “intelligent”
behaviors you can’t imagine
programming directly. (Most of
the stuff now in your brain wasn’t
programmed in advance, either!)
600.325/425 Declarative Methods - J. Eisner 4
The simplest problem:
Supervised binary classification of vectors
 Training set:
(x1, y1), (x2, y2), … (xn, yn)
where x1, x2, … are in Rn
and y1, y2, … are in {0,1} or {-,+} or {-1,1}
 Test set:
(xn+1, ?), (xn+2, ?), … (xn+m, ?)
where these x’s were probably not seen in training
5
University of Texas at Austin
Machine Learning Group
slide thanks to Ray Mooney
Linear Separators
6
University of Texas at Austin
Machine Learning Group
Linear Separators
slide thanks to Ray Mooney
7
University of Texas at Austin
Machine Learning Group
slide thanks to Ray Mooney (modified)
Nonlinear Separators
600.325/425 Declarative Methods - J. Eisner
Page 8
x
y
Note: A more complex function
requires more data to generate
an accurate model
(sample complexity)
slide thanks to Kevin Small (modified)
Nonlinear Separators
600.325/425 Declarative Methods - J. Eisner 9
Encoding and decoding for learning
 Binary classification of vectors … but how do
we treat “real” learning problems in this
framework?
 We need to encode each input example as a
vector in Rn:
feature extraction
600.325/425 Declarative Methods - J. Eisner 10
Features for recognizing a chair?
600.325/425 Declarative Methods - J. Eisner 11
Features for recognizing childhood autism?
(from DSM IV, the Diagnostic and Statistical Manual)
A. A total of six (or more) items from (1), (2), and (3), with
at least two from (1), and one each from (2) and (3):
 (1) Qualitative impairment in social interaction, as manifested
by at least two of the following:
 marked impairment in the use of multiple nonverbal behaviors
such as eye-to-eye gaze, facial expression, body postures, and
gestures to regulate social interaction.
 failure to develop peer relationships appropriate to
developmental level
 a lack of spontaneous seeking to share enjoyment, interests, or
achievements with other people (e.g., by a lack of showing,
bringing, or pointing out objects of interest)
 lack of social or emotional reciprocity
 (2) Qualitative impairments in communication as manifested
by at least one of the following: …
600.325/425 Declarative Methods - J. Eisner 12
Features for recognizing childhood autism?
(from DSM IV, the Diagnostic and Statistical Manual)
B. Delays or abnormal functioning in at least one
of the following areas, with onset prior to age 3
years:
(1) social interaction
(2) language as used in social communication, or
(3) symbolic or imaginative play.
C. The disturbance is not better accounted for by
Rett's disorder or childhood disintegrative
disorder.
600.325/425 Declarative Methods - J. Eisner 13
Features for recognizing a prime number?
 (2,+) (3,+) (4,-) (5,+) (6,-) (7,+) (8,-) (9,-)
(10,-) (11,+) (12,-) (13,+) (14,-) (15,-) …
 Ouch!
 But what kinds of features might you try if you
didn’t know anything about primality?
 How well would they work?
 False positives vs. false negatives?
 Expected performance vs. worst-case
600.325/425 Declarative Methods - J. Eisner 14
Features for recognizing masculine vs. feminine
words in French?
 le fromage (cheese) la salade (salad, lettuce)
 le monument (monument) la fourchette (fork)
 le sentiment (feeling) la télévision (television)
 le couteau (knife) la culture (culture)
 le téléphone (telephone) la situation (situation)
 le microscope (microscope) la société (society)
 le romantisme (romanticism)la différence (difference)
la philosophie (philosophy)
600.325/425 Declarative Methods - J. Eisner 15
Features for recognizing when the user who’s
typing isn’t the usual user?
 (And how do you train this?)
600.325/425 Declarative Methods - J. Eisner 16
Measuring performance
 Simplest: Classification error (fraction of wrong answers)
 Better: Loss functions – different penalties for false positives vs.
false negatives
 If the learner gives a confidence or probability along with each of
its answers, give extra credit for being confidently right but extra
penalty for being confidently wrong
 What’s the formula?
 Correct answer is yi  {-1, +1}
 System predicts zi  [-1, +1] (perhaps fractional)
 Score is i yi * zi
600.325/425 Declarative Methods - J. Eisner 17
Encoding and decoding for learning
 Binary classification of vectors … but how do we
treat “real” learning problems in this framework?
 If the output is to be binary, we need to encode each
input example as a vector in Rn:
feature extraction
 If the output is to be more complicated,
we may need to obtain it as a sequence of binary
decisions, each on a different feature vector
600.325/425 Declarative Methods - J. Eisner
Page 18
Multiclass Classification
Many binary classifiers
(“one versus all”)
slide thanks to Kevin Small (modified)
One multiway classifier
600.325/425 Declarative Methods - J. Eisner 19
Regression: predict a number, not a class
 Don’t just predict whether stock will go up or
down in the present circumstance – predict
by how much!
 Better, predict probabilities that it will go up
and down by different amounts
600.325/425 Declarative Methods - J. Eisner 20
Inference: Predict a whole pattern
 Predict a whole object
(in the sense of object-oriented programming)
 Output is a vector, or a tree, or something
 Why useful?
 Or, return many possible trees with a different
probability on each one
 Some fancy machine learning methods can handle
this directly … but how would you do a simple
encoding?
600.325/425 Declarative Methods - J. Eisner
Page 21
Defining Learning Problems
 ML algorithms are mathematical formalisms and
problems must be modeled accordingly
 Feature Space – space used to describe each
instance; often Rd
, {0,1}d
, etc.
 Output Space – space of possible output labels
 Hypothesis Space – space of functions that can be
selected by the machine learning algorithm (depends on
the algorithm)
slide thanks to Kevin Small (modified)
600.325/425 Declarative Methods - J. Eisner
Page 22
Context Sensitive Spelling
Did anybody (else) want too sleep for to more hours this morning?
 Output Space
 Could use the entire vocabulary; Y={a,aback,...,zucchini}
 Could also use a confusion set; Y={to, too, two}
 Model as (single label) multiclass classification
 Hypothesis space is provided by your learner
 Need to define the feature space
 
two
too
to
d
,
,


slide thanks to Kevin Small (modified)
600.325/425 Declarative Methods - J. Eisner
Page 23
Sentence Representation
S = I would like a piece of cake too!
 Define a set of features
 Features are relations that hold in the sentence.
 Two components to defining features
 Describe relations in the sentence: text, text ordering, properties
of the text (information sources)
 Define functions based upon these relations (more on this later)
slide thanks to Kevin Small (modified)
600.325/425 Declarative Methods - J. Eisner
Page 24
Sentence Representation
S1 = I would like a piece of cake too!
S2 = This is not the way to achieve peace in Iraq.
 Examples of (simple) features
1. Does ‘ever’ appear within a window of 3 words?
2. Does ‘cake’ appear within a window of 3 words?
3. Is the preceding word a verb?
S1 = 0, 1, 0
S2 = 0, 0, 1
1
2
3
slide thanks to Kevin Small (modified)
600.325/425 Declarative Methods - J. Eisner
Page 25
Embedding
 Requires some knowledge engineering
 Makes the discriminant function simpler (and learnable)
Peace
Piece
5
2
3
3
4
1
3
2
1 x
x
x
x
x
x
x
x
x 

5
4
1 y
y
y 

slide thanks to Kevin Small (modified)
600.325/425 Declarative Methods - J. Eisner
Page 26
Sparse Representation
 Between basic and complex features, the
dimensionality will be very high
 Most features will not be active in a given example
 Represent vectors with a list of active indices
S1 = 1, 0, 1, 0, 0, 0, 1, 0, 0, 1 becomes S1 = 1, 3, 7, 10
S2 = 0, 0, 0, 1, 0, 0, 1, 0, 0, 0 becomes S2 = 4, 7
slide thanks to Kevin Small (modified)
600.325/425 Declarative Methods - J. Eisner
Page 27
Types of Sparsity
 Sparse Function Space
 High dimensional data where target function depends
on a few features (many irrelevant features)
 Sparse Example Space
 High dimensional data where only a few features are
active in each example
 In NLP, we typically have both types of sparsity.
slide thanks to Kevin Small (modified)
600.325/425 Declarative Methods - J. Eisner 28
Training paradigms
 Supervised?
 Unsupervised?
 Partly supervised?
 Incomplete?
 Active learning, online learning
 Reinforcement learning
600.325/425 Declarative Methods - J. Eisner 29
Training and test sets
 How this relates to the midterm
 Want you to do well – proves I’m a good teacher (merit pay?)
 So I want to teach to the test …
 heck, just show you the test in advance!
 Or equivalently, test exactly what I taught …
 what was the title of slide 29?
 How should JHU prevent this?
 what would the title of slide 29 ½ have been?
 Development sets
 the market newsletter scam
 so, what if we have an army of robotic professors?
 some professor’s class will do well just by luck! she wins!
 JHU should only be able to send one prof to the professorial Olympics
 Olympic trials are like a development set
600.325/425 Declarative Methods - J. Eisner 30
Overfitting and underfitting
 Overfitting: Model the training data all too well
(autistic savants?). Do really well if we test on the
training data, but poorly if we test on new data.
 Underfitting: Try too hard to generalize. Ignore
relevant distinctions – try to find a simple linear
separator when the data are actually more
complicated than that.
 How does this relate to the # of parameters to
learn?
 Lord Kelvin: “And with 3 parameters, I can fit an
elephant …”
600.325/425 Declarative Methods - J. Eisner 31
“Feature Engineering” Workshop in 2005
CALL FOR PAPERS
Feature Engineering for Machine Learning in Natural Language
Processing
Workshop at the Annual Meeting of the Association of Computational
Linguistics (ACL 2005)
http://research.microsoft.com/~ringger/FeatureEngineeringWorkshop/
Submission Deadline: April 20, 2005
Ann Arbor, Michigan
June 29, 2005
600.325/425 Declarative Methods - J. Eisner 32
“Feature Engineering” Workshop in 2005
As experience with machine learning for solving natural language
processing tasks accumulates in the field, practitioners are finding
that feature engineering is as critical as the choice of machine
learning algorithm, if not more so.
Feature design, feature selection, and feature impact (through ablation
studies and the like) significantly affect the performance of systems
and deserve greater attention.
In the wake of the shift away from knowledge engineering and of the
successes of data-driven and statistical methods, researchers in the
field are likely to make further progress by incorporating additional,
sometimes familiar, sources of knowledge as features.
Although some experience in the area of feature engineering is to be
found in the theoretical machine learning community, the particular
demands of natural language processing leave much to be
discovered.
600.325/425 Declarative Methods - J. Eisner 33
“Feature Engineering” Workshop in 2005
Topics may include, but are not necessarily limited to:
 Novel methods for discovering or inducing features, such as mining the web for
closed classes, useful for indicator features.
 Comparative studies of different feature selection algorithms for NLP tasks.
 Interactive tools that help researchers to identify ambiguous cases that could be
disambiguated by the addition of features.
 Error analysis of various aspects of feature induction, selection, representation.
 Issues with representation, e.g., strategies for handling hierarchical
representations, including decomposing to atomic features or by employing
statistical relational learning.
 Techniques used in fields outside NLP that prove useful in NLP.
 The impact of feature selection and feature design on such practical
considerations as training time, experimental design, domain independence,
and evaluation.
 Analysis of feature engineering and its interaction with specific machine learning
methods commonly used in NLP.
 Combining classifiers that employ diverse types of features.
 Studies of methods for defining a feature set, for example by iteratively
expanding a base feature set.
 Issues with representing and combining real-valued and categorical features for
NLP tasks.
600.325/425 Declarative Methods - J. Eisner
Page 34
A Machine Learning System
Preprocessing
Raw
Text
Formatted
Text
slide thanks to Kevin Small (modified)
600.325/425 Declarative Methods - J. Eisner
Page 35
Preprocessing Text
 Sentence splitting, Word Splitting, etc.
 Put data in a form usable for feature
extraction
They recently
recovered a small
piece of a live
Elvis concert
recording.
He was singing
gospel songs,
including “Peace
in the Valley.”
0 0 0 They
0 0 1 recently
0 0 2 recovered
0 0 3 a
0 0 4 small
piece 0 5 piece
0 0 6 of
:
0 1 6 including
0 1 7 QUOTE
peace 1 8 Peace
0 1 9 in
0 1 10 the
0 1 11 Valley
0 1 12 .
0 1 13 QUOTE
slide thanks to Kevin Small (modified)
600.325/425 Declarative Methods - J. Eisner
Page 36
A Machine Learning System
Preprocessing
Feature
Extraction
Raw
Text
Formatted
Text
Feature
Vectors
slide thanks to Kevin Small (modified)
600.325/425 Declarative Methods - J. Eisner
Page 37
Feature Extraction
 Converts formatted text
into feature vectors
 Lexicon file contains
feature descriptions
0 0 0 They
0 0 1 recently
0 0 2 recovered
0 0 3 a
0 0 4 small
piece 0 5 piece
0 0 6 of
:
0 1 6 including
0 1 7 QUOTE
peace 1 8 Peace
0 1 9 in
0 1 10 the
0 1 11 Valley
0 1 12 .
0 1 13 QUOTE
0, 1001, 1013, 1134, 1175, 1206
1, 1021, 1055, 1085, 1182, 1252
Lexicon
File
slide thanks to Kevin Small (modified)
600.325/425 Declarative Methods - J. Eisner
Page 38
A Machine Learning System
Preprocessing
Feature
Extraction
Machine
Learner
Classifier
Postprocessing
Raw
Text
Formatted
Text
Testing
Examples
Function
Parameters
Labels
Annotated
Text
Feature
Vectors
Training
Examples
slide thanks to Kevin Small (modified)
600.325/425 Declarative Methods - J. Eisner
Page 39
A Machine Learning System
Preprocessing
Feature
Extraction
Machine
Learner
Classifier
Postprocessing
Raw
Text
Formatted
Text
Testing
Examples
Function
Parameters
Labels
Annotated
Text
Feature
Vectors
Training
Examples
slide thanks to Kevin Small (modified)

Más contenido relacionado

Similar a 09learning.ppt

AI and ML Skills for the Testing World Tutorial
AI and ML Skills for the Testing World TutorialAI and ML Skills for the Testing World Tutorial
AI and ML Skills for the Testing World TutorialTariq King
 
Machine Learning ebook.pdf
Machine Learning ebook.pdfMachine Learning ebook.pdf
Machine Learning ebook.pdfHODIT12
 
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 11_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1MostafaHazemMostafaa
 
30237--KK30237--KK EMBED Equa.docx
30237--KK30237--KK EMBED Equa.docx30237--KK30237--KK EMBED Equa.docx
30237--KK30237--KK EMBED Equa.docxgilbertkpeters11344
 
Machine Learning & AI - 2022 intro for pre-college students.pdf
Machine Learning & AI - 2022 intro for pre-college students.pdfMachine Learning & AI - 2022 intro for pre-college students.pdf
Machine Learning & AI - 2022 intro for pre-college students.pdfEd Fernandez
 
Lecture 6: Watson and the Social Web (2014), Chris Welty
Lecture 6: Watson and the Social Web (2014), Chris WeltyLecture 6: Watson and the Social Web (2014), Chris Welty
Lecture 6: Watson and the Social Web (2014), Chris WeltyLora Aroyo
 
Bi model face recognition framework
Bi model face recognition frameworkBi model face recognition framework
Bi model face recognition frameworkSumit Agarwal
 
Solr and Machine Vision - Scott Cote, Lucidworks & Trevor Grant, IBM
Solr and Machine Vision - Scott Cote, Lucidworks & Trevor Grant, IBMSolr and Machine Vision - Scott Cote, Lucidworks & Trevor Grant, IBM
Solr and Machine Vision - Scott Cote, Lucidworks & Trevor Grant, IBMLucidworks
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningAI Summary
 
Knowledge representation Problem in AI.pptx
Knowledge representation Problem in AI.pptxKnowledge representation Problem in AI.pptx
Knowledge representation Problem in AI.pptxSandeepGupta229023
 
Computational Biology, Part 4 Protein Coding Regions
Computational Biology, Part 4 Protein Coding RegionsComputational Biology, Part 4 Protein Coding Regions
Computational Biology, Part 4 Protein Coding Regionsbutest
 
2013 - Andrei Zmievski: Machine learning para datos
2013 - Andrei Zmievski: Machine learning para datos2013 - Andrei Zmievski: Machine learning para datos
2013 - Andrei Zmievski: Machine learning para datosPHP Conference Argentina
 
Object recognition
Object recognitionObject recognition
Object recognitionsaniacorreya
 
Generalizing Scientific Machine Learning and Differentiable Simulation Beyond...
Generalizing Scientific Machine Learning and Differentiable Simulation Beyond...Generalizing Scientific Machine Learning and Differentiable Simulation Beyond...
Generalizing Scientific Machine Learning and Differentiable Simulation Beyond...Chris Rackauckas
 
Facial Expression Recognition via Python
Facial Expression Recognition via PythonFacial Expression Recognition via Python
Facial Expression Recognition via PythonSaurav Gupta
 
Predicting Facial Expression using Neural Network
Predicting Facial Expression using Neural Network Predicting Facial Expression using Neural Network
Predicting Facial Expression using Neural Network Santanu Paul
 
Ashford 2 - Week 1 - Instructor GuidanceWeek OverviewThe f.docx
Ashford 2 - Week 1 - Instructor GuidanceWeek OverviewThe f.docxAshford 2 - Week 1 - Instructor GuidanceWeek OverviewThe f.docx
Ashford 2 - Week 1 - Instructor GuidanceWeek OverviewThe f.docxfredharris32
 

Similar a 09learning.ppt (20)

Bb32351355
Bb32351355Bb32351355
Bb32351355
 
AI and ML Skills for the Testing World Tutorial
AI and ML Skills for the Testing World TutorialAI and ML Skills for the Testing World Tutorial
AI and ML Skills for the Testing World Tutorial
 
Machine Learning ebook.pdf
Machine Learning ebook.pdfMachine Learning ebook.pdf
Machine Learning ebook.pdf
 
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 11_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
 
30237--KK30237--KK EMBED Equa.docx
30237--KK30237--KK EMBED Equa.docx30237--KK30237--KK EMBED Equa.docx
30237--KK30237--KK EMBED Equa.docx
 
Machine Learning & AI - 2022 intro for pre-college students.pdf
Machine Learning & AI - 2022 intro for pre-college students.pdfMachine Learning & AI - 2022 intro for pre-college students.pdf
Machine Learning & AI - 2022 intro for pre-college students.pdf
 
Lecture 6: Watson and the Social Web (2014), Chris Welty
Lecture 6: Watson and the Social Web (2014), Chris WeltyLecture 6: Watson and the Social Web (2014), Chris Welty
Lecture 6: Watson and the Social Web (2014), Chris Welty
 
weak slot and filler
weak slot and fillerweak slot and filler
weak slot and filler
 
Bi model face recognition framework
Bi model face recognition frameworkBi model face recognition framework
Bi model face recognition framework
 
Solr and Machine Vision - Scott Cote, Lucidworks & Trevor Grant, IBM
Solr and Machine Vision - Scott Cote, Lucidworks & Trevor Grant, IBMSolr and Machine Vision - Scott Cote, Lucidworks & Trevor Grant, IBM
Solr and Machine Vision - Scott Cote, Lucidworks & Trevor Grant, IBM
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Knowledge representation Problem in AI.pptx
Knowledge representation Problem in AI.pptxKnowledge representation Problem in AI.pptx
Knowledge representation Problem in AI.pptx
 
Computational Biology, Part 4 Protein Coding Regions
Computational Biology, Part 4 Protein Coding RegionsComputational Biology, Part 4 Protein Coding Regions
Computational Biology, Part 4 Protein Coding Regions
 
2013 - Andrei Zmievski: Machine learning para datos
2013 - Andrei Zmievski: Machine learning para datos2013 - Andrei Zmievski: Machine learning para datos
2013 - Andrei Zmievski: Machine learning para datos
 
Object recognition
Object recognitionObject recognition
Object recognition
 
Generalizing Scientific Machine Learning and Differentiable Simulation Beyond...
Generalizing Scientific Machine Learning and Differentiable Simulation Beyond...Generalizing Scientific Machine Learning and Differentiable Simulation Beyond...
Generalizing Scientific Machine Learning and Differentiable Simulation Beyond...
 
Facial Expression Recognition
Facial Expression RecognitionFacial Expression Recognition
Facial Expression Recognition
 
Facial Expression Recognition via Python
Facial Expression Recognition via PythonFacial Expression Recognition via Python
Facial Expression Recognition via Python
 
Predicting Facial Expression using Neural Network
Predicting Facial Expression using Neural Network Predicting Facial Expression using Neural Network
Predicting Facial Expression using Neural Network
 
Ashford 2 - Week 1 - Instructor GuidanceWeek OverviewThe f.docx
Ashford 2 - Week 1 - Instructor GuidanceWeek OverviewThe f.docxAshford 2 - Week 1 - Instructor GuidanceWeek OverviewThe f.docx
Ashford 2 - Week 1 - Instructor GuidanceWeek OverviewThe f.docx
 

Más de ABINASHPADHY6

Untitled (2).pptxghjfgjfgjfghjghghjghghh
Untitled (2).pptxghjfgjfgjfghjghghjghghhUntitled (2).pptxghjfgjfgjfghjghghjghghh
Untitled (2).pptxghjfgjfgjfghjghghjghghhABINASHPADHY6
 
Ankit Upadhyadfgdfgdfgy sdgdfgdfghdfgdfgdfg
Ankit Upadhyadfgdfgdfgy sdgdfgdfghdfgdfgdfgAnkit Upadhyadfgdfgdfgy sdgdfgdfghdfgdfgdfg
Ankit Upadhyadfgdfgdfgy sdgdfgdfghdfgdfgdfgABINASHPADHY6
 
Rapid Guard PVC Doors (Ritul Joshi).pptx
Rapid Guard PVC Doors (Ritul Joshi).pptxRapid Guard PVC Doors (Ritul Joshi).pptx
Rapid Guard PVC Doors (Ritul Joshi).pptxABINASHPADHY6
 
wqedfghj,hgfdgthjkjhgfdsdfgthujsdfrtghyu
wqedfghj,hgfdgthjkjhgfdsdfgthujsdfrtghyuwqedfghj,hgfdgthjkjhgfdsdfgthujsdfrtghyu
wqedfghj,hgfdgthjkjhgfdsdfgthujsdfrtghyuABINASHPADHY6
 
Blue Shark Tech zsxdfghsdfghjsdfghjkdfghjkdfghjk
Blue Shark Tech zsxdfghsdfghjsdfghjkdfghjkdfghjkBlue Shark Tech zsxdfghsdfghjsdfghjkdfghjkdfghjk
Blue Shark Tech zsxdfghsdfghjsdfghjkdfghjkdfghjkABINASHPADHY6
 
Types-of-Information-System.pptx
Types-of-Information-System.pptxTypes-of-Information-System.pptx
Types-of-Information-System.pptxABINASHPADHY6
 
Ferns and Petals.pdf
Ferns and Petals.pdfFerns and Petals.pdf
Ferns and Petals.pdfABINASHPADHY6
 
Coursera H5B26VDU9GSH.pdf
Coursera H5B26VDU9GSH.pdfCoursera H5B26VDU9GSH.pdf
Coursera H5B26VDU9GSH.pdfABINASHPADHY6
 
Bagging_and_Boosting.pptx
Bagging_and_Boosting.pptxBagging_and_Boosting.pptx
Bagging_and_Boosting.pptxABINASHPADHY6
 
shubhampresentation-180430060134.pptx
shubhampresentation-180430060134.pptxshubhampresentation-180430060134.pptx
shubhampresentation-180430060134.pptxABINASHPADHY6
 
decisiontree-110906040745-phpapp01.pptx
decisiontree-110906040745-phpapp01.pptxdecisiontree-110906040745-phpapp01.pptx
decisiontree-110906040745-phpapp01.pptxABINASHPADHY6
 
Module 7_ Use Cases_ Blockchain Certification Training Course.pptx
Module 7_ Use Cases_ Blockchain Certification Training Course.pptxModule 7_ Use Cases_ Blockchain Certification Training Course.pptx
Module 7_ Use Cases_ Blockchain Certification Training Course.pptxABINASHPADHY6
 
collaborativefiltering-150228122057-conversion-gate02.pptx
collaborativefiltering-150228122057-conversion-gate02.pptxcollaborativefiltering-150228122057-conversion-gate02.pptx
collaborativefiltering-150228122057-conversion-gate02.pptxABINASHPADHY6
 
Chernick.Michael.ppt
Chernick.Michael.pptChernick.Michael.ppt
Chernick.Michael.pptABINASHPADHY6
 
videorecommendationsystemfornewseducationandentertainment-170519183703.pptx
videorecommendationsystemfornewseducationandentertainment-170519183703.pptxvideorecommendationsystemfornewseducationandentertainment-170519183703.pptx
videorecommendationsystemfornewseducationandentertainment-170519183703.pptxABINASHPADHY6
 
log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...
log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...
log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...ABINASHPADHY6
 
pcappt-140121072949-phpapp01.pptx
pcappt-140121072949-phpapp01.pptxpcappt-140121072949-phpapp01.pptx
pcappt-140121072949-phpapp01.pptxABINASHPADHY6
 

Más de ABINASHPADHY6 (20)

Untitled (2).pptxghjfgjfgjfghjghghjghghh
Untitled (2).pptxghjfgjfgjfghjghghjghghhUntitled (2).pptxghjfgjfgjfghjghghjghghh
Untitled (2).pptxghjfgjfgjfghjghghjghghh
 
Ankit Upadhyadfgdfgdfgy sdgdfgdfghdfgdfgdfg
Ankit Upadhyadfgdfgdfgy sdgdfgdfghdfgdfgdfgAnkit Upadhyadfgdfgdfgy sdgdfgdfghdfgdfgdfg
Ankit Upadhyadfgdfgdfgy sdgdfgdfghdfgdfgdfg
 
Rapid Guard PVC Doors (Ritul Joshi).pptx
Rapid Guard PVC Doors (Ritul Joshi).pptxRapid Guard PVC Doors (Ritul Joshi).pptx
Rapid Guard PVC Doors (Ritul Joshi).pptx
 
wqedfghj,hgfdgthjkjhgfdsdfgthujsdfrtghyu
wqedfghj,hgfdgthjkjhgfdsdfgthujsdfrtghyuwqedfghj,hgfdgthjkjhgfdsdfgthujsdfrtghyu
wqedfghj,hgfdgthjkjhgfdsdfgthujsdfrtghyu
 
Blue Shark Tech zsxdfghsdfghjsdfghjkdfghjkdfghjk
Blue Shark Tech zsxdfghsdfghjsdfghjkdfghjkdfghjkBlue Shark Tech zsxdfghsdfghjsdfghjkdfghjkdfghjk
Blue Shark Tech zsxdfghsdfghjsdfghjkdfghjkdfghjk
 
Types-of-Information-System.pptx
Types-of-Information-System.pptxTypes-of-Information-System.pptx
Types-of-Information-System.pptx
 
Ferns and Petals.pdf
Ferns and Petals.pdfFerns and Petals.pdf
Ferns and Petals.pdf
 
Coursera H5B26VDU9GSH.pdf
Coursera H5B26VDU9GSH.pdfCoursera H5B26VDU9GSH.pdf
Coursera H5B26VDU9GSH.pdf
 
Advertising &.pptx
Advertising &.pptxAdvertising &.pptx
Advertising &.pptx
 
Bagging_and_Boosting.pptx
Bagging_and_Boosting.pptxBagging_and_Boosting.pptx
Bagging_and_Boosting.pptx
 
shubhampresentation-180430060134.pptx
shubhampresentation-180430060134.pptxshubhampresentation-180430060134.pptx
shubhampresentation-180430060134.pptx
 
decisiontree-110906040745-phpapp01.pptx
decisiontree-110906040745-phpapp01.pptxdecisiontree-110906040745-phpapp01.pptx
decisiontree-110906040745-phpapp01.pptx
 
Module 7_ Use Cases_ Blockchain Certification Training Course.pptx
Module 7_ Use Cases_ Blockchain Certification Training Course.pptxModule 7_ Use Cases_ Blockchain Certification Training Course.pptx
Module 7_ Use Cases_ Blockchain Certification Training Course.pptx
 
collaborativefiltering-150228122057-conversion-gate02.pptx
collaborativefiltering-150228122057-conversion-gate02.pptxcollaborativefiltering-150228122057-conversion-gate02.pptx
collaborativefiltering-150228122057-conversion-gate02.pptx
 
Chernick.Michael.ppt
Chernick.Michael.pptChernick.Michael.ppt
Chernick.Michael.ppt
 
videorecommendationsystemfornewseducationandentertainment-170519183703.pptx
videorecommendationsystemfornewseducationandentertainment-170519183703.pptxvideorecommendationsystemfornewseducationandentertainment-170519183703.pptx
videorecommendationsystemfornewseducationandentertainment-170519183703.pptx
 
log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...
log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...
log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...
 
pcappt-140121072949-phpapp01.pptx
pcappt-140121072949-phpapp01.pptxpcappt-140121072949-phpapp01.pptx
pcappt-140121072949-phpapp01.pptx
 
Lecture1_jps.ppt
Lecture1_jps.pptLecture1_jps.ppt
Lecture1_jps.ppt
 
Culbert.ppt
Culbert.pptCulbert.ppt
Culbert.ppt
 

Último

COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfDr Vijay Vishwakarma
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxDr. Ravikiran H M Gowda
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17Celine George
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jisc
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxPooja Bhuva
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxJisc
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxPooja Bhuva
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsKarakKing
 

Último (20)

COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 

09learning.ppt

  • 1. 600.325/425 Declarative Methods - J. Eisner 1 Machine Learning A large and fascinating field; there’s much more than what you’ll see in this class!
  • 2. 600.325/425 Declarative Methods - J. Eisner 2 What should we try to learn, if we want to…  make computer systems more efficient or secure?  make money in the stock market?  avoid losing money to fraud or scams?  do science or medicine?  win at games?  make more entertaining games?  improve user interfaces?  even brain-computer interfaces …  make traditional applications more useful?  word processors, drawing programs, email, web search, photo organizers, …
  • 3. 600.325/425 Declarative Methods - J. Eisner 3 What should we try to learn, if we want to…  make computer systems more efficient or secure?  make money in the stock market?  avoid losing money to fraud or scams?  do science or medicine?  win at games?  make more entertaining games?  improve user interfaces?  even brain-computer interfaces …  make traditional applications more useful?  word processors, drawing programs, email, web search, photo organizers, … This stuff has got to be an important part of the future … … beats trying to program all the special cases directly … and there are “intelligent” behaviors you can’t imagine programming directly. (Most of the stuff now in your brain wasn’t programmed in advance, either!)
  • 4. 600.325/425 Declarative Methods - J. Eisner 4 The simplest problem: Supervised binary classification of vectors  Training set: (x1, y1), (x2, y2), … (xn, yn) where x1, x2, … are in Rn and y1, y2, … are in {0,1} or {-,+} or {-1,1}  Test set: (xn+1, ?), (xn+2, ?), … (xn+m, ?) where these x’s were probably not seen in training
  • 5. 5 University of Texas at Austin Machine Learning Group slide thanks to Ray Mooney Linear Separators
  • 6. 6 University of Texas at Austin Machine Learning Group Linear Separators slide thanks to Ray Mooney
  • 7. 7 University of Texas at Austin Machine Learning Group slide thanks to Ray Mooney (modified) Nonlinear Separators
  • 8. 600.325/425 Declarative Methods - J. Eisner Page 8 x y Note: A more complex function requires more data to generate an accurate model (sample complexity) slide thanks to Kevin Small (modified) Nonlinear Separators
  • 9. 600.325/425 Declarative Methods - J. Eisner 9 Encoding and decoding for learning  Binary classification of vectors … but how do we treat “real” learning problems in this framework?  We need to encode each input example as a vector in Rn: feature extraction
  • 10. 600.325/425 Declarative Methods - J. Eisner 10 Features for recognizing a chair?
  • 11. 600.325/425 Declarative Methods - J. Eisner 11 Features for recognizing childhood autism? (from DSM IV, the Diagnostic and Statistical Manual) A. A total of six (or more) items from (1), (2), and (3), with at least two from (1), and one each from (2) and (3):  (1) Qualitative impairment in social interaction, as manifested by at least two of the following:  marked impairment in the use of multiple nonverbal behaviors such as eye-to-eye gaze, facial expression, body postures, and gestures to regulate social interaction.  failure to develop peer relationships appropriate to developmental level  a lack of spontaneous seeking to share enjoyment, interests, or achievements with other people (e.g., by a lack of showing, bringing, or pointing out objects of interest)  lack of social or emotional reciprocity  (2) Qualitative impairments in communication as manifested by at least one of the following: …
  • 12. 600.325/425 Declarative Methods - J. Eisner 12 Features for recognizing childhood autism? (from DSM IV, the Diagnostic and Statistical Manual) B. Delays or abnormal functioning in at least one of the following areas, with onset prior to age 3 years: (1) social interaction (2) language as used in social communication, or (3) symbolic or imaginative play. C. The disturbance is not better accounted for by Rett's disorder or childhood disintegrative disorder.
  • 13. 600.325/425 Declarative Methods - J. Eisner 13 Features for recognizing a prime number?  (2,+) (3,+) (4,-) (5,+) (6,-) (7,+) (8,-) (9,-) (10,-) (11,+) (12,-) (13,+) (14,-) (15,-) …  Ouch!  But what kinds of features might you try if you didn’t know anything about primality?  How well would they work?  False positives vs. false negatives?  Expected performance vs. worst-case
  • 14. 600.325/425 Declarative Methods - J. Eisner 14 Features for recognizing masculine vs. feminine words in French?  le fromage (cheese) la salade (salad, lettuce)  le monument (monument) la fourchette (fork)  le sentiment (feeling) la télévision (television)  le couteau (knife) la culture (culture)  le téléphone (telephone) la situation (situation)  le microscope (microscope) la société (society)  le romantisme (romanticism)la différence (difference) la philosophie (philosophy)
  • 15. 600.325/425 Declarative Methods - J. Eisner 15 Features for recognizing when the user who’s typing isn’t the usual user?  (And how do you train this?)
  • 16. 600.325/425 Declarative Methods - J. Eisner 16 Measuring performance  Simplest: Classification error (fraction of wrong answers)  Better: Loss functions – different penalties for false positives vs. false negatives  If the learner gives a confidence or probability along with each of its answers, give extra credit for being confidently right but extra penalty for being confidently wrong  What’s the formula?  Correct answer is yi  {-1, +1}  System predicts zi  [-1, +1] (perhaps fractional)  Score is i yi * zi
  • 17. 600.325/425 Declarative Methods - J. Eisner 17 Encoding and decoding for learning  Binary classification of vectors … but how do we treat “real” learning problems in this framework?  If the output is to be binary, we need to encode each input example as a vector in Rn: feature extraction  If the output is to be more complicated, we may need to obtain it as a sequence of binary decisions, each on a different feature vector
  • 18. 600.325/425 Declarative Methods - J. Eisner Page 18 Multiclass Classification Many binary classifiers (“one versus all”) slide thanks to Kevin Small (modified) One multiway classifier
  • 19. 600.325/425 Declarative Methods - J. Eisner 19 Regression: predict a number, not a class  Don’t just predict whether stock will go up or down in the present circumstance – predict by how much!  Better, predict probabilities that it will go up and down by different amounts
  • 20. 600.325/425 Declarative Methods - J. Eisner 20 Inference: Predict a whole pattern  Predict a whole object (in the sense of object-oriented programming)  Output is a vector, or a tree, or something  Why useful?  Or, return many possible trees with a different probability on each one  Some fancy machine learning methods can handle this directly … but how would you do a simple encoding?
  • 21. 600.325/425 Declarative Methods - J. Eisner Page 21 Defining Learning Problems  ML algorithms are mathematical formalisms and problems must be modeled accordingly  Feature Space – space used to describe each instance; often Rd , {0,1}d , etc.  Output Space – space of possible output labels  Hypothesis Space – space of functions that can be selected by the machine learning algorithm (depends on the algorithm) slide thanks to Kevin Small (modified)
  • 22. 600.325/425 Declarative Methods - J. Eisner Page 22 Context Sensitive Spelling Did anybody (else) want too sleep for to more hours this morning?  Output Space  Could use the entire vocabulary; Y={a,aback,...,zucchini}  Could also use a confusion set; Y={to, too, two}  Model as (single label) multiclass classification  Hypothesis space is provided by your learner  Need to define the feature space   two too to d , ,   slide thanks to Kevin Small (modified)
  • 23. 600.325/425 Declarative Methods - J. Eisner Page 23 Sentence Representation S = I would like a piece of cake too!  Define a set of features  Features are relations that hold in the sentence.  Two components to defining features  Describe relations in the sentence: text, text ordering, properties of the text (information sources)  Define functions based upon these relations (more on this later) slide thanks to Kevin Small (modified)
  • 24. 600.325/425 Declarative Methods - J. Eisner Page 24 Sentence Representation S1 = I would like a piece of cake too! S2 = This is not the way to achieve peace in Iraq.  Examples of (simple) features 1. Does ‘ever’ appear within a window of 3 words? 2. Does ‘cake’ appear within a window of 3 words? 3. Is the preceding word a verb? S1 = 0, 1, 0 S2 = 0, 0, 1 1 2 3 slide thanks to Kevin Small (modified)
  • 25. 600.325/425 Declarative Methods - J. Eisner Page 25 Embedding  Requires some knowledge engineering  Makes the discriminant function simpler (and learnable) Peace Piece 5 2 3 3 4 1 3 2 1 x x x x x x x x x   5 4 1 y y y   slide thanks to Kevin Small (modified)
  • 26. 600.325/425 Declarative Methods - J. Eisner Page 26 Sparse Representation  Between basic and complex features, the dimensionality will be very high  Most features will not be active in a given example  Represent vectors with a list of active indices S1 = 1, 0, 1, 0, 0, 0, 1, 0, 0, 1 becomes S1 = 1, 3, 7, 10 S2 = 0, 0, 0, 1, 0, 0, 1, 0, 0, 0 becomes S2 = 4, 7 slide thanks to Kevin Small (modified)
  • 27. 600.325/425 Declarative Methods - J. Eisner Page 27 Types of Sparsity  Sparse Function Space  High dimensional data where target function depends on a few features (many irrelevant features)  Sparse Example Space  High dimensional data where only a few features are active in each example  In NLP, we typically have both types of sparsity. slide thanks to Kevin Small (modified)
  • 28. 600.325/425 Declarative Methods - J. Eisner 28 Training paradigms  Supervised?  Unsupervised?  Partly supervised?  Incomplete?  Active learning, online learning  Reinforcement learning
  • 29. 600.325/425 Declarative Methods - J. Eisner 29 Training and test sets  How this relates to the midterm  Want you to do well – proves I’m a good teacher (merit pay?)  So I want to teach to the test …  heck, just show you the test in advance!  Or equivalently, test exactly what I taught …  what was the title of slide 29?  How should JHU prevent this?  what would the title of slide 29 ½ have been?  Development sets  the market newsletter scam  so, what if we have an army of robotic professors?  some professor’s class will do well just by luck! she wins!  JHU should only be able to send one prof to the professorial Olympics  Olympic trials are like a development set
  • 30. 600.325/425 Declarative Methods - J. Eisner 30 Overfitting and underfitting  Overfitting: Model the training data all too well (autistic savants?). Do really well if we test on the training data, but poorly if we test on new data.  Underfitting: Try too hard to generalize. Ignore relevant distinctions – try to find a simple linear separator when the data are actually more complicated than that.  How does this relate to the # of parameters to learn?  Lord Kelvin: “And with 3 parameters, I can fit an elephant …”
  • 31. 600.325/425 Declarative Methods - J. Eisner 31 “Feature Engineering” Workshop in 2005 CALL FOR PAPERS Feature Engineering for Machine Learning in Natural Language Processing Workshop at the Annual Meeting of the Association of Computational Linguistics (ACL 2005) http://research.microsoft.com/~ringger/FeatureEngineeringWorkshop/ Submission Deadline: April 20, 2005 Ann Arbor, Michigan June 29, 2005
  • 32. 600.325/425 Declarative Methods - J. Eisner 32 “Feature Engineering” Workshop in 2005 As experience with machine learning for solving natural language processing tasks accumulates in the field, practitioners are finding that feature engineering is as critical as the choice of machine learning algorithm, if not more so. Feature design, feature selection, and feature impact (through ablation studies and the like) significantly affect the performance of systems and deserve greater attention. In the wake of the shift away from knowledge engineering and of the successes of data-driven and statistical methods, researchers in the field are likely to make further progress by incorporating additional, sometimes familiar, sources of knowledge as features. Although some experience in the area of feature engineering is to be found in the theoretical machine learning community, the particular demands of natural language processing leave much to be discovered.
  • 33. 600.325/425 Declarative Methods - J. Eisner 33 “Feature Engineering” Workshop in 2005 Topics may include, but are not necessarily limited to:  Novel methods for discovering or inducing features, such as mining the web for closed classes, useful for indicator features.  Comparative studies of different feature selection algorithms for NLP tasks.  Interactive tools that help researchers to identify ambiguous cases that could be disambiguated by the addition of features.  Error analysis of various aspects of feature induction, selection, representation.  Issues with representation, e.g., strategies for handling hierarchical representations, including decomposing to atomic features or by employing statistical relational learning.  Techniques used in fields outside NLP that prove useful in NLP.  The impact of feature selection and feature design on such practical considerations as training time, experimental design, domain independence, and evaluation.  Analysis of feature engineering and its interaction with specific machine learning methods commonly used in NLP.  Combining classifiers that employ diverse types of features.  Studies of methods for defining a feature set, for example by iteratively expanding a base feature set.  Issues with representing and combining real-valued and categorical features for NLP tasks.
  • 34. 600.325/425 Declarative Methods - J. Eisner Page 34 A Machine Learning System Preprocessing Raw Text Formatted Text slide thanks to Kevin Small (modified)
  • 35. 600.325/425 Declarative Methods - J. Eisner Page 35 Preprocessing Text  Sentence splitting, Word Splitting, etc.  Put data in a form usable for feature extraction They recently recovered a small piece of a live Elvis concert recording. He was singing gospel songs, including “Peace in the Valley.” 0 0 0 They 0 0 1 recently 0 0 2 recovered 0 0 3 a 0 0 4 small piece 0 5 piece 0 0 6 of : 0 1 6 including 0 1 7 QUOTE peace 1 8 Peace 0 1 9 in 0 1 10 the 0 1 11 Valley 0 1 12 . 0 1 13 QUOTE slide thanks to Kevin Small (modified)
  • 36. 600.325/425 Declarative Methods - J. Eisner Page 36 A Machine Learning System Preprocessing Feature Extraction Raw Text Formatted Text Feature Vectors slide thanks to Kevin Small (modified)
  • 37. 600.325/425 Declarative Methods - J. Eisner Page 37 Feature Extraction  Converts formatted text into feature vectors  Lexicon file contains feature descriptions 0 0 0 They 0 0 1 recently 0 0 2 recovered 0 0 3 a 0 0 4 small piece 0 5 piece 0 0 6 of : 0 1 6 including 0 1 7 QUOTE peace 1 8 Peace 0 1 9 in 0 1 10 the 0 1 11 Valley 0 1 12 . 0 1 13 QUOTE 0, 1001, 1013, 1134, 1175, 1206 1, 1021, 1055, 1085, 1182, 1252 Lexicon File slide thanks to Kevin Small (modified)
  • 38. 600.325/425 Declarative Methods - J. Eisner Page 38 A Machine Learning System Preprocessing Feature Extraction Machine Learner Classifier Postprocessing Raw Text Formatted Text Testing Examples Function Parameters Labels Annotated Text Feature Vectors Training Examples slide thanks to Kevin Small (modified)
  • 39. 600.325/425 Declarative Methods - J. Eisner Page 39 A Machine Learning System Preprocessing Feature Extraction Machine Learner Classifier Postprocessing Raw Text Formatted Text Testing Examples Function Parameters Labels Annotated Text Feature Vectors Training Examples slide thanks to Kevin Small (modified)