Semi-supervised Learning

Semi Supervised Learning ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Supervised learning is a typical machine learning setting, where labeled examples are used as training examples ? = yes Supervised learning decision trees, neural networks, support vector machines, etc. trained model training data label training unseen data (Jeff, Professor, 7, ?) label unknown

Labeled vs. Unlabeled In many practical applications, unlabeled training examples are readily available but labeled ones are fairly expansive to obtain because labeling the unlabeled examples requires human effort class = “ war ” (almost) infinite number of web pages on the Internet ?

Three main paradigms for Semi-supervised Learning: ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

SSL: Why unlabeled data can be helpful? Suppose the data is well-modeled by a mixture density: Thus, the optimal classification rule for this model is the MAP rule: [D.J. Miller & H.S. Uyar, NIPS’96] where and  = {  l } The class labels are viewed as random quantities and are assumed chosen conditioned on the selected mixture component m i  {1,2,…, L } and possibly on the feature value, i.e. according to the probabilities P[ c i | x i , m i ] where unlabeled examples can be used to help estimate this term

Transductive SVM Transductive SVM : Taking into account a particular test set and trying to minimize misclassifications of just those particular examples Figure reprinted from [T. Joachims, ICML99] Concretely, using unlabeled examples to help identify the maximum margin hyperplanes

Active learning: Getting more from query The labels of the training examples are obtained by querying the oracle . Thus, for the same number of queries, more helpful information can be obtained by actively selecting some unlabeled examples to query Key: To select the unlabeled examples on which the labeling will convey the most helpful information for the learner

[object Object],[object Object],[object Object],[object Object],[object Object],Active Learning: Representative approaches

[object Object],[object Object],Active Learning Application: Image retrieval Where are my photos taken at Guilin?

[object Object],[object Object],[object Object],[object Object],Active Learning: Text-based image retrieval query Database Text Interface Text-based Retrieval Engine “ tiger” tiger lily white tiger

[object Object],[object Object],Co-training

[A. Blum & T. Mitchell, COLT98] Co-training (con’t) learner 1 learner 2 X 1 view X 2 view labeled training examples unlabeled training examples labeled unlabeled examples labeled unlabeled examples

Co-training (con’t) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Multi-view Learning and Co-training ,[object Object],[object Object],[object Object],[object Object],[object Object]

Inductive vs.Transductive ,[object Object],[object Object],[object Object]

An Example of two views ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Another Example Classifying Jobs for FlipDog X1 : job title X2: job description

Two Views ,[object Object],[object Object],[object Object],[object Object]

Co-training ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

The Yarowsky Algorithm Choose instances labeled with high confidence Add them to the pool of current labeled training data …… (Yarowsky 1995) Iteration: 0 + - A Classifier trained by SL Iteration: 1 + - Iteration: 2 + -

Co-training Assumption 1: compatibility ,[object Object],[object Object], Each set of features is sufficient for classification

Co-training Assumption 2: conditional independence ,[object Object],[object Object]

Co-Training ,[object Object],[object Object],[object Object],[object Object],[object Object],x x 1 x 2 (Blum and Mitchell 1998)

Co-Training Allow C1 to label Some instances Allow C2 to label Some instances Iteration: t + - Iteration: t +1 + - …… C1 : A Classifier trained on view 1 C2 : A Classifier trained on view 2 Add self-labeled instances to the pool of training data

Agreement Maximization ,[object Object],[object Object],[object Object],(Leskes 2005)

What if Co-training Assumption Not Perfectly Satisfied? ,[object Object],[object Object],- + + +

Other Related Works ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Reference ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Semi-supervised Learning

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Semi-supervised Learning

Similar to Semi-supervised Learning (20)

More from butest

More from butest (20)

Semi-supervised Learning