Visitor Hotel Preference Learning Using Label Ranking

•

0 likes•395 views

Multilabel classication is an extension of conventional classification in which a single instance can be associated with multiple labels. Recent research has shown that, just like for standard classication, instance-based learning algorithms relying on the nearest neighbor estimation principle can be used quite successfully in this context. In this paper, we propose a new instance-based approach to multilabel classification, which is based on calibrated label ranking, a recently proposed framework that unies multilabel classication and label ranking. Within this framework, instance-based prediction is realized is the form of MAP estimation, assuming a statistical distribution called the Mallows model.

Technology Business

Weiwei Cheng & Eyke Hüllermeier
Knowledge Engineering & Bioinformatics Lab
Department of Mathematics and Computer Science
University of Marburg, Germany

Label Ranking (an example)

Learning visitor’s preferences on hotels

Golf ≻ Park ≻ Krim
label ranking

Krim ≻ Golf ≻ Park
visitor 1

Krim ≻ Park ≻ Golf
visitor 2

Park ≻ Golf ≻ Krim
visitor 3
visitor 4
new visitor ???

where the visitor could be described by feature vectors,
e.g., (gender, age, place of birth, is a professor, …)
1/15

Label Ranking (an example)

Learning visitor’s preferences on hotels

Golf Park Krim
visitor 1 1 2 3
visitor 2 2 3 1
visitor 3 3 2 1
visitor 4 2 1 3
new visitor ? ? ?

π(i) = position of the i-th label in the ranking
1: Golf 2: Park 3: Krim
2/15

Label Ranking (more formally)
Given:
 a set of training instances
 a set of labels
 for each training instance : a set of pairwise preferences
of the form (for some of the labels)

Find:
 A ranking function ( mapping) that maps each
to a ranking of (permutation ) and generalizes well
in terms of a loss function on rankings (e.g., Kendall’s tau)

3/15

Label ranking

Multilabel classification

Calibrated label ranking

4/15

Instance-based Label Ranking
nearest neighbor

 Target function is estimated (on demand) in a local way.
 Distribution of rankings is (approx.) constant in a local region.
 Core part is to estimate the locally constant model.
5/15

Probabilistic Model for Ranking

Mallows model (Mallows, Biometrika, 1957)

with
center ranking
spread parameter
and is a metric on permutations

6/15

Inference

Observation Extensions

An example:

7/15

Inference (Cont.)
For observations :

Maximum Likelihood Estimation (MLE) becomes a difficult,
non-convex optimization problem!
8/15

Special Structure of Observations

In the multilabel classification context, the observation is
a special type of partial ranking. It contains:

 two tie groups (i.e. relevant & irrelevant label set)
 all labels (i.e. no label is missing)

By exploiting this special structure , MLE can be made
much more efficiently!

9/15

First Theorem

Sorting the labels according to their
frequency of occurrence in the neighborhood

Problem: What about ties?
Solution: Replacing MLE with Bayes estimation.

10/15

Second Theorem

A simple prediction procedure:

1. Sort labels with their frequency in the neighborhood
2. Break ties with frequency outside

11/15

Experimental Setting
dataset domain #inst. #attr. #labels card.
emotions music 593 72 6 1,87
image vision 2000 135 5 1,24
genbase biology 662 1186(n) 27 1,25
mediamill multimedia 5000 120 101 4,27
reuters text 7119 243 7 1,24
scene vision 2407 294 6 1,07
yeast biology 2417 103 14 4,24

Tested methods:
 MLKNN
 binary relevance learning (BR) with C4.5
 Our method: Mallows

Evaluation metrics
 Hamming loss and rank loss
12/15

Experimental Results

Hamming loss rank loss
dataset BR MLKNN Mallows BR MLKNN Mallows
emotions 0.253 0.261 0.197 0.352 0.262 0.163
image 0.001 0.005 0.003 0.006 0.006 0.006
genbase 0.243 0.193 0.192 0.398 0.214 0.208
mediamill 0.032 0.027 0.027 0.189 0.037 0.036
reuters 0.057 0.073 0.085 0.089 0.068 0.087
scene 0.131 0.087 0.094 0.300 0.077 0.088
yeast 0.249 0.194 0.197 0.360 0.168 0.165

13/15

Our Contributions

 A new instance-based multilabel classifier with state-
of-the-art predictive accuracy;

 It is computationally very efficient;

 with very simple prediction procedure, justified by an
underlying probabilistic ranking model.

The End

Similar to Visitor Hotel Preference Learning Using Label Ranking

20 The Most Practical User AnalysesJozo Kovac

Factorization Machines and Applications in Recommender SystemsEvgeniy Marinov

Interpretability and Reproducibility in Production Machine Learning Applicat...Swaminathan Sundararaman

深度學習在AOI的應用CHENHuiMei

forest-cover-typeKayleigh Beard

Good Hunting: Locating, Prioritizing, and Fixing Bugs Automatically (Keynote,...Dongsun Kim

Cutting Edge Predictive Modeling For ClassificationPankaj Sharma

ICIF19_Garg_job_talk_portfolio_modification.pdfVarun Garg

Object Tracking with Instance Matching and Online LearningJui-Hsin (Larry) Lai

Guidetaikhoan262

Huong dan cu the svmtaikhoan262

MediaEval 2017 - Interestingness Task: Predicting Media Interestingness via B...multimediaeval

MLSEV. Logistic Regression, Deepnets, and Time Series BigML, Inc

Estimating Human Pose from Occluded Images (ACCV 2009)Jia-Bin Huang

Comparing Machine Learning Algorithms in Text MiningAndrea Gigli

Predicting Subsystem Defects using Dependency Graph Complexities Thomas Zimmermann

Slope one recommender on hadoopYONG ZHENG

convolutional_rbm.pptAyushSingh398902

Comparison of Top Data Mining(Final)Sanghun Kim

Strong Baselines for Neural Semi-supervised Learning under Domain ShiftSebastian Ruder

Similar to Visitor Hotel Preference Learning Using Label Ranking (20)

20 The Most Practical User Analyses

Factorization Machines and Applications in Recommender Systems

Interpretability and Reproducibility in Production Machine Learning Applicat...

深度學習在AOI的應用

forest-cover-type

Good Hunting: Locating, Prioritizing, and Fixing Bugs Automatically (Keynote,...

Cutting Edge Predictive Modeling For Classification

ICIF19_Garg_job_talk_portfolio_modification.pdf

Object Tracking with Instance Matching and Online Learning

Guide

Huong dan cu the svm

MediaEval 2017 - Interestingness Task: Predicting Media Interestingness via B...

MLSEV. Logistic Regression, Deepnets, and Time Series

Estimating Human Pose from Occluded Images (ACCV 2009)

Comparing Machine Learning Algorithms in Text Mining

Predicting Subsystem Defects using Dependency Graph Complexities

Slope one recommender on hadoop

convolutional_rbm.ppt

Comparison of Top Data Mining(Final)

Strong Baselines for Neural Semi-supervised Learning under Domain Shift

Recently uploaded

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

Scaling API-first – The story of a global engineering organizationRadu Cotescu

Artificial Intelligence: Facts and MythsJoaquim Jorge

Histor y of HAM Radio presentation slidevu2urc

How to convert PDF to text with Nanonetsnaman860154

🐬 The future of MySQL is Postgres 🐘RTylerCroy

Slack Application Development 101 Slidespraypatel2

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Recently uploaded (20)

Data Cloud, More than a CDP by Matt Robison

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Automating Google Workspace (GWS) & more with Apps Script

How to Troubleshoot Apps for the Modern Connected Worker

Boost PC performance: How more available memory can improve productivity

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

08448380779 Call Girls In Friends Colony Women Seeking Men

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

A Domino Admins Adventures (Engage 2024)

Exploring the Future Potential of AI-Enabled Smartphone Processors

Powerful Google developer tools for immediate impact! (2023-24 C)

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

Scaling API-first – The story of a global engineering organization

Artificial Intelligence: Facts and Myths

Histor y of HAM Radio presentation slide

How to convert PDF to text with Nanonets

🐬 The future of MySQL is Postgres 🐘

Slack Application Development 101 Slides

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Visitor Hotel Preference Learning Using Label Ranking

1. Weiwei Cheng & Eyke Hüllermeier Knowledge Engineering & Bioinformatics Lab Department of Mathematics and Computer Science University of Marburg, Germany

2. Label Ranking (an example) Learning visitor’s preferences on hotels Golf ≻ Park ≻ Krim label ranking Krim ≻ Golf ≻ Park visitor 1 Krim ≻ Park ≻ Golf visitor 2 Park ≻ Golf ≻ Krim visitor 3 visitor 4 new visitor ??? where the visitor could be described by feature vectors, e.g., (gender, age, place of birth, is a professor, …) 1/15

3. Label Ranking (an example) Learning visitor’s preferences on hotels Golf Park Krim visitor 1 1 2 3 visitor 2 2 3 1 visitor 3 3 2 1 visitor 4 2 1 3 new visitor ? ? ? π(i) = position of the i-th label in the ranking 1: Golf 2: Park 3: Krim 2/15

4. Label Ranking (more formally) Given:  a set of training instances  a set of labels  for each training instance : a set of pairwise preferences of the form (for some of the labels) Find:  A ranking function ( mapping) that maps each to a ranking of (permutation ) and generalizes well in terms of a loss function on rankings (e.g., Kendall’s tau) 3/15

5. Label ranking Multilabel classification Calibrated label ranking 4/15

6. Instance-based Label Ranking nearest neighbor  Target function is estimated (on demand) in a local way.  Distribution of rankings is (approx.) constant in a local region.  Core part is to estimate the locally constant model. 5/15

7. Probabilistic Model for Ranking Mallows model (Mallows, Biometrika, 1957) with center ranking spread parameter and is a metric on permutations 6/15

8. Inference Observation Extensions An example: 7/15

9. Inference (Cont.) For observations : Maximum Likelihood Estimation (MLE) becomes a difficult, non-convex optimization problem! 8/15

10. Special Structure of Observations In the multilabel classification context, the observation is a special type of partial ranking. It contains:  two tie groups (i.e. relevant & irrelevant label set)  all labels (i.e. no label is missing) By exploiting this special structure , MLE can be made much more efficiently! 9/15

11. First Theorem Sorting the labels according to their frequency of occurrence in the neighborhood Problem: What about ties? Solution: Replacing MLE with Bayes estimation. 10/15

12. Second Theorem A simple prediction procedure: 1. Sort labels with their frequency in the neighborhood 2. Break ties with frequency outside 11/15

13. Experimental Setting dataset domain #inst. #attr. #labels card. emotions music 593 72 6 1,87 image vision 2000 135 5 1,24 genbase biology 662 1186(n) 27 1,25 mediamill multimedia 5000 120 101 4,27 reuters text 7119 243 7 1,24 scene vision 2407 294 6 1,07 yeast biology 2417 103 14 4,24 Tested methods:  MLKNN  binary relevance learning (BR) with C4.5  Our method: Mallows Evaluation metrics  Hamming loss and rank loss 12/15

14. Experimental Results Hamming loss rank loss dataset BR MLKNN Mallows BR MLKNN Mallows emotions 0.253 0.261 0.197 0.352 0.262 0.163 image 0.001 0.005 0.003 0.006 0.006 0.006 genbase 0.243 0.193 0.192 0.398 0.214 0.208 mediamill 0.032 0.027 0.027 0.189 0.037 0.036 reuters 0.057 0.073 0.085 0.089 0.068 0.087 scene 0.131 0.087 0.094 0.300 0.077 0.088 yeast 0.249 0.194 0.197 0.360 0.168 0.165 13/15

15. Experimental Results (Cont.) 14/15

16. Our Contributions  A new instance-based multilabel classifier with state- of-the-art predictive accuracy;  It is computationally very efficient;  with very simple prediction procedure, justified by an underlying probabilistic ranking model. The End

17. Google “kebi germany” for more info.

Visitor Hotel Preference Learning Using Label Ranking

Recommended

Recommended

More Related Content

Similar to Visitor Hotel Preference Learning Using Label Ranking

Similar to Visitor Hotel Preference Learning Using Label Ranking (20)

Recently uploaded

Recently uploaded (20)

Visitor Hotel Preference Learning Using Label Ranking