SlideShare una empresa de Scribd logo
1 de 1
IMPROVING IMAGE TAG RECOMMENDATION
                                                                     USING FAVORITE IMAGE CONTEXT
                                                              Wonyong Eom, Sihyoung Lee, Wesley De Neve, and Yong Man Ro
                                                                                      Image and Video Systems Lab
                                                                         Korea Advanced Institute of Science and Technology (KAIST)
                                                                                           Daejeon, South Korea
                                                           e-mail: ymro@ee.kaist.ac.kr                                                  website: http://ivylab.kaist.ac.kr

I. INTRODUCTION                                                                                          III. EXPERIMENTS
- Observation                                                                                            1. Experimental setup
   - the number of images shared on online social network services                                          - We collected images from Flickr users meeting the following
     keeps growing at a fast rate                                                                             requirements: 1) uploaded at least 100 images, 2) assigned at least
- Problem                                                                                                     500 tags, and 3) bookmarked at least 500 favorite images
   - manual tagging of images is labor intensive and time consuming,                                        - Consequently, using the Flickr API, we retrieved a total of 387,397
     making it difficult to facilitate effective image retrieval                                              images from 27 users (on September 30, 2010)
- Novel solution                                                                                               - the images retrieved are either favorite images or images owned by
   - personalized tag recommendation using favorite image context                                                the 27 users, and are annotated with 4,657,288 tags by 46,686 users
      • source of collective knowledge that consists of images and                                          - To calculate visual distance, we used global and local image features
        associated tags that have been bookmarked by a particular user
   - assumptions made                                                                                     2. Effectiveness of using favorite image context
      • favorite images and their associated tags are indicative of the                                     - Recommending tags using tag statistics: Rtag(t, q)
                                                                                                                   Context                     P@5                        S@5                      P@1
         visual and topical interests of a user                                                                   Personal                     0.158                      0.609                    0.318
      • people actively bookmark favorite images                                                                  Collective                   0.208                      0.612                    0.373
                                                                                                                   Favorite                    0.247                      0.729                    0.457
II. PROPOSED TAG RECOMMENDATION METHOD                                                                     - Recommending tags using visual similarity: SCD-based Rimg(t, q)
1. Number of favorite images on Flickr for users of MIRFLICKR-25000                                                Context                     P@5                        S@5                      P@1
                                                                                                                  Personal                     0.187                      0.629                    0.384
                                             1000000
                                                                                                                  Collective                   0.208                      0.611                    0.324
                                                                                                                   Favorite                    0.294                      0.813                    0.446
                 Number of favorite images




                                              100000
                                                                                                           - Recommending tags using visual similarity: BoVW-based Rimg(t, q)
                                               10000
                                                                                                                   Context                     P@5                        S@5                      P@1
                                                1000                                                              Personal                     0.206                      0.697                    0.367
                                                                                                                  Collective                   0.309                      0.767                    0.523
                                                 100                                                               Favorite                    0.317                      0.813                    0.513

                                                  10
                                                                                                          3. Influence of linear fusion and bookmarking activity
                                                   1
                                                       1                                        9861                          0.4
                                                       187        2538              6872   8855 9861
                                                                           User

                                                                                                                              0.3
              Fig. 1. Number of favorite images per MIRFLICKR-25000 user

2. Personal, collective, and favorite image context
                                                                                                                        P@5




                                                                                                                              0.2
                                                                                                                                                   tag statistics
                                                                                                                                                   visual similarity (SCD)
                                                                                                                              0.1                  visual similarity (BoVW)
                                                                                                                                                   tag statistics + visual similarity (SCD)
                                                                                                                                                   tag statistics + visual similarity (BoVW)
                personal context
                                                                                                                               0
                                                                                                                              Level 1             Level 2             Level 3                  Level 4
                                                                                                                                                      Type of user group
                favorite image context
                                                                                                                  Fig. 3. P@5 for users with different levels of bookmarking activity
                                                                            ...
                                                                                                          4. Example query images
              collective context
                                                                           ...                                                                                                                  tag statistics +
                                                                                                             query image                  tag statistics          visual similarity
                                                                                                                                                                                               visual similarity
 Fig. 2. Relation between personal, collective, and favorite image context, visualized                                              nature, africa, photo, nature, wildlife,
 from the point-of-view of a user who uploaded a new image that is to be annotated                                                                                                nature, wildlife,
                                                                                                                                           image,         macro, birds, africa,
                                                                                                                                                                                macro, birds, africa,
                                                                                                                                    moth, bird, wildlife,  flower, animal,
 3. Mathematical modeling                                                                                                                                                       animal, bird, flower,
                                                                                                                                            birds,          flowers, safari,
-The relevance of a set of tags Tq to the content of a query image q                                                                                                               safari, flowers
                                                                
                                                                                                                                      macro, Australia         butterfly
Tq  t t  T and R(t , q)   tag
-R(t, q) is modeled by linearly fusing the output of two relevance functions                                                          sardegna, mare,          italy, bw, red,     italy, bw, red, green,
                                                                                                                                    donna, fitness, street, milano, green, street,     street, milano,
R(t , q)    Rtag (t , q)  (1   )  Rimg (t , q)                                                                               red, luce, bw, green, silhouette, people,         silhouette, light,
-Rtag(t, q) is modeled by making use of tag statistics                                                                                      light               paris, canon         sardegna, shadow
                                                       P(t | v), if P(t | v)  0
R tag (t , q)  P(t )                          
                                               vV
                                                       
                                                        ,       otherwise                                  Fig. 4. Example images with tags recommended using favorite image context

-Rimg(t, q) is modeled using a MAP-based method                                                          IV. CONCLUSIONS
                               P(q | t , Q ) P(t | Q )                                                   - Tag recommendation using favorite image context is, for the users
R img (t , q)  P(t | q, Q )                          ,                                                   selected, more effective than the use of personal and collective context
                                    P(q | Q )                                                            - Linearly fusing tag statistics and visual similarity allows for a higher
                                                                                                           effectiveness in terms of P@5, compared to their separate usage

                                                           IEEE International Conference on Image Processing (ICIP), September 2011, Brussels (Belgium)

Más contenido relacionado

Más de Wesley De Neve

Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...Wesley De Neve
 
Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...Wesley De Neve
 
Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...Wesley De Neve
 
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Wesley De Neve
 
The 5th Aslla Symposium
The 5th Aslla SymposiumThe 5th Aslla Symposium
The 5th Aslla SymposiumWesley De Neve
 
Ghent University Global Campus 101
Ghent University Global Campus 101Ghent University Global Campus 101
Ghent University Global Campus 101Wesley De Neve
 
Booklet for the First GUGC Research Symposium
Booklet for the First GUGC Research SymposiumBooklet for the First GUGC Research Symposium
Booklet for the First GUGC Research SymposiumWesley De Neve
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusWesley De Neve
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusWesley De Neve
 
Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...Wesley De Neve
 
Towards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesTowards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesWesley De Neve
 
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Wesley De Neve
 
GUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and BioinformaticsGUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and BioinformaticsWesley De Neve
 
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...Wesley De Neve
 
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...Wesley De Neve
 
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
 Exploring Deep Machine Learning for Automatic Right Whale Recognition and No... Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...Wesley De Neve
 
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...Wesley De Neve
 
Towards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processingTowards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processingWesley De Neve
 
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...Wesley De Neve
 
Towards Twitter hashtag recommendation using distributed word representations...
Towards Twitter hashtag recommendation using distributed word representations...Towards Twitter hashtag recommendation using distributed word representations...
Towards Twitter hashtag recommendation using distributed word representations...Wesley De Neve
 

Más de Wesley De Neve (20)

Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
 
Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...
 
Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...
 
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
 
The 5th Aslla Symposium
The 5th Aslla SymposiumThe 5th Aslla Symposium
The 5th Aslla Symposium
 
Ghent University Global Campus 101
Ghent University Global Campus 101Ghent University Global Campus 101
Ghent University Global Campus 101
 
Booklet for the First GUGC Research Symposium
Booklet for the First GUGC Research SymposiumBooklet for the First GUGC Research Symposium
Booklet for the First GUGC Research Symposium
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global Campus
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global Campus
 
Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...
 
Towards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesTowards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniques
 
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
 
GUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and BioinformaticsGUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and Bioinformatics
 
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
 
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
 
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
 Exploring Deep Machine Learning for Automatic Right Whale Recognition and No... Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
 
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
 
Towards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processingTowards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processing
 
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
 
Towards Twitter hashtag recommendation using distributed word representations...
Towards Twitter hashtag recommendation using distributed word representations...Towards Twitter hashtag recommendation using distributed word representations...
Towards Twitter hashtag recommendation using distributed word representations...
 

Último

Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGDSC PJATK
 
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdfBreaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdfUK Journal
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...FIDO Alliance
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101vincent683379
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekCzechDreamin
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftshyamraj55
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIES VE
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfFIDO Alliance
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeCzechDreamin
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024Stephanie Beckett
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceSamy Fodil
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyJohn Staveley
 
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FIDO Alliance
 
Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireExakis Nelite
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...CzechDreamin
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfFIDO Alliance
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...CzechDreamin
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlPeter Udo Diehl
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...panagenda
 

Último (20)

Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdfBreaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří Karpíšek
 
Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoft
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
 
Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - Questionnaire
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
 

Improving Image Tag Recommendation Using Favorite Image Context

  • 1. IMPROVING IMAGE TAG RECOMMENDATION USING FAVORITE IMAGE CONTEXT Wonyong Eom, Sihyoung Lee, Wesley De Neve, and Yong Man Ro Image and Video Systems Lab Korea Advanced Institute of Science and Technology (KAIST) Daejeon, South Korea e-mail: ymro@ee.kaist.ac.kr website: http://ivylab.kaist.ac.kr I. INTRODUCTION III. EXPERIMENTS - Observation 1. Experimental setup - the number of images shared on online social network services - We collected images from Flickr users meeting the following keeps growing at a fast rate requirements: 1) uploaded at least 100 images, 2) assigned at least - Problem 500 tags, and 3) bookmarked at least 500 favorite images - manual tagging of images is labor intensive and time consuming, - Consequently, using the Flickr API, we retrieved a total of 387,397 making it difficult to facilitate effective image retrieval images from 27 users (on September 30, 2010) - Novel solution - the images retrieved are either favorite images or images owned by - personalized tag recommendation using favorite image context the 27 users, and are annotated with 4,657,288 tags by 46,686 users • source of collective knowledge that consists of images and - To calculate visual distance, we used global and local image features associated tags that have been bookmarked by a particular user - assumptions made 2. Effectiveness of using favorite image context • favorite images and their associated tags are indicative of the - Recommending tags using tag statistics: Rtag(t, q) Context P@5 S@5 P@1 visual and topical interests of a user Personal 0.158 0.609 0.318 • people actively bookmark favorite images Collective 0.208 0.612 0.373 Favorite 0.247 0.729 0.457 II. PROPOSED TAG RECOMMENDATION METHOD - Recommending tags using visual similarity: SCD-based Rimg(t, q) 1. Number of favorite images on Flickr for users of MIRFLICKR-25000 Context P@5 S@5 P@1 Personal 0.187 0.629 0.384 1000000 Collective 0.208 0.611 0.324 Favorite 0.294 0.813 0.446 Number of favorite images 100000 - Recommending tags using visual similarity: BoVW-based Rimg(t, q) 10000 Context P@5 S@5 P@1 1000 Personal 0.206 0.697 0.367 Collective 0.309 0.767 0.523 100 Favorite 0.317 0.813 0.513 10 3. Influence of linear fusion and bookmarking activity 1 1 9861 0.4 187 2538 6872 8855 9861 User 0.3 Fig. 1. Number of favorite images per MIRFLICKR-25000 user 2. Personal, collective, and favorite image context P@5 0.2 tag statistics visual similarity (SCD) 0.1 visual similarity (BoVW) tag statistics + visual similarity (SCD) tag statistics + visual similarity (BoVW) personal context 0 Level 1 Level 2 Level 3 Level 4 Type of user group favorite image context Fig. 3. P@5 for users with different levels of bookmarking activity ... 4. Example query images collective context ... tag statistics + query image tag statistics visual similarity visual similarity Fig. 2. Relation between personal, collective, and favorite image context, visualized nature, africa, photo, nature, wildlife, from the point-of-view of a user who uploaded a new image that is to be annotated nature, wildlife, image, macro, birds, africa, macro, birds, africa, moth, bird, wildlife, flower, animal, 3. Mathematical modeling animal, bird, flower, birds, flowers, safari, -The relevance of a set of tags Tq to the content of a query image q safari, flowers   macro, Australia butterfly Tq  t t  T and R(t , q)   tag -R(t, q) is modeled by linearly fusing the output of two relevance functions sardegna, mare, italy, bw, red, italy, bw, red, green, donna, fitness, street, milano, green, street, street, milano, R(t , q)    Rtag (t , q)  (1   )  Rimg (t , q) red, luce, bw, green, silhouette, people, silhouette, light, -Rtag(t, q) is modeled by making use of tag statistics light paris, canon sardegna, shadow P(t | v), if P(t | v)  0 R tag (t , q)  P(t )  vV   , otherwise Fig. 4. Example images with tags recommended using favorite image context -Rimg(t, q) is modeled using a MAP-based method IV. CONCLUSIONS P(q | t , Q ) P(t | Q ) - Tag recommendation using favorite image context is, for the users R img (t , q)  P(t | q, Q )  , selected, more effective than the use of personal and collective context P(q | Q ) - Linearly fusing tag statistics and visual similarity allows for a higher effectiveness in terms of P@5, compared to their separate usage IEEE International Conference on Image Processing (ICIP), September 2011, Brussels (Belgium)