SlideShare una empresa de Scribd logo
1 de 20
Descargar para leer sin conexión
WP	
  3
                              User	
  profiling	
  &	
  
                          Recommenda6on	
  (Part	
  3)
                               BBC,	
  Pro-­‐ne+cs,	
  VUA
                                                             1

Wednesday, March 28, 12
Contents
         Overview
         User profiling
                 General goal & approach
                 From activity streams to profile
                 Issues
                 Analytics
                 Beancounter

         Recommendations
                 General goal & approach
                 Semantic recommendation
                 Statistical recommendation
                 Hybrid recommendation

         Exploitation
         Conclusions

                26-27 March 2012               NoTube 3rd Review   2

Wednesday, March 28, 12
Overview



                                              Semantic Content                   Semantic
                                                 Patterns for                   Pattern-based
                                                TV Programs                    Recommendation
              EPG Metadata     TV Program
                                                                                   Strategy
                 (BBC)         Enrichment
                                            RDF Graph                            Statistical
                                                TV            Recommendation    Similarity-based
                                             Programs             Service      Recommendation
                                                                                   Strategy
              User Ratings &
              Demographics     User Data         Similarity
                (BBC EPG       Analysis          Clusters                          Hybrid
                  Data)                        of Programs                     Recommendation
                                                                                   Strategy




                                                                                                   End End-Users
                                                                                                       Users




                26-27 March 2012                                   NoTube 3rd Review                         3

Wednesday, March 28, 12
Overview



                                               Semantic Content                   Semantic
                                                  Patterns for                   Pattern-based
                                                 TV Programs                    Recommendation
              EPG Metadata      TV Program
                                                                                    Strategy
                 (BBC)          Enrichment
                                             RDF Graph                            Statistical
                                                 TV            Recommendation    Similarity-based
                                              Programs             Service      Recommendation
                                                                                    Strategy
              User Ratings &
              Demographics      User Data         Similarity
                (BBC EPG        Analysis          Clusters                          Hybrid
                  Data)                         of Programs                     Recommendation
                                                                                    Strategy




                     BEA
                               NCO
                                  UNT
                                     E         R
                                                                                                    End End-Users
                                                                                                        Users




                26-27 March 2012                                    NoTube 3rd Review                         3

Wednesday, March 28, 12
Statistical recommendations


        • We had privileged access to two bulk user ratings datasets
          from BBC
        • From these, used Apache Mahout toolkit to derive "item to
          item" similarity measures between each pair of items
        • With larger (20k users) this worked well; with a smaller (1k)
          dataset, less well
        • With BBC, investigating publication of these behaviour-
          derived similarity measures




                26-27 March 2012   NoTube 3rd Review      4

Wednesday, March 28, 12
Hybrid models:
 factual paths and statistical similarity




(and not to mention ‘@wossy’ is on Twitter with 1 million followers...)
                                                                          31
Wednesday, March 28, 12
Statistical recommendation

                                                                           12k


                          9        8     5                 2   0       9



                          0        0     8                 8   8       6



                          3        2     7                 9   9       8

        20k
                26-27 March 2012       NoTube 3rd Review           6

Wednesday, March 28, 12
Statistical recommendation




                          9        0     0                 0   0       9



                          0        0     8                 0   8       0



                          0        0     7                 0   9       8

                26-27 March 2012       NoTube 3rd Review           7

Wednesday, March 28, 12
9
Wednesday, March 28, 12
10
Wednesday, March 28, 12
11
Wednesday, March 28, 12
12
Wednesday, March 28, 12
TV Preference Data is very sparse


        • Even for a single service (e.g. Netflix), data is
          ‘overwhelmingly sparse’
        • For NoTube’s open systems, challenges multiply:
            – often no global view, only per-user data
            – many ways of identifying the same content item
            – many ways of identifying the same user
            – never mind other entities (actors, directors, ...)
        • Q: Can we tell a story about how organizations with such
          privileged overviews can contribute in a privacy respecting
          way to the public commons of linked data? (A: yes! see WP4)
                26-27 March 2012        NoTube 3rd Review          12

Wednesday, March 28, 12
Fragmentation by site




                26-27 March 2012     NoTube 3rd Review   13

Wednesday, March 28, 12
29
Wednesday, March 28, 12
30
Wednesday, March 28, 12
Statistical recommendation:
                              Process

        • Build on best-in-class opensource code, rather than re-
          invent
        • Big-data ready (Hadoop-based)
        • Of various options, LogLikelihoodSimilarity generally gave
          best results (standard 'withold some ratings' evaluation
          strategy)
        • Other explorations: including large scale (1/2 billion tweet)
          Twitter analysis, Spectral Clustering, using
          demographics, ...



                26-27 March 2012   NoTube 3rd Review    16

Wednesday, March 28, 12
Exploitation & Further
                             Development

         Beancounter:
         •Pronetics’ user profiling SaaS
         •integration in the e-commerce technological solution
         • making it more general purpose
         • making it capable of big data management a SaaS
         playground for Semantic Web researcher

         •open source licensing
         •community extensions


                26-27 March 2012     NoTube 3rd Review   17

Wednesday, March 28, 12
Exploitation & Further
                             Development

         Recommendations:
         •explore further the combination of demographic
         stereotypes & semantics in a hybrid approach to learn a
         prediction model for the shows a user is most likely
         interested in
         •integrate in personalized semantic search frameworks
         •extend with additional LOD sources
         •test further the measures for diversity, serendipity and
         predictability

         •open source licensing
         •community extensions
                26-27 March 2012     NoTube 3rd Review   18

Wednesday, March 28, 12
Acknowledgements




                26-27 March 2012   NoTube 3rd Review   19

Wednesday, March 28, 12

Más contenido relacionado

Destacado (6)

Keynote at SMAP2012: Personalized Access to TV Content
Keynote at SMAP2012: Personalized Access to TV ContentKeynote at SMAP2012: Personalized Access to TV Content
Keynote at SMAP2012: Personalized Access to TV Content
 
Lecture 5: Personalization on the Social Web (2014)
Lecture 5: Personalization on the Social Web (2014)Lecture 5: Personalization on the Social Web (2014)
Lecture 5: Personalization on the Social Web (2014)
 
Semantic Digital Humanities Workshop 2015 @Oxford
Semantic Digital Humanities Workshop 2015 @OxfordSemantic Digital Humanities Workshop 2015 @Oxford
Semantic Digital Humanities Workshop 2015 @Oxford
 
Lecture 5: Personalization on the Social Web (2013)
Lecture 5: Personalization on the Social Web (2013)Lecture 5: Personalization on the Social Web (2013)
Lecture 5: Personalization on the Social Web (2013)
 
Future TV is Now: Personalized & Social
Future TV is Now: Personalized & SocialFuture TV is Now: Personalized & Social
Future TV is Now: Personalized & Social
 
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New SearchSXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
 

Similar a NoTube: Recommendations (Collaborative)

Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172
Editor IJARCET
 
Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172
Editor IJARCET
 

Similar a NoTube: Recommendations (Collaborative) (18)

IRJET - Support Vector Machine versus Naive Bayes Classifier:A Juxtaposition ...
IRJET - Support Vector Machine versus Naive Bayes Classifier:A Juxtaposition ...IRJET - Support Vector Machine versus Naive Bayes Classifier:A Juxtaposition ...
IRJET - Support Vector Machine versus Naive Bayes Classifier:A Juxtaposition ...
 
A Low Rank Mechanism to Detect and Achieve Partially Completed Image Tags
A Low Rank Mechanism to Detect and Achieve Partially Completed Image TagsA Low Rank Mechanism to Detect and Achieve Partially Completed Image Tags
A Low Rank Mechanism to Detect and Achieve Partially Completed Image Tags
 
Survey on Feature Selection and Dimensionality Reduction Techniques
Survey on Feature Selection and Dimensionality Reduction TechniquesSurvey on Feature Selection and Dimensionality Reduction Techniques
Survey on Feature Selection and Dimensionality Reduction Techniques
 
IRJET - Enhanced Movie Recommendation Engine using Content Filtering, Collabo...
IRJET - Enhanced Movie Recommendation Engine using Content Filtering, Collabo...IRJET - Enhanced Movie Recommendation Engine using Content Filtering, Collabo...
IRJET - Enhanced Movie Recommendation Engine using Content Filtering, Collabo...
 
Product Recommendation Systems based on Hybrid Approach Technology
Product Recommendation Systems based on Hybrid Approach TechnologyProduct Recommendation Systems based on Hybrid Approach Technology
Product Recommendation Systems based on Hybrid Approach Technology
 
IRJET - Finger Vein Extraction and Authentication System for ATM
IRJET -  	  Finger Vein Extraction and Authentication System for ATMIRJET -  	  Finger Vein Extraction and Authentication System for ATM
IRJET - Finger Vein Extraction and Authentication System for ATM
 
NoTube: Architecture
NoTube: ArchitectureNoTube: Architecture
NoTube: Architecture
 
Building a business case and institutional policy on a 10Y research data mana...
Building a business case and institutional policy on a 10Y research data mana...Building a business case and institutional policy on a 10Y research data mana...
Building a business case and institutional policy on a 10Y research data mana...
 
IRJET- Semantic Analysis of Online Customer Queries
IRJET-  	  Semantic Analysis of Online Customer QueriesIRJET-  	  Semantic Analysis of Online Customer Queries
IRJET- Semantic Analysis of Online Customer Queries
 
IRJET- Analysis of Music Recommendation System using Machine Learning Alg...
IRJET-  	  Analysis of Music Recommendation System using Machine Learning Alg...IRJET-  	  Analysis of Music Recommendation System using Machine Learning Alg...
IRJET- Analysis of Music Recommendation System using Machine Learning Alg...
 
Partitioning of Query Processing in Distributed Database System to Improve Th...
Partitioning of Query Processing in Distributed Database System to Improve Th...Partitioning of Query Processing in Distributed Database System to Improve Th...
Partitioning of Query Processing in Distributed Database System to Improve Th...
 
Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...
Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...
Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...
 
A2 annotation approach
A2 annotation approachA2 annotation approach
A2 annotation approach
 
IRJET- Searching an Optimal Algorithm for Movie Recommendation System
IRJET- Searching an Optimal Algorithm for Movie Recommendation SystemIRJET- Searching an Optimal Algorithm for Movie Recommendation System
IRJET- Searching an Optimal Algorithm for Movie Recommendation System
 
Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172
 
Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172Volume 2-issue-6-2165-2172
Volume 2-issue-6-2165-2172
 
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...
 
Aug 23, 2010 entrips evaluation presentation.ppt
Aug 23, 2010 entrips evaluation presentation.pptAug 23, 2010 entrips evaluation presentation.ppt
Aug 23, 2010 entrips evaluation presentation.ppt
 

Más de MODUL Technology GmbH

14 no tube dissemination and showcases [compatibility mode]
14 no tube dissemination and showcases [compatibility mode]14 no tube dissemination and showcases [compatibility mode]
14 no tube dissemination and showcases [compatibility mode]
MODUL Technology GmbH
 
NoTube: Ad Insertion [compatibility mode]
NoTube: Ad Insertion [compatibility mode]NoTube: Ad Insertion [compatibility mode]
NoTube: Ad Insertion [compatibility mode]
MODUL Technology GmbH
 

Más de MODUL Technology GmbH (20)

How distinct and aligned with UGC is European capitals’ DMO branding on Insta...
How distinct and aligned with UGC is European capitals’ DMO branding on Insta...How distinct and aligned with UGC is European capitals’ DMO branding on Insta...
How distinct and aligned with UGC is European capitals’ DMO branding on Insta...
 
Framing Few Shot Knowledge Graph Completion with Large Language Models
Framing Few Shot Knowledge Graph Completion with Large Language ModelsFraming Few Shot Knowledge Graph Completion with Large Language Models
Framing Few Shot Knowledge Graph Completion with Large Language Models
 
Unsupervised Topic Modeling with BERTopic for Coarse and Fine-Grained News Cl...
Unsupervised Topic Modeling with BERTopic for Coarse and Fine-Grained News Cl...Unsupervised Topic Modeling with BERTopic for Coarse and Fine-Grained News Cl...
Unsupervised Topic Modeling with BERTopic for Coarse and Fine-Grained News Cl...
 
Breaking New Ground with EPOCH: AI and Web Intelligence Transform Price Forec...
Breaking New Ground with EPOCH: AI and Web Intelligence Transform Price Forec...Breaking New Ground with EPOCH: AI and Web Intelligence Transform Price Forec...
Breaking New Ground with EPOCH: AI and Web Intelligence Transform Price Forec...
 
New Opportunities for Understanding Tourist Photography.pptx
New Opportunities for Understanding Tourist Photography.pptxNew Opportunities for Understanding Tourist Photography.pptx
New Opportunities for Understanding Tourist Photography.pptx
 
How do destinations relate to one another? A study of visual destination bran...
How do destinations relate to one another? A study of visual destination bran...How do destinations relate to one another? A study of visual destination bran...
How do destinations relate to one another? A study of visual destination bran...
 
Do DMOs promote the right aspects of the destination? A study of Instagram ph...
Do DMOs promote the right aspects of the destination? A study of Instagram ph...Do DMOs promote the right aspects of the destination? A study of Instagram ph...
Do DMOs promote the right aspects of the destination? A study of Instagram ph...
 
The Impact of Social Media on perceived Destination Image: case of Mexico Ci...
The Impact of Social Media on perceived Destination Image:  case of Mexico Ci...The Impact of Social Media on perceived Destination Image:  case of Mexico Ci...
The Impact of Social Media on perceived Destination Image: case of Mexico Ci...
 
The Impact of Social Media on perceived Destination Image: the case of Mexico...
The Impact of Social Media on perceived Destination Image:the case of Mexico...The Impact of Social Media on perceived Destination Image:the case of Mexico...
The Impact of Social Media on perceived Destination Image: the case of Mexico...
 
How Instagram influences Visual Destination Image - a case study of Jordan an...
How Instagram influences Visual Destination Image - a case study of Jordan an...How Instagram influences Visual Destination Image - a case study of Jordan an...
How Instagram influences Visual Destination Image - a case study of Jordan an...
 
Media mining for smarter tourism
Media mining for smarter tourismMedia mining for smarter tourism
Media mining for smarter tourism
 
14 no tube dissemination and showcases [compatibility mode]
14 no tube dissemination and showcases [compatibility mode]14 no tube dissemination and showcases [compatibility mode]
14 no tube dissemination and showcases [compatibility mode]
 
NoTube: BBC show case
NoTube: BBC show caseNoTube: BBC show case
NoTube: BBC show case
 
NoTube: Stoneroos show case
NoTube: Stoneroos show caseNoTube: Stoneroos show case
NoTube: Stoneroos show case
 
NoTube: RAI Show Case
NoTube: RAI Show CaseNoTube: RAI Show Case
NoTube: RAI Show Case
 
NoTube: Loudness Normalisation
NoTube: Loudness NormalisationNoTube: Loudness Normalisation
NoTube: Loudness Normalisation
 
NoTube: Ad Insertion [compatibility mode]
NoTube: Ad Insertion [compatibility mode]NoTube: Ad Insertion [compatibility mode]
NoTube: Ad Insertion [compatibility mode]
 
NoTube: Metadata Enrichment
NoTube: Metadata EnrichmentNoTube: Metadata Enrichment
NoTube: Metadata Enrichment
 
NoTube: Metadata Interoperability
NoTube: Metadata InteroperabilityNoTube: Metadata Interoperability
NoTube: Metadata Interoperability
 
NoTube: Models & Semantics
NoTube: Models & SemanticsNoTube: Models & Semantics
NoTube: Models & Semantics
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

NoTube: Recommendations (Collaborative)

  • 1. WP  3 User  profiling  &   Recommenda6on  (Part  3) BBC,  Pro-­‐ne+cs,  VUA 1 Wednesday, March 28, 12
  • 2. Contents Overview User profiling General goal & approach From activity streams to profile Issues Analytics Beancounter Recommendations General goal & approach Semantic recommendation Statistical recommendation Hybrid recommendation Exploitation Conclusions 26-27 March 2012 NoTube 3rd Review 2 Wednesday, March 28, 12
  • 3. Overview Semantic Content Semantic Patterns for Pattern-based TV Programs Recommendation EPG Metadata TV Program Strategy (BBC) Enrichment RDF Graph Statistical TV Recommendation Similarity-based Programs Service Recommendation Strategy User Ratings & Demographics User Data Similarity (BBC EPG Analysis Clusters Hybrid Data) of Programs Recommendation Strategy End End-Users Users 26-27 March 2012 NoTube 3rd Review 3 Wednesday, March 28, 12
  • 4. Overview Semantic Content Semantic Patterns for Pattern-based TV Programs Recommendation EPG Metadata TV Program Strategy (BBC) Enrichment RDF Graph Statistical TV Recommendation Similarity-based Programs Service Recommendation Strategy User Ratings & Demographics User Data Similarity (BBC EPG Analysis Clusters Hybrid Data) of Programs Recommendation Strategy BEA NCO UNT E R End End-Users Users 26-27 March 2012 NoTube 3rd Review 3 Wednesday, March 28, 12
  • 5. Statistical recommendations • We had privileged access to two bulk user ratings datasets from BBC • From these, used Apache Mahout toolkit to derive "item to item" similarity measures between each pair of items • With larger (20k users) this worked well; with a smaller (1k) dataset, less well • With BBC, investigating publication of these behaviour- derived similarity measures 26-27 March 2012 NoTube 3rd Review 4 Wednesday, March 28, 12
  • 6. Hybrid models: factual paths and statistical similarity (and not to mention ‘@wossy’ is on Twitter with 1 million followers...) 31 Wednesday, March 28, 12
  • 7. Statistical recommendation 12k 9 8 5 2 0 9 0 0 8 8 8 6 3 2 7 9 9 8 20k 26-27 March 2012 NoTube 3rd Review 6 Wednesday, March 28, 12
  • 8. Statistical recommendation 9 0 0 0 0 9 0 0 8 0 8 0 0 0 7 0 9 8 26-27 March 2012 NoTube 3rd Review 7 Wednesday, March 28, 12
  • 13. TV Preference Data is very sparse • Even for a single service (e.g. Netflix), data is ‘overwhelmingly sparse’ • For NoTube’s open systems, challenges multiply: – often no global view, only per-user data – many ways of identifying the same content item – many ways of identifying the same user – never mind other entities (actors, directors, ...) • Q: Can we tell a story about how organizations with such privileged overviews can contribute in a privacy respecting way to the public commons of linked data? (A: yes! see WP4) 26-27 March 2012 NoTube 3rd Review 12 Wednesday, March 28, 12
  • 14. Fragmentation by site 26-27 March 2012 NoTube 3rd Review 13 Wednesday, March 28, 12
  • 17. Statistical recommendation: Process • Build on best-in-class opensource code, rather than re- invent • Big-data ready (Hadoop-based) • Of various options, LogLikelihoodSimilarity generally gave best results (standard 'withold some ratings' evaluation strategy) • Other explorations: including large scale (1/2 billion tweet) Twitter analysis, Spectral Clustering, using demographics, ... 26-27 March 2012 NoTube 3rd Review 16 Wednesday, March 28, 12
  • 18. Exploitation & Further Development Beancounter: •Pronetics’ user profiling SaaS •integration in the e-commerce technological solution • making it more general purpose • making it capable of big data management a SaaS playground for Semantic Web researcher •open source licensing •community extensions 26-27 March 2012 NoTube 3rd Review 17 Wednesday, March 28, 12
  • 19. Exploitation & Further Development Recommendations: •explore further the combination of demographic stereotypes & semantics in a hybrid approach to learn a prediction model for the shows a user is most likely interested in •integrate in personalized semantic search frameworks •extend with additional LOD sources •test further the measures for diversity, serendipity and predictability •open source licensing •community extensions 26-27 March 2012 NoTube 3rd Review 18 Wednesday, March 28, 12
  • 20. Acknowledgements 26-27 March 2012 NoTube 3rd Review 19 Wednesday, March 28, 12