SlideShare una empresa de Scribd logo
1 de 39
Descargar para leer sin conexión
Mining Big Data and Open
Knowledge Sources to develop
transparent and serendipitous
content-based adaptive systems
Cataldo Musto, Giovanni Semeraro, Fedelucio Narducci
state of the art.
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
our research: personalization
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
Recommender Systems
Relevant items (movies, news, books, etc.) are pushed to the
user according to her preferences or her needs.
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
Amazon.com
Recommendations
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
current recommendation technologies share three
important drawbacks.
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
(1) training is a bottleneck.
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
need for
explicit
information
about
user interests.
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
(2) recsys are black boxes.
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
(3) suggestions are not surprising.
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
exploiting big data to build a novel generation
of content-based adaptive systems
solution
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
current work.
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
near future work.
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
big data.
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
Information
Overload
we can handle 126 bits of information
we deal with 393 bits of information
ratio: more than 3x(Source: Adrian C.Ott,The 24-hour customer)
consequence:
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
Information Overload
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
Big Data: obstacle or
opportunity?
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
cornestone 1
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
exploit social media to
model user
preferences.
social media are an opportunity
provide information about user preferences
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
example
user preferences in music from Facebook
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
implicit preferences
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
example
Play.me
playlist
Most popular songs of the artists extracted from Last.fm (as well as
those added through the enrichment) are proposed to the user.
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
Myusic
recommendations
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
cornestone 2
exploit entity linking algorithms
to make user profiles more
transparent and LOD-aware
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
MyFeeds
RSS recommendations
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
MyFeeds
transparent user preferences
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
extracted from Facebook.
MyFeeds
transparent user preferences
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
further processing
MyFeeds
entity linking algorithms
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
• They map free text with structured
information
• Wikipedia pages or DBpedia nodes
• examples
• Tag.me ,Wikipedia Miner, DBpedia
Spotlight, etc.
Tag.me
extracts the Wikipedia pages the content refers to.
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
Linked Open Data Cloud
Structured
(RDF)
representation
of the information
stored in Wikipedia.
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
Linked Open Data Cloud
Profiles based
on Tag.me are
LOD-aware
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
cornestone 3
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
exploit open knowledge sources
to make recommendation
techniques more serendipitous.
‘in vitro’ experiments
Watchmi plug-in
developed by Aprico.tv
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
From BOW to eBOW
Given a description of a TV show, we exploit ESA to
obtain an enhanced representation
The original set of features is enriched with the set of
Wikipedia articles related the most with theTV show
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
TV SHOW
Rad an Rad
Die besten Duelle der MotoGP
(Wheel to wheel
The best duels in the MotoGP)
Wikipedia(Articles(
großer&preis&von&italien&
(motorrad)&
großer&preis&von&malaysia&
(motorrad)&
großer&preis&von&tschechien&
(motorrad)&
scuderia&ferrari&
valen8no&rossi&
motorrad9wm9saison&2005&
motorrad9wm9saison&2006&
max&biaggi&
großer&preis&der&usa&(motorrad)&
motorrad9wm9saison&2008&
rad&(heraldik)&
loris&capirossi&
shin’ya&nakano&
motogp&
example
From BOW to eBOW
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
challenges.
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
issues.
recommendations.
Challenges and Issues
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
• Main challenge and issue:
• data representation and data filtering
• How to exploit these novel data sylos?
• What information is relevant for personalization?
• What kind of processing do data need?
• Which one is the best representation?
• Do reasoning techniques improve profiles transparency and
personalization accuracy?
• Do people accept the exploitation of these data?
• How to model the context?
Recommendations
C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous
content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
• Cornerstones
• Social media-based user profiling
• LOD-aware user profiles
• Open Knowledge Sources for Serendipitous Encounters
• Recommendations
• Promote the LOD initiative, to publish data in a structured
form, to enable reasoning on the information
• Make data sylos interconnected
• To design applications able to properly model, manage and
exploit the big amount of data coming from social media.
questions?
Cataldo Musto, Ph.D. - cataldo.musto@uniba.it

Más contenido relacionado

Similar a Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems

Digital Sustainability in the IT Supply Chain
Digital Sustainability in the IT Supply ChainDigital Sustainability in the IT Supply Chain
Digital Sustainability in the IT Supply ChainMatthias Stürmer
 
Public data ecosystems in and for smart cities: how to make open / Big / smar...
Public data ecosystems in and for smart cities: how to make open / Big / smar...Public data ecosystems in and for smart cities: how to make open / Big / smar...
Public data ecosystems in and for smart cities: how to make open / Big / smar...Anastasija Nikiforova
 
Foresight conversation
Foresight conversationForesight conversation
Foresight conversationsuresh sood
 
Research collaboration between Spain and Switzerland
Research collaboration between Spain and  Switzerland Research collaboration between Spain and  Switzerland
Research collaboration between Spain and Switzerland shengjing 孙胜晶
 
Digital cultural heritage spring 2015 day 2
Digital cultural heritage spring 2015 day 2Digital cultural heritage spring 2015 day 2
Digital cultural heritage spring 2015 day 2Stefano A Gazziano
 
Digital preservation through Digital Sustainability
Digital preservation through Digital SustainabilityDigital preservation through Digital Sustainability
Digital preservation through Digital SustainabilityMatthias Stürmer
 
Gobinda Chowdhury
Gobinda ChowdhuryGobinda Chowdhury
Gobinda Chowdhurymaredata
 
Linked Open Data and data-driven journalism
Linked Open Data and data-driven journalismLinked Open Data and data-driven journalism
Linked Open Data and data-driven journalismPia Jøsendal
 
EUDAT 3rd Conference: Bringing Data e-Infrastructures to Horizon2020 - Carl-C...
EUDAT 3rd Conference: Bringing Data e-Infrastructures to Horizon2020 - Carl-C...EUDAT 3rd Conference: Bringing Data e-Infrastructures to Horizon2020 - Carl-C...
EUDAT 3rd Conference: Bringing Data e-Infrastructures to Horizon2020 - Carl-C...EUDAT
 
Co-Design in Data Science
Co-Design in Data ScienceCo-Design in Data Science
Co-Design in Data ScienceSam Pottinger
 
Susanna Sansone - OpenCon Oxford, 1st Dec 2017
Susanna Sansone - OpenCon Oxford, 1st Dec 2017Susanna Sansone - OpenCon Oxford, 1st Dec 2017
Susanna Sansone - OpenCon Oxford, 1st Dec 2017Crossref
 
Educating Data Scientists: the SoBigData master experience
Educating Data Scientists: the SoBigData master experienceEducating Data Scientists: the SoBigData master experience
Educating Data Scientists: the SoBigData master experienceResearch Data Alliance
 
Open Digital Science & e-infrastructures
Open Digital Science & e-infrastructuresOpen Digital Science & e-infrastructures
Open Digital Science & e-infrastructuresCarl-Christian Buhr
 
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...Geoffrey Fox
 
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...Geoffrey Fox
 
The FAIR movement - Oxford Open Data Week
The FAIR movement - Oxford Open Data WeekThe FAIR movement - Oxford Open Data Week
The FAIR movement - Oxford Open Data WeekSusanna-Assunta Sansone
 
Machine Learning and Social Participation
Machine Learning and Social ParticipationMachine Learning and Social Participation
Machine Learning and Social ParticipationYasodara Cordova
 
People in the Machine: Human-centric Software Engineering for Smart Systems
People in the Machine: Human-centric Software Engineering for Smart SystemsPeople in the Machine: Human-centric Software Engineering for Smart Systems
People in the Machine: Human-centric Software Engineering for Smart SystemsArosha Bandara
 

Similar a Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems (20)

Digital Sustainability in the IT Supply Chain
Digital Sustainability in the IT Supply ChainDigital Sustainability in the IT Supply Chain
Digital Sustainability in the IT Supply Chain
 
Rdaeu russia_fg_1_july2014_final
Rdaeu  russia_fg_1_july2014_finalRdaeu  russia_fg_1_july2014_final
Rdaeu russia_fg_1_july2014_final
 
Public data ecosystems in and for smart cities: how to make open / Big / smar...
Public data ecosystems in and for smart cities: how to make open / Big / smar...Public data ecosystems in and for smart cities: how to make open / Big / smar...
Public data ecosystems in and for smart cities: how to make open / Big / smar...
 
Foresight conversation
Foresight conversationForesight conversation
Foresight conversation
 
Research collaboration between Spain and Switzerland
Research collaboration between Spain and  Switzerland Research collaboration between Spain and  Switzerland
Research collaboration between Spain and Switzerland
 
Digital cultural heritage spring 2015 day 2
Digital cultural heritage spring 2015 day 2Digital cultural heritage spring 2015 day 2
Digital cultural heritage spring 2015 day 2
 
Digital preservation through Digital Sustainability
Digital preservation through Digital SustainabilityDigital preservation through Digital Sustainability
Digital preservation through Digital Sustainability
 
Gobinda Chowdhury
Gobinda ChowdhuryGobinda Chowdhury
Gobinda Chowdhury
 
Linked Open Data and data-driven journalism
Linked Open Data and data-driven journalismLinked Open Data and data-driven journalism
Linked Open Data and data-driven journalism
 
EUDAT 3rd Conference: Bringing Data e-Infrastructures to Horizon2020 - Carl-C...
EUDAT 3rd Conference: Bringing Data e-Infrastructures to Horizon2020 - Carl-C...EUDAT 3rd Conference: Bringing Data e-Infrastructures to Horizon2020 - Carl-C...
EUDAT 3rd Conference: Bringing Data e-Infrastructures to Horizon2020 - Carl-C...
 
Co-Design in Data Science
Co-Design in Data ScienceCo-Design in Data Science
Co-Design in Data Science
 
Susanna Sansone - OpenCon Oxford, 1st Dec 2017
Susanna Sansone - OpenCon Oxford, 1st Dec 2017Susanna Sansone - OpenCon Oxford, 1st Dec 2017
Susanna Sansone - OpenCon Oxford, 1st Dec 2017
 
Big Data: Big Issues for IP
Big Data: Big Issues for IPBig Data: Big Issues for IP
Big Data: Big Issues for IP
 
Educating Data Scientists: the SoBigData master experience
Educating Data Scientists: the SoBigData master experienceEducating Data Scientists: the SoBigData master experience
Educating Data Scientists: the SoBigData master experience
 
Open Digital Science & e-infrastructures
Open Digital Science & e-infrastructuresOpen Digital Science & e-infrastructures
Open Digital Science & e-infrastructures
 
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
 
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...
 
The FAIR movement - Oxford Open Data Week
The FAIR movement - Oxford Open Data WeekThe FAIR movement - Oxford Open Data Week
The FAIR movement - Oxford Open Data Week
 
Machine Learning and Social Participation
Machine Learning and Social ParticipationMachine Learning and Social Participation
Machine Learning and Social Participation
 
People in the Machine: Human-centric Software Engineering for Smart Systems
People in the Machine: Human-centric Software Engineering for Smart SystemsPeople in the Machine: Human-centric Software Engineering for Smart Systems
People in the Machine: Human-centric Software Engineering for Smart Systems
 

Más de Cataldo Musto

MyrrorBot: a Digital Assistant Based on Holistic User Models for Personalize...
MyrrorBot: a Digital Assistant Based on Holistic User Models forPersonalize...MyrrorBot: a Digital Assistant Based on Holistic User Models forPersonalize...
MyrrorBot: a Digital Assistant Based on Holistic User Models for Personalize...Cataldo Musto
 
Fairness and Popularity Bias in Recommender Systems: an Empirical Evaluation
Fairness and Popularity Bias in Recommender Systems: an Empirical EvaluationFairness and Popularity Bias in Recommender Systems: an Empirical Evaluation
Fairness and Popularity Bias in Recommender Systems: an Empirical EvaluationCataldo Musto
 
Intelligenza Artificiale e Social Media - Monitoraggio della Farnesina e La M...
Intelligenza Artificiale e Social Media - Monitoraggio della Farnesina e La M...Intelligenza Artificiale e Social Media - Monitoraggio della Farnesina e La M...
Intelligenza Artificiale e Social Media - Monitoraggio della Farnesina e La M...Cataldo Musto
 
Exploring the Effects of Natural Language Justifications in Food Recommender ...
Exploring the Effects of Natural Language Justifications in Food Recommender ...Exploring the Effects of Natural Language Justifications in Food Recommender ...
Exploring the Effects of Natural Language Justifications in Food Recommender ...Cataldo Musto
 
Exploiting Distributional Semantics Models for Natural Language Context-aware...
Exploiting Distributional Semantics Models for Natural Language Context-aware...Exploiting Distributional Semantics Models for Natural Language Context-aware...
Exploiting Distributional Semantics Models for Natural Language Context-aware...Cataldo Musto
 
Towards a Knowledge-aware Food Recommender System Exploiting Holistic User Mo...
Towards a Knowledge-aware Food Recommender System Exploiting Holistic User Mo...Towards a Knowledge-aware Food Recommender System Exploiting Holistic User Mo...
Towards a Knowledge-aware Food Recommender System Exploiting Holistic User Mo...Cataldo Musto
 
Towards Queryable User Profiles: Introducing Conversational Agents in a Platf...
Towards Queryable User Profiles: Introducing Conversational Agents in a Platf...Towards Queryable User Profiles: Introducing Conversational Agents in a Platf...
Towards Queryable User Profiles: Introducing Conversational Agents in a Platf...Cataldo Musto
 
Hybrid Semantics aware Recommendations Exploiting Knowledge Graph Embeddings
Hybrid Semantics aware Recommendations Exploiting Knowledge Graph EmbeddingsHybrid Semantics aware Recommendations Exploiting Knowledge Graph Embeddings
Hybrid Semantics aware Recommendations Exploiting Knowledge Graph EmbeddingsCataldo Musto
 
Natural Language Justifications for Recommender Systems Exploiting Text Summa...
Natural Language Justifications for Recommender Systems Exploiting Text Summa...Natural Language Justifications for Recommender Systems Exploiting Text Summa...
Natural Language Justifications for Recommender Systems Exploiting Text Summa...Cataldo Musto
 
L'IA per l'Empowerment del Cittadino: Hate Map, Myrror, PA Risponde
L'IA per l'Empowerment del Cittadino: Hate Map, Myrror, PA RispondeL'IA per l'Empowerment del Cittadino: Hate Map, Myrror, PA Risponde
L'IA per l'Empowerment del Cittadino: Hate Map, Myrror, PA RispondeCataldo Musto
 
Explanation Strategies - Advances in Content-based Recommender System
Explanation Strategies - Advances in Content-based Recommender SystemExplanation Strategies - Advances in Content-based Recommender System
Explanation Strategies - Advances in Content-based Recommender SystemCataldo Musto
 
Justifying Recommendations through Aspect-based Sentiment Analysis of Users R...
Justifying Recommendations through Aspect-based Sentiment Analysis of Users R...Justifying Recommendations through Aspect-based Sentiment Analysis of Users R...
Justifying Recommendations through Aspect-based Sentiment Analysis of Users R...Cataldo Musto
 
ExpLOD: un framework per la generazione di spiegazioni per recommender system...
ExpLOD: un framework per la generazione di spiegazioni per recommender system...ExpLOD: un framework per la generazione di spiegazioni per recommender system...
ExpLOD: un framework per la generazione di spiegazioni per recommender system...Cataldo Musto
 
Myrror: una piattaforma per Holistic User Modeling e Quantified Self
Myrror: una piattaforma per Holistic User Modeling e Quantified SelfMyrror: una piattaforma per Holistic User Modeling e Quantified Self
Myrror: una piattaforma per Holistic User Modeling e Quantified SelfCataldo Musto
 
Semantic Holistic User Modeling for Personalized Access to Digital Content an...
Semantic Holistic User Modeling for Personalized Access to Digital Content an...Semantic Holistic User Modeling for Personalized Access to Digital Content an...
Semantic Holistic User Modeling for Personalized Access to Digital Content an...Cataldo Musto
 
Holistic User Modeling for Personalized Services in Smart Cities
Holistic User Modeling for Personalized Services in Smart CitiesHolistic User Modeling for Personalized Services in Smart Cities
Holistic User Modeling for Personalized Services in Smart CitiesCataldo Musto
 
A Framework for Holistic User Modeling Merging Heterogeneous Digital Footprints
A Framework for Holistic User Modeling Merging Heterogeneous Digital FootprintsA Framework for Holistic User Modeling Merging Heterogeneous Digital Footprints
A Framework for Holistic User Modeling Merging Heterogeneous Digital FootprintsCataldo Musto
 
eHealth, mHealth in Otorinolaringoiatria: innovazioni dirompenti o disastrose?
eHealth, mHealth in Otorinolaringoiatria: innovazioni dirompenti o disastrose?eHealth, mHealth in Otorinolaringoiatria: innovazioni dirompenti o disastrose?
eHealth, mHealth in Otorinolaringoiatria: innovazioni dirompenti o disastrose?Cataldo Musto
 
Semantics-aware Recommender Systems Exploiting Linked Open Data and Graph-bas...
Semantics-aware Recommender Systems Exploiting Linked Open Data and Graph-bas...Semantics-aware Recommender Systems Exploiting Linked Open Data and Graph-bas...
Semantics-aware Recommender Systems Exploiting Linked Open Data and Graph-bas...Cataldo Musto
 
Il Linguaggio dell'Odio sui Social Network
Il Linguaggio dell'Odio sui Social NetworkIl Linguaggio dell'Odio sui Social Network
Il Linguaggio dell'Odio sui Social NetworkCataldo Musto
 

Más de Cataldo Musto (20)

MyrrorBot: a Digital Assistant Based on Holistic User Models for Personalize...
MyrrorBot: a Digital Assistant Based on Holistic User Models forPersonalize...MyrrorBot: a Digital Assistant Based on Holistic User Models forPersonalize...
MyrrorBot: a Digital Assistant Based on Holistic User Models for Personalize...
 
Fairness and Popularity Bias in Recommender Systems: an Empirical Evaluation
Fairness and Popularity Bias in Recommender Systems: an Empirical EvaluationFairness and Popularity Bias in Recommender Systems: an Empirical Evaluation
Fairness and Popularity Bias in Recommender Systems: an Empirical Evaluation
 
Intelligenza Artificiale e Social Media - Monitoraggio della Farnesina e La M...
Intelligenza Artificiale e Social Media - Monitoraggio della Farnesina e La M...Intelligenza Artificiale e Social Media - Monitoraggio della Farnesina e La M...
Intelligenza Artificiale e Social Media - Monitoraggio della Farnesina e La M...
 
Exploring the Effects of Natural Language Justifications in Food Recommender ...
Exploring the Effects of Natural Language Justifications in Food Recommender ...Exploring the Effects of Natural Language Justifications in Food Recommender ...
Exploring the Effects of Natural Language Justifications in Food Recommender ...
 
Exploiting Distributional Semantics Models for Natural Language Context-aware...
Exploiting Distributional Semantics Models for Natural Language Context-aware...Exploiting Distributional Semantics Models for Natural Language Context-aware...
Exploiting Distributional Semantics Models for Natural Language Context-aware...
 
Towards a Knowledge-aware Food Recommender System Exploiting Holistic User Mo...
Towards a Knowledge-aware Food Recommender System Exploiting Holistic User Mo...Towards a Knowledge-aware Food Recommender System Exploiting Holistic User Mo...
Towards a Knowledge-aware Food Recommender System Exploiting Holistic User Mo...
 
Towards Queryable User Profiles: Introducing Conversational Agents in a Platf...
Towards Queryable User Profiles: Introducing Conversational Agents in a Platf...Towards Queryable User Profiles: Introducing Conversational Agents in a Platf...
Towards Queryable User Profiles: Introducing Conversational Agents in a Platf...
 
Hybrid Semantics aware Recommendations Exploiting Knowledge Graph Embeddings
Hybrid Semantics aware Recommendations Exploiting Knowledge Graph EmbeddingsHybrid Semantics aware Recommendations Exploiting Knowledge Graph Embeddings
Hybrid Semantics aware Recommendations Exploiting Knowledge Graph Embeddings
 
Natural Language Justifications for Recommender Systems Exploiting Text Summa...
Natural Language Justifications for Recommender Systems Exploiting Text Summa...Natural Language Justifications for Recommender Systems Exploiting Text Summa...
Natural Language Justifications for Recommender Systems Exploiting Text Summa...
 
L'IA per l'Empowerment del Cittadino: Hate Map, Myrror, PA Risponde
L'IA per l'Empowerment del Cittadino: Hate Map, Myrror, PA RispondeL'IA per l'Empowerment del Cittadino: Hate Map, Myrror, PA Risponde
L'IA per l'Empowerment del Cittadino: Hate Map, Myrror, PA Risponde
 
Explanation Strategies - Advances in Content-based Recommender System
Explanation Strategies - Advances in Content-based Recommender SystemExplanation Strategies - Advances in Content-based Recommender System
Explanation Strategies - Advances in Content-based Recommender System
 
Justifying Recommendations through Aspect-based Sentiment Analysis of Users R...
Justifying Recommendations through Aspect-based Sentiment Analysis of Users R...Justifying Recommendations through Aspect-based Sentiment Analysis of Users R...
Justifying Recommendations through Aspect-based Sentiment Analysis of Users R...
 
ExpLOD: un framework per la generazione di spiegazioni per recommender system...
ExpLOD: un framework per la generazione di spiegazioni per recommender system...ExpLOD: un framework per la generazione di spiegazioni per recommender system...
ExpLOD: un framework per la generazione di spiegazioni per recommender system...
 
Myrror: una piattaforma per Holistic User Modeling e Quantified Self
Myrror: una piattaforma per Holistic User Modeling e Quantified SelfMyrror: una piattaforma per Holistic User Modeling e Quantified Self
Myrror: una piattaforma per Holistic User Modeling e Quantified Self
 
Semantic Holistic User Modeling for Personalized Access to Digital Content an...
Semantic Holistic User Modeling for Personalized Access to Digital Content an...Semantic Holistic User Modeling for Personalized Access to Digital Content an...
Semantic Holistic User Modeling for Personalized Access to Digital Content an...
 
Holistic User Modeling for Personalized Services in Smart Cities
Holistic User Modeling for Personalized Services in Smart CitiesHolistic User Modeling for Personalized Services in Smart Cities
Holistic User Modeling for Personalized Services in Smart Cities
 
A Framework for Holistic User Modeling Merging Heterogeneous Digital Footprints
A Framework for Holistic User Modeling Merging Heterogeneous Digital FootprintsA Framework for Holistic User Modeling Merging Heterogeneous Digital Footprints
A Framework for Holistic User Modeling Merging Heterogeneous Digital Footprints
 
eHealth, mHealth in Otorinolaringoiatria: innovazioni dirompenti o disastrose?
eHealth, mHealth in Otorinolaringoiatria: innovazioni dirompenti o disastrose?eHealth, mHealth in Otorinolaringoiatria: innovazioni dirompenti o disastrose?
eHealth, mHealth in Otorinolaringoiatria: innovazioni dirompenti o disastrose?
 
Semantics-aware Recommender Systems Exploiting Linked Open Data and Graph-bas...
Semantics-aware Recommender Systems Exploiting Linked Open Data and Graph-bas...Semantics-aware Recommender Systems Exploiting Linked Open Data and Graph-bas...
Semantics-aware Recommender Systems Exploiting Linked Open Data and Graph-bas...
 
Il Linguaggio dell'Odio sui Social Network
Il Linguaggio dell'Odio sui Social NetworkIl Linguaggio dell'Odio sui Social Network
Il Linguaggio dell'Odio sui Social Network
 

Último

A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 

Último (20)

A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 

Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems

  • 1. Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems Cataldo Musto, Giovanni Semeraro, Fedelucio Narducci
  • 2. state of the art. C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 3. our research: personalization C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 4. Recommender Systems Relevant items (movies, news, books, etc.) are pushed to the user according to her preferences or her needs. C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 5. Amazon.com Recommendations C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 6. current recommendation technologies share three important drawbacks. C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 7. (1) training is a bottleneck. C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 8. need for explicit information about user interests. C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 9. (2) recsys are black boxes. C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 10. (3) suggestions are not surprising. C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 11. exploiting big data to build a novel generation of content-based adaptive systems solution C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 12. current work. C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013 near future work.
  • 13. C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 14. big data. C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 15. Information Overload we can handle 126 bits of information we deal with 393 bits of information ratio: more than 3x(Source: Adrian C.Ott,The 24-hour customer) consequence: C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 16. Information Overload C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 17. Big Data: obstacle or opportunity? C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 18. cornestone 1 C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013 exploit social media to model user preferences.
  • 19. social media are an opportunity provide information about user preferences C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 20. example user preferences in music from Facebook C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 21. implicit preferences C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013 example
  • 22. Play.me playlist Most popular songs of the artists extracted from Last.fm (as well as those added through the enrichment) are proposed to the user. C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 23. Myusic recommendations C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 24. cornestone 2 exploit entity linking algorithms to make user profiles more transparent and LOD-aware C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 25. MyFeeds RSS recommendations C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 26. MyFeeds transparent user preferences C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013 extracted from Facebook.
  • 27. MyFeeds transparent user preferences C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013 further processing
  • 28. MyFeeds entity linking algorithms C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013 • They map free text with structured information • Wikipedia pages or DBpedia nodes • examples • Tag.me ,Wikipedia Miner, DBpedia Spotlight, etc.
  • 29. Tag.me extracts the Wikipedia pages the content refers to. C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 30. Linked Open Data Cloud Structured (RDF) representation of the information stored in Wikipedia. C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 31. Linked Open Data Cloud Profiles based on Tag.me are LOD-aware C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 32. cornestone 3 C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013 exploit open knowledge sources to make recommendation techniques more serendipitous.
  • 33. ‘in vitro’ experiments Watchmi plug-in developed by Aprico.tv C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 34. From BOW to eBOW Given a description of a TV show, we exploit ESA to obtain an enhanced representation The original set of features is enriched with the set of Wikipedia articles related the most with theTV show C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 35. TV SHOW Rad an Rad Die besten Duelle der MotoGP (Wheel to wheel The best duels in the MotoGP) Wikipedia(Articles( großer&preis&von&italien& (motorrad)& großer&preis&von&malaysia& (motorrad)& großer&preis&von&tschechien& (motorrad)& scuderia&ferrari& valen8no&rossi& motorrad9wm9saison&2005& motorrad9wm9saison&2006& max&biaggi& großer&preis&der&usa&(motorrad)& motorrad9wm9saison&2008& rad&(heraldik)& loris&capirossi& shin’ya&nakano& motogp& example From BOW to eBOW C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013
  • 36. challenges. C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013 issues. recommendations.
  • 37. Challenges and Issues C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013 • Main challenge and issue: • data representation and data filtering • How to exploit these novel data sylos? • What information is relevant for personalization? • What kind of processing do data need? • Which one is the best representation? • Do reasoning techniques improve profiles transparency and personalization accuracy? • Do people accept the exploitation of these data? • How to model the context?
  • 38. Recommendations C.Musto, G.Semeraro - Mining Big Data and Open Knowledge Sources to develop transparent and serendipitous content-based adaptive systems - World Summit on Big Data and Organization Design, Paris, 16-17 May 2013 • Cornerstones • Social media-based user profiling • LOD-aware user profiles • Open Knowledge Sources for Serendipitous Encounters • Recommendations • Promote the LOD initiative, to publish data in a structured form, to enable reasoning on the information • Make data sylos interconnected • To design applications able to properly model, manage and exploit the big amount of data coming from social media.
  • 39. questions? Cataldo Musto, Ph.D. - cataldo.musto@uniba.it