SlideShare una empresa de Scribd logo
@twitter Mining #MicroblogsUsing #SemanticTechnologies Selver Softic, Martin Ebner, Herbert Mühlburger , Thomas Altmann, Behnam Taraghi
Web 2.0 -  well knownstory Web 2.0 technologiesbroughtuserscloserto Web … Wikis, Blogs, Forums … Podcasts, RSS, XML … … thenusersstarted togeneratecontent  … Source: http:mediabistro.com
From Web toSocial Web Result = a vastofinformation Text, Pictures, Audio, Videos …. Communication, networking, exchangeofdata Web becamemore personal Cultural, geographicalandsocialbordersdisappeared Source: http://www.ignitesocialmedia.com
Social Media Boom!
Socialsitesaredatasilos source: www.pidgintech.com
But still disconnected ? source: www.pidgintech.com
Data is still captured in Walled Garden!
Statements Social Web relies on usersandcommunicationamongthem Whilecommunicatingusersproduceorconsumecontent Socialsitesaredatasilosrich on varietyofinformation Thisinformationcouldbeinterestingfor: monitoring of trends, advertising, statistics, reputation, news broadcasting , tagging … Thisdataiscaptured in Walledgarden !!!
Questions Howtousethisdatatogainmoreusefulinsights Whataretheadvantagesof online (offline) search on such dataandhowtoreachit in an uniform way Is itpossibletostructurize, connectandexposethedata in order tobeusedbyhumansandmachinesmoreefficiently Whatwould an architecturelooklikeforthisissue
Social Web Trends Microblogging SocialBookmarking Social Networking Social Marketing Sharing Photos, Videos … Source: http://socialwebresearch.com
Microblogs Microblogs Usedforcommunication,publishingandinformationexchange Simple forprocessing Information  generatedbymany different users Socialuserrelations Tripartitecommunicationstructure Varietyofinformations Noboundariesbyculture,locationortechnology (mobile users) Twitter Most Popular Large amountoddata But limited According: http://an.kaist.ac.kr/traces/WWW2010.html 41.7 million user profiles, 1.47 billion social relations, 4,262 trending topics, and 106 million tweets
SemanticaspectsandTwitter Twitter User realtions Tweetsasshortinformationartefacts Communication withtripartitepattern Time relatedinformation Vocabularies SIOC, FOAF, Dublin Core
Linked Data andTwitter Twittercontainsinfos on: People, Organisations, Locations, Trends … LOD Cloudcontains Billionsoftriplesabout: Geolocations , dataaboutscience, government, commonknowledge, persons, news … Vocabularies MOAT, CommmonTag
Architecture model
Acquisition - Grabeeter
Grabeeter Search in your Tweets Filter your Tweets by date Search in your Tweets offline using the Grabeeter Client Filter your tweets offline using the Grabeeter Client Grabeeter provides an API
Triplification Module  Author Date Content Reciever <tweet url="http://grabeeter.tugraz.at/tweet/199272" text="Sitting in Prater #vienna, launch party. Nice" screen_name="selvers" created="2010-08-19" twitterUrl="http://twitter.com/selvers/status/21606926237"/> RDF  Store Triplifier
Triplification Module @prefix foaf: <http://xmlns.com/foaf/0.1/> . @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> . @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix sioc: <http://rdfs.org/sioc/ns#> . @prefix sioct: <http://rdfs.org/sioc/types#> . @prefix dcterms: <http://purl.org/dc/terms/#> . <http://twitter.com/selvers/status/21606926237>  rdf:typesioct:MicroblogPost ; sioc:content "Sitting in Prater #vienna, launch party. Nice" ; sioc:has_creator  <http://twitter.com/selvers/>  ; foaf:maker <http://grabeteer.tugraz.at/foaf/selvers/> ; dcterms:created  “2010-08-19” ; rdfs:sameAs  <http://grabeeter.tugraz.at/tweet/199272> . <http://twitter.com/selvers/>  rdf:typefoaf:Person ; foaf:name  "SelverSoftic" ; foaf:depiction <http://a0.twimg.com/profile_images/905118560/f9e4b6eba.13070201_3_normal.jpg> ; foaf:knows <http://twitter.com/hmuehlburger/> ; foaf:knows <http://twitter.com/mhausenblas/> ; foaf:knows <http://twitter.com/mebner/> .  …
Interlinking Module Hashtags (People, Organisation, Locations) MOAT, CommonTag Later NLP processedcontent, SILK Framework SELECT ?post ?content ?maker ?name WHERE { ?post rdf:typesioct:MicroblogPost; foaf:maker ?maker;       ?makerfoaf:name ?name; sioc:content ?content. FILTER(regex(?content,#vienna)) }  Classifier tag: tagName "vienna" ; moat: tagMeaning <http://dbpedia .org/resource/Vienna> tag: taggedResource <http://twitter.com/selvers/status/2160692623>
Analysis
Conclusions & Outlook Currentstateofthearttechnologiessufficetorealisetheproposedarchitectureparadigm Interlinkingwith LOD Cloud (Tweet-O-Sphere) Involving NLP Methods Sentiment classification (Re)TaggingofTweets Providing SPARQL Endpoint + Lookup Serviceasresearchinterface SocialSemantic Web Apps
Questions?

Más contenido relacionado

La actualidad más candente

Cyber security lifting the veil of hacking webinar
Cyber security   lifting the veil of hacking webinarCyber security   lifting the veil of hacking webinar
Cyber security lifting the veil of hacking webinar
Association for Project Management
 

La actualidad más candente (15)

Webinar: Personal Online Privacy - Sucuri Security
Webinar: Personal Online Privacy - Sucuri SecurityWebinar: Personal Online Privacy - Sucuri Security
Webinar: Personal Online Privacy - Sucuri Security
 
Sucuri Webinar: How to clean hacked WordPress sites
Sucuri Webinar: How to clean hacked WordPress sitesSucuri Webinar: How to clean hacked WordPress sites
Sucuri Webinar: How to clean hacked WordPress sites
 
Sucuri Webinar: Impacts of a website compromise
Sucuri Webinar: Impacts of a website compromiseSucuri Webinar: Impacts of a website compromise
Sucuri Webinar: Impacts of a website compromise
 
Sucuri Webinar: How Websites Get Hacked
Sucuri Webinar: How Websites Get HackedSucuri Webinar: How Websites Get Hacked
Sucuri Webinar: How Websites Get Hacked
 
obtain additional security
obtain additional security 
obtain additional security
obtain additional security
 
Sucuri Webinar: What is SEO Spam and How to Fight It
Sucuri Webinar: What is SEO Spam and How to Fight ItSucuri Webinar: What is SEO Spam and How to Fight It
Sucuri Webinar: What is SEO Spam and How to Fight It
 
Webinar: eCommerce Compliance - PCI meets GDPR
Webinar: eCommerce Compliance - PCI meets GDPRWebinar: eCommerce Compliance - PCI meets GDPR
Webinar: eCommerce Compliance - PCI meets GDPR
 
Why Do Hackers Hack?
Why Do Hackers Hack?Why Do Hackers Hack?
Why Do Hackers Hack?
 
Logs: Understanding Them to Better Manage Your WordPress Site
Logs: Understanding Them to Better Manage Your WordPress SiteLogs: Understanding Them to Better Manage Your WordPress Site
Logs: Understanding Them to Better Manage Your WordPress Site
 
Steps to Keep Your Site Clean
Steps to Keep Your Site CleanSteps to Keep Your Site Clean
Steps to Keep Your Site Clean
 
Sucuri Webinar: Defending Your Google Brand Reputation and Analytics Reports
Sucuri Webinar: Defending Your Google Brand Reputation and Analytics ReportsSucuri Webinar: Defending Your Google Brand Reputation and Analytics Reports
Sucuri Webinar: Defending Your Google Brand Reputation and Analytics Reports
 
Getting the word out: How to implement your online branding strategy
Getting the word out: How to implement your online branding strategyGetting the word out: How to implement your online branding strategy
Getting the word out: How to implement your online branding strategy
 
Website Security
Website SecurityWebsite Security
Website Security
 
Cyber security lifting the veil of hacking webinar
Cyber security   lifting the veil of hacking webinarCyber security   lifting the veil of hacking webinar
Cyber security lifting the veil of hacking webinar
 
Sucuri Webinar: How Caching Options Can Impact Your Website Speed
Sucuri Webinar: How Caching Options Can Impact Your Website SpeedSucuri Webinar: How Caching Options Can Impact Your Website Speed
Sucuri Webinar: How Caching Options Can Impact Your Website Speed
 

Similar a Swap2010 twitter minining using semantic web technologies and linked data

Bills Pr 2.0 Presentation
Bills Pr 2.0 PresentationBills Pr 2.0 Presentation
Bills Pr 2.0 Presentation
InBlackandWhite
 
Social Media Web Marketing Nov 2009 Wk2
Social Media Web Marketing Nov 2009 Wk2Social Media Web Marketing Nov 2009 Wk2
Social Media Web Marketing Nov 2009 Wk2
PCM creative
 
Social Developers London - Twitter Cards Update
Social Developers London - Twitter Cards UpdateSocial Developers London - Twitter Cards Update
Social Developers London - Twitter Cards Update
Angus Fox
 

Similar a Swap2010 twitter minining using semantic web technologies and linked data (20)

Bills Pr 2.0 Presentation
Bills Pr 2.0 PresentationBills Pr 2.0 Presentation
Bills Pr 2.0 Presentation
 
Semantic Microblogging
Semantic MicrobloggingSemantic Microblogging
Semantic Microblogging
 
Geeks History of the Internet - how we arrived at Web 2.0
Geeks History of the Internet - how we arrived at Web 2.0Geeks History of the Internet - how we arrived at Web 2.0
Geeks History of the Internet - how we arrived at Web 2.0
 
Social Media Web Marketing Nov 2009 Wk2
Social Media Web Marketing Nov 2009 Wk2Social Media Web Marketing Nov 2009 Wk2
Social Media Web Marketing Nov 2009 Wk2
 
Web 3 0
Web 3 0 Web 3 0
Web 3 0
 
Social Semantic Web on Facebook Open Graph protocol and Twitter Annotations
Social Semantic Web on Facebook Open Graph protocol and Twitter AnnotationsSocial Semantic Web on Facebook Open Graph protocol and Twitter Annotations
Social Semantic Web on Facebook Open Graph protocol and Twitter Annotations
 
BotCommons: Metadata for Bots - Devoxx 2017
BotCommons: Metadata for Bots - Devoxx 2017BotCommons: Metadata for Bots - Devoxx 2017
BotCommons: Metadata for Bots - Devoxx 2017
 
Privacy on the internet presentation_kf_final
Privacy on the internet presentation_kf_finalPrivacy on the internet presentation_kf_final
Privacy on the internet presentation_kf_final
 
Professor Hendrik Speck - Social Conduct. Privacy and Social Networks.
Professor Hendrik Speck - Social Conduct. Privacy and Social Networks.Professor Hendrik Speck - Social Conduct. Privacy and Social Networks.
Professor Hendrik Speck - Social Conduct. Privacy and Social Networks.
 
Information Management Trends 2009
Information Management Trends 2009Information Management Trends 2009
Information Management Trends 2009
 
Espiando redes de microblogging Navaja Negra 2017
Espiando redes de microblogging Navaja Negra 2017Espiando redes de microblogging Navaja Negra 2017
Espiando redes de microblogging Navaja Negra 2017
 
Microformats
MicroformatsMicroformats
Microformats
 
Webware Webinar
Webware WebinarWebware Webinar
Webware Webinar
 
Web3.0 or The semantic web
Web3.0 or The semantic webWeb3.0 or The semantic web
Web3.0 or The semantic web
 
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
 
Social Developers London - Twitter Cards Update
Social Developers London - Twitter Cards UpdateSocial Developers London - Twitter Cards Update
Social Developers London - Twitter Cards Update
 
Alfonso Muñoz y Miguel Hernandez - Playing with mastodon for fun and profit [...
Alfonso Muñoz y Miguel Hernandez - Playing with mastodon for fun and profit [...Alfonso Muñoz y Miguel Hernandez - Playing with mastodon for fun and profit [...
Alfonso Muñoz y Miguel Hernandez - Playing with mastodon for fun and profit [...
 
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
 
IRJET- Tweet Segmentation and its Application to Named Entity Recognition
IRJET- Tweet Segmentation and its Application to Named Entity RecognitionIRJET- Tweet Segmentation and its Application to Named Entity Recognition
IRJET- Tweet Segmentation and its Application to Named Entity Recognition
 
News Media Metadata - The Current Landscape
News Media Metadata - The Current LandscapeNews Media Metadata - The Current Landscape
News Media Metadata - The Current Landscape
 

Último

Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
UXDXConf
 

Último (20)

Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdf
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
 
The architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdfThe architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdf
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. Startups
 
Intro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераIntro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджера
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
Designing for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at ComcastDesigning for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at Comcast
 
Enterprise Security Monitoring, And Log Management.
Enterprise Security Monitoring, And Log Management.Enterprise Security Monitoring, And Log Management.
Enterprise Security Monitoring, And Log Management.
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří Karpíšek
 
ECS 2024 Teams Premium - Pretty Secure
ECS 2024   Teams Premium - Pretty SecureECS 2024   Teams Premium - Pretty Secure
ECS 2024 Teams Premium - Pretty Secure
 
Transforming The New York Times: Empowering Evolution through UX
Transforming The New York Times: Empowering Evolution through UXTransforming The New York Times: Empowering Evolution through UX
Transforming The New York Times: Empowering Evolution through UX
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
Server-Driven User Interface (SDUI) at Priceline
Server-Driven User Interface (SDUI) at PricelineServer-Driven User Interface (SDUI) at Priceline
Server-Driven User Interface (SDUI) at Priceline
 

Swap2010 twitter minining using semantic web technologies and linked data

  • 1. @twitter Mining #MicroblogsUsing #SemanticTechnologies Selver Softic, Martin Ebner, Herbert Mühlburger , Thomas Altmann, Behnam Taraghi
  • 2. Web 2.0 - well knownstory Web 2.0 technologiesbroughtuserscloserto Web … Wikis, Blogs, Forums … Podcasts, RSS, XML … … thenusersstarted togeneratecontent … Source: http:mediabistro.com
  • 3. From Web toSocial Web Result = a vastofinformation Text, Pictures, Audio, Videos …. Communication, networking, exchangeofdata Web becamemore personal Cultural, geographicalandsocialbordersdisappeared Source: http://www.ignitesocialmedia.com
  • 5.
  • 7. But still disconnected ? source: www.pidgintech.com
  • 8. Data is still captured in Walled Garden!
  • 9. Statements Social Web relies on usersandcommunicationamongthem Whilecommunicatingusersproduceorconsumecontent Socialsitesaredatasilosrich on varietyofinformation Thisinformationcouldbeinterestingfor: monitoring of trends, advertising, statistics, reputation, news broadcasting , tagging … Thisdataiscaptured in Walledgarden !!!
  • 10. Questions Howtousethisdatatogainmoreusefulinsights Whataretheadvantagesof online (offline) search on such dataandhowtoreachit in an uniform way Is itpossibletostructurize, connectandexposethedata in order tobeusedbyhumansandmachinesmoreefficiently Whatwould an architecturelooklikeforthisissue
  • 11. Social Web Trends Microblogging SocialBookmarking Social Networking Social Marketing Sharing Photos, Videos … Source: http://socialwebresearch.com
  • 12. Microblogs Microblogs Usedforcommunication,publishingandinformationexchange Simple forprocessing Information generatedbymany different users Socialuserrelations Tripartitecommunicationstructure Varietyofinformations Noboundariesbyculture,locationortechnology (mobile users) Twitter Most Popular Large amountoddata But limited According: http://an.kaist.ac.kr/traces/WWW2010.html 41.7 million user profiles, 1.47 billion social relations, 4,262 trending topics, and 106 million tweets
  • 13. SemanticaspectsandTwitter Twitter User realtions Tweetsasshortinformationartefacts Communication withtripartitepattern Time relatedinformation Vocabularies SIOC, FOAF, Dublin Core
  • 14. Linked Data andTwitter Twittercontainsinfos on: People, Organisations, Locations, Trends … LOD Cloudcontains Billionsoftriplesabout: Geolocations , dataaboutscience, government, commonknowledge, persons, news … Vocabularies MOAT, CommmonTag
  • 17. Grabeeter Search in your Tweets Filter your Tweets by date Search in your Tweets offline using the Grabeeter Client Filter your tweets offline using the Grabeeter Client Grabeeter provides an API
  • 18. Triplification Module Author Date Content Reciever <tweet url="http://grabeeter.tugraz.at/tweet/199272" text="Sitting in Prater #vienna, launch party. Nice" screen_name="selvers" created="2010-08-19" twitterUrl="http://twitter.com/selvers/status/21606926237"/> RDF Store Triplifier
  • 19. Triplification Module @prefix foaf: <http://xmlns.com/foaf/0.1/> . @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> . @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix sioc: <http://rdfs.org/sioc/ns#> . @prefix sioct: <http://rdfs.org/sioc/types#> . @prefix dcterms: <http://purl.org/dc/terms/#> . <http://twitter.com/selvers/status/21606926237> rdf:typesioct:MicroblogPost ; sioc:content "Sitting in Prater #vienna, launch party. Nice" ; sioc:has_creator <http://twitter.com/selvers/> ; foaf:maker <http://grabeteer.tugraz.at/foaf/selvers/> ; dcterms:created “2010-08-19” ; rdfs:sameAs <http://grabeeter.tugraz.at/tweet/199272> . <http://twitter.com/selvers/> rdf:typefoaf:Person ; foaf:name "SelverSoftic" ; foaf:depiction <http://a0.twimg.com/profile_images/905118560/f9e4b6eba.13070201_3_normal.jpg> ; foaf:knows <http://twitter.com/hmuehlburger/> ; foaf:knows <http://twitter.com/mhausenblas/> ; foaf:knows <http://twitter.com/mebner/> . …
  • 20. Interlinking Module Hashtags (People, Organisation, Locations) MOAT, CommonTag Later NLP processedcontent, SILK Framework SELECT ?post ?content ?maker ?name WHERE { ?post rdf:typesioct:MicroblogPost; foaf:maker ?maker; ?makerfoaf:name ?name; sioc:content ?content. FILTER(regex(?content,#vienna)) } Classifier tag: tagName "vienna" ; moat: tagMeaning <http://dbpedia .org/resource/Vienna> tag: taggedResource <http://twitter.com/selvers/status/2160692623>
  • 22. Conclusions & Outlook Currentstateofthearttechnologiessufficetorealisetheproposedarchitectureparadigm Interlinkingwith LOD Cloud (Tweet-O-Sphere) Involving NLP Methods Sentiment classification (Re)TaggingofTweets Providing SPARQL Endpoint + Lookup Serviceasresearchinterface SocialSemantic Web Apps