SlideShare una empresa de Scribd logo
1 de 40
Researcher Profiling based on
Semantic Analysis in Social Networks
                Laurens De Vocht




Supervisors                        Promotors
Gonzalo Parra                        Erik Duval
Selver Softic                      Martin Ebner
                   July 1, 2011
Agenda
 ‣Background
 ‣Framework
 ‣Client Application
 ‣Deployment
 ‣Evaluation
 ‣Conclusion
                 2
Background

 ‣Definitions
 ‣Problem Statement
 ‣The Social Semantic Web
 ‣Scope & Value of Study

                3
Definitions
 Profiling
 “Inferring unobser vable information about users from
 observable information about them, that is their actions or their
 utterances.” (Zukerman and Albrecht, 2001)


 Semantic Analysis
 “A technique using semantic-based tools and ontologies in
 order to gain a deeper understanding of the information being
 stored and manipulated in an existing system” (McComb, 2004)



                                 4
Problem Statement
Web users generate a massive
unstructured information flow




                                            ?


                         Who has scientific information
                                      relevant for me?
                          5
Problem Statement
Connecting researchers based on shared scientific events
(conferences)
                         Scientific Profiling


                                                        Scientific
                  User Model              Event Model   Conferences
                                                        Resource


   Researchers




                               Profiler/
                               Analyzer


   Researcher
     (User)


                                  6
The Social Semantic Web

         Community of      (micro)blogging,
        researchers with       sharing,
           conference          tagging,
           experience        discussion
                                              semi-structured
                                                information
  Larger population of                             system
   people interested in
                           (faceted) search
  scientific conferences
                                engine




                           recommendation       clustered and
                               engine           analyzed data




                                                  (Gruber, 2007)
                             7
The Social Semantic Web

         Community of      (micro)blogging,
        researchers with       sharing,
           conference          tagging,
           experience        discussion
                                              semi-structured
                                                information
  Larger population of                             system
   people interested in
                           (faceted) search
  scientific conferences
                                engine




                           recommendation       clustered and
                               engine           analyzed data




 Human process
                                                  (Gruber, 2007)
                             7
The Social Semantic Web

         Community of      (micro)blogging,
        researchers with       sharing,
           conference          tagging,
           experience        discussion
                                                semi-structured
                                                  information
  Larger population of                               system
   people interested in
                           (faceted) search
  scientific conferences
                                engine




                           recommendation         clustered and
                               engine             analyzed data




 Human process                                Machine process
                                                    (Gruber, 2007)
                             7
The Social Semantic Web
                     Social Web
         Community of         (micro)blogging,
        researchers with          sharing,
           conference             tagging,
           experience           discussion
                                                   semi-structured
                                                     information
  Larger population of                                  system
   people interested in
                              (faceted) search
  scientific conferences
                                   engine




                             recommendation          clustered and
                                 engine              analyzed data




 Human process                                   Machine process
                                                       (Gruber, 2007)
                                  7
The Social Semantic Web
                     Social Web                  Semantic Web
         Community of         (micro)blogging,
        researchers with          sharing,
           conference             tagging,
           experience           discussion
                                                   semi-structured
                                                     information
  Larger population of                                  system
   people interested in
                              (faceted) search
  scientific conferences
                                   engine




                             recommendation          clustered and
                                 engine              analyzed data




 Human process                                   Machine process
                                                       (Gruber, 2007)
                                  7
The Social semantic Web
 ‣Hashtags as Identifiers
  ‣not always strong or consistent enough
  ‣properties of good hashtags formalized
  ‣helpful in assessment of valuable identifiers
                                    (Laniado and Mika, 2007)


 ‣Expert Search/Profiling with Linked Data
  ‣aggregate and analyze certain types of data
  ‣need to surpass limits of closed data sets
  ‣LOD delivers multi-purpose data
                                     (Stankovic et al., 2010)

                            8
Scope & Value of the Study
‣Bridging research areas
Human Computer-Interaction & Semantic Analysis
‣Mining usable data
out of social networks (microblogs)
‣Integration
Social network data and linked open data
‣Framework driven methodology
based upon current state-of-the-art semantic tools
‣Evaluation
proof-of-concept Research 2.0 application

                           9
Solution
Annotate Data from Social Networks


                      Community approved
                     ontologies: FOAF, SIOC


 Linked Open Data                     Applications




                              Scientific Profiling Framework



                       Connect People and Resources
                        that share Scientific Affinities
                        10
Framework


 ‣Overview
 ‣Grabeeter
 ‣Architecture
 ‣Web Service

                 11
Framework: Overview
          Social                      Linked Open
                                                                    Output Format
         Networks                      Data Cloud



   Framework   Aggregate                      Interlink                    Publish


     Archived/Cached                                                   Scientific
                                      Linked Data                    Information
           Data            Annotate                       Analyse




                                         12
Framework: Overview
          Social                       Linked Open
                                                                       Output Format
         Networks                       Data Cloud



   Framework   Aggregate                       Interlink                      Publish


     Archived/Cached                                                      Scientific
                                       Linked Data                      Information
           Data            Annotate                        Analyse




                                         DBPedia                            JSON
           Twitter                       Colinda                          RDF (XML)
                                        GeoNames


               Aggregate                        Interlink                       Publish


                                          Semantic                         Scientific
          Grabeeter                                                      Profiling API
                           Annotate   Profiling Network       Analyse


                                          12
Framework: Overview
          Social                       Linked Open
                                                                       Output Format
         Networks                       Data Cloud



   Framework   Aggregate                       Interlink                      Publish


     Archived/Cached                                                      Scientific
                                       Linked Data                      Information
           Data            Annotate                        Analyse




                                         DBPedia                            JSON
           Twitter                       Colinda                          RDF (XML)
                                        GeoNames


               Aggregate                        Interlink                       Publish


                                          Semantic                         Scientific
          Grabeeter                                                      Profiling API
                           Annotate   Profiling Network       Analyse


                                          13
Framework: Grabeeter
= Twitter aggregation & archiving tool
(developed at TUGraz)




                            14
Framework: Grabeeter
= Twitter aggregation & archiving tool
(developed at TUGraz)




                            14
Framework: Overview
          Social                       Linked Open
                                                                       Output Format
         Networks                       Data Cloud



   Framework   Aggregate                       Interlink                      Publish


     Archived/Cached                                                      Scientific
                                       Linked Data                      Information
           Data            Annotate                        Analyse




                                         DBPedia                            JSON
           Twitter                       Colinda                          RDF (XML)
                                        GeoNames


               Aggregate                        Interlink                       Publish


                                          Semantic                         Scientific
          Grabeeter                                                      Profiling API
                           Annotate   Profiling Network       Analyse


                                          15
Framework: Architecture
                            Applications




                                    Analysis




         Extraction               Interlinking




      Grabeeter       RDF Store




                      16
Framework: Web Service
‣ User Profile
  http://api.semanticprofiling.net/profile.php?user=screen_name

‣ Discover people, events...
  http:///discovery.php?

  find=persons|events|popular_friends|popular_mentions|popular_events
  user=screen_name

‣ Register new Twitter user
  http://api.semanticprofiling.net/register.php?user=twitter_user

‣ Event Details
  http://api.semanticprofiling.net/event.php?name=event_name

                                                 17
Framework: Web Service




              18
Framework: Web Service




              19
Framework: Web Service




              20
Framework: Web Service




              21
Framework: Web Service




              21
Deployment
                           <device>                                                  <device>
                       Grabeeter Server                                  Researcher Affinity Browser Server



   <execution environment>       <execution environment>
     Application Server                   RDMS                               <execution environment>
                                                                               Application Server

         crawling                   MySQL Database
          scripts
                                                                           AffinityBrowser.war




                                  <device>
                      Semantic Profiling Network Server



          <execution environment>
               PHP Server


      provider.php


      interlink.php

      profile.php
                                               <execution environment>
                                                        RDMS
      discovery.php


      event.php
                                                  MySQL Database
      allusers.php



                                                             22
Evaluation


 ‣Approach
 ‣Usability
 ‣Usefulness
 ‣Discussion

               23
Evaluation: Approach



‣Test usability & usefulness
‣Web application: “Researcher Affinity Browser”
‣Using explicit evaluation questionnaire



                      24
Evaluation: Usability

               ‣Definitely useful application
               ‣Use of the map view makes sense
               ‣People - Event split confusing
               ‣View of own profile
                ‣not a suitable starting point
                ‣only useful in comparison
                ‣shouldn’t be always visible
               ‣Person-specific affinities
                ‣too much hidden

                  25
Evaluation: Usability




                  26
Evaluation: Usability




                  27
Evaluation: Usefulness

 ‣Relevance
  Test users rate their search results
 ‣Satisfaction questionnaire
  Targeted questions about usefulness
  Allow comments on user interface



                      28
Evaluation: Usefulness
Relevant user percentage
                                   Number of users
                 0% (None)

             1-20% (A few)

 21-40% (Less than one half)

   41-60% (About one half)

61-80% (More than one half)

        81-99% (Almost all)

                 100% (All)

                               0       1             2   3   4



                                       29
Evaluation: Usefulness                                        Usefulness Questionnaire Results
                                   Concept Affinity

           Clear view of affinities between people

             Map & Plot combination understood

                     Deactivating filer fast enough

                       Activating filer fast enough

                           Never usability glitches

           Convention between views understood

Information display not overwhelming (confusing)

                     Relevant detailed person info

Shown details correspond with ‘real life’ activities

                   Enough relevant (new) persons

            Daily updating of information obvious

   Twitter data made more useful for researchers
                                                       1            2          3           4       5

                                                           30
Evaluation: Discussion
‣ Affinities exposed in an engaging way
‣ Relevant users rating
  OR Many common entities trigger positive rating
  OR Common entities start deeper investigation
‣ Reliability of person details hard to verify
‣ UI satisfaction user dependent
  ‣ What does the user expect from “Affinity Browser”?
  ‣ Test different scenarios to identify usage types?
                             31
Future work

‣ Rank tags
 by importance, not just frequency of use

‣ Visualization
 improve viewing of links between users and entities

‣ Multiple Resources
 better reliability and more verification of data


                              32
Conclusion

‣ Framework could support many social semantic-based applications
‣ Realized with current state-of-the-art technologies
‣ Interlinking with Linked Open Data Cloud enriches social network
  data
‣ Researcher Affinity Browser
  ‣ Exposes affinities between users
  ‣ User feedback affirms positively new view on social data
  ‣ Hash tags identified as conferences provide consistent links

                                 33

Más contenido relacionado

Destacado

Petroleum exploration_developement_production_process
Petroleum exploration_developement_production_processPetroleum exploration_developement_production_process
Petroleum exploration_developement_production_processVaratha Pandian
 
MSc thesis’2011
MSc thesis’2011MSc thesis’2011
MSc thesis’2011knyazevaig
 
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...Laurens De Vocht
 
Discovering Meaningful Connections between Resources in the Web of Data
Discovering Meaningful Connections between Resources in the Web of DataDiscovering Meaningful Connections between Resources in the Web of Data
Discovering Meaningful Connections between Resources in the Web of DataLaurens De Vocht
 
A Framework Concept for Profiling Researchers on Twitter using the Web of Data
A Framework Concept for Profiling Researchers on Twitter using the Web of DataA Framework Concept for Profiling Researchers on Twitter using the Web of Data
A Framework Concept for Profiling Researchers on Twitter using the Web of DataLaurens De Vocht
 
Aligning Web Collaboration Tools with Research Data for Scholars
Aligning Web Collaboration Tools with Research Data for ScholarsAligning Web Collaboration Tools with Research Data for Scholars
Aligning Web Collaboration Tools with Research Data for ScholarsLaurens De Vocht
 
Providing Interchangeable Open Data to Accelerate Development of Sustainable ...
Providing Interchangeable Open Data to Accelerate Development of Sustainable ...Providing Interchangeable Open Data to Accelerate Development of Sustainable ...
Providing Interchangeable Open Data to Accelerate Development of Sustainable ...Laurens De Vocht
 
The DataTank, RML and Domain Modelling
The DataTank, RML and Domain ModellingThe DataTank, RML and Domain Modelling
The DataTank, RML and Domain ModellingLaurens De Vocht
 
Big Linked Data ETL Benchmark on Cloud Commodity Hardware
Big Linked Data ETL Benchmark on Cloud Commodity HardwareBig Linked Data ETL Benchmark on Cloud Commodity Hardware
Big Linked Data ETL Benchmark on Cloud Commodity HardwareLaurens De Vocht
 
Examples of how Blueback tools can expand and enhance your Petrel workflow
Examples of how Blueback tools can expand and enhance your Petrel workflowExamples of how Blueback tools can expand and enhance your Petrel workflow
Examples of how Blueback tools can expand and enhance your Petrel workflowMitch Sutherland
 
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked DataEffect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked DataLaurens De Vocht
 
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...Laurens De Vocht
 
OSLO: Open Standards for Linked Organizations
OSLO: Open Standards for Linked OrganizationsOSLO: Open Standards for Linked Organizations
OSLO: Open Standards for Linked OrganizationsLaurens De Vocht
 
Oil 101: Introduction to Oil and Gas - Upstream
Oil 101:  Introduction to Oil and Gas - UpstreamOil 101:  Introduction to Oil and Gas - Upstream
Oil 101: Introduction to Oil and Gas - UpstreamEKT Interactive
 
FUNDAMENTAL OF PETROLEUM EXPLORATION
FUNDAMENTAL OF PETROLEUM EXPLORATIONFUNDAMENTAL OF PETROLEUM EXPLORATION
FUNDAMENTAL OF PETROLEUM EXPLORATIONajlyotki
 
Introduction into Oil and Gas Industry. OIL: Part 1
Introduction into Oil and Gas Industry. OIL: Part 1Introduction into Oil and Gas Industry. OIL: Part 1
Introduction into Oil and Gas Industry. OIL: Part 1Fidan Aliyeva
 
Reservoir Modeling with Petrel
Reservoir Modeling with PetrelReservoir Modeling with Petrel
Reservoir Modeling with Petreladeeba setu
 

Destacado (20)

Petroleum exploration_developement_production_process
Petroleum exploration_developement_production_processPetroleum exploration_developement_production_process
Petroleum exploration_developement_production_process
 
MSc thesis’2011
MSc thesis’2011MSc thesis’2011
MSc thesis’2011
 
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
 
Discovering Meaningful Connections between Resources in the Web of Data
Discovering Meaningful Connections between Resources in the Web of DataDiscovering Meaningful Connections between Resources in the Web of Data
Discovering Meaningful Connections between Resources in the Web of Data
 
A Framework Concept for Profiling Researchers on Twitter using the Web of Data
A Framework Concept for Profiling Researchers on Twitter using the Web of DataA Framework Concept for Profiling Researchers on Twitter using the Web of Data
A Framework Concept for Profiling Researchers on Twitter using the Web of Data
 
Aligning Web Collaboration Tools with Research Data for Scholars
Aligning Web Collaboration Tools with Research Data for ScholarsAligning Web Collaboration Tools with Research Data for Scholars
Aligning Web Collaboration Tools with Research Data for Scholars
 
Providing Interchangeable Open Data to Accelerate Development of Sustainable ...
Providing Interchangeable Open Data to Accelerate Development of Sustainable ...Providing Interchangeable Open Data to Accelerate Development of Sustainable ...
Providing Interchangeable Open Data to Accelerate Development of Sustainable ...
 
The DataTank, RML and Domain Modelling
The DataTank, RML and Domain ModellingThe DataTank, RML and Domain Modelling
The DataTank, RML and Domain Modelling
 
Big Linked Data ETL Benchmark on Cloud Commodity Hardware
Big Linked Data ETL Benchmark on Cloud Commodity HardwareBig Linked Data ETL Benchmark on Cloud Commodity Hardware
Big Linked Data ETL Benchmark on Cloud Commodity Hardware
 
Examples of how Blueback tools can expand and enhance your Petrel workflow
Examples of how Blueback tools can expand and enhance your Petrel workflowExamples of how Blueback tools can expand and enhance your Petrel workflow
Examples of how Blueback tools can expand and enhance your Petrel workflow
 
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked DataEffect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
 
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
 
OSLO: Open Standards for Linked Organizations
OSLO: Open Standards for Linked OrganizationsOSLO: Open Standards for Linked Organizations
OSLO: Open Standards for Linked Organizations
 
Basics of petroleum geology
Basics of petroleum geologyBasics of petroleum geology
Basics of petroleum geology
 
Oil 101: Introduction to Oil and Gas - Upstream
Oil 101:  Introduction to Oil and Gas - UpstreamOil 101:  Introduction to Oil and Gas - Upstream
Oil 101: Introduction to Oil and Gas - Upstream
 
FUNDAMENTAL OF PETROLEUM EXPLORATION
FUNDAMENTAL OF PETROLEUM EXPLORATIONFUNDAMENTAL OF PETROLEUM EXPLORATION
FUNDAMENTAL OF PETROLEUM EXPLORATION
 
Introduction to Oil and Gas Industry - Upstream Midstream Downstream
Introduction to Oil and Gas Industry - Upstream Midstream DownstreamIntroduction to Oil and Gas Industry - Upstream Midstream Downstream
Introduction to Oil and Gas Industry - Upstream Midstream Downstream
 
Introduction into Oil and Gas Industry. OIL: Part 1
Introduction into Oil and Gas Industry. OIL: Part 1Introduction into Oil and Gas Industry. OIL: Part 1
Introduction into Oil and Gas Industry. OIL: Part 1
 
Reservoir Modeling with Petrel
Reservoir Modeling with PetrelReservoir Modeling with Petrel
Reservoir Modeling with Petrel
 
Oil and gas industry ppt
Oil and gas industry pptOil and gas industry ppt
Oil and gas industry ppt
 

Similar a Researcher Profiling based on Semantic Analysis in Social Networks

Building a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceBuilding a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceRobert H. McDonald
 
ryssiouk_anastasia_arestyposter
ryssiouk_anastasia_arestyposterryssiouk_anastasia_arestyposter
ryssiouk_anastasia_arestyposterAnastasia Ryssiouk
 
Vellino presentationtocisti
Vellino presentationtocistiVellino presentationtocisti
Vellino presentationtocistiAndre Vellino
 
If we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleIf we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleCarole Goble
 
THE Jisc Supplement 25 Nov 2009
THE Jisc Supplement 25 Nov 2009THE Jisc Supplement 25 Nov 2009
THE Jisc Supplement 25 Nov 2009Fiona Salvage
 
If we build it will they come?
If we build it will they come?If we build it will they come?
If we build it will they come?myGrid team
 
20120718 linkedopendataandnextgenerationsciencemcguinnessesip final
20120718 linkedopendataandnextgenerationsciencemcguinnessesip final20120718 linkedopendataandnextgenerationsciencemcguinnessesip final
20120718 linkedopendataandnextgenerationsciencemcguinnessesip finalDeborah McGuinness
 
An Embeddable Dashboard for Widget-Based Visual Analytics on Scientific Commu...
An Embeddable Dashboard for Widget-Based Visual Analytics on Scientific Commu...An Embeddable Dashboard for Widget-Based Visual Analytics on Scientific Commu...
An Embeddable Dashboard for Widget-Based Visual Analytics on Scientific Commu...Michael Derntl
 
Trends in Human-Computer Interaction in Information Seeking
Trends in Human-Computer Interaction in Information SeekingTrends in Human-Computer Interaction in Information Seeking
Trends in Human-Computer Interaction in Information SeekingRich Miller
 
Riding The Semantic Wave
Riding The Semantic WaveRiding The Semantic Wave
Riding The Semantic WaveKaniska Mandal
 
Embracing Social Software And Semantic Web In Digital Libraries
Embracing Social Software And Semantic Web In Digital LibrariesEmbracing Social Software And Semantic Web In Digital Libraries
Embracing Social Software And Semantic Web In Digital LibrariesAkhmad Riza Faizal
 
Aiding Computer Aided Drug Design
Aiding Computer Aided Drug DesignAiding Computer Aided Drug Design
Aiding Computer Aided Drug DesignShahir Shamsir
 
Oral exam presentation
Oral exam presentationOral exam presentation
Oral exam presentationcurlybecca
 
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference Kaitlin Thaney
 
Knowledge management for integrative omics data analysis
Knowledge management for integrative omics data analysisKnowledge management for integrative omics data analysis
Knowledge management for integrative omics data analysisCOST action BM1006
 
Session 0.0 poster minutes madness
Session 0.0   poster minutes madnessSession 0.0   poster minutes madness
Session 0.0 poster minutes madnesssemanticsconference
 

Similar a Researcher Profiling based on Semantic Analysis in Social Networks (20)

Building a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceBuilding a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability Science
 
ryssiouk_anastasia_arestyposter
ryssiouk_anastasia_arestyposterryssiouk_anastasia_arestyposter
ryssiouk_anastasia_arestyposter
 
Making Conversations Visible
Making Conversations VisibleMaking Conversations Visible
Making Conversations Visible
 
Vellino presentationtocisti
Vellino presentationtocistiVellino presentationtocisti
Vellino presentationtocisti
 
If we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleIf we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote Goble
 
THE Jisc Supplement 25 Nov 2009
THE Jisc Supplement 25 Nov 2009THE Jisc Supplement 25 Nov 2009
THE Jisc Supplement 25 Nov 2009
 
119 128
119 128119 128
119 128
 
119 128
119 128119 128
119 128
 
If we build it will they come?
If we build it will they come?If we build it will they come?
If we build it will they come?
 
20120718 linkedopendataandnextgenerationsciencemcguinnessesip final
20120718 linkedopendataandnextgenerationsciencemcguinnessesip final20120718 linkedopendataandnextgenerationsciencemcguinnessesip final
20120718 linkedopendataandnextgenerationsciencemcguinnessesip final
 
An Embeddable Dashboard for Widget-Based Visual Analytics on Scientific Commu...
An Embeddable Dashboard for Widget-Based Visual Analytics on Scientific Commu...An Embeddable Dashboard for Widget-Based Visual Analytics on Scientific Commu...
An Embeddable Dashboard for Widget-Based Visual Analytics on Scientific Commu...
 
Trends in Human-Computer Interaction in Information Seeking
Trends in Human-Computer Interaction in Information SeekingTrends in Human-Computer Interaction in Information Seeking
Trends in Human-Computer Interaction in Information Seeking
 
Riding The Semantic Wave
Riding The Semantic WaveRiding The Semantic Wave
Riding The Semantic Wave
 
Embracing Social Software And Semantic Web In Digital Libraries
Embracing Social Software And Semantic Web In Digital LibrariesEmbracing Social Software And Semantic Web In Digital Libraries
Embracing Social Software And Semantic Web In Digital Libraries
 
Aiding Computer Aided Drug Design
Aiding Computer Aided Drug DesignAiding Computer Aided Drug Design
Aiding Computer Aided Drug Design
 
An overview of JPSearch
An overview of JPSearchAn overview of JPSearch
An overview of JPSearch
 
Oral exam presentation
Oral exam presentationOral exam presentation
Oral exam presentation
 
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
 
Knowledge management for integrative omics data analysis
Knowledge management for integrative omics data analysisKnowledge management for integrative omics data analysis
Knowledge management for integrative omics data analysis
 
Session 0.0 poster minutes madness
Session 0.0   poster minutes madnessSession 0.0   poster minutes madness
Session 0.0 poster minutes madness
 

Último

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 

Último (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 

Researcher Profiling based on Semantic Analysis in Social Networks

  • 1. Researcher Profiling based on Semantic Analysis in Social Networks Laurens De Vocht Supervisors Promotors Gonzalo Parra Erik Duval Selver Softic Martin Ebner July 1, 2011
  • 2. Agenda ‣Background ‣Framework ‣Client Application ‣Deployment ‣Evaluation ‣Conclusion 2
  • 3. Background ‣Definitions ‣Problem Statement ‣The Social Semantic Web ‣Scope & Value of Study 3
  • 4. Definitions Profiling “Inferring unobser vable information about users from observable information about them, that is their actions or their utterances.” (Zukerman and Albrecht, 2001) Semantic Analysis “A technique using semantic-based tools and ontologies in order to gain a deeper understanding of the information being stored and manipulated in an existing system” (McComb, 2004) 4
  • 5. Problem Statement Web users generate a massive unstructured information flow ? Who has scientific information relevant for me? 5
  • 6. Problem Statement Connecting researchers based on shared scientific events (conferences) Scientific Profiling Scientific User Model Event Model Conferences Resource Researchers Profiler/ Analyzer Researcher (User) 6
  • 7. The Social Semantic Web Community of (micro)blogging, researchers with sharing, conference tagging, experience discussion semi-structured information Larger population of system people interested in (faceted) search scientific conferences engine recommendation clustered and engine analyzed data (Gruber, 2007) 7
  • 8. The Social Semantic Web Community of (micro)blogging, researchers with sharing, conference tagging, experience discussion semi-structured information Larger population of system people interested in (faceted) search scientific conferences engine recommendation clustered and engine analyzed data Human process (Gruber, 2007) 7
  • 9. The Social Semantic Web Community of (micro)blogging, researchers with sharing, conference tagging, experience discussion semi-structured information Larger population of system people interested in (faceted) search scientific conferences engine recommendation clustered and engine analyzed data Human process Machine process (Gruber, 2007) 7
  • 10. The Social Semantic Web Social Web Community of (micro)blogging, researchers with sharing, conference tagging, experience discussion semi-structured information Larger population of system people interested in (faceted) search scientific conferences engine recommendation clustered and engine analyzed data Human process Machine process (Gruber, 2007) 7
  • 11. The Social Semantic Web Social Web Semantic Web Community of (micro)blogging, researchers with sharing, conference tagging, experience discussion semi-structured information Larger population of system people interested in (faceted) search scientific conferences engine recommendation clustered and engine analyzed data Human process Machine process (Gruber, 2007) 7
  • 12. The Social semantic Web ‣Hashtags as Identifiers ‣not always strong or consistent enough ‣properties of good hashtags formalized ‣helpful in assessment of valuable identifiers (Laniado and Mika, 2007) ‣Expert Search/Profiling with Linked Data ‣aggregate and analyze certain types of data ‣need to surpass limits of closed data sets ‣LOD delivers multi-purpose data (Stankovic et al., 2010) 8
  • 13. Scope & Value of the Study ‣Bridging research areas Human Computer-Interaction & Semantic Analysis ‣Mining usable data out of social networks (microblogs) ‣Integration Social network data and linked open data ‣Framework driven methodology based upon current state-of-the-art semantic tools ‣Evaluation proof-of-concept Research 2.0 application 9
  • 14. Solution Annotate Data from Social Networks Community approved ontologies: FOAF, SIOC Linked Open Data Applications Scientific Profiling Framework Connect People and Resources that share Scientific Affinities 10
  • 15. Framework ‣Overview ‣Grabeeter ‣Architecture ‣Web Service 11
  • 16. Framework: Overview Social Linked Open Output Format Networks Data Cloud Framework Aggregate Interlink Publish Archived/Cached Scientific Linked Data Information Data Annotate Analyse 12
  • 17. Framework: Overview Social Linked Open Output Format Networks Data Cloud Framework Aggregate Interlink Publish Archived/Cached Scientific Linked Data Information Data Annotate Analyse DBPedia JSON Twitter Colinda RDF (XML) GeoNames Aggregate Interlink Publish Semantic Scientific Grabeeter Profiling API Annotate Profiling Network Analyse 12
  • 18. Framework: Overview Social Linked Open Output Format Networks Data Cloud Framework Aggregate Interlink Publish Archived/Cached Scientific Linked Data Information Data Annotate Analyse DBPedia JSON Twitter Colinda RDF (XML) GeoNames Aggregate Interlink Publish Semantic Scientific Grabeeter Profiling API Annotate Profiling Network Analyse 13
  • 19. Framework: Grabeeter = Twitter aggregation & archiving tool (developed at TUGraz) 14
  • 20. Framework: Grabeeter = Twitter aggregation & archiving tool (developed at TUGraz) 14
  • 21. Framework: Overview Social Linked Open Output Format Networks Data Cloud Framework Aggregate Interlink Publish Archived/Cached Scientific Linked Data Information Data Annotate Analyse DBPedia JSON Twitter Colinda RDF (XML) GeoNames Aggregate Interlink Publish Semantic Scientific Grabeeter Profiling API Annotate Profiling Network Analyse 15
  • 22. Framework: Architecture Applications Analysis Extraction Interlinking Grabeeter RDF Store 16
  • 23. Framework: Web Service ‣ User Profile http://api.semanticprofiling.net/profile.php?user=screen_name ‣ Discover people, events... http:///discovery.php? find=persons|events|popular_friends|popular_mentions|popular_events user=screen_name ‣ Register new Twitter user http://api.semanticprofiling.net/register.php?user=twitter_user ‣ Event Details http://api.semanticprofiling.net/event.php?name=event_name 17
  • 29. Deployment <device> <device> Grabeeter Server Researcher Affinity Browser Server <execution environment> <execution environment> Application Server RDMS <execution environment> Application Server crawling MySQL Database scripts AffinityBrowser.war <device> Semantic Profiling Network Server <execution environment> PHP Server provider.php interlink.php profile.php <execution environment> RDMS discovery.php event.php MySQL Database allusers.php 22
  • 30. Evaluation ‣Approach ‣Usability ‣Usefulness ‣Discussion 23
  • 31. Evaluation: Approach ‣Test usability & usefulness ‣Web application: “Researcher Affinity Browser” ‣Using explicit evaluation questionnaire 24
  • 32. Evaluation: Usability ‣Definitely useful application ‣Use of the map view makes sense ‣People - Event split confusing ‣View of own profile ‣not a suitable starting point ‣only useful in comparison ‣shouldn’t be always visible ‣Person-specific affinities ‣too much hidden 25
  • 35. Evaluation: Usefulness ‣Relevance Test users rate their search results ‣Satisfaction questionnaire Targeted questions about usefulness Allow comments on user interface 28
  • 36. Evaluation: Usefulness Relevant user percentage Number of users 0% (None) 1-20% (A few) 21-40% (Less than one half) 41-60% (About one half) 61-80% (More than one half) 81-99% (Almost all) 100% (All) 0 1 2 3 4 29
  • 37. Evaluation: Usefulness Usefulness Questionnaire Results Concept Affinity Clear view of affinities between people Map & Plot combination understood Deactivating filer fast enough Activating filer fast enough Never usability glitches Convention between views understood Information display not overwhelming (confusing) Relevant detailed person info Shown details correspond with ‘real life’ activities Enough relevant (new) persons Daily updating of information obvious Twitter data made more useful for researchers 1 2 3 4 5 30
  • 38. Evaluation: Discussion ‣ Affinities exposed in an engaging way ‣ Relevant users rating OR Many common entities trigger positive rating OR Common entities start deeper investigation ‣ Reliability of person details hard to verify ‣ UI satisfaction user dependent ‣ What does the user expect from “Affinity Browser”? ‣ Test different scenarios to identify usage types? 31
  • 39. Future work ‣ Rank tags by importance, not just frequency of use ‣ Visualization improve viewing of links between users and entities ‣ Multiple Resources better reliability and more verification of data 32
  • 40. Conclusion ‣ Framework could support many social semantic-based applications ‣ Realized with current state-of-the-art technologies ‣ Interlinking with Linked Open Data Cloud enriches social network data ‣ Researcher Affinity Browser ‣ Exposes affinities between users ‣ User feedback affirms positively new view on social data ‣ Hash tags identified as conferences provide consistent links 33

Notas del editor

  1. \n
  2. \n
  3. Results: so far\n
  4. \n
  5. To make progress in research it is important to get in touch and share ideas with people who share affinities.\nOne of the most visible trends on the internet is the emergence of &amp;#x201C;Social Web&amp;#x201D; sites.\nCurrent online community sites are isolated fromone another. The main reason for this\nlack of interoperability is the fact that common standards for data interchange still have\nto arise.\nWe propose a framework to address an important issue in the context of the ongoing\nadoption of the &amp;#x201C;Web 2.0&amp;#x201D; in science and research, often referred to as &amp;#x201C;Science 2.0&amp;#x201D; or\n&amp;#x201C;Research 2.0&amp;#x201D;. A growing number of people are linked via acquaintances and online\nsocial networks such as Twitter allow indirect access to a huge amount of ideas. These\nideas are contained in amassive human information &amp;#xFB02;ow. That users of these networks\nproduce relevant data is being shown in many studies. The problem however lies in\ndiscovering and verifying such a stream of unstructured data items. Another related\nproblem is locating an expert that could provide an answer to a very speci&amp;#xFB01;c research\nquestion.\n\n
  6. The goal is to build a semantic pro&amp;#xFB01;ling framework that can support applications and\nservices that try to improve the connecting of researchers.\nThemain use case and application that the framework has to support is illustrated by\nwhat could be called: &amp;#x201C;the conference case&amp;#x201D;. Scientists and researchers are interested in\nvery speci&amp;#xFB01;c topics, this is best veri&amp;#xFB01;ed by the conferences they are attending. Another\ntrend is that they all blog and tweet about these events[14][10]. This creates huge opportunities for pro&amp;#xFB01;ling. The attendees tweet about what they notice, what they remark\nas interesting for their own projects. What if we could connect these users using this\ninformation? We could call an application that does just that &amp;#x201C;Scienti&amp;#xFB01;c Pro&amp;#xFB01;ling&amp;#x201D;. This\napproach comes from the concept that the data produced in social networks can have\ntrue value if properly annotated and interlinked [5]. A second requirement is to create\na suitable context in which this information can get meaning. This is very important to\nidentify which ontologies should be used\n
  7. Social Semantic Web Application - A Collective Knowledge System.\nThe essential difference between the classic Web and the Semantic Web is that structured data is exposed in a structured way.&amp;#xA0; For example, the classic Web might have a document that mentions a place, &quot;Paris&quot;.&amp;#xA0; The conventional way to find this document on the Web is to search for the term &quot;Paris&quot; in a search engine.&amp;#xA0; Similarly, to find out more about the place one would plow through the search results on the term &quot;Paris&quot; and manually pick out the pages that seem to have something to do with the place.&amp;#xA0; The heuristics employed by today&apos;s search engines for inferring what one means by the string &quot;Paris&quot; are biased by popularity, which means that one will encounter many pages about a celebrity heiress en route to the French capital.\nThe Semantic Web vision is to point to a representation of the entity, in this case a city, rather than its surface manifestation. Thus to find the city Paris, one would search for things known to be cities for entities whose names match &quot;Paris&quot;, possibly limiting the results to cities of a certain size or in a particular country. Then one might look for information of the desired type about the city, such as maps, travel guides, restaurants, or famous people who lived in Paris during some period of history.&amp;#xA0; The heuristics for searching the Semantic Web depend on conventions about how to represent things like cities (such as those specified in ontologies), and the availability of data which use these conventions.&amp;#xA0; Such data is not available for most user contributions in the Social Web. To move to the next level of collective knowledge systems, it would be nice to get the benefits of structured data from the systems that give rise to the Social Web.\nGruber argues that the Social Web and the Semantic Web should be combined, and that collective knowledge systems are the &quot;killer applications&quot; of this integration.&amp;#xA0; The keys to getting the most from collective knowledge systems, toward true collective intelligence, are tightly integrating user-contributed content and machine-gathered data, and harvesting the knowledge from this combination of unstructured and structured information.\n
  8. Social Semantic Web Application - A Collective Knowledge System.\nThe essential difference between the classic Web and the Semantic Web is that structured data is exposed in a structured way.&amp;#xA0; For example, the classic Web might have a document that mentions a place, &quot;Paris&quot;.&amp;#xA0; The conventional way to find this document on the Web is to search for the term &quot;Paris&quot; in a search engine.&amp;#xA0; Similarly, to find out more about the place one would plow through the search results on the term &quot;Paris&quot; and manually pick out the pages that seem to have something to do with the place.&amp;#xA0; The heuristics employed by today&apos;s search engines for inferring what one means by the string &quot;Paris&quot; are biased by popularity, which means that one will encounter many pages about a celebrity heiress en route to the French capital.\nThe Semantic Web vision is to point to a representation of the entity, in this case a city, rather than its surface manifestation. Thus to find the city Paris, one would search for things known to be cities for entities whose names match &quot;Paris&quot;, possibly limiting the results to cities of a certain size or in a particular country. Then one might look for information of the desired type about the city, such as maps, travel guides, restaurants, or famous people who lived in Paris during some period of history.&amp;#xA0; The heuristics for searching the Semantic Web depend on conventions about how to represent things like cities (such as those specified in ontologies), and the availability of data which use these conventions.&amp;#xA0; Such data is not available for most user contributions in the Social Web. To move to the next level of collective knowledge systems, it would be nice to get the benefits of structured data from the systems that give rise to the Social Web.\nGruber argues that the Social Web and the Semantic Web should be combined, and that collective knowledge systems are the &quot;killer applications&quot; of this integration.&amp;#xA0; The keys to getting the most from collective knowledge systems, toward true collective intelligence, are tightly integrating user-contributed content and machine-gathered data, and harvesting the knowledge from this combination of unstructured and structured information.\n
  9. Social Semantic Web Application - A Collective Knowledge System.\nThe essential difference between the classic Web and the Semantic Web is that structured data is exposed in a structured way.&amp;#xA0; For example, the classic Web might have a document that mentions a place, &quot;Paris&quot;.&amp;#xA0; The conventional way to find this document on the Web is to search for the term &quot;Paris&quot; in a search engine.&amp;#xA0; Similarly, to find out more about the place one would plow through the search results on the term &quot;Paris&quot; and manually pick out the pages that seem to have something to do with the place.&amp;#xA0; The heuristics employed by today&apos;s search engines for inferring what one means by the string &quot;Paris&quot; are biased by popularity, which means that one will encounter many pages about a celebrity heiress en route to the French capital.\nThe Semantic Web vision is to point to a representation of the entity, in this case a city, rather than its surface manifestation. Thus to find the city Paris, one would search for things known to be cities for entities whose names match &quot;Paris&quot;, possibly limiting the results to cities of a certain size or in a particular country. Then one might look for information of the desired type about the city, such as maps, travel guides, restaurants, or famous people who lived in Paris during some period of history.&amp;#xA0; The heuristics for searching the Semantic Web depend on conventions about how to represent things like cities (such as those specified in ontologies), and the availability of data which use these conventions.&amp;#xA0; Such data is not available for most user contributions in the Social Web. To move to the next level of collective knowledge systems, it would be nice to get the benefits of structured data from the systems that give rise to the Social Web.\nGruber argues that the Social Web and the Semantic Web should be combined, and that collective knowledge systems are the &quot;killer applications&quot; of this integration.&amp;#xA0; The keys to getting the most from collective knowledge systems, toward true collective intelligence, are tightly integrating user-contributed content and machine-gathered data, and harvesting the knowledge from this combination of unstructured and structured information.\n
  10. Social Semantic Web Application - A Collective Knowledge System.\nThe essential difference between the classic Web and the Semantic Web is that structured data is exposed in a structured way.&amp;#xA0; For example, the classic Web might have a document that mentions a place, &quot;Paris&quot;.&amp;#xA0; The conventional way to find this document on the Web is to search for the term &quot;Paris&quot; in a search engine.&amp;#xA0; Similarly, to find out more about the place one would plow through the search results on the term &quot;Paris&quot; and manually pick out the pages that seem to have something to do with the place.&amp;#xA0; The heuristics employed by today&apos;s search engines for inferring what one means by the string &quot;Paris&quot; are biased by popularity, which means that one will encounter many pages about a celebrity heiress en route to the French capital.\nThe Semantic Web vision is to point to a representation of the entity, in this case a city, rather than its surface manifestation. Thus to find the city Paris, one would search for things known to be cities for entities whose names match &quot;Paris&quot;, possibly limiting the results to cities of a certain size or in a particular country. Then one might look for information of the desired type about the city, such as maps, travel guides, restaurants, or famous people who lived in Paris during some period of history.&amp;#xA0; The heuristics for searching the Semantic Web depend on conventions about how to represent things like cities (such as those specified in ontologies), and the availability of data which use these conventions.&amp;#xA0; Such data is not available for most user contributions in the Social Web. To move to the next level of collective knowledge systems, it would be nice to get the benefits of structured data from the systems that give rise to the Social Web.\nGruber argues that the Social Web and the Semantic Web should be combined, and that collective knowledge systems are the &quot;killer applications&quot; of this integration.&amp;#xA0; The keys to getting the most from collective knowledge systems, toward true collective intelligence, are tightly integrating user-contributed content and machine-gathered data, and harvesting the knowledge from this combination of unstructured and structured information.\n
  11. Laniado and Mika found that not all hashtags are used in the same way, not all of them aggregate messages around a community or a topic, not all of them endure in time, and not all of them have an actual meaning. In this work they had addressed the issue of evaluating Twitter hashtags as strong identifiers, as a first step in order to bridge the gap between Twitter and the Semantic Web. The first contribution of this paper stands in the formalization of the problem, and in the elaboration of a number of desired properties for a good hashtag to serve as a URI. Frequency, specificity, consistency in usage and stability over time. Based on these data, they had tested the results obtained with the algorithms described in their paper, showing how a combination of the proposed measures can help in the task of assessing which tags are more likely to represent valuable identifiers. These results are promising, with respect to the perspective of anchoring Twitter hashtags to Semantic Web URIs, and to detect concepts and entities valuable to be treated as new identifiers.\n The authors concluded that expert search and profiling systems aggregate and analyze certain types of data depending on the types of expertise hypotheses they use. Traditional approaches tend to retrieve their data from closed or limited data corpuses. LOD on the other hand allows querying the whole Web like a huge database, thus surpassing the limits of closed data sets, and closed online communities. They believe that this opens new possibilities for traditional expert search and profiling systems which usually only rely on data from their local and limited databases or on unstructured data gathered from the Web. LOD also stands up for a great promise to deliver mutli purpose data that can be used to find experts in many domains and with many different expertise hypotheses. In this paper they have explored the potentials and drawbacks of LOD in comparison to traditional datasources used for expert search. They haven&amp;#x2019;t only asked the question what LOD can do, but also what one can do for LOD to make it an even better source of expertise evidence.\n
  12. The study spans two main areas - semantic analysis and usability. The current state of the art Semantic Web standards and processes are used as a foundation for this study. Researcher profiling applications integrate human computer interaction (HCI) and expert finding. Everybody who is interested in the semantic web, microblogging and profiling might find some parts of this thesis relevant.\nThe approach presented aims at gaining more knowledge and mining usable data out of social networks, especially microblogs, with a framework driven methodology based upon Semantic Web standards and tools. Introducing the interesting aspects about microblogs, this thesis tries to answer how far they correspond with ideas from other research areas like Science 2.0, Research 2.0, Semantic Web or Linked Data and to outline the importance and relevance of such or similar efforts by examples and arguments from current research and with examples from current work.\nIt is to be noted that neither the literature study nor the software architecture want to give a broad overview of the current semantic web and microblogging services. It is targeted as a carefully considered selection of articles that allows the development of researcher profiling applications. The architecture of the framework is being designed only with the problem statement in mind. At this time it is not part of the research to find out how this could be extended to other resources or targets (e.g. mobile applications). It focuses on the integration of user data from a microblogging service and domain knowledge from scientific conferences.\n
  13. The SemanticWeb Technology stack is well de&amp;#xFB01;ned and applying frameworks\nsuch as SIOC (Semantically Interlinked Online Communities) [4] and FOAF (Friend-Of-A-Friend) [2] can lead to a an interlinked and semantically rich knowledge source. This\nknowledge source will be built with user pro&amp;#xFB01;les and the content they produce on various\nsocial networks as a basis.\nTwitter contains infos on:\nPeople, Organisations, Locations, Trends &amp;#x2026;\nLOD Cloud contains\nBillions of triples about:\nGeolocations , data about science, government, common knowledge , persons, news &amp;#x2026;\n
  14. Results: so far\n
  15. The idea is to design, develop and implement a framework that collects data from social networks and uses community approved ontologies and linked open data to analyze and verify the data.\n
  16. The idea is to design, develop and implement a framework that collects data from social networks and uses community approved ontologies and linked open data to analyze and verify the data.\n
  17. Aggregate your Tweets, Search in your Tweets offline using the Grabeeter Client. Grabeeter [45] is an application that allows you to search tweets of a single Twitter user online and offline. In contrast to the Twitter API, Grabeeter provides all stored tweets and makes no restriction over time.The Grabeeter web application uses the Twitter API to retrieve tweets of predefined users. Tweets are stored in the Grabeeter database and on the file system as Apache Lucene[2] index. In order to ensure an efficient search tweets must be indexed.\n\n
  18. The idea is to design, develop and implement a framework that collects data from social networks and uses community approved ontologies and linked open data to analyze and verify the data.\n
  19. The semantic pro&amp;#xFB01;ling framework has to support a Scienti&amp;#xFB01;c Pro&amp;#xFB01;ling application as\nwas explained in the problem statement in chapter 1. The framework architecture still\nconsists of three layers:\n1. Extraction layer: Extracts data fromvarious resources and annotates it using relevant ontologies for that speci&amp;#xFB01;c data context.\n2. Interlinking layer: Is feeded with annotated data (triples) and creates a SPARQL\nendpoint for it. It is responsible for requesting more data if needed for a certain\ninformation query. It parsers high level queries and translates them tot SPARQL\nQueries. The results are then being returned.\n3. Analysis layer: Here a user information needs are translated into high level queries\nthat the interlinking layer understands. It also contains somemetrics to rank and\nevaluate the returned results.\n
  20. \n
  21. The user profile\n
  22. Related entities for a user\n
  23. Suggested conferences for a user\n
  24. Suggested users &amp; info for a specific event\n
  25. The test application is deployed on the Google App Engine server. This makes the maintenance straightforward and the deployment simple.\n\nGrabeeter consists of several scripts that are crawling the registered users Twitter accounts. Everything is stored in a MySQL database. Requests from the Semantic Profiling network are querying the Grabeeter MySQL database directly.\n\nThe semantic profiling server has several scripts that maintain the high level functionality. Two scripts are run periodically to keep the linked data network up to date. Other scripts realize the API functionality.\nThe &amp;#x201C;provider&amp;#x201D; script checks the Grabeeter database for new users. If there are new users their data is passed on to the Extraction module for annotation and triplification. For existing users, new tweets are fetched and triplified.\nThe &amp;#x201C;interlinking&amp;#x201D; script goes through all tags and first compares them against the Colinda repository. Any found conference tags are annotated appropriately. Secondly the script checks if tags represent a location or a common knowledge entity.\nThe scripts &amp;#x201C;person&amp;#x201D;, &amp;#x201C;event&amp;#x201D; and&amp;#x201C;discovery&amp;#x201D; implement the API functionality. They use the arguments given by the REST call. The return is every time a JSON Object containing the result of the call. The script &amp;#x201C;allusers&amp;#x201D; returns a JSON Array that contains all users currently in the system.\n
  26. Results: so far\n
  27. Avoid user interface is an issue (hide it)\nFocus should be on the data\nFixed data: users are presented a role and have to find a good matching conference and expert.\n\n
  28. \n
  29. Affinities as a starting point. Affinities are now facets to filter the result list of people. Instead of popup windows, tabs with details about each user appear in the bottom.\n
  30. Affinities as a starting point. Affinities are now facets to filter the result list of people. Instead of popup windows, tabs with details about each user appear in the bottom.\n
  31. \n
  32. \n
  33. Positive agreement among users\n1: Concept Affinity\n3: Understandable combination with affinity plot\n7: Convention between views understood\n13: Twitter data is made more useful for researchers\n\nNo agreement among users\n2: Clear view of affinities between people\n4,5: Filter (de)activation\n6: Never usability glitches\n8: Information display not overwhelming/confusing\n12: Daily updates of information obvious\n\n
  34. \n
  35. The more resources, the more types of entities can be interlinked to improve the verifiability of the results. The framework can easily be enriched easily with additional RDF resources, a new handle in the Interlinking module suffices. Some more effort has to be done to add data from another source that is not yet available as RDF. In that case it is necessary to write an additional Model class for the Extraction module and a handle in the Annotator class that includes data from that module by annotating it appropriately. This process is completely comparable to the extraction of Twitter data presented in this thesis. On the high level, new functionality can easily be added by proper translation into SPARQL queries. As more different data models and resources become available it might be of interest to extend the API as such. Again the same approach can be used as for the discovery and presentation of persons and scientific events.\n
  36. The framework serves as a powerful backend for a web service. In the requirements of the current framework we focused on the ability to extract, annotate and interlink data from Twitter and make the linked data available as a SPARQL Endpoint and a Web Service that allows high level requests. The architecture is based on state-of-the-art technologies and brings in a novel approach of usage and dissemination of knowledge cumulated in social networks. It uses semantic tools and techniques for the domains of appliance like Research 2.0.\nThe web service behaves as a REST API and can support applications that want to propose interesting people or interesting scientific events to their users. It is possible to create an application that connects people who attend or mention the same scientific conference, as soon as they both have made their social data available to the system. We have shown that the enrichment of social network data with linked data leads to a verifiable user profile that allows comparison with others alike.\nThe demonstration application introduces the concept &amp;#x201C;affinity&amp;#x201D;. The concept has only been used a few times before, but for a similar purpose: to expose an otherwise hidden proximity to or liking for specific aspects. The usefulness of this approach and its presentation has been reviewed positively by test users from the target group, researchers. They appreciated the use of affinities. Their feedback exposed what we learned in theory from the literature study: the use of linked data shapes a whole new view on existing social data. By interlinking tags to scientific conferences we are able to display verified entities. We noted for example that the choice for hash tags lead to enough identified\n\n