SlideShare una empresa de Scribd logo
1 de 26
Descargar para leer sin conexión
A Hybrid Semantic Approach to Building 
Dynamic Maps of Research Communities 
Francesco Osborne, Beppe Scavo, Enrico Motta 
KMi, The Open University, United Kingdom 
November 27th 2014
Research communities 
The engine of research.
We need to understand how scientific communities 
adapt and cooperate to implement visions into 
concrete technologies.
Research communities 
Communities of academic authors are usually identified by using 
standard community detection algorithms, which typically 
exploit co-authorship or citation graphs.
Temporal topic-based communities (TTC) 
A different type of community we investigated is formed by the 
set of researchers who, at a given time, are following shared 
research trajectory, i.e. they are working on the same topics at 
the same time. 
Osborne, F., Scavo, G., & Motta, E. (2014). Identifying diachronic topic-based research 
communities by clustering shared research trajectories. In The Semantic Web: Trends and 
Challenges (pp. 114-129). Springer International Publishing.
Research Communities Map Builder 
• RCMB is able to automatically link diachronic topic-based 
communities over subsequent time intervals to 
identify significant events. 
• These include topic shifts within a community; the 
appearance and fading of a community; communities 
splitting, merging, spawning other communities; etc. 
• The output of RCMB is a map of research 
communities, annotated with the detected events, 
which provides a concise visual representation of the 
dynamics of a research area.
RCMB steps: 
1. Applies the Temporal Semantic Topic-Based 
Clustering (TST) algorithm to find Temporal topic-based 
communities in different time intervals; 
2. Detects Topic Shifts; 
3. Links Communities in different years; 
4. Detect Key Events;
RCMB steps: 
1. Applies the Temporal Semantic Topic-Based 
Clustering (TST) algorithm to find Temporal topic-based 
communities in different time intervals. 
2. Detects Topic Shifts in following years 
3. Links Communities in different years 
4. Detect Key Events 
Temporal Semantic Topic-Based Clustering 
Osborne, F., Scavo, G., & Motta, E. (2014). Identifying diachronic topic-based 
research communities by clustering shared research trajectories. In The 
Semantic Web: Trends and Challenges (pp. 114-129). Springer International 
Publishing.
TST in short 
1. It augments the topic semantically using an automatically 
generated OWL ontology and represent each author as a 
semantic topic distribution over subsequent years. 
2. It weighs each topic according to its relationship with the 
main topic, for highlighting the communities strongly 
related to the main topic. 
3. It clusters authors using the ATTS (Adjusted Temporal 
Topic Similarity), which is computed by averaging the 
cosine similarities of the topic vectors over progressively 
smaller intervals of time.
Detecting Topic Shifts 
We use a sliding window algorithm that checks for a topic shift 
by comparing the initial topic distribution in time t with the topic 
distributions in time t+1, t+2… t+n. 
Information Extraction/Semantic Annotation community 
2002 
Infor. Extraction: 26 % 
Natural Language: 17 % 
Named Entity: 12 % 
Machine Learning: 9 % 
Knowledge Base: 9 % 
2010 
Linked Data: 16 % 
Natural Language: 15 % 
Semantic Annotation: 15 % 
SW Technology: 10 % 
Information Retrieval: 10 % 
Knowledge Base: 9 % 
Semantic Wiki: 9 % 
2006 
Semantic Annotation: 25 % 
Knowledge Base: 15 % 
Semantic Wiki: 11 % 
Information Extraction: 10 % 
Semantic Information: 8 % 
Natural Language: 6 % 
Information Retrieval: 6 %
Detecting Topic Shifts 
We define a topic shift a statistically significant change (detected 
via chi-square test ) in the topic distribution of a community 
which occurred in a certain time interval. 
To detect which topics were the main protagonists of this shift, 
we applying the same test excluding each time a different topic, 
and selecting the topic whose absence yields the bigger 
increment in the p value.
Community linking 
We are interested in two different links between community: 
• The strong link is defined as a link that connects the same 
community in subsequent timeframes. 
• The weak link is defined as the link that connects community 
C1 with community C2 in a subsequent timeframe, if C1 has an 
impact over C2 in terms of migrating authors and/or topics.
Community linking
Community linking 
We take the minimum values of ts 
and tw that minimize the MEF using 
the Nelder-Mead algorithm.
Key Events detection 
If a community has no strong links with any precedent 
interval communities, we detect the appearance of a 
community. 
2006 2007 
C1 
C3 
C2 
C1 
C2
Key Events detection 
If a community has no strong links with any subsequent 
interval communities, we detect the fading of a community. 
2006 2007 
C1 
C2 
C3 
C1 
C2
Key Events detection 
If a community is linked to more than one community in the 
subsequent interval and one of the links is a strong one we 
detect the forking of one or more communities out of the 
community characterized by the strong link. 
2006 2007 
C1 C1 
C2
Key Events detection 
If a community is linked to more than one community in the 
subsequent interval and none of the links is a strong one we 
detect the splitting of a community into multiple communities. 
2006 2007 
C1 
C2 
C3
Key Events detection 
If two or more communities are linked to one community in 
the subsequent interval and one of the inlinks is a strong link, 
we detect the assimilation of one or more communities into 
the community C characterized by the strong link. 
2006 2007 
C1 C1 
C2 
If the communities fade after the event, they are labelled 
as absorbed to C.
Key Events detection 
If two or more communities are linked to one community in 
the subsequent interval and none of the inlinks is a strong 
link, we detect the merging of two or more communities in a 
new community C. 
2006 2007 
C1 
C3 
C2 
If the communities fade after the event, they are labelled 
as merged in C.
Evaluation: Cluster Compactness
Case study 
We applying RCMB to two research areas: World Wide Web 
(WWW) and Semantic Web (SW). 
Our study was based on a dataset built from data retrieved by 
means of the API provided by Microsoft Academic Search. 
We first retrieved authors and papers labelled with WWW 
and SW or with their first 150 co-occurring topics. We then 
run RCMB on WWW and SW in the 2000-2010 time interval 
with a granularity of 3. The average number of authors 
selected in each year was 932 for WWW and 646 for SW.
Semantic Web
WWW
Future Work 
• Automatically generate comprehensive explanations for 
the identified dynamics. 
• Forecasting topic shifts and key events, e.g., estimating 
the probability that a new topic will emerge in a certain 
community or that two communities will merge in the 
coming years.
Questions? 
Interested in scholarly data? 
SAVE-SD 2015 
Semantics, Analytics, Visualisation: Enhancing Scholarly Data 
Workshop at 24th International World Wide Web Conference 
May 19, 2015 - Florence, Italy 
Site: cs.unibo.it/save-sd

Más contenido relacionado

La actualidad más candente

Tutorial on Question Answering Systems
Tutorial on Question Answering Systems Tutorial on Question Answering Systems
Tutorial on Question Answering Systems Saeedeh Shekarpour
 
Rostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic Networks
Rostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic NetworksRostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic Networks
Rostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic NetworksWitology
 
Complex Networks Analysis @ Universita Roma Tre
Complex Networks Analysis @ Universita Roma TreComplex Networks Analysis @ Universita Roma Tre
Complex Networks Analysis @ Universita Roma TreMatteo Moci
 
Semantic Interpretation of User Query for Question Answering on Interlinked Data
Semantic Interpretation of User Query for Question Answering on Interlinked DataSemantic Interpretation of User Query for Question Answering on Interlinked Data
Semantic Interpretation of User Query for Question Answering on Interlinked DataSaeedeh Shekarpour
 
Disease spreading & control in temporal networks
Disease spreading & control in temporal networksDisease spreading & control in temporal networks
Disease spreading & control in temporal networksPetter Holme
 
Optimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networks
Optimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networksOptimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networks
Optimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networksPetter Holme
 
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatramanOdsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatramanvenkatramanJ4
 
IRJET- Fake News Detection and Rumour Source Identification
IRJET- Fake News Detection and Rumour Source IdentificationIRJET- Fake News Detection and Rumour Source Identification
IRJET- Fake News Detection and Rumour Source IdentificationIRJET Journal
 
Automatic Hate Speech Detection: A Literature Review
Automatic Hate Speech Detection: A Literature ReviewAutomatic Hate Speech Detection: A Literature Review
Automatic Hate Speech Detection: A Literature ReviewDr. Amarjeet Singh
 
A Study of User Interaction with Context Aware Notifications from a Moodle Le...
A Study of User Interaction with Context Aware Notifications from a Moodle Le...A Study of User Interaction with Context Aware Notifications from a Moodle Le...
A Study of User Interaction with Context Aware Notifications from a Moodle Le...Periquest Ltd
 
Data Science Education at JHSPH
Data Science Education at JHSPHData Science Education at JHSPH
Data Science Education at JHSPHjtleek
 
WAPWG Jan 2020 Sloan cosmos workshop
WAPWG Jan 2020 Sloan cosmos workshopWAPWG Jan 2020 Sloan cosmos workshop
WAPWG Jan 2020 Sloan cosmos workshopSara Day Thomson
 
#CPLOL18 paper on #ResNetSLT community
#CPLOL18 paper on #ResNetSLT community#CPLOL18 paper on #ResNetSLT community
#CPLOL18 paper on #ResNetSLT communityBronwyn Hemsley
 
Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media. Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media. Mohsen Shahriari
 
Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...Axel Bruns
 
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...Dasha Herrmannova
 
A method to evaluate the reliability of social media data for social network ...
A method to evaluate the reliability of social media data for social network ...A method to evaluate the reliability of social media data for social network ...
A method to evaluate the reliability of social media data for social network ...Derek Weber
 
How the information content of your contact pattern representation affects pr...
How the information content of your contact pattern representation affects pr...How the information content of your contact pattern representation affects pr...
How the information content of your contact pattern representation affects pr...Petter Holme
 

La actualidad más candente (20)

Tutorial on Question Answering Systems
Tutorial on Question Answering Systems Tutorial on Question Answering Systems
Tutorial on Question Answering Systems
 
Rostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic Networks
Rostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic NetworksRostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic Networks
Rostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic Networks
 
Complex Networks Analysis @ Universita Roma Tre
Complex Networks Analysis @ Universita Roma TreComplex Networks Analysis @ Universita Roma Tre
Complex Networks Analysis @ Universita Roma Tre
 
Semantic Interpretation of User Query for Question Answering on Interlinked Data
Semantic Interpretation of User Query for Question Answering on Interlinked DataSemantic Interpretation of User Query for Question Answering on Interlinked Data
Semantic Interpretation of User Query for Question Answering on Interlinked Data
 
Disease spreading & control in temporal networks
Disease spreading & control in temporal networksDisease spreading & control in temporal networks
Disease spreading & control in temporal networks
 
Bibliometric Study and Network Analysis of the Phenomenon of Self-Publishing
Bibliometric Study and Network Analysis of the Phenomenon of Self-PublishingBibliometric Study and Network Analysis of the Phenomenon of Self-Publishing
Bibliometric Study and Network Analysis of the Phenomenon of Self-Publishing
 
Optimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networks
Optimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networksOptimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networks
Optimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networks
 
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatramanOdsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
 
00 Social Influence Effects on Men's HIV Testing
00 Social Influence Effects on Men's HIV Testing00 Social Influence Effects on Men's HIV Testing
00 Social Influence Effects on Men's HIV Testing
 
IRJET- Fake News Detection and Rumour Source Identification
IRJET- Fake News Detection and Rumour Source IdentificationIRJET- Fake News Detection and Rumour Source Identification
IRJET- Fake News Detection and Rumour Source Identification
 
Automatic Hate Speech Detection: A Literature Review
Automatic Hate Speech Detection: A Literature ReviewAutomatic Hate Speech Detection: A Literature Review
Automatic Hate Speech Detection: A Literature Review
 
A Study of User Interaction with Context Aware Notifications from a Moodle Le...
A Study of User Interaction with Context Aware Notifications from a Moodle Le...A Study of User Interaction with Context Aware Notifications from a Moodle Le...
A Study of User Interaction with Context Aware Notifications from a Moodle Le...
 
Data Science Education at JHSPH
Data Science Education at JHSPHData Science Education at JHSPH
Data Science Education at JHSPH
 
WAPWG Jan 2020 Sloan cosmos workshop
WAPWG Jan 2020 Sloan cosmos workshopWAPWG Jan 2020 Sloan cosmos workshop
WAPWG Jan 2020 Sloan cosmos workshop
 
#CPLOL18 paper on #ResNetSLT community
#CPLOL18 paper on #ResNetSLT community#CPLOL18 paper on #ResNetSLT community
#CPLOL18 paper on #ResNetSLT community
 
Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media. Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media.
 
Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...
 
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
 
A method to evaluate the reliability of social media data for social network ...
A method to evaluate the reliability of social media data for social network ...A method to evaluate the reliability of social media data for social network ...
A method to evaluate the reliability of social media data for social network ...
 
How the information content of your contact pattern representation affects pr...
How the information content of your contact pattern representation affects pr...How the information content of your contact pattern representation affects pr...
How the information content of your contact pattern representation affects pr...
 

Destacado

EKAW 2016 - TechMiner: Extracting Technologies from Academic Publications
EKAW 2016 - TechMiner: Extracting Technologies from Academic PublicationsEKAW 2016 - TechMiner: Extracting Technologies from Academic Publications
EKAW 2016 - TechMiner: Extracting Technologies from Academic PublicationsFrancesco Osborne
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerFrancesco Osborne
 
Ekaw2014 - Inferring Semantic Relations by User Feedback
Ekaw2014 - Inferring Semantic Relations by User FeedbackEkaw2014 - Inferring Semantic Relations by User Feedback
Ekaw2014 - Inferring Semantic Relations by User FeedbackFrancesco Osborne
 
Linked science presentation 25
Linked science presentation 25Linked science presentation 25
Linked science presentation 25Francesco Osborne
 
Supporting Springer Nature Editors by means of Semantic Technologies
Supporting Springer Nature Editors by means of Semantic TechnologiesSupporting Springer Nature Editors by means of Semantic Technologies
Supporting Springer Nature Editors by means of Semantic TechnologiesFrancesco Osborne
 
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...Francesco Osborne
 
Klink-2: integrating multiple web sources to generate semantic topic networks
 Klink-2: integrating multiple web sources to generate semantic topic networks Klink-2: integrating multiple web sources to generate semantic topic networks
Klink-2: integrating multiple web sources to generate semantic topic networksFrancesco Osborne
 

Destacado (7)

EKAW 2016 - TechMiner: Extracting Technologies from Academic Publications
EKAW 2016 - TechMiner: Extracting Technologies from Academic PublicationsEKAW 2016 - TechMiner: Extracting Technologies from Academic Publications
EKAW 2016 - TechMiner: Extracting Technologies from Academic Publications
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
 
Ekaw2014 - Inferring Semantic Relations by User Feedback
Ekaw2014 - Inferring Semantic Relations by User FeedbackEkaw2014 - Inferring Semantic Relations by User Feedback
Ekaw2014 - Inferring Semantic Relations by User Feedback
 
Linked science presentation 25
Linked science presentation 25Linked science presentation 25
Linked science presentation 25
 
Supporting Springer Nature Editors by means of Semantic Technologies
Supporting Springer Nature Editors by means of Semantic TechnologiesSupporting Springer Nature Editors by means of Semantic Technologies
Supporting Springer Nature Editors by means of Semantic Technologies
 
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
 
Klink-2: integrating multiple web sources to generate semantic topic networks
 Klink-2: integrating multiple web sources to generate semantic topic networks Klink-2: integrating multiple web sources to generate semantic topic networks
Klink-2: integrating multiple web sources to generate semantic topic networks
 

Similar a EKAW2014 - A Hybrid Semantic Approach to Building 
Dynamic Maps of Research Communities

Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 posterLife-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 posterVáclav Belák
 
Entity-Based Semantics Emerging from Personal Awareness Streams
Entity-Based Semantics Emerging from Personal Awareness Streams Entity-Based Semantics Emerging from Personal Awareness Streams
Entity-Based Semantics Emerging from Personal Awareness Streams Amparo Elizabeth Cano Basave
 
Feedback Effects Between Similarity And Social Influence In Online Communities
Feedback Effects Between Similarity And Social Influence In Online CommunitiesFeedback Effects Between Similarity And Social Influence In Online Communities
Feedback Effects Between Similarity And Social Influence In Online CommunitiesPaolo Massa
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)theijes
 
Mining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network ResearchMining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network ResearchMarko Rodriguez
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasAngelo Salatino
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology:  A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology:  A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasAngelo Salatino
 
Measuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online CommunitiesMeasuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online CommunitiesMatthew Rowe
 
1026 telling story from text 2
1026 telling story from text 21026 telling story from text 2
1026 telling story from text 2Ke Jiang
 
Searching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSilvia Puglisi
 
Socialnetworkanalysis
SocialnetworkanalysisSocialnetworkanalysis
Socialnetworkanalysiskcarter14
 
Did we become a community - A Literature Review
Did we become a community - A Literature ReviewDid we become a community - A Literature Review
Did we become a community - A Literature ReviewSu-Tuan Lulee
 
Current trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networksCurrent trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networkseSAT Publishing House
 
01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)Duke Network Analysis Center
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measuresdnac
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social Mediarezahk
 
Social networkanalysisfinal
Social networkanalysisfinalSocial networkanalysisfinal
Social networkanalysisfinalkcarter14
 
02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and Overview02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and OverviewDuke Network Analysis Center
 
Multi-level analysis on structures and dynamics of OSN
Multi-level analysis on structures and dynamics of OSNMulti-level analysis on structures and dynamics of OSN
Multi-level analysis on structures and dynamics of OSNHaewoon Kwak
 

Similar a EKAW2014 - A Hybrid Semantic Approach to Building 
Dynamic Maps of Research Communities (20)

Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 posterLife-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
 
Entity-Based Semantics Emerging from Personal Awareness Streams
Entity-Based Semantics Emerging from Personal Awareness Streams Entity-Based Semantics Emerging from Personal Awareness Streams
Entity-Based Semantics Emerging from Personal Awareness Streams
 
Feedback Effects Between Similarity And Social Influence In Online Communities
Feedback Effects Between Similarity And Social Influence In Online CommunitiesFeedback Effects Between Similarity And Social Influence In Online Communities
Feedback Effects Between Similarity And Social Influence In Online Communities
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)
 
Mining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network ResearchMining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network Research
 
Notes on mining social media updated
Notes on mining social media updatedNotes on mining social media updated
Notes on mining social media updated
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology:  A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology:  A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
 
Measuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online CommunitiesMeasuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online Communities
 
1026 telling story from text 2
1026 telling story from text 21026 telling story from text 2
1026 telling story from text 2
 
Searching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced information
 
Socialnetworkanalysis
SocialnetworkanalysisSocialnetworkanalysis
Socialnetworkanalysis
 
Did we become a community - A Literature Review
Did we become a community - A Literature ReviewDid we become a community - A Literature Review
Did we become a community - A Literature Review
 
Current trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networksCurrent trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networks
 
01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social Media
 
Social networkanalysisfinal
Social networkanalysisfinalSocial networkanalysisfinal
Social networkanalysisfinal
 
02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and Overview02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and Overview
 
Multi-level analysis on structures and dynamics of OSN
Multi-level analysis on structures and dynamics of OSNMulti-level analysis on structures and dynamics of OSN
Multi-level analysis on structures and dynamics of OSN
 

Último

Human brain.. It's parts and function.
Human brain.. It's parts and function. Human brain.. It's parts and function.
Human brain.. It's parts and function. MUKTA MANJARI SAHOO
 
Application of Foraminiferal Ecology- Rahul.pptx
Application of Foraminiferal Ecology- Rahul.pptxApplication of Foraminiferal Ecology- Rahul.pptx
Application of Foraminiferal Ecology- Rahul.pptxRahulVishwakarma71547
 
Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.ShwetaHattimare
 
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...Sérgio Sacani
 
CW marking grid Analytical BS - M Ahmad.docx
CW  marking grid Analytical BS - M Ahmad.docxCW  marking grid Analytical BS - M Ahmad.docx
CW marking grid Analytical BS - M Ahmad.docxmarwaahmad357
 
Principles & Formulation of Hair Care Products
Principles & Formulation of Hair Care  ProductsPrinciples & Formulation of Hair Care  Products
Principles & Formulation of Hair Care Productspurwaborkar@gmail.com
 
MARKER ASSISTED SELECTION IN CROP IMPROVEMENT
MARKER ASSISTED SELECTION IN CROP IMPROVEMENTMARKER ASSISTED SELECTION IN CROP IMPROVEMENT
MARKER ASSISTED SELECTION IN CROP IMPROVEMENTjipexe1248
 
Identification of Superclusters and Their Properties in the Sloan Digital Sky...
Identification of Superclusters and Their Properties in the Sloan Digital Sky...Identification of Superclusters and Their Properties in the Sloan Digital Sky...
Identification of Superclusters and Their Properties in the Sloan Digital Sky...Sérgio Sacani
 
Applied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docxApplied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docxmarwaahmad357
 
Gene transfer in plants agrobacterium.pdf
Gene transfer in plants agrobacterium.pdfGene transfer in plants agrobacterium.pdf
Gene transfer in plants agrobacterium.pdfNetHelix
 
Physics Serway Jewett 6th edition for Scientists and Engineers
Physics Serway Jewett 6th edition for Scientists and EngineersPhysics Serway Jewett 6th edition for Scientists and Engineers
Physics Serway Jewett 6th edition for Scientists and EngineersAndreaLucarelli
 
Main Exam Applied biochemistry final year
Main Exam Applied biochemistry final yearMain Exam Applied biochemistry final year
Main Exam Applied biochemistry final yearmarwaahmad357
 
IB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptxIB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptxUalikhanKalkhojayev1
 
Substances in Common Use for Shahu College Screening Test
Substances in Common Use for Shahu College Screening TestSubstances in Common Use for Shahu College Screening Test
Substances in Common Use for Shahu College Screening TestAkashDTejwani
 
001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...
001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...
001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...marwaahmad357
 
Controlling Parameters of Carbonate platform Environment
Controlling Parameters of Carbonate platform EnvironmentControlling Parameters of Carbonate platform Environment
Controlling Parameters of Carbonate platform EnvironmentRahulVishwakarma71547
 
M.Pharm - Question Bank - Drug Delivery Systems
M.Pharm - Question Bank - Drug Delivery SystemsM.Pharm - Question Bank - Drug Delivery Systems
M.Pharm - Question Bank - Drug Delivery SystemsSumathi Arumugam
 
Basic Concepts in Pharmacology in molecular .pptx
Basic Concepts in Pharmacology in molecular  .pptxBasic Concepts in Pharmacology in molecular  .pptx
Basic Concepts in Pharmacology in molecular .pptxVijayaKumarR28
 

Último (20)

Human brain.. It's parts and function.
Human brain.. It's parts and function. Human brain.. It's parts and function.
Human brain.. It's parts and function.
 
Application of Foraminiferal Ecology- Rahul.pptx
Application of Foraminiferal Ecology- Rahul.pptxApplication of Foraminiferal Ecology- Rahul.pptx
Application of Foraminiferal Ecology- Rahul.pptx
 
Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.
 
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...
 
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
 
CW marking grid Analytical BS - M Ahmad.docx
CW  marking grid Analytical BS - M Ahmad.docxCW  marking grid Analytical BS - M Ahmad.docx
CW marking grid Analytical BS - M Ahmad.docx
 
Principles & Formulation of Hair Care Products
Principles & Formulation of Hair Care  ProductsPrinciples & Formulation of Hair Care  Products
Principles & Formulation of Hair Care Products
 
MARKER ASSISTED SELECTION IN CROP IMPROVEMENT
MARKER ASSISTED SELECTION IN CROP IMPROVEMENTMARKER ASSISTED SELECTION IN CROP IMPROVEMENT
MARKER ASSISTED SELECTION IN CROP IMPROVEMENT
 
Identification of Superclusters and Their Properties in the Sloan Digital Sky...
Identification of Superclusters and Their Properties in the Sloan Digital Sky...Identification of Superclusters and Their Properties in the Sloan Digital Sky...
Identification of Superclusters and Their Properties in the Sloan Digital Sky...
 
Applied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docxApplied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docx
 
Gene transfer in plants agrobacterium.pdf
Gene transfer in plants agrobacterium.pdfGene transfer in plants agrobacterium.pdf
Gene transfer in plants agrobacterium.pdf
 
Physics Serway Jewett 6th edition for Scientists and Engineers
Physics Serway Jewett 6th edition for Scientists and EngineersPhysics Serway Jewett 6th edition for Scientists and Engineers
Physics Serway Jewett 6th edition for Scientists and Engineers
 
Main Exam Applied biochemistry final year
Main Exam Applied biochemistry final yearMain Exam Applied biochemistry final year
Main Exam Applied biochemistry final year
 
Cheminformatics tools supporting dissemination of data associated with US EPA...
Cheminformatics tools supporting dissemination of data associated with US EPA...Cheminformatics tools supporting dissemination of data associated with US EPA...
Cheminformatics tools supporting dissemination of data associated with US EPA...
 
IB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptxIB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptx
 
Substances in Common Use for Shahu College Screening Test
Substances in Common Use for Shahu College Screening TestSubstances in Common Use for Shahu College Screening Test
Substances in Common Use for Shahu College Screening Test
 
001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...
001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...
001 Case Study - Submission Point_c1051231_attempt_2023-11-23-14-08-42_ABS CW...
 
Controlling Parameters of Carbonate platform Environment
Controlling Parameters of Carbonate platform EnvironmentControlling Parameters of Carbonate platform Environment
Controlling Parameters of Carbonate platform Environment
 
M.Pharm - Question Bank - Drug Delivery Systems
M.Pharm - Question Bank - Drug Delivery SystemsM.Pharm - Question Bank - Drug Delivery Systems
M.Pharm - Question Bank - Drug Delivery Systems
 
Basic Concepts in Pharmacology in molecular .pptx
Basic Concepts in Pharmacology in molecular  .pptxBasic Concepts in Pharmacology in molecular  .pptx
Basic Concepts in Pharmacology in molecular .pptx
 

EKAW2014 - A Hybrid Semantic Approach to Building 
Dynamic Maps of Research Communities

  • 1. A Hybrid Semantic Approach to Building Dynamic Maps of Research Communities Francesco Osborne, Beppe Scavo, Enrico Motta KMi, The Open University, United Kingdom November 27th 2014
  • 2. Research communities The engine of research.
  • 3. We need to understand how scientific communities adapt and cooperate to implement visions into concrete technologies.
  • 4. Research communities Communities of academic authors are usually identified by using standard community detection algorithms, which typically exploit co-authorship or citation graphs.
  • 5. Temporal topic-based communities (TTC) A different type of community we investigated is formed by the set of researchers who, at a given time, are following shared research trajectory, i.e. they are working on the same topics at the same time. Osborne, F., Scavo, G., & Motta, E. (2014). Identifying diachronic topic-based research communities by clustering shared research trajectories. In The Semantic Web: Trends and Challenges (pp. 114-129). Springer International Publishing.
  • 6. Research Communities Map Builder • RCMB is able to automatically link diachronic topic-based communities over subsequent time intervals to identify significant events. • These include topic shifts within a community; the appearance and fading of a community; communities splitting, merging, spawning other communities; etc. • The output of RCMB is a map of research communities, annotated with the detected events, which provides a concise visual representation of the dynamics of a research area.
  • 7. RCMB steps: 1. Applies the Temporal Semantic Topic-Based Clustering (TST) algorithm to find Temporal topic-based communities in different time intervals; 2. Detects Topic Shifts; 3. Links Communities in different years; 4. Detect Key Events;
  • 8. RCMB steps: 1. Applies the Temporal Semantic Topic-Based Clustering (TST) algorithm to find Temporal topic-based communities in different time intervals. 2. Detects Topic Shifts in following years 3. Links Communities in different years 4. Detect Key Events Temporal Semantic Topic-Based Clustering Osborne, F., Scavo, G., & Motta, E. (2014). Identifying diachronic topic-based research communities by clustering shared research trajectories. In The Semantic Web: Trends and Challenges (pp. 114-129). Springer International Publishing.
  • 9. TST in short 1. It augments the topic semantically using an automatically generated OWL ontology and represent each author as a semantic topic distribution over subsequent years. 2. It weighs each topic according to its relationship with the main topic, for highlighting the communities strongly related to the main topic. 3. It clusters authors using the ATTS (Adjusted Temporal Topic Similarity), which is computed by averaging the cosine similarities of the topic vectors over progressively smaller intervals of time.
  • 10. Detecting Topic Shifts We use a sliding window algorithm that checks for a topic shift by comparing the initial topic distribution in time t with the topic distributions in time t+1, t+2… t+n. Information Extraction/Semantic Annotation community 2002 Infor. Extraction: 26 % Natural Language: 17 % Named Entity: 12 % Machine Learning: 9 % Knowledge Base: 9 % 2010 Linked Data: 16 % Natural Language: 15 % Semantic Annotation: 15 % SW Technology: 10 % Information Retrieval: 10 % Knowledge Base: 9 % Semantic Wiki: 9 % 2006 Semantic Annotation: 25 % Knowledge Base: 15 % Semantic Wiki: 11 % Information Extraction: 10 % Semantic Information: 8 % Natural Language: 6 % Information Retrieval: 6 %
  • 11. Detecting Topic Shifts We define a topic shift a statistically significant change (detected via chi-square test ) in the topic distribution of a community which occurred in a certain time interval. To detect which topics were the main protagonists of this shift, we applying the same test excluding each time a different topic, and selecting the topic whose absence yields the bigger increment in the p value.
  • 12. Community linking We are interested in two different links between community: • The strong link is defined as a link that connects the same community in subsequent timeframes. • The weak link is defined as the link that connects community C1 with community C2 in a subsequent timeframe, if C1 has an impact over C2 in terms of migrating authors and/or topics.
  • 14. Community linking We take the minimum values of ts and tw that minimize the MEF using the Nelder-Mead algorithm.
  • 15. Key Events detection If a community has no strong links with any precedent interval communities, we detect the appearance of a community. 2006 2007 C1 C3 C2 C1 C2
  • 16. Key Events detection If a community has no strong links with any subsequent interval communities, we detect the fading of a community. 2006 2007 C1 C2 C3 C1 C2
  • 17. Key Events detection If a community is linked to more than one community in the subsequent interval and one of the links is a strong one we detect the forking of one or more communities out of the community characterized by the strong link. 2006 2007 C1 C1 C2
  • 18. Key Events detection If a community is linked to more than one community in the subsequent interval and none of the links is a strong one we detect the splitting of a community into multiple communities. 2006 2007 C1 C2 C3
  • 19. Key Events detection If two or more communities are linked to one community in the subsequent interval and one of the inlinks is a strong link, we detect the assimilation of one or more communities into the community C characterized by the strong link. 2006 2007 C1 C1 C2 If the communities fade after the event, they are labelled as absorbed to C.
  • 20. Key Events detection If two or more communities are linked to one community in the subsequent interval and none of the inlinks is a strong link, we detect the merging of two or more communities in a new community C. 2006 2007 C1 C3 C2 If the communities fade after the event, they are labelled as merged in C.
  • 22. Case study We applying RCMB to two research areas: World Wide Web (WWW) and Semantic Web (SW). Our study was based on a dataset built from data retrieved by means of the API provided by Microsoft Academic Search. We first retrieved authors and papers labelled with WWW and SW or with their first 150 co-occurring topics. We then run RCMB on WWW and SW in the 2000-2010 time interval with a granularity of 3. The average number of authors selected in each year was 932 for WWW and 646 for SW.
  • 24. WWW
  • 25. Future Work • Automatically generate comprehensive explanations for the identified dynamics. • Forecasting topic shifts and key events, e.g., estimating the probability that a new topic will emerge in a certain community or that two communities will merge in the coming years.
  • 26. Questions? Interested in scholarly data? SAVE-SD 2015 Semantics, Analytics, Visualisation: Enhancing Scholarly Data Workshop at 24th International World Wide Web Conference May 19, 2015 - Florence, Italy Site: cs.unibo.it/save-sd