SlideShare una empresa de Scribd logo
1 de 26
1 
S C I E N C E  P A S S I O N  T E C H N O L O G Y 
Sequential Action Patterns in 
Collaborative Ontology-Engineering Projects: 
A Case-Study in the Biomedical Domain 
Simon Walk1, Philipp Singer2 and Markus Strohmaier2,3 
1 Graz University of Technology 
2 Gesis – Leibniz Institute for the Social Sciences 
3 University of Koblenz 
u Graz University of Technology CIKM2014
2 
Introduction & Motivation 
The importance of collaborative ontology-engineering 
projects increased over recent years due to an 
increase in 
• complexity of the modeled domains 
• requirements for the resulting ontology 
No individual is able to single-handedly cover the increased 
complexity and requirements. 
Hence, it is crucial to better understand and steer the 
underlying processes of how users collaboratively 
work on an ontology (i.e., via predictive models). 
u Graz University of Technology CIKM2014
3 
Approach & Objective 
To that extend we analyzed five collaborative ontology-engineering 
projects from the biomedical domain to: 
1. explore regularities and common patterns in user 
action sequences 
2. fit and select models using Markov chains of 
varying order 
3. predict user actions via the fitted Markov chains 
Our main objective is to predict future user actions 
in collaborative ontology-engineering projects. 
u Graz University of Technology CIKM2014
4 
Datasets 
Five collaborative ontology-engineering projects from 
the biomedical domain with varying sizes of features. 
Note that all ontologies were created with WebProtégé 
or derivatives of WebProtégé! 
u Graz University of Technology CIKM2014
5 
Types of Action Paths 
u Graz University of Technology CIKM2014
6 
Types of Action Paths 
u Graz University of Technology CIKM2014
7 
Types of Action Paths 
u Graz University of Technology CIKM2014
8 
Types of Action Paths 
u Graz University of Technology CIKM2014
9 
Extracted Action Paths 
1. Users for Classes 
 Sequences of users that changed a class. 
2. Change Types for Users & Classes 
 Sequences of change types performed by a user / on 
a class. 
3. Properties for Users & Classes 
 Sequences of properties changed by a user / for a 
class. 
u Graz University of Technology CIKM2014
10 
Exploring Regularities and 
Sequential Patterns 
u Graz University of Technology CIKM2014
11 
Exploring Regularities 
Randomness & Regularities 
Wald-Wolfowitz runs test 
 Adapted by O’Brien and Dyck (1985) 
 For ~60% of our paths, regularities could be detected.1 
Sequential Pattern Mining 
PrefixSpan to investigate commonly used sequential 
patterns. 
Only immediately succeeding states build patterns. 
 E.g., “A B C” contains “A B” and “B C” but not “A C” 
1https://github.com/psinger/RunsTest 
u Graz University of Technology CIKM2014
12 
Results for the Sequential Pattern Analysis 
Users for Classes Paths 
u Graz University of Technology CIKM2014
13 
Results for the Sequential Pattern Analysis 
Users for Classes Paths 
u Graz University of Technology CIKM2014
14 
Model Fitting & Selection 
u Graz University of Technology CIKM2014
15 
Modeling Fitting 
 Markov chains are stochastic processes 
representing transition probabilities between 
a countable number of known states. 
 A state space: listing all possible states 
 A transition matrix: listing all transition-probabilities 
between states 
 A Markov chain of n-th order means that n previous 
states contain predictive information about the next 
state. 
u Graz University of Technology CIKM2014
16 
Modeling Fitting & Selection 
We fitted Models from orders of zero to five.2 
 Lower order models are nested within higher order 
models. 
 Higher orders need exponentially more parameters 
and may result in overfitting. 
Bayesian model selection (Singer et al. 2014)2 
 Higher order models receive a penalty due to higher 
complexity. 
2 https://github.com/psinger/PathTools 
u Graz University of Technology CIKM2014
17 
Results Bayesian Model Selection 
u Graz University of Technology CIKM2014
18 
Predicting User Actions 
u Graz University of Technology CIKM2014
19 
K-Fold Cross-Fold Prediction Experiment 
1. Fit Markov chain model. 
 Split Paths into training and test set (stratified). 
 Rank transitions for each row in the transition matrix. 
1. Determine position of test set transition in the fitted 
Markov chain model. 
1. Calculate average over all positions. 
 Average Position of 1 equals best prediction 
accuracy. 
u Graz University of Technology CIKM2014
20 
K-Fold Cross-Fold Prediction Results 
u Graz University of Technology CIKM2014
21 
Results for the Prediction Task 
u Graz University of Technology CIKM2014
22 
Conclusions 
 A number of sequences were produced in a non-random 
way and frequent patterns can be extracted. 
 Memory effects (serial dependence) can increase 
prediction accuracy. 
 The resulting prediction models can (potentially) be 
used for 
 the creation of various recommendations as well as 
 to assess the impact of potential changes on the 
ontology and the community. 
u Graz University of Technology CIKM2014
23 
Future Work 
 Include additional data sources (e.g., Semantic 
MediaWikis). 
 Analyze higher order patterns and compare patterns 
of different data sources 
 Conduct live-lab experiments with generated 
prediction-models (recommendations). 
u Graz University of Technology CIKM2014
24 
Questions? 
u Graz University of Technology CIKM2014
25 
Thank you for your attention! 
uu Grraz Uniiverrsiitty off Technollogy CIKM2014
26 
References 
Wald and J. Wolfowitz. On a test whether two samples are from 
the same population. The Annals of Mathematical Statistics, 
11(2):147–162, 1940. 
P. C. O’Brien and P. J. Dyck. A runs test based on run lengths. 
Biometrics, pages 237–244, 1985. 
P. Singer, D. Helic, B. Taraghi, and M. Strohmaier. Detecting 
memory and structure in human navigation patterns using 
markov chain models of varying order. PloS one, 
9(7):e102070, 2014. 
u Graz University of Technology CIKM2014

Más contenido relacionado

Destacado

Music magazine evaluation
Music magazine evaluation Music magazine evaluation
Music magazine evaluation TomRobinson94
 
A2 Media Evaluation Full
A2 Media Evaluation FullA2 Media Evaluation Full
A2 Media Evaluation FullTomRobinson94
 
Brand Learing, Marketing Transformation Specialist
Brand Learing, Marketing Transformation SpecialistBrand Learing, Marketing Transformation Specialist
Brand Learing, Marketing Transformation SpecialistWalz Group, LLC.
 
The Medical Advantage, Inc. - EMR & Meaningful Use
The Medical Advantage, Inc. - EMR & Meaningful UseThe Medical Advantage, Inc. - EMR & Meaningful Use
The Medical Advantage, Inc. - EMR & Meaningful UseWalz Group, LLC.
 
Oliver Wyman Consulting on growing influence of ACOs: The ACO Surprise
Oliver Wyman Consulting on growing influence of ACOs: The ACO SurpriseOliver Wyman Consulting on growing influence of ACOs: The ACO Surprise
Oliver Wyman Consulting on growing influence of ACOs: The ACO SurpriseWalz Group, LLC.
 
Magazine conventions examples
Magazine conventions examplesMagazine conventions examples
Magazine conventions examplesTomRobinson94
 
Henry viii consistent
Henry viii consistentHenry viii consistent
Henry viii consistentTomRobinson94
 
Modeling and Mining Sequential Data
Modeling and Mining Sequential DataModeling and Mining Sequential Data
Modeling and Mining Sequential DataPhilipp Singer
 
Introduction to Bayesian Statistics
Introduction to Bayesian StatisticsIntroduction to Bayesian Statistics
Introduction to Bayesian StatisticsPhilipp Singer
 
Annabel Lee
Annabel LeeAnnabel Lee
Annabel Leebmtravis
 
AWStatsでS3&CloudFrontのアクセス解析
AWStatsでS3&CloudFrontのアクセス解析AWStatsでS3&CloudFrontのアクセス解析
AWStatsでS3&CloudFrontのアクセス解析Hiroyasu Suzuki
 
JAWS-UGコンテナ支部#2「EC2」から「ECS」へ
JAWS-UGコンテナ支部#2「EC2」から「ECS」へJAWS-UGコンテナ支部#2「EC2」から「ECS」へ
JAWS-UGコンテナ支部#2「EC2」から「ECS」へHiroyasu Suzuki
 
オリジナルAMIの作成(CentOS)
オリジナルAMIの作成(CentOS)オリジナルAMIの作成(CentOS)
オリジナルAMIの作成(CentOS)Hiroyasu Suzuki
 
VIOPS09 AWSで実現するクラウドと物理製品の融合
VIOPS09 AWSで実現するクラウドと物理製品の融合VIOPS09 AWSで実現するクラウドと物理製品の融合
VIOPS09 AWSで実現するクラウドと物理製品の融合Hiroyasu Suzuki
 

Destacado (17)

Music magazine evaluation
Music magazine evaluation Music magazine evaluation
Music magazine evaluation
 
A2 Media Evaluation Full
A2 Media Evaluation FullA2 Media Evaluation Full
A2 Media Evaluation Full
 
Campaign Management Guide
Campaign Management GuideCampaign Management Guide
Campaign Management Guide
 
Brand Learing, Marketing Transformation Specialist
Brand Learing, Marketing Transformation SpecialistBrand Learing, Marketing Transformation Specialist
Brand Learing, Marketing Transformation Specialist
 
The Medical Advantage, Inc. - EMR & Meaningful Use
The Medical Advantage, Inc. - EMR & Meaningful UseThe Medical Advantage, Inc. - EMR & Meaningful Use
The Medical Advantage, Inc. - EMR & Meaningful Use
 
Oliver Wyman Consulting on growing influence of ACOs: The ACO Surprise
Oliver Wyman Consulting on growing influence of ACOs: The ACO SurpriseOliver Wyman Consulting on growing influence of ACOs: The ACO Surprise
Oliver Wyman Consulting on growing influence of ACOs: The ACO Surprise
 
Magazine conventions examples
Magazine conventions examplesMagazine conventions examples
Magazine conventions examples
 
Henry viii consistent
Henry viii consistentHenry viii consistent
Henry viii consistent
 
CRM Strategic Analysis
CRM Strategic AnalysisCRM Strategic Analysis
CRM Strategic Analysis
 
Arrays
ArraysArrays
Arrays
 
Modeling and Mining Sequential Data
Modeling and Mining Sequential DataModeling and Mining Sequential Data
Modeling and Mining Sequential Data
 
Introduction to Bayesian Statistics
Introduction to Bayesian StatisticsIntroduction to Bayesian Statistics
Introduction to Bayesian Statistics
 
Annabel Lee
Annabel LeeAnnabel Lee
Annabel Lee
 
AWStatsでS3&CloudFrontのアクセス解析
AWStatsでS3&CloudFrontのアクセス解析AWStatsでS3&CloudFrontのアクセス解析
AWStatsでS3&CloudFrontのアクセス解析
 
JAWS-UGコンテナ支部#2「EC2」から「ECS」へ
JAWS-UGコンテナ支部#2「EC2」から「ECS」へJAWS-UGコンテナ支部#2「EC2」から「ECS」へ
JAWS-UGコンテナ支部#2「EC2」から「ECS」へ
 
オリジナルAMIの作成(CentOS)
オリジナルAMIの作成(CentOS)オリジナルAMIの作成(CentOS)
オリジナルAMIの作成(CentOS)
 
VIOPS09 AWSで実現するクラウドと物理製品の融合
VIOPS09 AWSで実現するクラウドと物理製品の融合VIOPS09 AWSで実現するクラウドと物理製品の融合
VIOPS09 AWSで実現するクラウドと物理製品の融合
 

Similar a Sequential Action Patterns in Collaborative Ontology Engineering Projects: A Case-study in the Biomedical Domain

Using AI Planning to Automate the Performance Analysis of Simulators
Using AI Planning to Automate the Performance Analysis of SimulatorsUsing AI Planning to Automate the Performance Analysis of Simulators
Using AI Planning to Automate the Performance Analysis of SimulatorsRoland Ewald
 
Sound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software TestingSound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software TestingJaguaraci Silva
 
Mobility model for convex areas
Mobility model for convex areasMobility model for convex areas
Mobility model for convex areasvasanthi32
 
Mpp Rsv 2008 Public
Mpp Rsv 2008 PublicMpp Rsv 2008 Public
Mpp Rsv 2008 Publiclab13unisa
 
Model Management in Model-Driven Engineering
Model Management in Model-Driven EngineeringModel Management in Model-Driven Engineering
Model Management in Model-Driven EngineeringAlfonso Pierantonio
 
Analytic Dependency Loops in Architectural Models of Cyber-Physical Systems
Analytic Dependency Loops in Architectural Models of Cyber-Physical SystemsAnalytic Dependency Loops in Architectural Models of Cyber-Physical Systems
Analytic Dependency Loops in Architectural Models of Cyber-Physical SystemsIvan Ruchkin
 
Charla evento TestingUY 2015 - Alternativas para el Testing de Aplicaciones S...
Charla evento TestingUY 2015 - Alternativas para el Testing de Aplicaciones S...Charla evento TestingUY 2015 - Alternativas para el Testing de Aplicaciones S...
Charla evento TestingUY 2015 - Alternativas para el Testing de Aplicaciones S...TestingUy
 
On Using Network Science in Mining Developers Collaboration in Software Engin...
On Using Network Science in Mining Developers Collaboration in Software Engin...On Using Network Science in Mining Developers Collaboration in Software Engin...
On Using Network Science in Mining Developers Collaboration in Software Engin...IJDKP
 
On Using Network Science in Mining Developers Collaboration in Software Engin...
On Using Network Science in Mining Developers Collaboration in Software Engin...On Using Network Science in Mining Developers Collaboration in Software Engin...
On Using Network Science in Mining Developers Collaboration in Software Engin...IJDKP
 
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...thanhdowork
 
accessible-streaming-algorithms
accessible-streaming-algorithmsaccessible-streaming-algorithms
accessible-streaming-algorithmsFarhan Zaki
 
Tutorial on Deep Generative Models
 Tutorial on Deep Generative Models Tutorial on Deep Generative Models
Tutorial on Deep Generative ModelsMLReview
 
A Survey On Tracking Moving Objects Using Various Algorithms
A Survey On Tracking Moving Objects Using Various AlgorithmsA Survey On Tracking Moving Objects Using Various Algorithms
A Survey On Tracking Moving Objects Using Various AlgorithmsIJMTST Journal
 
Model repositories and standard formats for model reusability
Model repositories and standard formats for model reusabilityModel repositories and standard formats for model reusability
Model repositories and standard formats for model reusabilityUniversity Medicine Greifswald
 
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.pptProto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.pptAnirbanBhar3
 
HEC Project Proposal_v1.0
HEC Project Proposal_v1.0HEC Project Proposal_v1.0
HEC Project Proposal_v1.0Awais Shibli
 
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...thanhdowork
 

Similar a Sequential Action Patterns in Collaborative Ontology Engineering Projects: A Case-study in the Biomedical Domain (20)

Using AI Planning to Automate the Performance Analysis of Simulators
Using AI Planning to Automate the Performance Analysis of SimulatorsUsing AI Planning to Automate the Performance Analysis of Simulators
Using AI Planning to Automate the Performance Analysis of Simulators
 
I05745368
I05745368I05745368
I05745368
 
Chemoinformatic
Chemoinformatic Chemoinformatic
Chemoinformatic
 
Sound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software TestingSound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software Testing
 
Mobility model for convex areas
Mobility model for convex areasMobility model for convex areas
Mobility model for convex areas
 
Mpp Rsv 2008 Public
Mpp Rsv 2008 PublicMpp Rsv 2008 Public
Mpp Rsv 2008 Public
 
Model Management in Model-Driven Engineering
Model Management in Model-Driven EngineeringModel Management in Model-Driven Engineering
Model Management in Model-Driven Engineering
 
Analytic Dependency Loops in Architectural Models of Cyber-Physical Systems
Analytic Dependency Loops in Architectural Models of Cyber-Physical SystemsAnalytic Dependency Loops in Architectural Models of Cyber-Physical Systems
Analytic Dependency Loops in Architectural Models of Cyber-Physical Systems
 
Charla evento TestingUY 2015 - Alternativas para el Testing de Aplicaciones S...
Charla evento TestingUY 2015 - Alternativas para el Testing de Aplicaciones S...Charla evento TestingUY 2015 - Alternativas para el Testing de Aplicaciones S...
Charla evento TestingUY 2015 - Alternativas para el Testing de Aplicaciones S...
 
On Using Network Science in Mining Developers Collaboration in Software Engin...
On Using Network Science in Mining Developers Collaboration in Software Engin...On Using Network Science in Mining Developers Collaboration in Software Engin...
On Using Network Science in Mining Developers Collaboration in Software Engin...
 
On Using Network Science in Mining Developers Collaboration in Software Engin...
On Using Network Science in Mining Developers Collaboration in Software Engin...On Using Network Science in Mining Developers Collaboration in Software Engin...
On Using Network Science in Mining Developers Collaboration in Software Engin...
 
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
 
accessible-streaming-algorithms
accessible-streaming-algorithmsaccessible-streaming-algorithms
accessible-streaming-algorithms
 
Tutorial on Deep Generative Models
 Tutorial on Deep Generative Models Tutorial on Deep Generative Models
Tutorial on Deep Generative Models
 
A Survey On Tracking Moving Objects Using Various Algorithms
A Survey On Tracking Moving Objects Using Various AlgorithmsA Survey On Tracking Moving Objects Using Various Algorithms
A Survey On Tracking Moving Objects Using Various Algorithms
 
Model repositories and standard formats for model reusability
Model repositories and standard formats for model reusabilityModel repositories and standard formats for model reusability
Model repositories and standard formats for model reusability
 
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.pptProto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt
 
Master Thesis
Master ThesisMaster Thesis
Master Thesis
 
HEC Project Proposal_v1.0
HEC Project Proposal_v1.0HEC Project Proposal_v1.0
HEC Project Proposal_v1.0
 
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...
 

Último

Anne Frank A Beacon of Hope amidst darkness ppt.pptx
Anne Frank A Beacon of Hope amidst darkness ppt.pptxAnne Frank A Beacon of Hope amidst darkness ppt.pptx
Anne Frank A Beacon of Hope amidst darkness ppt.pptxnoorehahmad
 
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Krijn Poppe
 
The Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism PresentationThe Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism PresentationNathan Young
 
SBFT Tool Competition 2024 -- Python Test Case Generation Track
SBFT Tool Competition 2024 -- Python Test Case Generation TrackSBFT Tool Competition 2024 -- Python Test Case Generation Track
SBFT Tool Competition 2024 -- Python Test Case Generation TrackSebastiano Panichella
 
Chizaram's Women Tech Makers Deck. .pptx
Chizaram's Women Tech Makers Deck.  .pptxChizaram's Women Tech Makers Deck.  .pptx
Chizaram's Women Tech Makers Deck. .pptxogubuikealex
 
PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.
PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.
PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.KathleenAnnCordero2
 
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...marjmae69
 
Work Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptxWork Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptxmavinoikein
 
Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with AerialistSimulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with AerialistSebastiano Panichella
 
PHYSICS PROJECT BY MSC - NANOTECHNOLOGY
PHYSICS PROJECT BY MSC  - NANOTECHNOLOGYPHYSICS PROJECT BY MSC  - NANOTECHNOLOGY
PHYSICS PROJECT BY MSC - NANOTECHNOLOGYpruthirajnayak525
 
Call Girls In Aerocity 🤳 Call Us +919599264170
Call Girls In Aerocity 🤳 Call Us +919599264170Call Girls In Aerocity 🤳 Call Us +919599264170
Call Girls In Aerocity 🤳 Call Us +919599264170Escort Service
 
Dutch Power - 26 maart 2024 - Henk Kras - Circular Plastics
Dutch Power - 26 maart 2024 - Henk Kras - Circular PlasticsDutch Power - 26 maart 2024 - Henk Kras - Circular Plastics
Dutch Power - 26 maart 2024 - Henk Kras - Circular PlasticsDutch Power
 
Event 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptxEvent 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptxaryanv1753
 
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATIONRACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATIONRachelAnnTenibroAmaz
 
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRRINDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRRsarwankumar4524
 
Early Modern Spain. All about this period
Early Modern Spain. All about this periodEarly Modern Spain. All about this period
Early Modern Spain. All about this periodSaraIsabelJimenez
 
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...漢銘 謝
 
Quality by design.. ppt for RA (1ST SEM
Quality by design.. ppt for  RA (1ST SEMQuality by design.. ppt for  RA (1ST SEM
Quality by design.. ppt for RA (1ST SEMCharmi13
 
Genshin Impact PPT Template by EaTemp.pptx
Genshin Impact PPT Template by EaTemp.pptxGenshin Impact PPT Template by EaTemp.pptx
Genshin Impact PPT Template by EaTemp.pptxJohnree4
 
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.com
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.comSaaStr Workshop Wednesday w/ Kyle Norton, Owner.com
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.comsaastr
 

Último (20)

Anne Frank A Beacon of Hope amidst darkness ppt.pptx
Anne Frank A Beacon of Hope amidst darkness ppt.pptxAnne Frank A Beacon of Hope amidst darkness ppt.pptx
Anne Frank A Beacon of Hope amidst darkness ppt.pptx
 
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
 
The Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism PresentationThe Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism Presentation
 
SBFT Tool Competition 2024 -- Python Test Case Generation Track
SBFT Tool Competition 2024 -- Python Test Case Generation TrackSBFT Tool Competition 2024 -- Python Test Case Generation Track
SBFT Tool Competition 2024 -- Python Test Case Generation Track
 
Chizaram's Women Tech Makers Deck. .pptx
Chizaram's Women Tech Makers Deck.  .pptxChizaram's Women Tech Makers Deck.  .pptx
Chizaram's Women Tech Makers Deck. .pptx
 
PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.
PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.
PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.
 
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...
 
Work Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptxWork Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptx
 
Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with AerialistSimulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist
 
PHYSICS PROJECT BY MSC - NANOTECHNOLOGY
PHYSICS PROJECT BY MSC  - NANOTECHNOLOGYPHYSICS PROJECT BY MSC  - NANOTECHNOLOGY
PHYSICS PROJECT BY MSC - NANOTECHNOLOGY
 
Call Girls In Aerocity 🤳 Call Us +919599264170
Call Girls In Aerocity 🤳 Call Us +919599264170Call Girls In Aerocity 🤳 Call Us +919599264170
Call Girls In Aerocity 🤳 Call Us +919599264170
 
Dutch Power - 26 maart 2024 - Henk Kras - Circular Plastics
Dutch Power - 26 maart 2024 - Henk Kras - Circular PlasticsDutch Power - 26 maart 2024 - Henk Kras - Circular Plastics
Dutch Power - 26 maart 2024 - Henk Kras - Circular Plastics
 
Event 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptxEvent 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptx
 
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATIONRACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
 
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRRINDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRR
 
Early Modern Spain. All about this period
Early Modern Spain. All about this periodEarly Modern Spain. All about this period
Early Modern Spain. All about this period
 
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...
THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...
 
Quality by design.. ppt for RA (1ST SEM
Quality by design.. ppt for  RA (1ST SEMQuality by design.. ppt for  RA (1ST SEM
Quality by design.. ppt for RA (1ST SEM
 
Genshin Impact PPT Template by EaTemp.pptx
Genshin Impact PPT Template by EaTemp.pptxGenshin Impact PPT Template by EaTemp.pptx
Genshin Impact PPT Template by EaTemp.pptx
 
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.com
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.comSaaStr Workshop Wednesday w/ Kyle Norton, Owner.com
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.com
 

Sequential Action Patterns in Collaborative Ontology Engineering Projects: A Case-study in the Biomedical Domain

  • 1. 1 S C I E N C E  P A S S I O N  T E C H N O L O G Y Sequential Action Patterns in Collaborative Ontology-Engineering Projects: A Case-Study in the Biomedical Domain Simon Walk1, Philipp Singer2 and Markus Strohmaier2,3 1 Graz University of Technology 2 Gesis – Leibniz Institute for the Social Sciences 3 University of Koblenz u Graz University of Technology CIKM2014
  • 2. 2 Introduction & Motivation The importance of collaborative ontology-engineering projects increased over recent years due to an increase in • complexity of the modeled domains • requirements for the resulting ontology No individual is able to single-handedly cover the increased complexity and requirements. Hence, it is crucial to better understand and steer the underlying processes of how users collaboratively work on an ontology (i.e., via predictive models). u Graz University of Technology CIKM2014
  • 3. 3 Approach & Objective To that extend we analyzed five collaborative ontology-engineering projects from the biomedical domain to: 1. explore regularities and common patterns in user action sequences 2. fit and select models using Markov chains of varying order 3. predict user actions via the fitted Markov chains Our main objective is to predict future user actions in collaborative ontology-engineering projects. u Graz University of Technology CIKM2014
  • 4. 4 Datasets Five collaborative ontology-engineering projects from the biomedical domain with varying sizes of features. Note that all ontologies were created with WebProtégé or derivatives of WebProtégé! u Graz University of Technology CIKM2014
  • 5. 5 Types of Action Paths u Graz University of Technology CIKM2014
  • 6. 6 Types of Action Paths u Graz University of Technology CIKM2014
  • 7. 7 Types of Action Paths u Graz University of Technology CIKM2014
  • 8. 8 Types of Action Paths u Graz University of Technology CIKM2014
  • 9. 9 Extracted Action Paths 1. Users for Classes  Sequences of users that changed a class. 2. Change Types for Users & Classes  Sequences of change types performed by a user / on a class. 3. Properties for Users & Classes  Sequences of properties changed by a user / for a class. u Graz University of Technology CIKM2014
  • 10. 10 Exploring Regularities and Sequential Patterns u Graz University of Technology CIKM2014
  • 11. 11 Exploring Regularities Randomness & Regularities Wald-Wolfowitz runs test  Adapted by O’Brien and Dyck (1985)  For ~60% of our paths, regularities could be detected.1 Sequential Pattern Mining PrefixSpan to investigate commonly used sequential patterns. Only immediately succeeding states build patterns.  E.g., “A B C” contains “A B” and “B C” but not “A C” 1https://github.com/psinger/RunsTest u Graz University of Technology CIKM2014
  • 12. 12 Results for the Sequential Pattern Analysis Users for Classes Paths u Graz University of Technology CIKM2014
  • 13. 13 Results for the Sequential Pattern Analysis Users for Classes Paths u Graz University of Technology CIKM2014
  • 14. 14 Model Fitting & Selection u Graz University of Technology CIKM2014
  • 15. 15 Modeling Fitting  Markov chains are stochastic processes representing transition probabilities between a countable number of known states.  A state space: listing all possible states  A transition matrix: listing all transition-probabilities between states  A Markov chain of n-th order means that n previous states contain predictive information about the next state. u Graz University of Technology CIKM2014
  • 16. 16 Modeling Fitting & Selection We fitted Models from orders of zero to five.2  Lower order models are nested within higher order models.  Higher orders need exponentially more parameters and may result in overfitting. Bayesian model selection (Singer et al. 2014)2  Higher order models receive a penalty due to higher complexity. 2 https://github.com/psinger/PathTools u Graz University of Technology CIKM2014
  • 17. 17 Results Bayesian Model Selection u Graz University of Technology CIKM2014
  • 18. 18 Predicting User Actions u Graz University of Technology CIKM2014
  • 19. 19 K-Fold Cross-Fold Prediction Experiment 1. Fit Markov chain model.  Split Paths into training and test set (stratified).  Rank transitions for each row in the transition matrix. 1. Determine position of test set transition in the fitted Markov chain model. 1. Calculate average over all positions.  Average Position of 1 equals best prediction accuracy. u Graz University of Technology CIKM2014
  • 20. 20 K-Fold Cross-Fold Prediction Results u Graz University of Technology CIKM2014
  • 21. 21 Results for the Prediction Task u Graz University of Technology CIKM2014
  • 22. 22 Conclusions  A number of sequences were produced in a non-random way and frequent patterns can be extracted.  Memory effects (serial dependence) can increase prediction accuracy.  The resulting prediction models can (potentially) be used for  the creation of various recommendations as well as  to assess the impact of potential changes on the ontology and the community. u Graz University of Technology CIKM2014
  • 23. 23 Future Work  Include additional data sources (e.g., Semantic MediaWikis).  Analyze higher order patterns and compare patterns of different data sources  Conduct live-lab experiments with generated prediction-models (recommendations). u Graz University of Technology CIKM2014
  • 24. 24 Questions? u Graz University of Technology CIKM2014
  • 25. 25 Thank you for your attention! uu Grraz Uniiverrsiitty off Technollogy CIKM2014
  • 26. 26 References Wald and J. Wolfowitz. On a test whether two samples are from the same population. The Annals of Mathematical Statistics, 11(2):147–162, 1940. P. C. O’Brien and P. J. Dyck. A runs test based on run lengths. Biometrics, pages 237–244, 1985. P. Singer, D. Helic, B. Taraghi, and M. Strohmaier. Detecting memory and structure in human navigation patterns using markov chain models of varying order. PloS one, 9(7):e102070, 2014. u Graz University of Technology CIKM2014

Notas del editor

  1. ICD-11: Classification-Scheme to encode diseases to inform decision makers of health-related spendings & insurance companies of what to charge ICTM: The same as ICD-11, and is planned to be merged into ICD-11, for traditional medicine! Multilingual (japanese, chinese, korean, english and traditional chinese) NCIt: It is a reference vocabulary covering areas for clinical care, translational and basic research, and cancer biology. BRO: A controlled terminology of resources, which is used to improve the sensitivity and specificity of web searches used for Biositemaps.
  2. Who is going to change a class next? What kind of change is a user going to perform next? What kind of change is performed next on a class? What property will be changed next by a user? What property will be changed next for a class?
  3. Support = the percentage of paths that exhibit a certain pattern. Pattern: Users for Classes 45.844 Sequences for ICD-11. If a user only changes 200 concepts => 45844 / 200 = 0.0044
  4. Given that there are sequential patterns of lengths 2 to 4 we argue that such patterns play a crucial role in the contributor logs of collaborative ontology-engineering projects at hand. Support: Small number of 1 Patterns, as they have a support close to 0. Look in Figure (a) at 0.2 - 0.4 Support. Aggregated number of patterns per Dataset == what we see in Figure (a)
  5. Interested in identifying higher order Markov chain models. Zero-order Markov chain as weighted random selection baseline.
  6. Zero Order Models => Random Baseline To calculate the transition probabilities between different states we calculate how often they occur in the data. Higher order models will fit at least as good as lower order models. Number of parameters: states^n * n-1
  7. Example for Determination of position