Brief description of the ArCo project with remarks on main issues about tool support for the ontology engineering process and some ongoing effort in my Lab to address them.
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Building the ArCo knowledge graph: process, experience and struggle with existing tools
1. Building the ArCo knowledge graph:
process, experience and struggle with
existing tools
Valentina Presutti
LILEC ‒ Alma Mater Studiorium University of Bologna
Institute of Cognitive Sciences and Technologies
1
https://w3id.org/arco
3. ArCo
The knowledge graph of the italian cultural heritage
3
• Valentina Anita Carriero, Aldo Gangemi, Maria Letizia Mancinelli, Andrea Giovanni Nuzzolese, Valentina Presutti, Chiara Veninata: Pattern-
based design applied to cultural heritage knowledge graphs. Semantic Web 12(2): 313-357 (2021), IOS Press
• Valentina Anita Carriero, Aldo Gangemi, Maria Letizia Mancinelli, Ludovica Marinucci, Andrea Giovanni Nuzzolese, Valentina Presutti, Chiara
Veninata: ArCo: The Italian Cultural Heritage Knowledge Graph. ISWC (2) 2019: 36-52
4. What ArCo is
An ontology network
representing cultural
heritage concepts
A knowledge graph
representing the italian
cultural heritage
4
5. ArCo KG in numbers
Ontology network
• 7 ontology modules
• 5,943 axioms
• 1,433 predicates
• reuse of 11 other ontologies
Data
• 169,151,644 triples
• 20,838 owl:sameAs linking to 20,479 distinct entities
in 8 other datasets and 2 controlled vocabularies
5
6. easily
installable
Where ArCo can be found
6
• Available on GitHub, as a docker, on Zenodo, LOV and DataHub
• Released with CC BY 4.0 license (Attribution-ShareAlike)
Docker container available on GitHub
+ its running instance online
https://w3id.org/arco multi-language
12. Italian Cultural Heritage
ICCD (MiBAC) coordinates
cataloguing activities and CH data
management
General Catalogue of
Italian CH
cataloguing standards,
controlled lists
almost 3M
catalogue records
+800K
catalogue records
~15M
catalogue numbers
released
30 types of cultural
properties
12
13. Using XD for developing ArCo KG
eXtreme Design
13
detailed
documentation
of the applied
methodology
• Valentina Anita Carriero, Aldo Gangemi, Maria Letizia Mancinelli, Andrea Giovanni Nuzzolese, Valentina Presutti, Chiara Veninata: Pattern-
based design applied to cultural heritage knowledge graphs. Semantic Web 12(2): 313-357 (2021), IOS Press
17. Involving third parties
• How ArCo data can be used?
• What use cases and requirements can we collect beyond
the need of ICCD?
early adopters
inclusive iterative, incremental open contest
17
18. requirements
collected in the form
of stories
Type: Linking my data to ArCo data
Title: Cultural heritage and residential property
Story: I am looking for a residential property to
buy, and I want to filter the results based on the
type of cultural heritage nearby.
applications
linking LOD
publishing LOD
e.g.
translated as
Competency Questions
and other constraints
CQs
Which is the geometry
of a cultural property?
e.g.
continuous
feedback
& new emerging
requirements
customer
team
design
team
ICCD
wider community
private & public
companies and
institutions
+
requirements from
beyond domain
experts
18
tools
19. Stories and Competency Questions
• Requirements are
collected as stories and
then transformed into
competency questions
(CQs)
• CQs feed both the
modeling and the testing
phases
• Priority is assigned based
on modeling issues and
their dependencies
19
21. design
team design principles
pattern-based design
ODP
repository
external
ontologies
OPLa
ontology
annotated with
12 reused ODPs
over the network
shortcut
binary relations
along with
n-ary relations
21
direct &
indirect
reuse
+ 1 new ODP
for recurrent event series
as collections of events
@ WOP2019
http://bit.do/fbQWk
1
2
3
e.g.
situation
collection
classification
componency
[…]
23. Exploring existing resources: reuse
• Identifying relevant existing ontologies
• Literature, LOV, projects à lots of manual and cognitive work
• Analysing the identified ontologies: what tools?
• Protégé and its plugins, LODview, LODE, etc.
23
24. OPLa annotations
24
OPLa ++
Pascal Hitzler, Aldo Gangemi, Krzysztof Janowicz, Adila Alfa Krisnadhi, Valentina Presutti:
Towards a Simple but Useful Ontology Design Pattern Representation Language. WOP@ISWC 2017
27. Modeling the ontologies
• Pattern-based ontology design: what tools?
27
CoModIDE (XD, patterns, OPLa)
https://comodide.com/
Cogan Shimizu, Karl Hammar
Good for learning ODP reuse
Yet many challenges (may not be CoModIDE goals)
Modular visualisation/editing of ontologies
Unit testing
Comparative views (for supporting reuse)
29. testing
testing
team
https://w3id.org/testalod
CQ verification
inference
verification
error
provocation
refactoring &
integration
design
team
release &
versioning
huge manual
effort
à prototype for
running unit tests
e.g. When was a cultural property created?
à SPARQL query
e.g. ex:MonumentX a :CulturalProperty .
ex:MonumentX
:hasCulturalPropertyComponent ex:ComponentY .
Expected inference: ex:MonumentX a
:ComplexCulturalProperty .
e.g. a-cd:Dating owl:disjointWith a-cd:Author .
ex:Example a a-cd:Dating, a-cd:Author .
Expected error: inconsistency
all tests on GitHub
http://bit.do/fbuHS
29
31. OWLunit (under development)
31
https://w3id.org/OWLunit
OWLUnit extends TESTaLOD in multiple
directions:
1. Inference Verification and Error
Provocation tests
2. Improved CQ verification tests:
checks for possible IRI changes
3. Execution of collections of tests
(test suites) represented with the
OWLunit ontology
Description of the an Error Provocation Test case
Description of a Test Suite
I didn’t know about themis (thanks Maria Keet to point it out on the chat!)
32. A summary of lessons learned
• Tool support for ontology engineering is still poor
• Ontology patterns are a powerful tool for improving modeling, reuse, testing and visualisation
• XD (or similar methodologies) work well for large, real-world ontology projects
• without it, it s a complete mess!
• Existing tools lack integrated support for these interdependent tasks
• Integrated support is mandatory!
• Requirements collection/engineering is overlooked, but it shall be supported and integrated with
modeling/testing components
• Domain experts and ontology engineers must work on the same platform (with different views)
• There is one additional main comment/question...
Lack of tool support is recognised as an important issue since (at least) twenty years
What is the motivation of such a lack of advance? Or did I miss something?
My2c? an OS Apache (or similar) project
32
34. References 1/2
34
Valentina Anita Carriero, Aldo Gangemi, Maria Letizia Mancinelli, Andrea Giovanni Nuzzolese, Valentina
Presutti, Chiara Veninata: Pattern-based design applied to cultural heritage knowledge graphs. Semantic
Web, Volume 12, Number 2 / 2021 (in press). Special Issue on Semantic Web for Cultural Heritage.
Valentina Anita Carriero, Aldo Gangemi, Maria Letizia Mancinelli, Ludovica Marinucci, Andrea Giovanni
Nuzzolese, Valentina Presutti, Chiara Veninata: ArCo: The Italian Cultural Heritage Knowledge Graph. ISWC
(2) 2019: 36-52
Valentina Anita Carriero, Aldo Gangemi, Maria Letizia Mancinelli, Ludovica Marinucci, Andrea Giovanni
Nuzzolese, Valentina Presutti, Chiara Veninata: ArCo ontology network and LOD on Italian Cultural
Heritage. ODOCH@CAiSE 2019: 97-102
Valentina Anita Carriero, Aldo Gangemi, Andrea Giovanni Nuzzolese, Valentina Presutti:
An Ontology Design Pattern for Representing Recurrent Events. WOP@ISWC 2019: 59-70
Valentina Anita Carriero, Fabio Mariani, Andrea Giovanni Nuzzolese, Valentina Pasqual, Valentina Presutti:
Agile Knowledge Graph Testing with TESTaLOD. ISWC Satellites 2019: 221-224
35. References 2/2
35
Valentina Anita Carriero, Aldo Gangemi, Maria Letizia Mancinelli, Andrea Giovanni Nuzzolese, Valentina
Presutti, Chiara Veninata: Pattern-based design applied to cultural heritage knowledge
graphs. CoRR abs/1911.07585 (2019)
Pascal Hitzler, Aldo Gangemi, Krzysztof Janowicz, Adila Alfa Krisnadhi, Valentina Presutti:
Towards a Simple but Useful Ontology Design Pattern Representation Language. WOP@ISWC 2017
Valentina Presutti, Giorgia Lodi, Andrea Giovanni Nuzzolese, Aldo Gangemi, Silvio Peroni, Luigi Asprino: The
Role of Ontology Design Patterns in Linked Data Projects. ER 2016: 113-121
Eva Blomqvist, Valentina Presutti, Enrico Daga, Aldo Gangemi: Experimenting with eXtreme Design. EKAW
2010: 120-134
Valentina Presutti, Enrico Daga, Aldo Gangemi, Eva Blomqvist: eXtreme Design with Content Ontology Design
Patterns. WOP 2009