SlideShare una empresa de Scribd logo
1 de 15
Descargar para leer sin conexión
Experts Workshop on Controlled Vocabularies
Mainz 10-11/10/2013

Giovanni Colavizza

Topic Introduction
Controlled vocabularies and humanities, a problematic relationship.
The functional categorization of historical place types and the problems it raises.

Giovanni Colavizza
Leibniz Institute of European History
Colavizza@ieg-mainz.de

1
Experts Workshop on Controlled Vocabularies
Mainz 10-11/10/2013

Giovanni Colavizza

The scenario
Controlled vocabulary: a selected list of terms, which refer to concepts, used for
categorization. Criteria of concept selection are usually domain specific.
Focus for this talk: vocabularies of concepts, not proper names.

2
Experts Workshop on Controlled Vocabularies
Mainz 10-11/10/2013

Giovanni Colavizza

The scenario
Controlled vocabulary: a selected list of terms, which refer to concepts, used for
categorization. Criteria of concept selection are usually domain specific.
Focus for this talk: vocabularies of concepts, not proper names.
The term - concept relation is often not specified: intended (?) use of natural
language, which is context and interpretation specific.
But there goes language independence!
@Dalia Varanka, A topographic feature
taxonomy for a US national topographic
mapping ontology, 2009.

2
Experts Workshop on Controlled Vocabularies
Mainz 10-11/10/2013

Giovanni Colavizza

The problem
Quantitative and computer-based methods scale-up our responsibilities together
with our means.

Retrieve

The data and metadata loop:

Reuse Extend

3

Share
Experts Workshop on Controlled Vocabularies
Mainz 10-11/10/2013

Giovanni Colavizza

The problem
Quantitative and computer-based methods scale-up our responsibilities together
with our means.

Retrieve

The data and metadata loop:

Reuse Extend

Share

More strict requirements: classification systems must be shared, to some extent.
Such shared part must be formally specified (machine-readable). The term concept bond has to become explicit.

3
Experts Workshop on Controlled Vocabularies
Mainz 10-11/10/2013

Giovanni Colavizza

New design requirements
•Allow for comparison beyond single project (data integration)
•Interoperability and portability
•Scalability
•More accurate retrieval
•Automatic classification
•Named entity recognition
•Reasoning...
One possible solution: integrate a more strict knowledge model on top of
controlled vocabularies. Express it via ontologies: simplified specifications of
(shared!) conceptualizations.
Already possible! ISO 25964 (data model), SKOS (web format)

4
Experts Workshop on Controlled Vocabularies
Mainz 10-11/10/2013

Giovanni Colavizza

IEG proposal - concept
•Keep both natural language vocabularies AND formalized ontologies
•An integrated approach:

1.develop back-end ontologies, well formalized and documented*
2.vocabularies are built as needed, in natural language, associating tags with
formally defined concepts (prevent late integration)

5
Experts Workshop on Controlled Vocabularies
Mainz 10-11/10/2013

Giovanni Colavizza

IEG proposal - concept
•Keep both natural language vocabularies AND formalized ontologies
•An integrated approach:

1.develop back-end ontologies, well formalized and documented*
2.vocabularies are built as needed, in natural language, associating tags with
formally defined concepts (prevent late integration)

But!
No 1-1 mapping between vocabularies and ontologies. Focus on what’s shared*.
Pareto principle: 80% effects (tags we need) come from 20% causes (concepts).

5
Experts Workshop on Controlled Vocabularies
Mainz 10-11/10/2013

Giovanni Colavizza

IEG proposal - implementation
Implementation is key:
1.Upper ontologies (integration among domains)
2.Domain ontologies (e.g. functions)
3.Labeling system
4.Controlled vocabularies
> Linked data enabled, user friendly (minimize learning curve and overhead),
single entry-point to standards: bridges tags and concepts.

6
Experts Workshop on Controlled Vocabularies
Mainz 10-11/10/2013

Giovanni Colavizza

IEG proposal - implementation
Implementation is key:
1.Upper ontologies (integration among domains)
2.Domain ontologies (e.g. functions)
3.Labeling system
4.Controlled vocabularies
> Linked data enabled, user friendly (minimize learning curve and overhead),
single entry-point to standards: bridges tags and concepts.
Large-scale collaborative and community-driven framework (numbers 1, 2, 3, in
part 4), few experts for back-end, many users for front-end, everything open.
Could we think about a Consortium for controlled vocabularies (like TEI)?

6
Experts Workshop on Controlled Vocabularies
Mainz 10-11/10/2013

Giovanni Colavizza

Historical place types
Quite problematic:
Same names mean different things in space, time, culture
Generic tags for specific meanings: ambiguity
Layers of interpretations: agents, socio-political context, historians

7
Experts Workshop on Controlled Vocabularies
Mainz 10-11/10/2013

Giovanni Colavizza

Historical place types
Quite problematic:
Same names mean different things in space, time, culture
Generic tags for specific meanings: ambiguity
Layers of interpretations: agents, socio-political context, historians

From nouns to verbs:
Most vocabularies of place types/features are already loosely classified by
functionality (economic activity, leisure facility, place of culture, etc.)
There are less verbs than nouns (Wordnet synsets: ~82k nouns, ~14k verbs)
Verbs brings us closer to concrete events (and linked data triples..)

7
Experts Workshop on Controlled Vocabularies
Mainz 10-11/10/2013

Giovanni Colavizza

Functional categorization - I

@Filippo De Vivo, Patrizi, informatori, barbieri. Politica e comunicazione a Venezia nella prima età moderna. Milan: Feltrinelli,
2012. In English: id., Information and communication in Venice: Rethinking Early Modern Politics. Oxford: Oxford University
Press, 2007.

8
Experts Workshop on Controlled Vocabularies
Mainz 10-11/10/2013

Giovanni Colavizza

Open questions
Is all this useful and feasible? (let’s try it)
Where to start (historical place types)
What to model (functions)
Design requirements
Explore technical solutions
How to integrate existing vocabularies
> Sketch guidelines
Partners, anyone? :)

9
Experts Workshop on Controlled Vocabularies
Mainz 10-11/10/2013

Giovanni Colavizza

Thanks!
Controlled vocabularies and humanities, a problematic relationship.
The functional categorization of historical place types and the problems it raises.

Giovanni Colavizza
Leibniz Institute of European History
Colavizza@ieg-mainz.de

10

Más contenido relacionado

Similar a Mainz Expert Workshop on Controlled Vocabularies 10/10/2013

Leipzig Functional Categorisation 11/12/2013
Leipzig Functional Categorisation 11/12/2013Leipzig Functional Categorisation 11/12/2013
Leipzig Functional Categorisation 11/12/2013Giovanni Colavizza
 
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...Daniel Vila Suero
 
Computer assisted text and corpus analysis
Computer assisted text and corpus analysisComputer assisted text and corpus analysis
Computer assisted text and corpus analysisRubyaShaheen
 
Corpus study design
Corpus study designCorpus study design
Corpus study designbikashtaly
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with PythonBenjamin Bengfort
 
TSS 2017: Terminology and Knowledge Organization Systems
TSS 2017: Terminology and Knowledge Organization SystemsTSS 2017: Terminology and Knowledge Organization Systems
TSS 2017: Terminology and Knowledge Organization SystemsMichael Wetzel
 
Semantic Web - Ontologies
Semantic Web - OntologiesSemantic Web - Ontologies
Semantic Web - OntologiesSerge Linckels
 
Multilingual Knowledge Organization Systems Management: Best Practices
Multilingual Knowledge Organization Systems Management: Best PracticesMultilingual Knowledge Organization Systems Management: Best Practices
Multilingual Knowledge Organization Systems Management: Best PracticesMauro Dragoni
 
An Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsAn Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsTye Rausch
 
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...Scottish Language Dictionaries
 
Communicative-discursive models and cognitive linguistics
Communicative-discursive models and cognitive linguisticsCommunicative-discursive models and cognitive linguistics
Communicative-discursive models and cognitive linguisticsalaidarindira0202
 
The role of the arts in researching multilingually at the borders of language...
The role of the arts in researching multilingually at the borders of language...The role of the arts in researching multilingually at the borders of language...
The role of the arts in researching multilingually at the borders of language...RMBorders
 
Summary of Multilingual Natural Language Processing Applications: From Theory...
Summary of Multilingual Natural Language Processing Applications: From Theory...Summary of Multilingual Natural Language Processing Applications: From Theory...
Summary of Multilingual Natural Language Processing Applications: From Theory...iwan_rg
 
The Corpus In The Classroom
The Corpus In The ClassroomThe Corpus In The Classroom
The Corpus In The ClassroomColin Graham
 
Syntax and lexis presentation final 3
Syntax and lexis presentation final 3Syntax and lexis presentation final 3
Syntax and lexis presentation final 3mohamed oubedda
 
Syntax and lexis presentation final 3
Syntax and lexis presentation final 3Syntax and lexis presentation final 3
Syntax and lexis presentation final 3mohamed oubedda
 
Inquiry on the Philosophy of Language.pptx
Inquiry on the Philosophy of Language.pptxInquiry on the Philosophy of Language.pptx
Inquiry on the Philosophy of Language.pptxutcrash88
 
Lexical Approach
Lexical ApproachLexical Approach
Lexical ApproachFiona Burns
 

Similar a Mainz Expert Workshop on Controlled Vocabularies 10/10/2013 (20)

Leipzig Functional Categorisation 11/12/2013
Leipzig Functional Categorisation 11/12/2013Leipzig Functional Categorisation 11/12/2013
Leipzig Functional Categorisation 11/12/2013
 
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...
 
Computer assisted text and corpus analysis
Computer assisted text and corpus analysisComputer assisted text and corpus analysis
Computer assisted text and corpus analysis
 
Corpus study design
Corpus study designCorpus study design
Corpus study design
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with Python
 
TSS 2017: Terminology and Knowledge Organization Systems
TSS 2017: Terminology and Knowledge Organization SystemsTSS 2017: Terminology and Knowledge Organization Systems
TSS 2017: Terminology and Knowledge Organization Systems
 
Semantic Web - Ontologies
Semantic Web - OntologiesSemantic Web - Ontologies
Semantic Web - Ontologies
 
Multilingual Knowledge Organization Systems Management: Best Practices
Multilingual Knowledge Organization Systems Management: Best PracticesMultilingual Knowledge Organization Systems Management: Best Practices
Multilingual Knowledge Organization Systems Management: Best Practices
 
An Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsAn Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical Semantics
 
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
 
Communicative-discursive models and cognitive linguistics
Communicative-discursive models and cognitive linguisticsCommunicative-discursive models and cognitive linguistics
Communicative-discursive models and cognitive linguistics
 
Roadmap for a multilingual BioPortal
Roadmap for a multilingual BioPortalRoadmap for a multilingual BioPortal
Roadmap for a multilingual BioPortal
 
The role of the arts in researching multilingually at the borders of language...
The role of the arts in researching multilingually at the borders of language...The role of the arts in researching multilingually at the borders of language...
The role of the arts in researching multilingually at the borders of language...
 
Summary of Multilingual Natural Language Processing Applications: From Theory...
Summary of Multilingual Natural Language Processing Applications: From Theory...Summary of Multilingual Natural Language Processing Applications: From Theory...
Summary of Multilingual Natural Language Processing Applications: From Theory...
 
The Corpus In The Classroom
The Corpus In The ClassroomThe Corpus In The Classroom
The Corpus In The Classroom
 
Syntax and lexis presentation final 3
Syntax and lexis presentation final 3Syntax and lexis presentation final 3
Syntax and lexis presentation final 3
 
Syntax and lexis presentation final 3
Syntax and lexis presentation final 3Syntax and lexis presentation final 3
Syntax and lexis presentation final 3
 
Inquiry on the Philosophy of Language.pptx
Inquiry on the Philosophy of Language.pptxInquiry on the Philosophy of Language.pptx
Inquiry on the Philosophy of Language.pptx
 
Lexical Approach
Lexical ApproachLexical Approach
Lexical Approach
 
Language
LanguageLanguage
Language
 

Más de Giovanni Colavizza

Sul ruolo dell’umanista nelle Digital Humanities
Sul ruolo dell’umanista nelle Digital HumanitiesSul ruolo dell’umanista nelle Digital Humanities
Sul ruolo dell’umanista nelle Digital HumanitiesGiovanni Colavizza
 
La Venice Time Machine e alcune sfide dei progetti “Big Science” nelle discip...
La Venice Time Machine e alcune sfide dei progetti “Big Science” nelle discip...La Venice Time Machine e alcune sfide dei progetti “Big Science” nelle discip...
La Venice Time Machine e alcune sfide dei progetti “Big Science” nelle discip...Giovanni Colavizza
 
The References of References: Enriching Library Catalogs via Domain-Specific ...
The References of References: Enriching Library Catalogs via Domain-Specific ...The References of References: Enriching Library Catalogs via Domain-Specific ...
The References of References: Enriching Library Catalogs via Domain-Specific ...Giovanni Colavizza
 
Notes de bas de page: d’un outil savant aux hyperliens
Notes de bas de page: d’un outil savant aux hyperliensNotes de bas de page: d’un outil savant aux hyperliens
Notes de bas de page: d’un outil savant aux hyperliensGiovanni Colavizza
 
Introduction to the Venice Time Machine
Introduction to the Venice Time MachineIntroduction to the Venice Time Machine
Introduction to the Venice Time MachineGiovanni Colavizza
 
Linked Books - DH Venice Fall School 2014
Linked Books - DH Venice Fall School 2014Linked Books - DH Venice Fall School 2014
Linked Books - DH Venice Fall School 2014Giovanni Colavizza
 
Venezia Biblioteche e Digital Humanities 28/10/2013
Venezia Biblioteche e Digital Humanities 28/10/2013Venezia Biblioteche e Digital Humanities 28/10/2013
Venezia Biblioteche e Digital Humanities 28/10/2013Giovanni Colavizza
 

Más de Giovanni Colavizza (7)

Sul ruolo dell’umanista nelle Digital Humanities
Sul ruolo dell’umanista nelle Digital HumanitiesSul ruolo dell’umanista nelle Digital Humanities
Sul ruolo dell’umanista nelle Digital Humanities
 
La Venice Time Machine e alcune sfide dei progetti “Big Science” nelle discip...
La Venice Time Machine e alcune sfide dei progetti “Big Science” nelle discip...La Venice Time Machine e alcune sfide dei progetti “Big Science” nelle discip...
La Venice Time Machine e alcune sfide dei progetti “Big Science” nelle discip...
 
The References of References: Enriching Library Catalogs via Domain-Specific ...
The References of References: Enriching Library Catalogs via Domain-Specific ...The References of References: Enriching Library Catalogs via Domain-Specific ...
The References of References: Enriching Library Catalogs via Domain-Specific ...
 
Notes de bas de page: d’un outil savant aux hyperliens
Notes de bas de page: d’un outil savant aux hyperliensNotes de bas de page: d’un outil savant aux hyperliens
Notes de bas de page: d’un outil savant aux hyperliens
 
Introduction to the Venice Time Machine
Introduction to the Venice Time MachineIntroduction to the Venice Time Machine
Introduction to the Venice Time Machine
 
Linked Books - DH Venice Fall School 2014
Linked Books - DH Venice Fall School 2014Linked Books - DH Venice Fall School 2014
Linked Books - DH Venice Fall School 2014
 
Venezia Biblioteche e Digital Humanities 28/10/2013
Venezia Biblioteche e Digital Humanities 28/10/2013Venezia Biblioteche e Digital Humanities 28/10/2013
Venezia Biblioteche e Digital Humanities 28/10/2013
 

Último

Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 

Último (20)

Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 

Mainz Expert Workshop on Controlled Vocabularies 10/10/2013

  • 1. Experts Workshop on Controlled Vocabularies Mainz 10-11/10/2013 Giovanni Colavizza Topic Introduction Controlled vocabularies and humanities, a problematic relationship. The functional categorization of historical place types and the problems it raises. Giovanni Colavizza Leibniz Institute of European History Colavizza@ieg-mainz.de 1
  • 2. Experts Workshop on Controlled Vocabularies Mainz 10-11/10/2013 Giovanni Colavizza The scenario Controlled vocabulary: a selected list of terms, which refer to concepts, used for categorization. Criteria of concept selection are usually domain specific. Focus for this talk: vocabularies of concepts, not proper names. 2
  • 3. Experts Workshop on Controlled Vocabularies Mainz 10-11/10/2013 Giovanni Colavizza The scenario Controlled vocabulary: a selected list of terms, which refer to concepts, used for categorization. Criteria of concept selection are usually domain specific. Focus for this talk: vocabularies of concepts, not proper names. The term - concept relation is often not specified: intended (?) use of natural language, which is context and interpretation specific. But there goes language independence! @Dalia Varanka, A topographic feature taxonomy for a US national topographic mapping ontology, 2009. 2
  • 4. Experts Workshop on Controlled Vocabularies Mainz 10-11/10/2013 Giovanni Colavizza The problem Quantitative and computer-based methods scale-up our responsibilities together with our means. Retrieve The data and metadata loop: Reuse Extend 3 Share
  • 5. Experts Workshop on Controlled Vocabularies Mainz 10-11/10/2013 Giovanni Colavizza The problem Quantitative and computer-based methods scale-up our responsibilities together with our means. Retrieve The data and metadata loop: Reuse Extend Share More strict requirements: classification systems must be shared, to some extent. Such shared part must be formally specified (machine-readable). The term concept bond has to become explicit. 3
  • 6. Experts Workshop on Controlled Vocabularies Mainz 10-11/10/2013 Giovanni Colavizza New design requirements •Allow for comparison beyond single project (data integration) •Interoperability and portability •Scalability •More accurate retrieval •Automatic classification •Named entity recognition •Reasoning... One possible solution: integrate a more strict knowledge model on top of controlled vocabularies. Express it via ontologies: simplified specifications of (shared!) conceptualizations. Already possible! ISO 25964 (data model), SKOS (web format) 4
  • 7. Experts Workshop on Controlled Vocabularies Mainz 10-11/10/2013 Giovanni Colavizza IEG proposal - concept •Keep both natural language vocabularies AND formalized ontologies •An integrated approach: 1.develop back-end ontologies, well formalized and documented* 2.vocabularies are built as needed, in natural language, associating tags with formally defined concepts (prevent late integration) 5
  • 8. Experts Workshop on Controlled Vocabularies Mainz 10-11/10/2013 Giovanni Colavizza IEG proposal - concept •Keep both natural language vocabularies AND formalized ontologies •An integrated approach: 1.develop back-end ontologies, well formalized and documented* 2.vocabularies are built as needed, in natural language, associating tags with formally defined concepts (prevent late integration) But! No 1-1 mapping between vocabularies and ontologies. Focus on what’s shared*. Pareto principle: 80% effects (tags we need) come from 20% causes (concepts). 5
  • 9. Experts Workshop on Controlled Vocabularies Mainz 10-11/10/2013 Giovanni Colavizza IEG proposal - implementation Implementation is key: 1.Upper ontologies (integration among domains) 2.Domain ontologies (e.g. functions) 3.Labeling system 4.Controlled vocabularies > Linked data enabled, user friendly (minimize learning curve and overhead), single entry-point to standards: bridges tags and concepts. 6
  • 10. Experts Workshop on Controlled Vocabularies Mainz 10-11/10/2013 Giovanni Colavizza IEG proposal - implementation Implementation is key: 1.Upper ontologies (integration among domains) 2.Domain ontologies (e.g. functions) 3.Labeling system 4.Controlled vocabularies > Linked data enabled, user friendly (minimize learning curve and overhead), single entry-point to standards: bridges tags and concepts. Large-scale collaborative and community-driven framework (numbers 1, 2, 3, in part 4), few experts for back-end, many users for front-end, everything open. Could we think about a Consortium for controlled vocabularies (like TEI)? 6
  • 11. Experts Workshop on Controlled Vocabularies Mainz 10-11/10/2013 Giovanni Colavizza Historical place types Quite problematic: Same names mean different things in space, time, culture Generic tags for specific meanings: ambiguity Layers of interpretations: agents, socio-political context, historians 7
  • 12. Experts Workshop on Controlled Vocabularies Mainz 10-11/10/2013 Giovanni Colavizza Historical place types Quite problematic: Same names mean different things in space, time, culture Generic tags for specific meanings: ambiguity Layers of interpretations: agents, socio-political context, historians From nouns to verbs: Most vocabularies of place types/features are already loosely classified by functionality (economic activity, leisure facility, place of culture, etc.) There are less verbs than nouns (Wordnet synsets: ~82k nouns, ~14k verbs) Verbs brings us closer to concrete events (and linked data triples..) 7
  • 13. Experts Workshop on Controlled Vocabularies Mainz 10-11/10/2013 Giovanni Colavizza Functional categorization - I @Filippo De Vivo, Patrizi, informatori, barbieri. Politica e comunicazione a Venezia nella prima età moderna. Milan: Feltrinelli, 2012. In English: id., Information and communication in Venice: Rethinking Early Modern Politics. Oxford: Oxford University Press, 2007. 8
  • 14. Experts Workshop on Controlled Vocabularies Mainz 10-11/10/2013 Giovanni Colavizza Open questions Is all this useful and feasible? (let’s try it) Where to start (historical place types) What to model (functions) Design requirements Explore technical solutions How to integrate existing vocabularies > Sketch guidelines Partners, anyone? :) 9
  • 15. Experts Workshop on Controlled Vocabularies Mainz 10-11/10/2013 Giovanni Colavizza Thanks! Controlled vocabularies and humanities, a problematic relationship. The functional categorization of historical place types and the problems it raises. Giovanni Colavizza Leibniz Institute of European History Colavizza@ieg-mainz.de 10