5. Martin-Rodilla, P.; Gonzalez-Perez, C. TEEM 2018 5
Uncertain Information In Humanities
Archives & Curation Modelling Approaches OO & Humanities Tech Modelling Approaches
Linked Open Data metamodels (in RDF, XML)
Text Encoding Initiative (TEI)
CIDOC-CRM and similar developments
Thesaurus, ad hoc creation of ontologies & folksonomies
Only “custom” software searching and indexing systems will be able to deal with vagueness.
NOT explicit vagueness support
(ontological//epistemic).
CIDOC Extensionfor uncertainty
(subjectivity i.e. some epistemic scenarios)
NOT explicit vagueness support
(ontological//epistemic).
Instance-level solutions, not at metamodel level
Some ad hoc re-adapt solutions: TEI Note tag, XML tags
for vagueness probabilistic information
6. Martin-Rodilla, P.; Gonzalez-Perez, C. TEEM 2018 6
Uncertain Information Outside Humanities
Statistical approaches: Vagueness as a margin of error function to mitigate
- Probability functions as indicators of the precision (using in inferential statistics)
- Certain degree of attributes’ values (i.e. error measurements for a given value).
Strongly mathematically approaches:
- Similar margin-error approaches as Interval Predictor Models
- Fuzzy logic approaches (e.g. fuzzy sets, rule bases, linguistic fuzzy description of variables or fuzzy quantifiers)
Software Engineering approaches
- Ontological vagueness: probability and possibility of existence of entities in the information models.
- Epistemic vagueness: modellable characteristics (set membership, interval membership, incompleteness).
- Works still in progress (2016 OMG group for UML vagueness, some UML solutions based on stereotypes).
14. Martin-Rodilla, P.; Gonzalez-Perez, C. TEEM 2018 14
DICTOMAGRED ConML Class Diagram
Toponym
Name: 1 Text
UsedIn: 1 Time
CurrentName: 0..* Text
HistoricalSource
IdentificationNumber: 1 Number
GeographicArea
XCoord: 1 Number
YCoord: 1 Number
Region: 1..* enum Regions
TextualHistoricalSource
Title: 1 Text
PublicationDate: 1 Time
Author:1 Text
OralHistoricalSource
Informer: 0..1 Text
Duration: 1 Time
IsInterview: 1 Boolean
Source Type
1..*
0..*
RefersTo
0..*
0..*
RefersTo
ToponymDistance
CalculatedIn: 1 Text
OriginalReference: 1 Text
ReliabilityLevel: 1 enum ReliabilityLevel
KmDistance: 1 Number
1
IsDestinyTo
0..*
IsOriginFrom
0..*
0..*
Regions
Magreb
Tunisia
Morocco
Algeria
Egypt
Upper Egypt
Lower Egypt
NearEast
Fertile Crescent
ReliabilityLevel
Very High
High
Medium
Low
Very Low
15. Martin-Rodilla, P.; Gonzalez-Perez, C. TEEM 2018 15
DICTOMAGRED
Object Diagram
Toponym
Name: 1 Text
UsedIn: 1 Time
CurrentName: 0..* Text
HistoricalSource
IdentificationNumber: 1 Number
GeographicArea
XCoord: 1 Number
YCoord: 1 Number
Region: 1..* enum Regions
TextualHistoricalSource
Title: 1 Text
PublicationDate: 1 Time
Author:1 Text
OralHistoricalSource
Informer: 0..1 Text
Duration: 1 Time
IsInterview: 1 Boolean
Source Type
1..*0..*
RefersTo
0..*
0..*
RefersTo
ths1: TextualHistoricalSource
IdentificationNumber= 0000943
Title="Muʿŷam al-buldān"
PublicationDate= 1977
Author= "Yāqūt, Šihāb al-dīn Abū ʿAbd Allāh"
top1: Toponym
Name=”Tubna”
UsedIn=Pre-Islamic Age (*)
CurrentName= null
top2: Toponym
Name= "Biskra"
UsedIn=3000 BC (+)
CurrentName= "Biskra"
ToponymDistance
CalculatedIn: 1 Text
OriginalReference: 1 Text
ReliabilityLevel: 1 enum ReliabilityLevel
KmDistance: 1 Number
topDis: ToponymDistance
CalculatedIn="1 Camel journey"
OriginalReference="Yāqūt"
ReliabilityLevel=Low
KmDistance=unknown
ga1: GeographicArea
Xcoord= 35.384470 (~)
Ycoord= 5.357035 (~)
Region= Maghreb
ga2: GeographicArea
XCoord= 34.838945 (*)
Ycoord=5.749731 (*)
Region= Algeria
ths2: TextualHistoricalSource
IdentificationNumber= 0000987
Title="Kitāb al-masālik"
PublicationDate= 1992
Author= "al-Bakrī, Abū ʿUbayd ʿAbd Allāh"
1
RefersTo
IsDestinyTo
RefersTo
0..*
IsOriginFrom
IsOriginFrom
IsDestinyTo
0..*
0..*
RefersTo
RefersTo
18. Martin-Rodilla, P.; Gonzalez-Perez, C. TEEM 2018 18
Conclusions & Future Steps
Vagueness is a valuableintrinsic characteristic in humanities.
We presented here a theoretical framework and specific modelling mechanisms in ConML for the
expression of ontological and epistemic vagueness in Digital Humanities.
DICTOMAGRED project illustrates how ConML mechanisms allow humanities researchers to express imprecision and
uncertainty in their own models.
More work is needed…
…e.g. the evaluation of new implementation techniques based of fuzzy logic and other
quantitative methods, so that computation may take place on the vagueness expressed by
models.