SlideShare una empresa de Scribd logo
1 de 9
Descargar para leer sin conexión
Using Semantic Technologies to 
Create Virtual Families from 
Historical Vital Records! 
Christophe Debruyne1,2, Oya Beyan1, Stefan Decker1 and Sandra Collins2! 
! 
1Insight @ NUI Galway! 
2Digital Repository of Ireland! 
! 
2014-09-25 @ EUON 2014!
Irish Record Linkage, 1864-1913! 
Developing a platform applying 
semantic technologies to historical 
birth, death and marriage certi!cates." 
" 
Answering questions such as: “How 
accurate are historic maternal 
mortality rates (MMR) and infant 
mortality rates (IMR) for Dublin?”" 
" 
Team consists of researchers 
(historians), digital archivists, and 
knowledge engineers." 
" 
Knowledge and 
Linked Data 
Engineers! 
Digital Historians! 
Archivists!
General Records O"ce ! 
• Vital registration data! 
– Birth-certi!cates" 
– Death-certi!cates" 
– Marriage records" 
• Digitised TIFF images of 
hardcopy indexes and registers.! 
• 2 TB of data! 
• Database describing the 
digitised records allowing 
searches on some "elds.! 
©General Records O#ce of Ireland 2014!
Challenges! 
• With respect to requirements! 
– Identifying certi!ed causes of death that can be attributed to 
maternal death." 
– Death certi!cates with no corresponding birth certi!cate" 
– Terminology used pre-1900. " 
– Capturing the socio-economical status of the families via, for 
instance, the professions, ranks of fathers." 
– … " 
• With respect to the platform! 
– Data protection" 
– Records vs. Knowledge" 
– Provenance vs. Interpretation"
GRO$Triplestore$ 
Triplestore$2$ Data$Analysis$ 
Transforma)on*from*one*model*to*another* 
• SPIN$–$SPARQL$Inference$ 
• SWRL$/$RuleML$ 
• SPARQL$Construct$ 
• …$ 
SEPARATION $OF $CONCERNS$ 
Obviously,$due$to$ 
the$sensiJve$ 
nature$of$the$ 
data,$data$ 
protecJon$is$key.$
Development of 2 ontologies! 
• 2 ontologies were developed – separation of concerns! 
• First ontology for describing the contents of records! 
– OWL 2 shallow, “#at ontology”" 
– Created by “lifting” the structure of the vital records" 
– (Marriage) Record, (Birth|Death) Certificate, Return! 
• Second ontology for data analysis! 
– OWL 2 + Rules to capture background and domain knowledge" 
– Created by means of Competency Questions (Grüninger and Fox)" 
– Person, Birth, Marriage, Death, withChild, motherOf, …! 
Grüninger, M., Fox, M.S.: The role of competency questions in enterprise engineering. In: Benchmarking 
Theory and Practice, pp. 22-31. Springer (1995)"
Tool for the Digital Archivist! 
• Records are encoded using spreadsheets – a tool the digital archivist 
is familiar with! 
• RDB-to-RDF mapping "les were de"ned to generate RDF from the in-memory 
databases created for each spreadsheet.!
Next steps! 
• Encoding a signi"cant amount of vital records in the excel "les! 
– To create the !rst triplestore; and" 
– To obtain a dataset for validating the transformations; and" 
– By consequence, validating the second ontology." 
• To investigate proper interaction with the data for the historians.! 
• Linking the data with additional context; i.e., Linked Logainm! 
– http://data.logainm.ie/ " 
– Nuno Lopes, Rebecca Grant, Brian Ó Raghallaigh, Eoghan Ó Carragáin, Sandra Collins, 
Stefan Decker: Linked Logainm: Enhancing Library Metadata Using Linked Data of Irish 
Place Names. TPDL Workshops 2013: 65-76"
More information! 
• @IRL_Project! 
• Project website http://irishrecordlinkage.wordpress.com/ ! 
! 
• In partnership with!

Más contenido relacionado

Similar a Using Semantic Technologies to Create Virtual Families from Historical Vital Records

Towards Linked Vital Registration Data for Reconstituting Families and Creati...
Towards Linked Vital Registration Data for Reconstituting Families and Creati...Towards Linked Vital Registration Data for Reconstituting Families and Creati...
Towards Linked Vital Registration Data for Reconstituting Families and Creati...dri_ireland
 
Towards linked vital registration data for reconstituting families and creati...
Towards linked vital registration data for reconstituting families and creati...Towards linked vital registration data for reconstituting families and creati...
Towards linked vital registration data for reconstituting families and creati...IRL_Project
 
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.dri_ireland
 
Big Data in the Arts and Humanities: Stirling presentation
Big Data in the Arts and Humanities: Stirling presentationBig Data in the Arts and Humanities: Stirling presentation
Big Data in the Arts and Humanities: Stirling presentationAndrew Prescott
 
Visualizing Data in Elasticsearch DevFest DC 2016
Visualizing Data in Elasticsearch DevFest DC 2016Visualizing Data in Elasticsearch DevFest DC 2016
Visualizing Data in Elasticsearch DevFest DC 2016David Erickson
 
Predictive Analytics - BarCamp Boston 2011
Predictive Analytics - BarCamp Boston 2011Predictive Analytics - BarCamp Boston 2011
Predictive Analytics - BarCamp Boston 2011Vedant Misra
 
Beyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional PracticeBeyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional PracticeEric Kansa
 
Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case Marieke van Erp
 
Exploring the "Search" in FamilySearch
Exploring the "Search" in FamilySearchExploring the "Search" in FamilySearch
Exploring the "Search" in FamilySearchCarol Petranek
 
Integrated Technology for Archaeological Imaging in the Field and in the Lab
Integrated Technology for Archaeological Imaging in the Field and in the LabIntegrated Technology for Archaeological Imaging in the Field and in the Lab
Integrated Technology for Archaeological Imaging in the Field and in the LabAshley M. Richter
 
Networked history of institutions
Networked history of institutionsNetworked history of institutions
Networked history of institutionsBrian Keegan
 
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Digital Methods Initiative
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsJon Voss
 
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014StampedeCon
 
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI PresentationOpen Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentationekansa
 
Big Data in the Arts and Humanities
Big Data in the Arts and HumanitiesBig Data in the Arts and Humanities
Big Data in the Arts and HumanitiesAndrew Prescott
 
Content Curation by Daniel Wilksch, PROV
Content Curation by Daniel Wilksch, PROV Content Curation by Daniel Wilksch, PROV
Content Curation by Daniel Wilksch, PROV Libmark
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsJohn Kunze
 
Call for papers, IJORCS, Volume 3 - Issue 3
Call for papers, IJORCS, Volume 3 - Issue 3Call for papers, IJORCS, Volume 3 - Issue 3
Call for papers, IJORCS, Volume 3 - Issue 3IJORCS
 

Similar a Using Semantic Technologies to Create Virtual Families from Historical Vital Records (20)

Towards Linked Vital Registration Data for Reconstituting Families and Creati...
Towards Linked Vital Registration Data for Reconstituting Families and Creati...Towards Linked Vital Registration Data for Reconstituting Families and Creati...
Towards Linked Vital Registration Data for Reconstituting Families and Creati...
 
Towards linked vital registration data for reconstituting families and creati...
Towards linked vital registration data for reconstituting families and creati...Towards linked vital registration data for reconstituting families and creati...
Towards linked vital registration data for reconstituting families and creati...
 
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
 
Big Data in the Arts and Humanities: Stirling presentation
Big Data in the Arts and Humanities: Stirling presentationBig Data in the Arts and Humanities: Stirling presentation
Big Data in the Arts and Humanities: Stirling presentation
 
Visualizing Data in Elasticsearch DevFest DC 2016
Visualizing Data in Elasticsearch DevFest DC 2016Visualizing Data in Elasticsearch DevFest DC 2016
Visualizing Data in Elasticsearch DevFest DC 2016
 
Predictive Analytics - BarCamp Boston 2011
Predictive Analytics - BarCamp Boston 2011Predictive Analytics - BarCamp Boston 2011
Predictive Analytics - BarCamp Boston 2011
 
Beyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional PracticeBeyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional Practice
 
Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case
 
Exploring the "Search" in FamilySearch
Exploring the "Search" in FamilySearchExploring the "Search" in FamilySearch
Exploring the "Search" in FamilySearch
 
Integrated Technology for Archaeological Imaging in the Field and in the Lab
Integrated Technology for Archaeological Imaging in the Field and in the LabIntegrated Technology for Archaeological Imaging in the Field and in the Lab
Integrated Technology for Archaeological Imaging in the Field and in the Lab
 
Session5 01.rutger vankoert
Session5 01.rutger vankoertSession5 01.rutger vankoert
Session5 01.rutger vankoert
 
Networked history of institutions
Networked history of institutionsNetworked history of institutions
Networked history of institutions
 
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
 
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI PresentationOpen Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
 
Big Data in the Arts and Humanities
Big Data in the Arts and HumanitiesBig Data in the Arts and Humanities
Big Data in the Arts and Humanities
 
Content Curation by Daniel Wilksch, PROV
Content Curation by Daniel Wilksch, PROV Content Curation by Daniel Wilksch, PROV
Content Curation by Daniel Wilksch, PROV
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data Citations
 
Call for papers, IJORCS, Volume 3 - Issue 3
Call for papers, IJORCS, Volume 3 - Issue 3Call for papers, IJORCS, Volume 3 - Issue 3
Call for papers, IJORCS, Volume 3 - Issue 3
 

Último

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 

Último (20)

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

Using Semantic Technologies to Create Virtual Families from Historical Vital Records

  • 1. Using Semantic Technologies to Create Virtual Families from Historical Vital Records! Christophe Debruyne1,2, Oya Beyan1, Stefan Decker1 and Sandra Collins2! ! 1Insight @ NUI Galway! 2Digital Repository of Ireland! ! 2014-09-25 @ EUON 2014!
  • 2. Irish Record Linkage, 1864-1913! Developing a platform applying semantic technologies to historical birth, death and marriage certi!cates." " Answering questions such as: “How accurate are historic maternal mortality rates (MMR) and infant mortality rates (IMR) for Dublin?”" " Team consists of researchers (historians), digital archivists, and knowledge engineers." " Knowledge and Linked Data Engineers! Digital Historians! Archivists!
  • 3. General Records O"ce ! • Vital registration data! – Birth-certi!cates" – Death-certi!cates" – Marriage records" • Digitised TIFF images of hardcopy indexes and registers.! • 2 TB of data! • Database describing the digitised records allowing searches on some "elds.! ©General Records O#ce of Ireland 2014!
  • 4. Challenges! • With respect to requirements! – Identifying certi!ed causes of death that can be attributed to maternal death." – Death certi!cates with no corresponding birth certi!cate" – Terminology used pre-1900. " – Capturing the socio-economical status of the families via, for instance, the professions, ranks of fathers." – … " • With respect to the platform! – Data protection" – Records vs. Knowledge" – Provenance vs. Interpretation"
  • 5. GRO$Triplestore$ Triplestore$2$ Data$Analysis$ Transforma)on*from*one*model*to*another* • SPIN$–$SPARQL$Inference$ • SWRL$/$RuleML$ • SPARQL$Construct$ • …$ SEPARATION $OF $CONCERNS$ Obviously,$due$to$ the$sensiJve$ nature$of$the$ data,$data$ protecJon$is$key.$
  • 6. Development of 2 ontologies! • 2 ontologies were developed – separation of concerns! • First ontology for describing the contents of records! – OWL 2 shallow, “#at ontology”" – Created by “lifting” the structure of the vital records" – (Marriage) Record, (Birth|Death) Certificate, Return! • Second ontology for data analysis! – OWL 2 + Rules to capture background and domain knowledge" – Created by means of Competency Questions (Grüninger and Fox)" – Person, Birth, Marriage, Death, withChild, motherOf, …! Grüninger, M., Fox, M.S.: The role of competency questions in enterprise engineering. In: Benchmarking Theory and Practice, pp. 22-31. Springer (1995)"
  • 7. Tool for the Digital Archivist! • Records are encoded using spreadsheets – a tool the digital archivist is familiar with! • RDB-to-RDF mapping "les were de"ned to generate RDF from the in-memory databases created for each spreadsheet.!
  • 8. Next steps! • Encoding a signi"cant amount of vital records in the excel "les! – To create the !rst triplestore; and" – To obtain a dataset for validating the transformations; and" – By consequence, validating the second ontology." • To investigate proper interaction with the data for the historians.! • Linking the data with additional context; i.e., Linked Logainm! – http://data.logainm.ie/ " – Nuno Lopes, Rebecca Grant, Brian Ó Raghallaigh, Eoghan Ó Carragáin, Sandra Collins, Stefan Decker: Linked Logainm: Enhancing Library Metadata Using Linked Data of Irish Place Names. TPDL Workshops 2013: 65-76"
  • 9. More information! • @IRL_Project! • Project website http://irishrecordlinkage.wordpress.com/ ! ! • In partnership with!