SlideShare una empresa de Scribd logo
1 de 44
A Publication Approach to
Linked Data in Archaeology
A Publication Approach to
Linked Data in Archaeology
Eric C. Kansa
UC Berkeley / OpenContext.org
Unless otherwise indicated, this work is licensed under a Creative Commons Attribution
3.0 License <http://creativecommons.org/licenses/by/3.0/>
• Started in 2007
• Open access / open data
publishing for archaeology
• Archiving by California
Digital Library
• Referenced by NSF and
NEH for grant data
management
• Started in 2007
• Open access / open data
publishing for archaeology
• Archiving by California
Digital Library
• Referenced by NSF and
NEH for grant data
management
My Precious DataMy Precious Data
?
Data Sharing as Publication
• Several projects studying
editorial + publishing
workflows
• Current Funding: ACLS,
NEH, Sloan, EOL
Data Sharing as Publication
• Several projects studying
editorial + publishing
workflows
• Current Funding: ACLS,
NEH, Sloan, EOL
Web of DataWeb of Data
Cross-discipline Connections
Open Context links with
humanities data (CIDOC,
Pleiades, British Museum), and
natural sciences (EOL, UBERON)
Pelagios API
EOL Computable Data
Challenge
(Ben Arbuckle, Sarah Kansa,
Eric Kansa)
EOL Computable Data
Challenge
1. 15 different sites
2. 34 zooarchaeologists
3. Publishing: decoding, cleanup,
metadata documentation
4. Linked Data annotation (EOL,
UBERON, biometrics)
5. Collaborative analysis
6. Reuse itself studied by
DIPIR.org (U. Michigan
ISchool)
EOL Computable Data
Challenge
1. 15 different sites
2. 34 zooarchaeologists
3. Publishing: decoding, cleanup,
metadata documentation
4. Linked Data annotation (EOL,
UBERON, biometrics)
5. Collaborative analysis
6. Reuse itself studied by
DIPIR.org (U. Michigan
ISchool)
Data Publishing
Google / Open Refine
1. Check consistency
2. Edit functions
3. All changes logged, can be
rolled back
Google / Open Refine
1. Check consistency
2. Edit functions
3. All changes logged, can be
rolled back
Bibliography
• Bibliographic references
expressed as Linked Data
(modeled after S. Heath)
• Associates publication
citation with Open Access
variants
Bibliography
• Bibliographic references
expressed as Linked Data
(modeled after S. Heath)
• Associates publication
citation with Open Access
variants
Why UBERON?
1. Expresses relevant expert knowledge,
tremendous effort. Why ignore or
duplicate this effort?
2. Anatomic entities related to
embryology, genetic networks. New
research opportunities for zooarch?
3. Zooarchaeology gains stakeholders
(biometric data of wide interest)
Why UBERON?
1. Expresses relevant expert knowledge,
tremendous effort. Why ignore or
duplicate this effort?
2. Anatomic entities related to
embryology, genetic networks. New
research opportunities for zooarch?
3. Zooarchaeology gains stakeholders
(biometric data of wide interest)
“Ovis aries”
http://eol.org/pages/311906/
Code: 14
Domestic
sheep
Code: 70
Code: 16
Ovis aries
Code: 15
Sheep
O. aries
Schaf
Sh.
“Distal epiphysis unfused”
http://opencontext.org/vocabularies/open-context-zooarch/zoo-0058
dist.
unfused
d. uf.
30
uf. dist.,
f. prox.
Distal epiph.
unfused
Distal end unf.
Sheep/Goat Distal Femur FusionSheep/Goat Distal Femur Fusion
Karain B Cave (N=53) Pınarbaşı (N=3) Çukuriçi Höyük (N=13)
Suberde (N=0) Domuztepe (N=28) Ulucak (N=15)
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
Unfused
Fused
“Distal epiphysis unfused”
http://opencontext.org/vocabularies/open-context-zooarch/zoo-0058
DIPIR: Data Documentation PracticesDIPIR: Data Documentation Practices
I use an Excel spreadsheet…which I … inherited from my research
advisers. …my dissertation advisor was still recording data for each
specimen on paper when I was in graduate school so that's what I
started …then quickly, I was like, "This is ridiculous.“… I just started
using an Excel spreadsheet that has sort of slowly gotten bigger and
bigger over time with more variables or columns…I've added …color
coding…I also use…a very sort of primitive numerical coding system,
again, that I inherited from my research advisers…So, this little book
that goes with me of codes which is sort of odd, but …we all know
that a 14 is a sheep.” (CCU13)
A long way to go before we
get usable, intelligible data
CC-BY (Eduardo Otubo)
http://www.flickr.com/photos/otubo/5091378744
SPARQL endpoint easy to break (too big of a graph
to query).
Needed a work-around, so I also use the normal
(“plain web”) index to query the British Museum.
(1) Keyword
search for
relevant term.
(2) Scrape results
(blech!) for item
identifiers
(“objectid”
parameter in
URLs)
(3) Use ObjectIDs
in SPARQL queries
(limits size of
graph queried, so
server doesn’t
die).
SELECT ?s ?oPart ?oThes ?oLab
WHERE
{
?s
<http://collection.britishmuseum.org/id/c
rm/bm-extensions/codex_id>
'$objectID';
<http://collection.britishmuseum.org/id/c
rm/P46F.is_composed_of> ?oPart.
?oPart
<http://collection.britishmuseum.org/id/c
rm/P45F.consists_of> ?oThes.
?oThes
<http://www.w3.org/2004/02/skos/core#
prefLabel> ?oLab.
} LIMIT 10
Why is linked
data important?
Why is linked
data important?
1. Improve data quality, expert
curation of concepts +
vocabularies
2. Develop ties with other
research communities (can
feedback to collect new /
different data)
3. Increasingly sophisticated
open source tools, support
services
4. Part of the Web, not just on
the Web
1. Improve data quality, expert
curation of concepts +
vocabularies
2. Develop ties with other
research communities (can
feedback to collect new /
different data)
3. Increasingly sophisticated
open source tools, support
services
4. Part of the Web, not just on
the Web
… but
participating
in Linked Data
requires
effort!
… but
participating
in Linked Data
requires
effort!
Why is linked
data important?
Why is linked
data important?
Image Credit: Copyright Newline Cinema
One does not simply
share usable data…
Data are challenging
1. “Raw data” often problematic,
even with documentation (10X
effort needed with decoded data)
2. Tension between modeling needs
and familiarity with tools (Excel)
3. More work needed modeling
research methods (esp. sampling,
see DIPIR.org outcomes)
4. You’re never going to be done!
Data are challenging
1. “Raw data” often problematic,
even with documentation (10X
effort needed with decoded data)
2. Tension between modeling needs
and familiarity with tools (Excel)
3. More work needed modeling
research methods (esp. sampling,
see DIPIR.org outcomes)
4. You’re never going to be done!

Más contenido relacionado

La actualidad más candente

Linking Data, Linking People
Linking Data, Linking PeopleLinking Data, Linking People
Linking Data, Linking PeoplefereiraJ
 
Finding sci tech grey literature information
Finding sci tech grey literature informationFinding sci tech grey literature information
Finding sci tech grey literature informationMatthew Von Hendy
 
Interpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open ContextInterpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open ContextEric Kansa
 
Texas sla presentation finding sci tech grey literature information
Texas sla presentation  finding sci tech grey literature informationTexas sla presentation  finding sci tech grey literature information
Texas sla presentation finding sci tech grey literature informationMatthew Von Hendy
 
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...ICZN
 
DataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefDataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefCrossref
 
The MIAPA ontology: An annotation ontology for validating minimum metadata re...
The MIAPA ontology: An annotation ontology for validating minimum metadata re...The MIAPA ontology: An annotation ontology for validating minimum metadata re...
The MIAPA ontology: An annotation ontology for validating minimum metadata re...Hilmar Lapp
 
Mcb database resources workshop 2013
Mcb database resources workshop 2013Mcb database resources workshop 2013
Mcb database resources workshop 2013UCT
 
Building the new open linked library: Theory and Practice
Building the new open linked library: Theory and PracticeBuilding the new open linked library: Theory and Practice
Building the new open linked library: Theory and PracticeTrish Rose-Sandler
 
Content Mining of Science and Medicine
Content Mining of Science and MedicineContent Mining of Science and Medicine
Content Mining of Science and MedicineTheContentMine
 
Finding and accessing human genome data with Repositive
Finding and accessing human genome data with RepositiveFinding and accessing human genome data with Repositive
Finding and accessing human genome data with RepositiveManuel Corpas
 
Text and Data Mining explained at FTDM
Text and Data Mining explained at FTDMText and Data Mining explained at FTDM
Text and Data Mining explained at FTDMpetermurrayrust
 
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...marcosmartinezromero
 
Introduction to FundRef Webinar
Introduction to FundRef WebinarIntroduction to FundRef Webinar
Introduction to FundRef WebinarCrossref
 
DAS game: how a programmer thinks
DAS game: how a programmer thinksDAS game: how a programmer thinks
DAS game: how a programmer thinksRafael C. Jimenez
 

La actualidad más candente (20)

Linking Data, Linking People
Linking Data, Linking PeopleLinking Data, Linking People
Linking Data, Linking People
 
Finding sci tech grey literature information
Finding sci tech grey literature informationFinding sci tech grey literature information
Finding sci tech grey literature information
 
Interpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open ContextInterpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open Context
 
Texas sla presentation finding sci tech grey literature information
Texas sla presentation  finding sci tech grey literature informationTexas sla presentation  finding sci tech grey literature information
Texas sla presentation finding sci tech grey literature information
 
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
 
DataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefDataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRef
 
The MIAPA ontology: An annotation ontology for validating minimum metadata re...
The MIAPA ontology: An annotation ontology for validating minimum metadata re...The MIAPA ontology: An annotation ontology for validating minimum metadata re...
The MIAPA ontology: An annotation ontology for validating minimum metadata re...
 
Mcb database resources workshop 2013
Mcb database resources workshop 2013Mcb database resources workshop 2013
Mcb database resources workshop 2013
 
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
 
Building the new open linked library: Theory and Practice
Building the new open linked library: Theory and PracticeBuilding the new open linked library: Theory and Practice
Building the new open linked library: Theory and Practice
 
Content Mining of Science and Medicine
Content Mining of Science and MedicineContent Mining of Science and Medicine
Content Mining of Science and Medicine
 
Finding and accessing human genome data with Repositive
Finding and accessing human genome data with RepositiveFinding and accessing human genome data with Repositive
Finding and accessing human genome data with Repositive
 
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
 
Text and Data Mining explained at FTDM
Text and Data Mining explained at FTDMText and Data Mining explained at FTDM
Text and Data Mining explained at FTDM
 
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...
 
Top 10 web
Top 10 webTop 10 web
Top 10 web
 
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
 
Introduction to FundRef Webinar
Introduction to FundRef WebinarIntroduction to FundRef Webinar
Introduction to FundRef Webinar
 
DAS game: how a programmer thinks
DAS game: how a programmer thinksDAS game: how a programmer thinks
DAS game: how a programmer thinks
 
Open Annotation Model
Open Annotation ModelOpen Annotation Model
Open Annotation Model
 

Similar a #LAWDI Open Context, publishing linked data in archaeology

ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsJon Voss
 
An Open Context for Archaeology
An Open Context for ArchaeologyAn Open Context for Archaeology
An Open Context for Archaeologyguest756e05
 
It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011Ross Singer
 
Idcc kansa-kansa-arbuckle
Idcc kansa-kansa-arbuckleIdcc kansa-kansa-arbuckle
Idcc kansa-kansa-arbuckleEric Kansa
 
Data Sharing as Publication: A View from Archaeology
Data Sharing as Publication: A View from ArchaeologyData Sharing as Publication: A View from Archaeology
Data Sharing as Publication: A View from ArchaeologyEric Kansa
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8Scott Edmunds
 
Ontologies neo4j-graph-workshop-berlin
Ontologies neo4j-graph-workshop-berlinOntologies neo4j-graph-workshop-berlin
Ontologies neo4j-graph-workshop-berlinSimon Jupp
 
2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptxvijayapraba1
 
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Cyndy Parr
 
Content Mining at Wellcome Trust
Content Mining at Wellcome TrustContent Mining at Wellcome Trust
Content Mining at Wellcome TrustTheContentMine
 
Importing life science at a into Neo4j
Importing life science at a into Neo4jImporting life science at a into Neo4j
Importing life science at a into Neo4jSimon Jupp
 
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...Trevor Owens
 
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK Cyndy Parr
 
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseAnita de Waard
 
Maximising your communication impact – making altmetrics workss
Maximising your communication impact – making altmetrics workssMaximising your communication impact – making altmetrics workss
Maximising your communication impact – making altmetrics workssCiarán Quinn
 
Metadata for researchers
Metadata for researchers Metadata for researchers
Metadata for researchers Getaneh Alemu
 

Similar a #LAWDI Open Context, publishing linked data in archaeology (20)

ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
An Open Context for Archaeology
An Open Context for ArchaeologyAn Open Context for Archaeology
An Open Context for Archaeology
 
It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011
 
Idcc kansa-kansa-arbuckle
Idcc kansa-kansa-arbuckleIdcc kansa-kansa-arbuckle
Idcc kansa-kansa-arbuckle
 
Data Sharing as Publication: A View from Archaeology
Data Sharing as Publication: A View from ArchaeologyData Sharing as Publication: A View from Archaeology
Data Sharing as Publication: A View from Archaeology
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
Ontologies neo4j-graph-workshop-berlin
Ontologies neo4j-graph-workshop-berlinOntologies neo4j-graph-workshop-berlin
Ontologies neo4j-graph-workshop-berlin
 
2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx
 
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
 
Content Mining at Wellcome Trust
Content Mining at Wellcome TrustContent Mining at Wellcome Trust
Content Mining at Wellcome Trust
 
Importing life science at a into Neo4j
Importing life science at a into Neo4jImporting life science at a into Neo4j
Importing life science at a into Neo4j
 
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
 
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
 
Peer Review and Science2.0
Peer Review and Science2.0Peer Review and Science2.0
Peer Review and Science2.0
 
Reference Rot and Linked Data: Threat and Remedy
Reference Rot and Linked Data: Threat and RemedyReference Rot and Linked Data: Threat and Remedy
Reference Rot and Linked Data: Threat and Remedy
 
Maximising your communication impact – making altmetrics workss
Maximising your communication impact – making altmetrics workssMaximising your communication impact – making altmetrics workss
Maximising your communication impact – making altmetrics workss
 
Metadata for researchers
Metadata for researchers Metadata for researchers
Metadata for researchers
 
FAIRy Stories
FAIRy StoriesFAIRy Stories
FAIRy Stories
 

Último

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Último (20)

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

#LAWDI Open Context, publishing linked data in archaeology

  • 1. A Publication Approach to Linked Data in Archaeology A Publication Approach to Linked Data in Archaeology Eric C. Kansa UC Berkeley / OpenContext.org Unless otherwise indicated, this work is licensed under a Creative Commons Attribution 3.0 License <http://creativecommons.org/licenses/by/3.0/>
  • 2. • Started in 2007 • Open access / open data publishing for archaeology • Archiving by California Digital Library • Referenced by NSF and NEH for grant data management • Started in 2007 • Open access / open data publishing for archaeology • Archiving by California Digital Library • Referenced by NSF and NEH for grant data management
  • 3. My Precious DataMy Precious Data ?
  • 4.
  • 5. Data Sharing as Publication • Several projects studying editorial + publishing workflows • Current Funding: ACLS, NEH, Sloan, EOL Data Sharing as Publication • Several projects studying editorial + publishing workflows • Current Funding: ACLS, NEH, Sloan, EOL
  • 6.
  • 7.
  • 8. Web of DataWeb of Data Cross-discipline Connections Open Context links with humanities data (CIDOC, Pleiades, British Museum), and natural sciences (EOL, UBERON)
  • 10. EOL Computable Data Challenge (Ben Arbuckle, Sarah Kansa, Eric Kansa)
  • 11. EOL Computable Data Challenge 1. 15 different sites 2. 34 zooarchaeologists 3. Publishing: decoding, cleanup, metadata documentation 4. Linked Data annotation (EOL, UBERON, biometrics) 5. Collaborative analysis 6. Reuse itself studied by DIPIR.org (U. Michigan ISchool) EOL Computable Data Challenge 1. 15 different sites 2. 34 zooarchaeologists 3. Publishing: decoding, cleanup, metadata documentation 4. Linked Data annotation (EOL, UBERON, biometrics) 5. Collaborative analysis 6. Reuse itself studied by DIPIR.org (U. Michigan ISchool)
  • 12. Data Publishing Google / Open Refine 1. Check consistency 2. Edit functions 3. All changes logged, can be rolled back Google / Open Refine 1. Check consistency 2. Edit functions 3. All changes logged, can be rolled back
  • 13.
  • 14. Bibliography • Bibliographic references expressed as Linked Data (modeled after S. Heath) • Associates publication citation with Open Access variants Bibliography • Bibliographic references expressed as Linked Data (modeled after S. Heath) • Associates publication citation with Open Access variants
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22. Why UBERON? 1. Expresses relevant expert knowledge, tremendous effort. Why ignore or duplicate this effort? 2. Anatomic entities related to embryology, genetic networks. New research opportunities for zooarch? 3. Zooarchaeology gains stakeholders (biometric data of wide interest) Why UBERON? 1. Expresses relevant expert knowledge, tremendous effort. Why ignore or duplicate this effort? 2. Anatomic entities related to embryology, genetic networks. New research opportunities for zooarch? 3. Zooarchaeology gains stakeholders (biometric data of wide interest)
  • 23.
  • 24.
  • 25. “Ovis aries” http://eol.org/pages/311906/ Code: 14 Domestic sheep Code: 70 Code: 16 Ovis aries Code: 15 Sheep O. aries Schaf Sh.
  • 26.
  • 27.
  • 29. Sheep/Goat Distal Femur FusionSheep/Goat Distal Femur Fusion Karain B Cave (N=53) Pınarbaşı (N=3) Çukuriçi Höyük (N=13) Suberde (N=0) Domuztepe (N=28) Ulucak (N=15) 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Unfused Fused
  • 31.
  • 32.
  • 33.
  • 34.
  • 35. DIPIR: Data Documentation PracticesDIPIR: Data Documentation Practices I use an Excel spreadsheet…which I … inherited from my research advisers. …my dissertation advisor was still recording data for each specimen on paper when I was in graduate school so that's what I started …then quickly, I was like, "This is ridiculous.“… I just started using an Excel spreadsheet that has sort of slowly gotten bigger and bigger over time with more variables or columns…I've added …color coding…I also use…a very sort of primitive numerical coding system, again, that I inherited from my research advisers…So, this little book that goes with me of codes which is sort of odd, but …we all know that a 14 is a sheep.” (CCU13) A long way to go before we get usable, intelligible data
  • 37. SPARQL endpoint easy to break (too big of a graph to query). Needed a work-around, so I also use the normal (“plain web”) index to query the British Museum.
  • 38. (1) Keyword search for relevant term. (2) Scrape results (blech!) for item identifiers (“objectid” parameter in URLs) (3) Use ObjectIDs in SPARQL queries (limits size of graph queried, so server doesn’t die).
  • 39. SELECT ?s ?oPart ?oThes ?oLab WHERE { ?s <http://collection.britishmuseum.org/id/c rm/bm-extensions/codex_id> '$objectID'; <http://collection.britishmuseum.org/id/c rm/P46F.is_composed_of> ?oPart. ?oPart <http://collection.britishmuseum.org/id/c rm/P45F.consists_of> ?oThes. ?oThes <http://www.w3.org/2004/02/skos/core# prefLabel> ?oLab. } LIMIT 10
  • 40. Why is linked data important? Why is linked data important? 1. Improve data quality, expert curation of concepts + vocabularies 2. Develop ties with other research communities (can feedback to collect new / different data) 3. Increasingly sophisticated open source tools, support services 4. Part of the Web, not just on the Web 1. Improve data quality, expert curation of concepts + vocabularies 2. Develop ties with other research communities (can feedback to collect new / different data) 3. Increasingly sophisticated open source tools, support services 4. Part of the Web, not just on the Web
  • 41. … but participating in Linked Data requires effort! … but participating in Linked Data requires effort! Why is linked data important? Why is linked data important?
  • 42. Image Credit: Copyright Newline Cinema
  • 43. One does not simply share usable data…
  • 44. Data are challenging 1. “Raw data” often problematic, even with documentation (10X effort needed with decoded data) 2. Tension between modeling needs and familiarity with tools (Excel) 3. More work needed modeling research methods (esp. sampling, see DIPIR.org outcomes) 4. You’re never going to be done! Data are challenging 1. “Raw data” often problematic, even with documentation (10X effort needed with decoded data) 2. Tension between modeling needs and familiarity with tools (Excel) 3. More work needed modeling research methods (esp. sampling, see DIPIR.org outcomes) 4. You’re never going to be done!