SlideShare a Scribd company logo
1 of 40
Download to read offline
Grab a bucket!



                                            It’s raining data!
Photo: http://www.flickr.com/photos/peasap/655111542/
                                                                          Dorothea Salo
                                                                 University of Wisconsin
                                                                            Access 2009
the...




Painting: “Cassandra,” Evelyn de Morgan
Photo: http://commons.wikimedia.org/wiki/File:Cassandra1.jpeg
                                                                of Open Access
I’ve got nothing against




                                       but the reality was...
Photo: http://www.flickr.com/photos/y2bk/528300692/
... blurrier.
     goals?

                    means?

                               something for nothing?

                                             fit between content and container?

                                                            fit between user needs and system?

                       and so now, I may be becoming
Photo: http://www.flickr.com/photos/jennsstuff/2965783700/
the...




         of Data Curation?
What do we know about data?




Photo: http://www.flickr.com/photos/kentbye/2053916246/
There’s a lot of data.




Photo: http://www.flickr.com/photos/noelzialee/2126153623/
Photo: http://www.flickr.com/photos/jonevans/1032687817/



          Data are there to be interacted with.
Data are wildly diverse in nature...




         ... as are their technical environments.
Photo: http://www.flickr.com/photos/28481088@N00/670258156/
Data are already out there.




Photo: NASA (via http://nasaimages.org/), “Multiwavelength M81”
A lot of data are analog...




                     ... but really want to be digital.
Photo: http://www.flickr.com/photos/mrbill/3452943573/
Data are project-based.




http://www.exploringthehyper.net/
Data are sloppy.


Photo: http://www.flickr.com/photos/midorisyu/2622024163/
Data aren’t standardized.




Photo: http://www.flickr.com/photos/mikewade/3463334719/
Our Big Bucket:




 the digital library
Our other Big Bucket:




the institutional repository
Photo: http://www.flickr.com/photos/peasap/655111542/



                                    Impedance mismatches
What do we know about these?
Photo: http://www.flickr.com/photos/schex/193912573/
Carefully built and tended




                   http://www.collectionscanada.gc.ca/naskapi/index-e.html
Production is a Taylorist’s dream.
                     Photo: http://www.flickr.com/photos/villeneuve53/1808995620/
when it isn’t a Taylorist’s nightmare.
Photo: http://www.flickr.com/photos/elsie/97542274/
What do we know about these?
We’re caged up




                                   inside our institutions.
Photo: http://www.flickr.com/photos/annia316/115439737/
Photo: http://commons.wikimedia.org/wiki/File:Black_Ford_Model_T_in_HK.JPG




                                                      Any color...
Bring it on; we’ll take anything!




                 ... as long as it’s static and final.
Photo: http://www.flickr.com/photos/orblivio/146691405/
Right, anything you’ve got!




                                           ... one file at a time.
Photo: http://www.flickr.com/photos/jetalone/39990302/
Any look and feel...
Any metadata you want!




                ... as long as it’s key-value pairs.
Photo: http://www.flickr.com/photos/rattodisabina/2460905893/
Do anything you want...




                       ... as long as it’s “download.”
Photo: http://www.flickr.com/photos/procsilas/306417902/
Content models




           Enough said.
So where does all that leave us?




Photo: http://www.flickr.com/photos/library_of_congress/2162653769/
Photo: http://www.flickr.com/photos/jonevans/1032687817/



                  We need bigger, better buckets.
Silos are both necessary




                                         and unacceptable.
Photo: http://www.flickr.com/photos/jojakeman/2818910104/
We have a lot of modeling to do.




                                        And meta-modeling.
Photo: http://www.flickr.com/photos/crobj/727348790/
We have a lot of code to write.
Photo: http://www.flickr.com/photos/fienna/170559081/
We can’t code or model in isolation.
Photo: http://www.flickr.com/photos/naus3a01/240614578/
Fedora is the new world.




                           But Fedora must change.
Photo: http://www.flickr.com/photos/mythwhisper/3361907495/
Solr brings it all together
Photo: http://www.flickr.com/photos/chantrybee/2911840052/
... the




Vermeer: the Muse Clio, from “The Allegory of Painting”
                                                          of Data Curation.
Thank you!



This presentation is available under a Creative
Commons Attribution 3.0 United States license.

More Related Content

What's hot

How much radical openness does innovation need? Intimacy vs. Openness Deathma...
How much radical openness does innovation need? Intimacy vs. Openness Deathma...How much radical openness does innovation need? Intimacy vs. Openness Deathma...
How much radical openness does innovation need? Intimacy vs. Openness Deathma...
Matteo Cassese
 
Freak Out, Geek Out, or Seek Out
Freak Out, Geek Out, or Seek OutFreak Out, Geek Out, or Seek Out
Freak Out, Geek Out, or Seek Out
David King
 
Assembly feb.7th
Assembly feb.7thAssembly feb.7th
Assembly feb.7th
ndbekah
 
Are We Really Better Safe than Sorry - Notes
Are We Really Better Safe than Sorry - NotesAre We Really Better Safe than Sorry - Notes
Are We Really Better Safe than Sorry - Notes
Kathryn Bergeron
 
My Life as A Sponge #altc2013 Invited Speaker session
My Life as A Sponge  #altc2013 Invited Speaker session My Life as A Sponge  #altc2013 Invited Speaker session
My Life as A Sponge #altc2013 Invited Speaker session
Sheila MacNeill
 
Are We Really Better Safe Than Sorry
Are We Really Better Safe Than SorryAre We Really Better Safe Than Sorry
Are We Really Better Safe Than Sorry
Kathryn Bergeron
 

What's hot (20)

Tervezz szokást! - WIAD, Mobile Hungary - Kolozsi István, kolboid
Tervezz szokást! - WIAD, Mobile Hungary - Kolozsi István, kolboidTervezz szokást! - WIAD, Mobile Hungary - Kolozsi István, kolboid
Tervezz szokást! - WIAD, Mobile Hungary - Kolozsi István, kolboid
 
A Day In The Life v4.2
A Day In The Life v4.2A Day In The Life v4.2
A Day In The Life v4.2
 
Digital Portfolios - Presented at GREAT12 Dublin
Digital Portfolios - Presented at GREAT12 DublinDigital Portfolios - Presented at GREAT12 Dublin
Digital Portfolios - Presented at GREAT12 Dublin
 
21st Century Bricoleurs v3
21st Century Bricoleurs v321st Century Bricoleurs v3
21st Century Bricoleurs v3
 
21st Century Bricoleurs
21st Century Bricoleurs21st Century Bricoleurs
21st Century Bricoleurs
 
How much radical openness does innovation need? Intimacy vs. Openness Deathma...
How much radical openness does innovation need? Intimacy vs. Openness Deathma...How much radical openness does innovation need? Intimacy vs. Openness Deathma...
How much radical openness does innovation need? Intimacy vs. Openness Deathma...
 
Your time is limited, so don’t waste it living someone’s life! - Steve Jobs
Your time is limited, so don’t waste it living someone’s life! - Steve JobsYour time is limited, so don’t waste it living someone’s life! - Steve Jobs
Your time is limited, so don’t waste it living someone’s life! - Steve Jobs
 
Getting The Word Out V2
Getting The Word Out V2Getting The Word Out V2
Getting The Word Out V2
 
Kaliya Hamlin on unconferences at Ignite Bay Area
Kaliya Hamlin on unconferences at Ignite Bay AreaKaliya Hamlin on unconferences at Ignite Bay Area
Kaliya Hamlin on unconferences at Ignite Bay Area
 
When LES Is More
When LES Is MoreWhen LES Is More
When LES Is More
 
Freak Out, Geek Out, or Seek Out
Freak Out, Geek Out, or Seek OutFreak Out, Geek Out, or Seek Out
Freak Out, Geek Out, or Seek Out
 
Telling Photo Tales v5
Telling Photo Tales v5Telling Photo Tales v5
Telling Photo Tales v5
 
Assembly feb.7th
Assembly feb.7thAssembly feb.7th
Assembly feb.7th
 
Are We Really Better Safe than Sorry - Notes
Are We Really Better Safe than Sorry - NotesAre We Really Better Safe than Sorry - Notes
Are We Really Better Safe than Sorry - Notes
 
Multitasking
MultitaskingMultitasking
Multitasking
 
Ebooks: Landscape & Impl
Ebooks: Landscape & ImplEbooks: Landscape & Impl
Ebooks: Landscape & Impl
 
How Ebooks, File Types, and DRM Affect your Library
 How Ebooks, File Types, and DRM Affect your Library How Ebooks, File Types, and DRM Affect your Library
How Ebooks, File Types, and DRM Affect your Library
 
Generation Y by @orsnemes
Generation Y by @orsnemesGeneration Y by @orsnemes
Generation Y by @orsnemes
 
My Life as A Sponge #altc2013 Invited Speaker session
My Life as A Sponge  #altc2013 Invited Speaker session My Life as A Sponge  #altc2013 Invited Speaker session
My Life as A Sponge #altc2013 Invited Speaker session
 
Are We Really Better Safe Than Sorry
Are We Really Better Safe Than SorryAre We Really Better Safe Than Sorry
Are We Really Better Safe Than Sorry
 

Similar to Grab a bucket! It's raining data!

Similar to Grab a bucket! It's raining data! (20)

21st Century Bricoleurs v1.1
21st Century Bricoleurs v1.121st Century Bricoleurs v1.1
21st Century Bricoleurs v1.1
 
Holiday Science Lecture: Art, Life and Programming
Holiday Science Lecture: Art, Life and ProgrammingHoliday Science Lecture: Art, Life and Programming
Holiday Science Lecture: Art, Life and Programming
 
The Importance of Storytelling in Web Design, WordCamp Miami 2013
The Importance of Storytelling in Web Design, WordCamp Miami 2013The Importance of Storytelling in Web Design, WordCamp Miami 2013
The Importance of Storytelling in Web Design, WordCamp Miami 2013
 
21st Century Bricoleurs v2 part two
21st Century Bricoleurs v2 part two21st Century Bricoleurs v2 part two
21st Century Bricoleurs v2 part two
 
Some Kind of Wonderful
Some Kind of WonderfulSome Kind of Wonderful
Some Kind of Wonderful
 
What Can I Do Now? (web 2.0 pedagogy) v3.3
What Can I Do Now? (web 2.0 pedagogy) v3.3What Can I Do Now? (web 2.0 pedagogy) v3.3
What Can I Do Now? (web 2.0 pedagogy) v3.3
 
Parent Technology Certificate Session 3
Parent Technology Certificate Session 3Parent Technology Certificate Session 3
Parent Technology Certificate Session 3
 
21st Century Bricoleurs v2 part one
21st Century Bricoleurs v2 part one21st Century Bricoleurs v2 part one
21st Century Bricoleurs v2 part one
 
Science in the Open
Science in the OpenScience in the Open
Science in the Open
 
Socialmedialiverpool 120209011224-phpapp02
Socialmedialiverpool 120209011224-phpapp02Socialmedialiverpool 120209011224-phpapp02
Socialmedialiverpool 120209011224-phpapp02
 
Extreme (web 2.0) Lesson Makeover v3.1
Extreme (web 2.0) Lesson Makeover v3.1Extreme (web 2.0) Lesson Makeover v3.1
Extreme (web 2.0) Lesson Makeover v3.1
 
What Can I Do Now? (web 2.0 pedagogy) v3.9
What Can I Do Now? (web 2.0 pedagogy) v3.9What Can I Do Now? (web 2.0 pedagogy) v3.9
What Can I Do Now? (web 2.0 pedagogy) v3.9
 
What Can I Do Now? (web 2.0 pedagogy) v4
What Can I Do Now? (web 2.0 pedagogy) v4What Can I Do Now? (web 2.0 pedagogy) v4
What Can I Do Now? (web 2.0 pedagogy) v4
 
What Can I Do Now? (web 2.0 pedagogy) v3.8
What Can I Do Now? (web 2.0 pedagogy) v3.8What Can I Do Now? (web 2.0 pedagogy) v3.8
What Can I Do Now? (web 2.0 pedagogy) v3.8
 
What Can I Do Now? (web 2.0 pedagogy) v3.2
What Can I Do Now? (web 2.0 pedagogy) v3.2What Can I Do Now? (web 2.0 pedagogy) v3.2
What Can I Do Now? (web 2.0 pedagogy) v3.2
 
What Can I Do Now? (web 2.0 pedagogy) v3.4
What Can I Do Now? (web 2.0 pedagogy) v3.4What Can I Do Now? (web 2.0 pedagogy) v3.4
What Can I Do Now? (web 2.0 pedagogy) v3.4
 
Letting Go
Letting GoLetting Go
Letting Go
 
Lettinggo 110902081541-phpapp02
Lettinggo 110902081541-phpapp02Lettinggo 110902081541-phpapp02
Lettinggo 110902081541-phpapp02
 
Letting go
Letting goLetting go
Letting go
 
Freak Out, Geek Out, or Seek Out: Dealing with Tech Change and Customer Engag...
Freak Out, Geek Out, or Seek Out: Dealing with Tech Change and Customer Engag...Freak Out, Geek Out, or Seek Out: Dealing with Tech Change and Customer Engag...
Freak Out, Geek Out, or Seek Out: Dealing with Tech Change and Customer Engag...
 

More from Dorothea Salo

Risk management and auditing
Risk management and auditingRisk management and auditing
Risk management and auditing
Dorothea Salo
 
MARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesMARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archives
Dorothea Salo
 
RDF, RDA, and other TLAs
RDF, RDA, and other TLAsRDF, RDA, and other TLAs
RDF, RDA, and other TLAs
Dorothea Salo
 

More from Dorothea Salo (20)

Soylent Semantic Web Is People! (with notes)
Soylent Semantic Web Is People! (with notes)Soylent Semantic Web Is People! (with notes)
Soylent Semantic Web Is People! (with notes)
 
Soylent SemanticWeb Is People!
Soylent SemanticWeb Is People!Soylent SemanticWeb Is People!
Soylent SemanticWeb Is People!
 
Encryption
EncryptionEncryption
Encryption
 
Privacy and libraries
Privacy and librariesPrivacy and libraries
Privacy and libraries
 
Paying for it
Paying for itPaying for it
Paying for it
 
Risk management and auditing
Risk management and auditingRisk management and auditing
Risk management and auditing
 
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
 
Preservation and institutional repositories for the digital arts and humanities
Preservation and institutional repositories for the digital arts and humanitiesPreservation and institutional repositories for the digital arts and humanities
Preservation and institutional repositories for the digital arts and humanities
 
Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?
 
MARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesMARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archives
 
Library Linked Data
Library Linked DataLibrary Linked Data
Library Linked Data
 
FRBR and RDA
FRBR and RDAFRBR and RDA
FRBR and RDA
 
Research Data and Scholarly Communication
Research Data and Scholarly CommunicationResearch Data and Scholarly Communication
Research Data and Scholarly Communication
 
Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)
 
Manufacturing Serendipity
Manufacturing SerendipityManufacturing Serendipity
Manufacturing Serendipity
 
What We Organize
What We OrganizeWhat We Organize
What We Organize
 
Occupy Copyright!
Occupy Copyright!Occupy Copyright!
Occupy Copyright!
 
RDF, RDA, and other TLAs
RDF, RDA, and other TLAsRDF, RDA, and other TLAs
RDF, RDA, and other TLAs
 
I own copyright, so I pwn you!
I own copyright, so I pwn you!I own copyright, so I pwn you!
I own copyright, so I pwn you!
 
Librarians love data!
Librarians love data!Librarians love data!
Librarians love data!
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 

Grab a bucket! It's raining data!