SlideShare una empresa de Scribd logo
1 de 16
Descargar para leer sin conexión
Curation roles in theory and practice	
Mark A. Parsons
!
!
!
!
American Geophysical Union Fall Meeting
13 December 2013

Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License
Outline
• Two Theories: Power of Metaphor and Generative Value
• An examination of roles with focus on “Curation” — a hard to define practice.
• Comparison of three publications defining curation-style roles as well as personal
experience.
• Suggestions for practice (and theory)
Theory: Power of metaphor
Language gets its power because it is defined relative to
frames, prototypes, metaphor, narratives, images and
emotions. Part of its power comes from unconscious
aspects: we are not consciously aware of all that it evokes
in us, but it is there, hidden, always at work. If we hear the
same language over and over, we will think more and more
in terms of the frames and metaphors activated by that
language.
	 	 —George Lakoff (2008)
as quoted in Parsons & Fox. 2013. Is data publication the right metaphor? Data Science
Journal 12.
Theory: Generative value of data
• Generative Value per Jonathan Zittrain (2008) as
interpreted and extended to data by John Wilbanks:
“the capacity to produce unanticipated change through
unfiltered contributions from broad and varied
audiences.” —J. Zittrain
• Data become more generative by being more adaptable,
more easily mastered, more accessible, and more
connected and influential.
• Not net present value but net potential value.
accessibility

adaptability

ease
of mastery

leverage

slide courtesy John Wilbanks 2013

Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License
slide courtesy John Wilbanks 2013

Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License
EASY TO USE
NO OPEN LICENSE

accessibility

adaptability

ease
of mastery

leverage

slide courtesy John Wilbanks 2013

Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License
accessibility

NO OPEN LICENSE
DOWNLOAD AVAILABLE

adaptability

ease
of mastery

leverage

Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License
slide courtesy John Wilbanks

2013
What is the Practice?
• What was my role at the National Snow and Ice Data Center?
• data manager
• data scientist
• information scientist
• informaticist
• data steward
• data curator
• project manager
• operations manager

• “In between” work
Curation
Oxford English Dictionary:
cuˈration, n.
Etymology:  Middle English, < Old French curacion , < Latin cūrātiōn-em ,
noun of action < cūrāre to cure v.
 1. The action of curing; healing, cure. (c1374—1677)
 2. Curatorship, guardianship. (1769—1774)

	 b. The supervision by a curator of a collection of preserved or exhibited
items

	 	 (1979—1986)
Parsons (for today’s purposes):
• to add generative value
Four perspectives on roles, i.e. practice
Lawrence B, et al.. 2011. Citation and peer review of data: Moving towards
formal data publication. International Journal of Digital Curation 6 (2).
• A data publication perspective
Baker KS and GC Bowker. 2007. Information ecology: Open system
environment for data, memories, and knowing. Journal of Intelligent
Information Systems 29 (1): 127-144.
• An ecological infrastructuring perspective
Schopf JM. 2012. Treating data like software: A case for production quality
data. Proceedings of the Joint Conference on Digital Libraries 11-14 June
2012, Washington DC.
• A production software perspective
My experience at NSIDC and how the center defined and redefined curation
roles over the years in response to evolving user needs and project mandates.
• A hybrid, ad hoc perspective
Controls in the process
• Who controls what? when?
• “gatekeepers” and “controllers”
• “assessors” and “analysers”
• “reviewers”
• “testing” by machine and human.
!

• Drivers at NSIDC
• reduce user enquiries, so as users have evolved the controls and their
associated roles have changed.
• funding requirements
• different types of data streams (e.g. NRT satellite data vs. indigenous
knowledge).
Controls and value
• Controls usually increase the generative value of the data.
• Focus is often on mastery.
• Controls can delay the release of data.
• For example, QC and documentation to avoid “misuse”
• Issues: When is this an excuse to hoard? When should the QC occur?
Shouldn’t it be continuous? Who can certify quality? The provider, user,
curator, peer (who’s that)?
• Controls can reduce the adaptability and connection of the data through
controlled access mechanisms or data models.
• There can be tensions between the different aspects of generativity.
• Flexibility in controls can help prioritize and handle growing demands and
sometimes conflicting requirements.
Some suggested practice
1. Consciously consider different perspectives and different terminologies
when designing data workflows.
a. Ask “how” questions as well as “what” questions of users and
providers.
2. Assess how control steps both augment and potentially inhibit attributes
of generative value.
3. Iterate controls in steps and optimize across multiple dimensions of value.
4. Allow multiple players (users, curators, providers, stakeholders) to
annotate, adapt, and update data.
5. Define and explain different “levels of service”.
6. 3 - 5 require well documented and referenced versioning.
Practice guiding theory
• How can we define and measure “objective” value of data (and associated
information including code) as opposed to “subjective” quality of data.
• How do different metaphors and models of data stewardship/publication/
production/presentation affect attitudes and actions of different
stakeholders in the data lifecycle.
An example of “Bricolage:”

creation from a diverse range of materials or sources.
Uncredited artwork from old aluminum cans at the
Spier Hotel in Stellenbosch, South Africa.

Mark A.Parsons
parsom3@rpi.edu

Más contenido relacionado

Destacado (6)

Arte
ArteArte
Arte
 
Adoración de los pastores
Adoración de los pastoresAdoración de los pastores
Adoración de los pastores
 
Is data publication the right metaphor?
Is data publication the right metaphor?Is data publication the right metaphor?
Is data publication the right metaphor?
 
Q4 2009 ppt_e_final
Q4 2009 ppt_e_finalQ4 2009 ppt_e_final
Q4 2009 ppt_e_final
 
Partes de la eucaristía
Partes de la eucaristíaPartes de la eucaristía
Partes de la eucaristía
 
Parsons scidatacon2016
Parsons scidatacon2016Parsons scidatacon2016
Parsons scidatacon2016
 

Similar a Curation roles in theory and practice

Curating for Value in Different Data Stewardship Paradigms
Curating for Value in Different Data Stewardship ParadigmsCurating for Value in Different Data Stewardship Paradigms
Curating for Value in Different Data Stewardship ParadigmsMark Parsons
 
Empirical user studies in Semantic Web contexts
Empirical user studies in Semantic Web contextsEmpirical user studies in Semantic Web contexts
Empirical user studies in Semantic Web contextsCatia Pesquita
 
Is data publication the right metaphor?
Is data publication the right metaphor?Is data publication the right metaphor?
Is data publication the right metaphor?Research Data Alliance
 
Software Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental ScanSoftware Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental ScanMicah Altman
 
RDA, Data Citation, and PIDs for DataOne
RDA, Data Citation, and PIDs for DataOneRDA, Data Citation, and PIDs for DataOne
RDA, Data Citation, and PIDs for DataOneResearch Data Alliance
 
Owning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your PatronsOwning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your PatronsRobert H. McDonald
 
Open Data is not Enough (final version)
Open Data is not Enough (final version)Open Data is not Enough (final version)
Open Data is not Enough (final version)Research Data Alliance
 
LIBRARY ASSESSMENT
LIBRARY ASSESSMENTLIBRARY ASSESSMENT
LIBRARY ASSESSMENTJen Rutner
 
Crediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teamsCrediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teamsCarole Goble
 
Choosing What to Hold and What to Fold: Database Quality Decisions in Tough ...
Choosing What to Hold and What to Fold: Database Quality Decisions in Tough ...Choosing What to Hold and What to Fold: Database Quality Decisions in Tough ...
Choosing What to Hold and What to Fold: Database Quality Decisions in Tough ...tfons
 
Interfaces for User-Controlled and Transparent Recommendations
Interfaces for User-Controlled and Transparent RecommendationsInterfaces for User-Controlled and Transparent Recommendations
Interfaces for User-Controlled and Transparent RecommendationsPeter Brusilovsky
 
Uconn Coiro Assessment 2008
Uconn Coiro Assessment 2008Uconn Coiro Assessment 2008
Uconn Coiro Assessment 2008Julie Coiro
 
User experience at Imperial: a case study of qualitative approaches to Primo ...
User experience at Imperial: a case study of qualitative approaches to Primo ...User experience at Imperial: a case study of qualitative approaches to Primo ...
User experience at Imperial: a case study of qualitative approaches to Primo ...Andrew Preater
 
Impact the UX of Your Website with Contextual Inquiry
Impact the UX of Your Website with Contextual InquiryImpact the UX of Your Website with Contextual Inquiry
Impact the UX of Your Website with Contextual InquiryRachel Vacek
 
An RDM Guide for Researchers: Presentation at BIDS Reproducibility Working Group
An RDM Guide for Researchers: Presentation at BIDS Reproducibility Working GroupAn RDM Guide for Researchers: Presentation at BIDS Reproducibility Working Group
An RDM Guide for Researchers: Presentation at BIDS Reproducibility Working GroupJohn Borghi
 
ASA conference Feb 2013
ASA conference Feb 2013ASA conference Feb 2013
ASA conference Feb 2013mrkwr
 
RDAP 15: Research Data Integration in the Purdue Libraries
RDAP 15: Research Data Integration in the Purdue LibrariesRDAP 15: Research Data Integration in the Purdue Libraries
RDAP 15: Research Data Integration in the Purdue LibrariesASIS&T
 

Similar a Curation roles in theory and practice (20)

Curating for Value in Different Data Stewardship Paradigms
Curating for Value in Different Data Stewardship ParadigmsCurating for Value in Different Data Stewardship Paradigms
Curating for Value in Different Data Stewardship Paradigms
 
Empirical user studies in Semantic Web contexts
Empirical user studies in Semantic Web contextsEmpirical user studies in Semantic Web contexts
Empirical user studies in Semantic Web contexts
 
Is data publication the right metaphor?
Is data publication the right metaphor?Is data publication the right metaphor?
Is data publication the right metaphor?
 
Software Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental ScanSoftware Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental Scan
 
RDA, Data Citation, and PIDs for DataOne
RDA, Data Citation, and PIDs for DataOneRDA, Data Citation, and PIDs for DataOne
RDA, Data Citation, and PIDs for DataOne
 
Owning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your PatronsOwning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your Patrons
 
Open Data is not Enough (final version)
Open Data is not Enough (final version)Open Data is not Enough (final version)
Open Data is not Enough (final version)
 
LIBRARY ASSESSMENT
LIBRARY ASSESSMENTLIBRARY ASSESSMENT
LIBRARY ASSESSMENT
 
Crediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teamsCrediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teams
 
Qs1 group a
Qs1 group a Qs1 group a
Qs1 group a
 
Choosing What to Hold and What to Fold: Database Quality Decisions in Tough ...
Choosing What to Hold and What to Fold: Database Quality Decisions in Tough ...Choosing What to Hold and What to Fold: Database Quality Decisions in Tough ...
Choosing What to Hold and What to Fold: Database Quality Decisions in Tough ...
 
WISE2019 presentation
WISE2019 presentationWISE2019 presentation
WISE2019 presentation
 
Interfaces for User-Controlled and Transparent Recommendations
Interfaces for User-Controlled and Transparent RecommendationsInterfaces for User-Controlled and Transparent Recommendations
Interfaces for User-Controlled and Transparent Recommendations
 
Uconn Coiro Assessment 2008
Uconn Coiro Assessment 2008Uconn Coiro Assessment 2008
Uconn Coiro Assessment 2008
 
User experience at Imperial: a case study of qualitative approaches to Primo ...
User experience at Imperial: a case study of qualitative approaches to Primo ...User experience at Imperial: a case study of qualitative approaches to Primo ...
User experience at Imperial: a case study of qualitative approaches to Primo ...
 
Impact the UX of Your Website with Contextual Inquiry
Impact the UX of Your Website with Contextual InquiryImpact the UX of Your Website with Contextual Inquiry
Impact the UX of Your Website with Contextual Inquiry
 
Alamw15 VIVO
Alamw15 VIVOAlamw15 VIVO
Alamw15 VIVO
 
An RDM Guide for Researchers: Presentation at BIDS Reproducibility Working Group
An RDM Guide for Researchers: Presentation at BIDS Reproducibility Working GroupAn RDM Guide for Researchers: Presentation at BIDS Reproducibility Working Group
An RDM Guide for Researchers: Presentation at BIDS Reproducibility Working Group
 
ASA conference Feb 2013
ASA conference Feb 2013ASA conference Feb 2013
ASA conference Feb 2013
 
RDAP 15: Research Data Integration in the Purdue Libraries
RDAP 15: Research Data Integration in the Purdue LibrariesRDAP 15: Research Data Integration in the Purdue Libraries
RDAP 15: Research Data Integration in the Purdue Libraries
 

Último

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 

Último (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

Curation roles in theory and practice

  • 1. Curation roles in theory and practice Mark A. Parsons ! ! ! ! American Geophysical Union Fall Meeting 13 December 2013 Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License
  • 2. Outline • Two Theories: Power of Metaphor and Generative Value • An examination of roles with focus on “Curation” — a hard to define practice. • Comparison of three publications defining curation-style roles as well as personal experience. • Suggestions for practice (and theory)
  • 3. Theory: Power of metaphor Language gets its power because it is defined relative to frames, prototypes, metaphor, narratives, images and emotions. Part of its power comes from unconscious aspects: we are not consciously aware of all that it evokes in us, but it is there, hidden, always at work. If we hear the same language over and over, we will think more and more in terms of the frames and metaphors activated by that language. —George Lakoff (2008) as quoted in Parsons & Fox. 2013. Is data publication the right metaphor? Data Science Journal 12.
  • 4. Theory: Generative value of data • Generative Value per Jonathan Zittrain (2008) as interpreted and extended to data by John Wilbanks: “the capacity to produce unanticipated change through unfiltered contributions from broad and varied audiences.” —J. Zittrain • Data become more generative by being more adaptable, more easily mastered, more accessible, and more connected and influential. • Not net present value but net potential value.
  • 5. accessibility adaptability ease of mastery leverage slide courtesy John Wilbanks 2013 Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License
  • 6. slide courtesy John Wilbanks 2013 Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License
  • 7. EASY TO USE NO OPEN LICENSE accessibility adaptability ease of mastery leverage slide courtesy John Wilbanks 2013 Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License
  • 8. accessibility NO OPEN LICENSE DOWNLOAD AVAILABLE adaptability ease of mastery leverage Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License slide courtesy John Wilbanks 2013
  • 9. What is the Practice? • What was my role at the National Snow and Ice Data Center? • data manager • data scientist • information scientist • informaticist • data steward • data curator • project manager • operations manager • “In between” work
  • 10. Curation Oxford English Dictionary: cuˈration, n. Etymology:  Middle English, < Old French curacion , < Latin cūrātiōn-em , noun of action < cūrāre to cure v.  1. The action of curing; healing, cure. (c1374—1677)  2. Curatorship, guardianship. (1769—1774)
 b. The supervision by a curator of a collection of preserved or exhibited items
 (1979—1986) Parsons (for today’s purposes): • to add generative value
  • 11. Four perspectives on roles, i.e. practice Lawrence B, et al.. 2011. Citation and peer review of data: Moving towards formal data publication. International Journal of Digital Curation 6 (2). • A data publication perspective Baker KS and GC Bowker. 2007. Information ecology: Open system environment for data, memories, and knowing. Journal of Intelligent Information Systems 29 (1): 127-144. • An ecological infrastructuring perspective Schopf JM. 2012. Treating data like software: A case for production quality data. Proceedings of the Joint Conference on Digital Libraries 11-14 June 2012, Washington DC. • A production software perspective My experience at NSIDC and how the center defined and redefined curation roles over the years in response to evolving user needs and project mandates. • A hybrid, ad hoc perspective
  • 12. Controls in the process • Who controls what? when? • “gatekeepers” and “controllers” • “assessors” and “analysers” • “reviewers” • “testing” by machine and human. ! • Drivers at NSIDC • reduce user enquiries, so as users have evolved the controls and their associated roles have changed. • funding requirements • different types of data streams (e.g. NRT satellite data vs. indigenous knowledge).
  • 13. Controls and value • Controls usually increase the generative value of the data. • Focus is often on mastery. • Controls can delay the release of data. • For example, QC and documentation to avoid “misuse” • Issues: When is this an excuse to hoard? When should the QC occur? Shouldn’t it be continuous? Who can certify quality? The provider, user, curator, peer (who’s that)? • Controls can reduce the adaptability and connection of the data through controlled access mechanisms or data models. • There can be tensions between the different aspects of generativity. • Flexibility in controls can help prioritize and handle growing demands and sometimes conflicting requirements.
  • 14. Some suggested practice 1. Consciously consider different perspectives and different terminologies when designing data workflows. a. Ask “how” questions as well as “what” questions of users and providers. 2. Assess how control steps both augment and potentially inhibit attributes of generative value. 3. Iterate controls in steps and optimize across multiple dimensions of value. 4. Allow multiple players (users, curators, providers, stakeholders) to annotate, adapt, and update data. 5. Define and explain different “levels of service”. 6. 3 - 5 require well documented and referenced versioning.
  • 15. Practice guiding theory • How can we define and measure “objective” value of data (and associated information including code) as opposed to “subjective” quality of data. • How do different metaphors and models of data stewardship/publication/ production/presentation affect attitudes and actions of different stakeholders in the data lifecycle.
  • 16. An example of “Bricolage:”
 creation from a diverse range of materials or sources. Uncredited artwork from old aluminum cans at the Spier Hotel in Stellenbosch, South Africa. Mark A.Parsons parsom3@rpi.edu