SlideShare una empresa de Scribd logo
1 de 47
Descargar para leer sin conexión
Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License
Open Data is Not Enough
Making Data Sharing Work
Mark A. Parsons

0000-0002-7723-0950

Secretary General 

RDA meets the Italian scientific community

Pisa, Italy

14 July 2015
The many representations (and conceptions) of sea ice
Evolution of sea ice data products at NSIDC, presented by Ruth Duerr
March 10, 2015, RDA 5th Plenary, San Diego
Figure courtesy Donna Scott
The generative value of data
• Generative value per Jonathan Zittrain (2008) as
interpreted and extended to data by John Wilbanks:
“the capacity to produce unanticipated change through
unfiltered contributions from broad and varied
audiences.” —J. Zittrain
• Data become more generative by being more adaptable,
more easily mastered, more accessible, and more
connected and influential.
• Not net present value but net potential value.
Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License
accessibility
adaptability
leverage
ease
of mastery
slide courtesy John Wilbanks 2013
To make open work we need
• Curation—increasing the (generative) value of the data
• Context—of both data and application of provider and user
• Trust—of data, information, organisations, institutions….
• Interfaces, connections, relationships, mediation—Bridges
• People
Research Data Alliance
Vision
Researchers and innovators openly share data across
technologies, disciplines, and countries to address the
grand challenges of society.
Mission
RDA builds the social and technical bridges that enable
open sharing of data.
Dynamics of Infrastructure
Edwards, et al. 2007 Understanding Infrastructure: Dynamics,
Tensions, and Design.	
• Infrastructures become “ubiquitous, accessible, reliable, and
transparent” as they mature. 

• Systems Networks Inter-networks 

• “system-building, characterized by the deliberate and successful
design of technology-based services.” 

• “technology transfer across domains and locations results in
variations on the original design, as well as the emergence of
competing systems.”

• Finally, “a process of consolidation characterized by gateways
that allow dissimilar systems to be linked into networks.”
Not what, but
	 When is infrastructure?
Not what, but
	 When and
	 	 Who is infrastructure?
Bridges and
Gateways
Gateways are often wrongly
understood as “technologies,”
i.e. hardware or software
alone. A more accurate
approach conceives them as
combining a technical solution
with a social choice, i.e. a
standard, both of which must
be integrated into existing
users’ communities of
practice. Because of this,
gateways rarely perform
perfectly.

	 — Edwards et al. 2007
Infrastructure is
Relationships, interactions, and connections
	 between people, technologies, and institutions
Fran	
  Berman,	
  Research	
  Data	
  Alliance
“Create - Adopt - Use”
(in 12-18 months)
Systems
Interoperability
Adopted Policy
Sustainable Economics
Common Types, 

Standards, Metadata
Traffic	
  Image:	
  	
  

Mike	
  Gonzalez
Adopted Community
Practice
Training, Education,
Workforce
Shared Principles
• Openness
• Consensus
• Balance
• Harmonization
• Community Driven
• Non-profit
Fran	
  Berman,	
  Research	
  Data	
  Alliance
RDA: Accelerate Data Sharing and Interoperability Across
Cultures, Communities, 

Scales, Technologies
▪ Technical	
  parts	
  of	
  the	
  data	
  engine:	
  
▪ Data	
  type	
  registries	
  reference	
  model	
  
▪ Wheat	
  data	
  interoperability	
  framework	
  
▪ Rules	
  of	
  the	
  road:	
  
▪ Common	
  agreement	
  on	
  data	
  citation	
  
▪ Common	
  practice	
  for	
  data	
  repositories	
  
▪ Principles	
  of	
  legal	
  interoperability	
  
▪ Better	
  drivers	
  
• Summer	
  schools	
  in	
  data	
  science	
  and	
  cloud	
  
computing	
  in	
  the	
  developing	
  world	
  (with	
  
CODATA)	
  
• Active	
  data	
  management	
  plan	
  
development	
  and	
  monitoring	
  
Policy and Practice
Systems
Interoperability
Sustainable
Economics
Common Types, 

Standards, Metadata
Training, Education,
Workforce
• A basic vocabulary of foundational terminology and query tool to make sure we know what we’re
talking about.

• A data type model and registry (“MIME-types” for data) to help tools interpret, display, and process
data.

• A persistent identifier type registry to help search engines understand what they are pointing to and
retrieving.

• A basic set of machine actionable rules to enhance trust

• Coming soon: 

• A metadata standards directory so we can describe similar things consistently

• A dynamic-data citation methodology so we can reference precise subsets of changing data.

• Semantically linked terms describing wheat data so we can share harvest and related
information around the world

• Services and methods for finding data across multiple registries, to help cross disciplinary and
multi-facetted discovery.

• A unified repository certification scheme to reduce confusion and improve trust.
Initial Products—adopt one today!
RDA Working Groups
1. Brokering Governance

2. Data Citation WG

3. Data Description Registry
Interoperability

4. Data Foundation and Terminology
WG

5. Data Type Registries WG

6. Metadata Standards Directory
Working Group

7. PID Information Types WG

8. Practical Policy WG

9. RDA/CODATA Summer Schools in
Data Science and Cloud
Computing in the Developing World

10.RDA/WDS Publishing Data
Bibliometrics WG

11.RDA/WDS Publishing Data
Services WG

12.RDA/WDS Publishing Data
Workflows WG

13.Repository Audit and Certification
DSA–WDS Partnership WG

14.Repository Platforms for Research
Data*

15.The BioSharing Registry:
connecting data policies,
standards & databases in life
sciences*

16.Wheat Data Interoperability WG
* in review
RDA Interest Groups
1. Agricultural Data Interoperability IG

2. Active Data Management Plans*

3. Big Data Analytics IG

4. Biodiversity Data Integration IG

5. Brokering IG

6. Community Capability Model IG

7. Data Fabric IG

8. Data for Development

9. Data Foundations and Terminology IG*

10.Data in Context IG

11.Development of cloud computing capacity and
education in developing world research

12.Digital Practices in History and Ethnography IG

13.Domain Repositories Interest Group

14.Education and Training on handling of research
data

15.ELIXIR Bridging Force IG

16.Engagement IG

17.Federated Identity Management

18.Geospatial IG

19.Libraries for Research Data

20.Long tail of research data IG

21.Marine Data Harmonization IG

22.Metabolomics

23.Metadata IG

24.PID Interest Group

25.Preservation e-Infrastructure IG

26.Quality of Urban Life Interest Group

27.RDA/CODATA Legal Interoperability IG

28.RDA/CODATA Materials Data, Infrastructure &
Interoperability IG

29.RDA/WDS Certification of Digital Repositories IG

30.RDA/WDS Publishing Data Cost Recovery for
Data Centres

31.RDA/WDS Publishing Data IG

32.Reproducibility IG

33.Research data needs of the Photon and Neutron
Science community

34.Research Data Provenance

35.Service Management IG

36.Structural Biology IG

37.Toxicogenomics Interoperability IG
* in review
The Research Data Alliance Community Today
Total RDA Community Members: 3029
from 103 countries
RDA Organisational 

Members and Affiliates
Some themes amidst the difference
1. Persistent Identifiers for data, documents, people,
organisations, instruments—Everything!

2. Certifying Trust in assertions, evidence, organisations,
processes… 

3. The value of Conversations, Relationships, and Mediation
— an agile network effect.
‹#›
An Area of Convergence and Agreement
Internet Domain
nodes with IP numbers
packages being exchanged
standardized protocols
Slide courtesy P. Wittenberg from L. Lannom from D. Clark
‹#›
An Area of Convergence and Agreement
Internet Domain
nodes with IP numbers
packages being exchanged
standardized protocols
Data Domain
objects with PID numbers
objects being exchanged
standardized protocols
Slide courtesy P. Wittenberg from L. Lannom from D. Clark
Some themes amidst the difference
1. Persistent Identifiers for data, documents, people,
organisations, instruments—Everything!

2. Certifying Trust in assertions, evidence, organisations,
processes… 

3. The value of Conversations, Relationships, and Mediation
— an agile network effect.
Increasing Complexity of
Mediation
From: C. Borgman, 2008, NSF
Cyberlearning Report
An Agile Manifesto for Organisations
(courtesy Bruce Caron)
We value:

• Individuals and interactions over processes and tools

• Working volunteers over comprehensive documentation

• Member collaboration over contract negotiation

• Responding to change over following a plan.
RDA/WDS Publishing
Data Bibliometrics
Data
providers
Data
consumers
Social
Technical
Solutions
dimension
Beneficiary
dimension
Working Groups Clusters
Data
Citation
Data Foundation
and Terminology
Repository Audit and
Certification DSA–WDS
Partnership
Brokering
Governance
PID Information
Types
Data Type
Registries
RDA/WDS
Publishing Data
Workflows
RDA/CODATA Summer Schools
in Data Science and Cloud
Computing in the Developing
World
Metadata
Standards
Directory
Practical
Policy
The BioSharing
Registry
Wheat Data
Interoperability
RDA/WDS
Publishing Data
Services
Data Description
Registry
Interoperability
Standardisation of
Data Categories and
Codes
Q1
Q2Q3
Q4
The	
  “Data	
  Fabric”
Slide courtesy Peter Wittenburg
The	
  “Data	
  Fabric”
DMP
Slide courtesy Peter Wittenburg
The	
  “Data	
  Fabric”
CERT
REP
DMP
DOM
Slide courtesy Peter Wittenburg
The	
  “Data	
  Fabric”
CITDD
CERT
REP
DMP
DOM
Slide courtesy Peter Wittenburg
The	
  “Data	
  Fabric”
CITDD
PROV BROK
CERT
CERTBDA
REP
REPRO
DMP
DOM
PP
Slide courtesy Peter Wittenburg
The	
  “Data	
  Fabric”
CITDD
PROV BROK
CERT
CERTBDA
REP
REPRO
DMP
DOM
FIM
PP
Slide courtesy Peter Wittenburg
WG/IG Tags and Titles
• Only words that occur twice or more

• ‘Data’ and common words removed

• Only half the Groups have tags
Regional RDAs
• Australian National Data Service, RDA/United States, RDA/Europe,

• Implement RDA deliverables locally and enhance adoption.

• Ensure regional or national issues are addressed globally.

• Support plenaries and support attendance at plenaries.
Initial Impact
• Data are having their day! RDA is both cause and effect.
• Collaborative value
• Accelerating harmonization—the citation story.
• Discovering shared themes—PIDs, the “platform” story
• New insight from rethinking old paradigms—roadmaps, architectures, and lifecycles are
passé. Reuse, agility and bridging are hip.
• Real deliverables in 2 years.
• Learning fast through openness and 18 months.
• demonstration of delivery from outputs book
• Money
• funding success is advancing the field
• measurable return on investment
• http://datastories.jiscinvolve.org/wp/
Preservation	
  of	
  LHC	
  data

100PB	
  growing	
  to	
  ~5EB	
  for	
  decades
Veni …	
  to	
  RDA	
  plenaries,	
  WG	
  &	
  IG	
  meetings
Vidi …	
  how	
  other	
  disciplines	
  attacked	
  the	
  problem
Vici …	
  developed,	
  refined	
  and	
  now	
  implementing	
  a	
  strategy,	
  including	
  
cost	
  model	
  and	
  business	
  case	
  
➔ A	
  better	
  solution,	
  more	
  sustainable	
  and	
  advanced	
  by	
  years	
  
vs.“go	
  it	
  alone”
slide	
  courtesy	
  Jamie	
  Shiers,	
  CERN
Fran	
  Berman,	
  Research	
  Data	
  Alliance
Next Steps for RDA: Stay
Pragmatic, Focus on Impact
• Next	
  Plenaries	
  (Plenaries	
  are	
  both	
  
community	
  and	
  working	
  meetings.	
  	
  
Meetings	
  held	
  twice	
  yearly	
  around	
  
the	
  world.):	
  
– September,	
  2015:	
  	
  Paris,	
  France	
  (P6)	
  
– March,	
  2016:	
  	
  Tokyo,	
  Japan	
  (P7)	
  
– September,	
  2016:	
  ~	
  Washington,	
  DC	
  
(P8)	
  
– March,	
  2017:	
  	
  Barcelona,	
  Spain	
  (P9)
Continuing	
  pipeline	
  of	
  infrastructure	
  
deliverables	
  adopted,	
  used,	
  
coordinated	
  and	
  amplified	
  to	
  
accelerate	
  data	
  sharing	
  
Increasing	
  coordination	
  and	
  
collaboration	
  between	
  domains,	
  
sectors,	
  organizations,	
  communities.	
  	
  
Effective	
  advocacy	
  for	
  national	
  and	
  
international	
  data	
  issues	
  and	
  
communities.	
  
Stronger	
  partnerships	
  with	
  industry,	
  
governments,	
  domains,	
  organizations.	
  	
  



Substantive	
  engagement	
  of	
  students	
  
and	
  early	
  career	
  professionals,	
  greater	
  
spectrum	
  of	
  international	
  cultures.
More Infrastructure
Impact-focused
Outreach
More effective
Community
Joining	
  RDA:	
  
Go	
  to	
  rd-­‐alliance.org	
  and	
  
register	
  
• Must	
  agree	
  to	
  RDA	
  principles	
  
(openness,	
  community-­‐
driven,	
  etc.)	
  
• Free	
  for	
  individuals
Plenary 6
and Data Challenge!
CNAM, Paris, France

23 - 25 September 2015
Info:

enquiries@rd-alliance.org
@resdatall



Funders Forum
Interest Groups

domain coordination, idea generation, maintenance, …
RDAMembership Working Groups
implementable, impactful outcomes
Council

organisational vision and strategy
Technical
Advisory Board

socio-technical vision
and strategy
Secretariat 

administration and
operations
Organisational

Advisory Board

needs, adoption,
business advice
RDA
Foundation

Más contenido relacionado

La actualidad más candente

Bridging Gaps and Broadening Participation in Today's and Future Research Com...
Bridging Gaps and Broadening Participation inToday's and Future Research Com...Bridging Gaps and Broadening Participation inToday's and Future Research Com...
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
Sandra Gesing
 
2009-C&T-NodeXL and social queries - a social media network analysis toolkit
2009-C&T-NodeXL and social queries - a social media network analysis toolkit2009-C&T-NodeXL and social queries - a social media network analysis toolkit
2009-C&T-NodeXL and social queries - a social media network analysis toolkit
Marc Smith
 
Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...
Research Data Alliance
 
Network effectiveness presentation materials
Network effectiveness presentation materialsNetwork effectiveness presentation materials
Network effectiveness presentation materials
guestb12b087
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
SEAD
 
Net Effectiveness Oct 6
Net Effectiveness Oct 6Net Effectiveness Oct 6
Net Effectiveness Oct 6
dianascearce
 

La actualidad más candente (20)

Data commons bonazzi bd2 k fundamentals of science feb 2017
Data commons bonazzi   bd2 k fundamentals of science feb 2017Data commons bonazzi   bd2 k fundamentals of science feb 2017
Data commons bonazzi bd2 k fundamentals of science feb 2017
 
Data Commons Garvan - 2016
Data Commons Garvan -  2016 Data Commons Garvan -  2016
Data Commons Garvan - 2016
 
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
Bridging Gaps and Broadening Participation inToday's and Future Research Com...Bridging Gaps and Broadening Participation inToday's and Future Research Com...
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so far
 
2009-C&T-NodeXL and social queries - a social media network analysis toolkit
2009-C&T-NodeXL and social queries - a social media network analysis toolkit2009-C&T-NodeXL and social queries - a social media network analysis toolkit
2009-C&T-NodeXL and social queries - a social media network analysis toolkit
 
Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...Learning from past infrastructure to embrace friction and create the Research...
Learning from past infrastructure to embrace friction and create the Research...
 
The Key Success Factor in Knowledge Management... What Else? Change Management
The Key Success Factor in Knowledge Management... What Else? Change ManagementThe Key Success Factor in Knowledge Management... What Else? Change Management
The Key Success Factor in Knowledge Management... What Else? Change Management
 
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
 
Network effectiveness presentation materials
Network effectiveness presentation materialsNetwork effectiveness presentation materials
Network effectiveness presentation materials
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
 
How can the cultural heritage community best meet the challenges of email arc...
How can the cultural heritage community best meet the challenges of email arc...How can the cultural heritage community best meet the challenges of email arc...
How can the cultural heritage community best meet the challenges of email arc...
 
Net Effectiveness Oct 6
Net Effectiveness Oct 6Net Effectiveness Oct 6
Net Effectiveness Oct 6
 
Transcript - DOIs to support citation of grey literature
Transcript - DOIs to support citation of grey literatureTranscript - DOIs to support citation of grey literature
Transcript - DOIs to support citation of grey literature
 
Linked Data Approach for Integration of Human Health & Environmental Data
Linked Data Approach for Integration of Human Health & Environmental DataLinked Data Approach for Integration of Human Health & Environmental Data
Linked Data Approach for Integration of Human Health & Environmental Data
 
Net Effectiveness April7
Net Effectiveness April7Net Effectiveness April7
Net Effectiveness April7
 
The current challenges of upgrading the infrastructure
The current challenges of upgrading the infrastructureThe current challenges of upgrading the infrastructure
The current challenges of upgrading the infrastructure
 
NetHope World Economic Forum - Data Driven Development
NetHope World Economic Forum - Data Driven DevelopmentNetHope World Economic Forum - Data Driven Development
NetHope World Economic Forum - Data Driven Development
 
Repository Federation: Towards Data Interoperability
Repository Federation: Towards Data InteroperabilityRepository Federation: Towards Data Interoperability
Repository Federation: Towards Data Interoperability
 
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCESBROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
 
Government Linked Data Projects in the Wild
Government Linked Data Projects in the WildGovernment Linked Data Projects in the Wild
Government Linked Data Projects in the Wild
 

Destacado

Search Engine Powerpoint
Search Engine PowerpointSearch Engine Powerpoint
Search Engine Powerpoint
201014161
 

Destacado (14)

IMCSummit 2015 - Day 2 Developer Track - Anatomy of an In-Memory Data Fabric:...
IMCSummit 2015 - Day 2 Developer Track - Anatomy of an In-Memory Data Fabric:...IMCSummit 2015 - Day 2 Developer Track - Anatomy of an In-Memory Data Fabric:...
IMCSummit 2015 - Day 2 Developer Track - Anatomy of an In-Memory Data Fabric:...
 
Data Shapes and Data Transformations
Data Shapes and Data TransformationsData Shapes and Data Transformations
Data Shapes and Data Transformations
 
Symbology mediation
Symbology mediationSymbology mediation
Symbology mediation
 
Spark Study Notes
Spark Study NotesSpark Study Notes
Spark Study Notes
 
Ontologies for Emergency & Disaster Management
Ontologies for Emergency & Disaster Management Ontologies for Emergency & Disaster Management
Ontologies for Emergency & Disaster Management
 
Maj Gen R C Padhi
Maj Gen R C PadhiMaj Gen R C Padhi
Maj Gen R C Padhi
 
Canterbury Plugfest: Geospatial Interoperability Works!
Canterbury Plugfest: Geospatial Interoperability Works!Canterbury Plugfest: Geospatial Interoperability Works!
Canterbury Plugfest: Geospatial Interoperability Works!
 
Core Geospatial Ontologies
Core Geospatial OntologiesCore Geospatial Ontologies
Core Geospatial Ontologies
 
Testbed-12 Semantic Portrayal, Registry and Mediation Engineering Report Pr...
Testbed-12  Semantic Portrayal, Registry and Mediation  Engineering Report Pr...Testbed-12  Semantic Portrayal, Registry and Mediation  Engineering Report Pr...
Testbed-12 Semantic Portrayal, Registry and Mediation Engineering Report Pr...
 
Interoperability & standards
Interoperability & standardsInteroperability & standards
Interoperability & standards
 
Ontology integration - Heterogeneity, Techniques and more
Ontology integration - Heterogeneity, Techniques and moreOntology integration - Heterogeneity, Techniques and more
Ontology integration - Heterogeneity, Techniques and more
 
Ncar globally accessible user environment
Ncar globally accessible user environmentNcar globally accessible user environment
Ncar globally accessible user environment
 
Data Integration Ontology Mapping
Data Integration Ontology MappingData Integration Ontology Mapping
Data Integration Ontology Mapping
 
Search Engine Powerpoint
Search Engine PowerpointSearch Engine Powerpoint
Search Engine Powerpoint
 

Similar a Open Data is not Enough (final version)

Going Glocal—Polar Data in a Global Infrastructure
Going Glocal—Polar Data in a Global InfrastructureGoing Glocal—Polar Data in a Global Infrastructure
Going Glocal—Polar Data in a Global Infrastructure
Research Data Alliance
 
#ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love #ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love
Kristi Holmes
 

Similar a Open Data is not Enough (final version) (20)

The Research Data Alliance--Creating the culture and technology for an intern...
The Research Data Alliance--Creating the culture and technology for an intern...The Research Data Alliance--Creating the culture and technology for an intern...
The Research Data Alliance--Creating the culture and technology for an intern...
 
Keynote: Mark Parsons - Plans are Useless, But Planning is Essential
Keynote: Mark Parsons - Plans are Useless, But Planning is EssentialKeynote: Mark Parsons - Plans are Useless, But Planning is Essential
Keynote: Mark Parsons - Plans are Useless, But Planning is Essential
 
Infrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDAInfrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDA
 
RDA, Data Citation, and PIDs for DataOne
RDA, Data Citation, and PIDs for DataOneRDA, Data Citation, and PIDs for DataOne
RDA, Data Citation, and PIDs for DataOne
 
RDA Update
RDA UpdateRDA Update
RDA Update
 
RDA Presentation to the International Federation of Library Associations
RDA Presentation to the International Federation of Library AssociationsRDA Presentation to the International Federation of Library Associations
RDA Presentation to the International Federation of Library Associations
 
Data Policy for Open Science
Data Policy for Open ScienceData Policy for Open Science
Data Policy for Open Science
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data Commons
 
Metadata 2020 Vivo Conference 2018
Metadata 2020 Vivo Conference 2018 Metadata 2020 Vivo Conference 2018
Metadata 2020 Vivo Conference 2018
 
Going Glocal—Polar Data in a Global Infrastructure
Going Glocal—Polar Data in a Global InfrastructureGoing Glocal—Polar Data in a Global Infrastructure
Going Glocal—Polar Data in a Global Infrastructure
 
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
 
My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018
 
#ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love #ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love
 
FAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDAFAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDA
 
Research into Practice case study 2: Library linked data implementations an...
	Research into Practice case study 2:  Library linked data implementations an...	Research into Practice case study 2:  Library linked data implementations an...
Research into Practice case study 2: Library linked data implementations an...
 
Sentara Linked Data Workshop - Sept 10, 2012
Sentara Linked Data Workshop - Sept 10, 2012Sentara Linked Data Workshop - Sept 10, 2012
Sentara Linked Data Workshop - Sept 10, 2012
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Report from RDAPlenary 3 to DataCitation Community in Australia
Report from RDAPlenary 3 to DataCitation Community in AustraliaReport from RDAPlenary 3 to DataCitation Community in Australia
Report from RDAPlenary 3 to DataCitation Community in Australia
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data Sharing
 
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at ScaleFull Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
 

Más de Research Data Alliance

Más de Research Data Alliance (20)

RDA in a Nutshell - September 2020
RDA in a Nutshell - September 2020RDA in a Nutshell - September 2020
RDA in a Nutshell - September 2020
 
RDA in a Nutshell - August 2020
RDA in a Nutshell - August 2020RDA in a Nutshell - August 2020
RDA in a Nutshell - August 2020
 
RDA in a Nutshell - July 2020
RDA in a Nutshell - July 2020RDA in a Nutshell - July 2020
RDA in a Nutshell - July 2020
 
RDA in a Nutshell - June 2020
RDA in a Nutshell - June 2020RDA in a Nutshell - June 2020
RDA in a Nutshell - June 2020
 
RDA in a Nutshell - May 2020
RDA in a Nutshell - May 2020RDA in a Nutshell - May 2020
RDA in a Nutshell - May 2020
 
RDA in a Nutshell - April 2020
RDA in a Nutshell - April 2020RDA in a Nutshell - April 2020
RDA in a Nutshell - April 2020
 
RDA in a Nutshell - March 2020
RDA in a Nutshell - March 2020RDA in a Nutshell - March 2020
RDA in a Nutshell - March 2020
 
RDA in a Nutshell - February 2020
RDA in a Nutshell - February 2020RDA in a Nutshell - February 2020
RDA in a Nutshell - February 2020
 
RDA in a Nutshell - January 2020
RDA in a Nutshell - January 2020RDA in a Nutshell - January 2020
RDA in a Nutshell - January 2020
 
Rda in a Nutshell - December 2019
Rda in a Nutshell - December 2019Rda in a Nutshell - December 2019
Rda in a Nutshell - December 2019
 
Rda in a Nutshell - November 2019
Rda in a Nutshell - November 2019Rda in a Nutshell - November 2019
Rda in a Nutshell - November 2019
 
RDA in a Nutshell - October 2019
RDA in a Nutshell - October 2019RDA in a Nutshell - October 2019
RDA in a Nutshell - October 2019
 
The Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsThe Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to Individuals
 
The Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsThe Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to Individuals
 
RDA Value for Infrastructure Providers
RDA Value for Infrastructure ProvidersRDA Value for Infrastructure Providers
RDA Value for Infrastructure Providers
 
Rda in a nutshell september 2019
Rda in a nutshell september 2019Rda in a nutshell september 2019
Rda in a nutshell september 2019
 
The Value of the Rda Value for Organisations Performing Research
The Value of the Rda Value for Organisations Performing ResearchThe Value of the Rda Value for Organisations Performing Research
The Value of the Rda Value for Organisations Performing Research
 
RDA Value for Libraries
RDA Value for LibrariesRDA Value for Libraries
RDA Value for Libraries
 
The Value of the RDA for Funders
The Value of the RDA for FundersThe Value of the RDA for Funders
The Value of the RDA for Funders
 
Rda in a nutshell august 2019
Rda in a nutshell august 2019Rda in a nutshell august 2019
Rda in a nutshell august 2019
 

Open Data is not Enough (final version)

  • 1. Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License Open Data is Not Enough Making Data Sharing Work Mark A. Parsons 0000-0002-7723-0950 Secretary General RDA meets the Italian scientific community Pisa, Italy 14 July 2015
  • 2.
  • 3.
  • 4.
  • 5. The many representations (and conceptions) of sea ice Evolution of sea ice data products at NSIDC, presented by Ruth Duerr March 10, 2015, RDA 5th Plenary, San Diego Figure courtesy Donna Scott
  • 6. The generative value of data • Generative value per Jonathan Zittrain (2008) as interpreted and extended to data by John Wilbanks: “the capacity to produce unanticipated change through unfiltered contributions from broad and varied audiences.” —J. Zittrain • Data become more generative by being more adaptable, more easily mastered, more accessible, and more connected and influential. • Not net present value but net potential value.
  • 7. Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License accessibility adaptability leverage ease of mastery slide courtesy John Wilbanks 2013
  • 8. To make open work we need • Curation—increasing the (generative) value of the data • Context—of both data and application of provider and user • Trust—of data, information, organisations, institutions…. • Interfaces, connections, relationships, mediation—Bridges • People
  • 9. Research Data Alliance Vision Researchers and innovators openly share data across technologies, disciplines, and countries to address the grand challenges of society. Mission RDA builds the social and technical bridges that enable open sharing of data.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14. Dynamics of Infrastructure Edwards, et al. 2007 Understanding Infrastructure: Dynamics, Tensions, and Design. • Infrastructures become “ubiquitous, accessible, reliable, and transparent” as they mature. • Systems Networks Inter-networks • “system-building, characterized by the deliberate and successful design of technology-based services.” • “technology transfer across domains and locations results in variations on the original design, as well as the emergence of competing systems.” • Finally, “a process of consolidation characterized by gateways that allow dissimilar systems to be linked into networks.”
  • 15. Not what, but When is infrastructure?
  • 16. Not what, but When and Who is infrastructure?
  • 17. Bridges and Gateways Gateways are often wrongly understood as “technologies,” i.e. hardware or software alone. A more accurate approach conceives them as combining a technical solution with a social choice, i.e. a standard, both of which must be integrated into existing users’ communities of practice. Because of this, gateways rarely perform perfectly.
 — Edwards et al. 2007
  • 18. Infrastructure is Relationships, interactions, and connections between people, technologies, and institutions
  • 19. Fran  Berman,  Research  Data  Alliance “Create - Adopt - Use” (in 12-18 months) Systems Interoperability Adopted Policy Sustainable Economics Common Types, 
 Standards, Metadata Traffic  Image:    
 Mike  Gonzalez Adopted Community Practice Training, Education, Workforce
  • 20. Shared Principles • Openness • Consensus • Balance • Harmonization • Community Driven • Non-profit
  • 21. Fran  Berman,  Research  Data  Alliance RDA: Accelerate Data Sharing and Interoperability Across Cultures, Communities, 
 Scales, Technologies ▪ Technical  parts  of  the  data  engine:   ▪ Data  type  registries  reference  model   ▪ Wheat  data  interoperability  framework   ▪ Rules  of  the  road:   ▪ Common  agreement  on  data  citation   ▪ Common  practice  for  data  repositories   ▪ Principles  of  legal  interoperability   ▪ Better  drivers   • Summer  schools  in  data  science  and  cloud   computing  in  the  developing  world  (with   CODATA)   • Active  data  management  plan   development  and  monitoring   Policy and Practice Systems Interoperability Sustainable Economics Common Types, 
 Standards, Metadata Training, Education, Workforce
  • 22. • A basic vocabulary of foundational terminology and query tool to make sure we know what we’re talking about. • A data type model and registry (“MIME-types” for data) to help tools interpret, display, and process data. • A persistent identifier type registry to help search engines understand what they are pointing to and retrieving. • A basic set of machine actionable rules to enhance trust • Coming soon: • A metadata standards directory so we can describe similar things consistently • A dynamic-data citation methodology so we can reference precise subsets of changing data. • Semantically linked terms describing wheat data so we can share harvest and related information around the world • Services and methods for finding data across multiple registries, to help cross disciplinary and multi-facetted discovery. • A unified repository certification scheme to reduce confusion and improve trust. Initial Products—adopt one today!
  • 23. RDA Working Groups 1. Brokering Governance 2. Data Citation WG 3. Data Description Registry Interoperability 4. Data Foundation and Terminology WG 5. Data Type Registries WG 6. Metadata Standards Directory Working Group 7. PID Information Types WG 8. Practical Policy WG 9. RDA/CODATA Summer Schools in Data Science and Cloud Computing in the Developing World 10.RDA/WDS Publishing Data Bibliometrics WG 11.RDA/WDS Publishing Data Services WG 12.RDA/WDS Publishing Data Workflows WG 13.Repository Audit and Certification DSA–WDS Partnership WG 14.Repository Platforms for Research Data* 15.The BioSharing Registry: connecting data policies, standards & databases in life sciences* 16.Wheat Data Interoperability WG * in review
  • 24. RDA Interest Groups 1. Agricultural Data Interoperability IG 2. Active Data Management Plans* 3. Big Data Analytics IG 4. Biodiversity Data Integration IG 5. Brokering IG 6. Community Capability Model IG 7. Data Fabric IG 8. Data for Development 9. Data Foundations and Terminology IG* 10.Data in Context IG 11.Development of cloud computing capacity and education in developing world research 12.Digital Practices in History and Ethnography IG 13.Domain Repositories Interest Group 14.Education and Training on handling of research data 15.ELIXIR Bridging Force IG 16.Engagement IG 17.Federated Identity Management 18.Geospatial IG 19.Libraries for Research Data 20.Long tail of research data IG 21.Marine Data Harmonization IG 22.Metabolomics 23.Metadata IG 24.PID Interest Group 25.Preservation e-Infrastructure IG 26.Quality of Urban Life Interest Group 27.RDA/CODATA Legal Interoperability IG 28.RDA/CODATA Materials Data, Infrastructure & Interoperability IG 29.RDA/WDS Certification of Digital Repositories IG 30.RDA/WDS Publishing Data Cost Recovery for Data Centres 31.RDA/WDS Publishing Data IG 32.Reproducibility IG 33.Research data needs of the Photon and Neutron Science community 34.Research Data Provenance 35.Service Management IG 36.Structural Biology IG 37.Toxicogenomics Interoperability IG * in review
  • 25. The Research Data Alliance Community Today Total RDA Community Members: 3029 from 103 countries
  • 27. Some themes amidst the difference 1. Persistent Identifiers for data, documents, people, organisations, instruments—Everything! 2. Certifying Trust in assertions, evidence, organisations, processes… 3. The value of Conversations, Relationships, and Mediation — an agile network effect.
  • 28. ‹#› An Area of Convergence and Agreement Internet Domain nodes with IP numbers packages being exchanged standardized protocols Slide courtesy P. Wittenberg from L. Lannom from D. Clark
  • 29. ‹#› An Area of Convergence and Agreement Internet Domain nodes with IP numbers packages being exchanged standardized protocols Data Domain objects with PID numbers objects being exchanged standardized protocols Slide courtesy P. Wittenberg from L. Lannom from D. Clark
  • 30. Some themes amidst the difference 1. Persistent Identifiers for data, documents, people, organisations, instruments—Everything! 2. Certifying Trust in assertions, evidence, organisations, processes… 3. The value of Conversations, Relationships, and Mediation — an agile network effect.
  • 31. Increasing Complexity of Mediation From: C. Borgman, 2008, NSF Cyberlearning Report
  • 32. An Agile Manifesto for Organisations (courtesy Bruce Caron) We value: • Individuals and interactions over processes and tools • Working volunteers over comprehensive documentation • Member collaboration over contract negotiation • Responding to change over following a plan.
  • 33. RDA/WDS Publishing Data Bibliometrics Data providers Data consumers Social Technical Solutions dimension Beneficiary dimension Working Groups Clusters Data Citation Data Foundation and Terminology Repository Audit and Certification DSA–WDS Partnership Brokering Governance PID Information Types Data Type Registries RDA/WDS Publishing Data Workflows RDA/CODATA Summer Schools in Data Science and Cloud Computing in the Developing World Metadata Standards Directory Practical Policy The BioSharing Registry Wheat Data Interoperability RDA/WDS Publishing Data Services Data Description Registry Interoperability Standardisation of Data Categories and Codes Q1 Q2Q3 Q4
  • 34. The  “Data  Fabric” Slide courtesy Peter Wittenburg
  • 35. The  “Data  Fabric” DMP Slide courtesy Peter Wittenburg
  • 38. The  “Data  Fabric” CITDD PROV BROK CERT CERTBDA REP REPRO DMP DOM PP Slide courtesy Peter Wittenburg
  • 39. The  “Data  Fabric” CITDD PROV BROK CERT CERTBDA REP REPRO DMP DOM FIM PP Slide courtesy Peter Wittenburg
  • 40. WG/IG Tags and Titles • Only words that occur twice or more • ‘Data’ and common words removed • Only half the Groups have tags
  • 41. Regional RDAs • Australian National Data Service, RDA/United States, RDA/Europe, • Implement RDA deliverables locally and enhance adoption. • Ensure regional or national issues are addressed globally. • Support plenaries and support attendance at plenaries.
  • 42. Initial Impact • Data are having their day! RDA is both cause and effect. • Collaborative value • Accelerating harmonization—the citation story. • Discovering shared themes—PIDs, the “platform” story • New insight from rethinking old paradigms—roadmaps, architectures, and lifecycles are passé. Reuse, agility and bridging are hip. • Real deliverables in 2 years. • Learning fast through openness and 18 months. • demonstration of delivery from outputs book • Money • funding success is advancing the field • measurable return on investment • http://datastories.jiscinvolve.org/wp/
  • 43. Preservation  of  LHC  data
 100PB  growing  to  ~5EB  for  decades Veni …  to  RDA  plenaries,  WG  &  IG  meetings Vidi …  how  other  disciplines  attacked  the  problem Vici …  developed,  refined  and  now  implementing  a  strategy,  including   cost  model  and  business  case   ➔ A  better  solution,  more  sustainable  and  advanced  by  years   vs.“go  it  alone” slide  courtesy  Jamie  Shiers,  CERN
  • 44. Fran  Berman,  Research  Data  Alliance Next Steps for RDA: Stay Pragmatic, Focus on Impact • Next  Plenaries  (Plenaries  are  both   community  and  working  meetings.     Meetings  held  twice  yearly  around   the  world.):   – September,  2015:    Paris,  France  (P6)   – March,  2016:    Tokyo,  Japan  (P7)   – September,  2016:  ~  Washington,  DC   (P8)   – March,  2017:    Barcelona,  Spain  (P9) Continuing  pipeline  of  infrastructure   deliverables  adopted,  used,   coordinated  and  amplified  to   accelerate  data  sharing   Increasing  coordination  and   collaboration  between  domains,   sectors,  organizations,  communities.     Effective  advocacy  for  national  and   international  data  issues  and   communities.   Stronger  partnerships  with  industry,   governments,  domains,  organizations.    
 
 Substantive  engagement  of  students   and  early  career  professionals,  greater   spectrum  of  international  cultures. More Infrastructure Impact-focused Outreach More effective Community Joining  RDA:   Go  to  rd-­‐alliance.org  and   register   • Must  agree  to  RDA  principles   (openness,  community-­‐ driven,  etc.)   • Free  for  individuals
  • 45. Plenary 6 and Data Challenge! CNAM, Paris, France 23 - 25 September 2015
  • 47. Funders Forum Interest Groups
 domain coordination, idea generation, maintenance, … RDAMembership Working Groups implementable, impactful outcomes Council
 organisational vision and strategy Technical Advisory Board
 socio-technical vision and strategy Secretariat 
 administration and operations Organisational
 Advisory Board
 needs, adoption, business advice RDA Foundation