SlideShare una empresa de Scribd logo
1 de 37
Sharing Data-Rich Research
Through Repository Layering
Stephen Abrams
California Digital Library
Angela Rizk-Jackson
Julia Kochi
University of California, San Francisco
Noah Wittman
University of California, Berkeley
Why is data curation important?
 Accelerating scientific progress
 Enabling appropriate scrutiny and verification of results
 Promoting integrity and debate
 Facilitating new collaborations
 Avoiding needless duplication of effort
 Increasingly, complying with institutional policies, publication
requirements, and funder mandates
Cf. White and Teds (2011), “Making the case for research data management” DCC briefing paper,
www.dcc.ac.uk/resources/briefing-papers/making-case-rdm
The library’s role
 A continuation of its long-standing mission and practice to
connect patrons with content of interest in meaningful ways
across barriers of space and time
Cf. Tenopir et al. (2012), “Academic librarians and research data services: Preparation and attitudes,” 78th
IFLA General Conference and Assembly, Helsinki, conference.ifla.org/past/ifla78/116-tenopir-en.pdf
 Offering solutions that enhance the natural points of
alignment between the scholarly research and information
lifecycles
Publish
Reuse
ShareCreate
Discover
Collect
PreserveAccessResearchResearch CurationCuration
Scholarly lifecycle Information lifecycle
Merritt
 Curation repository available to the UC community and
external partners
 Preservation and access
 Content agnostic, model free
 Highly decentralized micro-services architecture
Cf. Abrams, Cruse, Kunze, and Minor (2011), “Curation micro-services: A pipeline metaphor for
repositories,” Journal of Digital Information 12(2), journals.tdl.org/jodi/article/view/1605
 26 curatorial units
 271 collections
 325,000 objects
 450,000 versions
 4,500,000 files
 13 TB
www.cdlib.org/uc3/merritt
merritt.cdlib.org
Merritt
Storage node
Storage
broker
Inventory
ONEShare UNM
storage node
Storage node
UI/API
UI/API
UI/API
LDAP
LDAP
LDAP
RDBMS
Fixity
User
agent
Message
queue
RDBMS
Load
balancer
Ingest
Load
balancer
Ingest
Ingest
EZID
No-SQL
DataCite
…
DataONE
member node
RDBMS
RDBMS
DataONE
coord’ing node
…
IDF
Load
balancer
Web of
Knowledge
Primo
SAN
SDSC
cloud
(Some) issues to address
 Scale
 Individual objects ranging from 0 to 47,000 files
 Individual files ranging from 0 to 14 GB
 Maintaining control
 Concern over potential loss of control over dissemination and
use of data
 User experience
 Switch from organizational to individual interaction
www.flickr.com/photos/vixon/116447718www.flickr.com/photos/traftery/4319529821www.flickr.com/photos/32195273@N05/51076852642
(Some) issues to address
 Scale
 Individual objects ranging from 0 to 47,000 files
 Individual files ranging from 0 to 14 GB
 Maintaining control
 Concern over potential loss of control over dissemination and
use of data
 User experience
 Switch from organizational to individual interaction
Augment repository function by composition (when possible)
and addition (when necessary)
 Loosely-coupled integration with external community supported
systems and services
Scale
 Avoiding client timeout
 ≤ 2 GB: File-based  stream-based AIP-to-DIP processing
 > 2 GB: Asynchronous delivery
 Email notification with personalized, time-limited URL
 Streamlined storage provisioning
 SDSC cloud
cloud.sdsc.edu
www.kevatron.co.uk/converting-8-24-bit-samples-in-coreaudio-on-ios www.flickr.com/photos/paulbhartzog/680749585
Control
 Data use agreements (DUAs)
 Explicit assertion of license requirements and terms of use
 Curatorial and consumer notification of acceptance
Cf. Brazhnik and Jones (2007), “Anatomy of data integration,” Journal of Biomedical Informatics 40(3): 252-
69, doi:10.1016/j.jbi.2006.09.001
From: no-reply-merritt@ucop.edu
Subject:Merritt DUA acceptance
Name: Stephen Abrams
Affiliation: California Digital Library
Collection: UCSF DataShare
Object: Frontotemporal Lobar Degeneration (FTLD)
Date: 2013-05-3109:50:34PDT
Terms of use: As part of this agreement, Consumer submits to the following
statements:
(1) I will receive access to de-identified data and will not attempt to establish the
identity of any of the study subjects.
(2) I will share these data only with my immediate co-workers, and I will not transfer
these data to other research groups. I understand that these data are available to
other research groups through the process by which I obtain them.
(3) I will require anyone in my group who utilizes these data, or anyone with whom I
share these data to comply with this data use agreement
...
User experience
 Due to its open eligibility policy, Merritt will always provide a
more generic UX than special-purpose or disciplinary systems
 Shifting user roles, shifting expectations
 Institutional  individual researcher
 Behavioral expectations set by the commercial/mobile web
User experience
 Due to its open eligibility policy, Merritt will always provide a
more generic UX than special-purpose or disciplinary systems
 Shifting user roles, shifting expectations
 Institutional  individual researcher
 Behavioral expectations set by the commercial web
 Integration with extant services that better provide the
desired UX
 DataShare
 Research Hub
DataShare
 “The goal of the DataShare project is to catalyze widespread
sharing of scientific research data”
datashare.ucsf.edu
 UCSF Clinical and Translational Science Institute
ctsi.ucsf.edu
 UCSF Library
www.library.ucsf.edu
 UCSF Center for Imaging of Neurodegenerative Disease
www.radiology.ucsf.edu/cind
 Architecture
 DataShare submission client (Ruby/Rails)
 Merritt curation repository
 DataShare discovery portal (XTF/Java)
DataShare
 Prepare
 Describe
 Upload
 Curate
 Discover
 Share
DataShare
 Prepare
 Best practice advice
 Describe
 Upload
 Curate
 Discover
 Share
DataShare
 Prepare
 Describe
 Schema-directed
metadata editor
 DataCite schema
schema.datacite.org
 Upload
 Curate
 Discover
 Share
DataShare
 Prepare
 Describe
 Upload
 File browse or
drag-n-drop
 Curate
 Discover
 Share
DataShare
 Prepare
 Describe
 Upload
 File browse or
drag-n-drop
 Curate
 Discover
 Share
DataShare
 Prepare
 Describe
 Upload
 Curate
 Manage datasets
 Discover
 Share
DataShare
 Prepare
 Describe
 Upload
 Curate
 Discover
 Faceted search and
browse
 Share
DataShare
 Prepare
 Describe
 Upload
 Curate
 Discover
 Share
 DataONE
 DataCite
 (soon) Primo
Web of Knowledge
 SEO
Merritt + DataShare
Storage node
Storage
broker
Inventory
ONEShare UNM
storage node
Storage node
UI/API
UI/API
UI/API
LDAP
LDAP
LDAP
RDBMS
Fixity
User
agent
Message
queue
RDBMS
Load
balancer
Ingest
Load
balancer
Ingest
Ingest
EZID
No-SQL
DataCite
…
DataONE
member node
RDBMS
RDBMS
DataONE
coord’ing node
…
IDF
Load
balancer
Web of
Knowledge
Primo
SAN
SDSC
cloud
DataShare
upload
Collection
Atom feed
XTF
xtf.cdlib.org
DataShare
portal
Lucene
Research Hub
 “Research Hub provides powerful tools for content
management and collaboration”
hub.berkeley.edu
 Alfresco CMS
www.alfresco.com
 770 projects, 3,900 users
 Personal file management
 Project collaboration
 Departmental resource pooling
 Research data management
 Desktop sync, mobile app, Adobe Creative Suite
 UC Berkeley Information Services and Technology
ist.berkeley.edu
Research Hub
 Prepare
 Acquire and
arrange
 Describe
 Upload
 Curate
 Discover
 Share
Research Hub
 Prepare
 Describe
 Schema-directed
metadata editors
 Upload
 Curate
 Discover
 Share
Research Hub
 Prepare
 Describe
 Upload
 Direct action
 Curate
 Discover
 Share
 Prepare
 Describe
 Upload
 Direct action
 Curate
 Discover
 Share
Research Hub
Research Hub
 Prepare
 Describe
 Upload
 Policy-based
workflow rules
 Curate
 Discover
 Share
Research Hub
 Prepare
 Describe
 Upload
 Drag-and-drop
 Curate
 Discover
 Share
Research Hub
 Prepare
 Describe
 Upload
 Confirmation
 Curate
 Discover
 Share
Research Hub
 Prepare
 Describe
 Upload
 Curate
 Manage datasets
 Discover
 Share
Research Hub
 Prepare
 Describe
 Upload
 Curate
 Discover
 Share
Research Hub
 Prepare
 Describe
 Upload
 Curate
 Discover
 Share
Merritt + DataShare + Research Hub
Storage node
Storage
broker
Inventory
ONEShare UNM
storage node
Storage node
UI/API
UI/API
UI/API
LDAP
LDAP
LDAP
RDBMS
Fixity
User
agent
Message
queue
RDBMS
Load
balancer
Ingest
Load
balancer
Ingest
Ingest
EZID
No-SQL
DataCite
…
DataONE
member node
RDBMS
RDBMS
DataONE
coord’ing node
…
IDF
Load
balancer
Web of
Knowledge
Primo
SAN
SDSC
cloud
DataShare
upload
Collection
Atom feed
XTF
xtf.cdlib.org
DataShare
portal
Lucene
Research
Hub
Next steps
 Self-service account
registration
 UCTrust and InCommon
Shibboleth federations
 Additional cloud-based
replication
 Outreach
 Integration with Open Context
archaeological portal
opencontext.org
 Atom-based submission
 Integration with Nuxeo
www.nuxeo.com
 UC system-wide DAMS solution
 Integration with Islandora
islandora.ca
 Collaboration with UCLA Library
 Tuque API
 Integration with DPN
www.dpn.org
Sharing research through repositories
 Conform to institutional policy, publication requirements, and
funder mandates
 Pro-active curation of valuable research outputs
 Stable citation and access
 High visibility publication and discovery
 Use metrics
Sharing research through repositories
 Conform to institutional policy, publication requirements, and
funder mandates
 Pro-active curation of valuable research outputs
 Stable citation and access
 High visibility publication and discovery
 Use metrics
 Repository layering as an appropriate division of labor
 Exploiting existing capabilities already in local use
For more information
 Merritt
www.cdlib.org/uc3/merritt
uc3@ucop.edu
Stephen Abrams David Loy
Patricia Cruse Mark Reyes
Shirin Faenza Joan Starr
Scott Fisher Carly Strasser
Erik Hetzner Marisa Strong
Joshua Hubbard Bhavitavya Vedula
Greg Janée Kenneth Weiss
John Kunze Perry Willet
Rosalie Lack
 DataShare
datashare.ucsf.edu
Geoffrey Boushey Julia Kochi
Anirvan Chatterjee Angela Rizk-Jackson
Maninder Kahlon Michael Weiner
 Research Hub
hub.berkeley.edu
Ian Crew Michael McCarthy (Tribloom)
Noah Wittman Patrick McGrath
www.slideshare.net/UC3/or-2013abramssharingdatarichresearch

Más contenido relacionado

La actualidad más candente

D paul ecn2013
D paul ecn2013D paul ecn2013
D paul ecn2013ECNOfficer
 
A Social Content Delivery Network for Scientific Cooperation: Vision, Design...
A Social Content Delivery Network for Scientific Cooperation: Vision,  Design...A Social Content Delivery Network for Scientific Cooperation: Vision,  Design...
A Social Content Delivery Network for Scientific Cooperation: Vision, Design...Simon Caton
 
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...Jenn Riley
 
Komatsoulis internet2 executive track
Komatsoulis internet2 executive trackKomatsoulis internet2 executive track
Komatsoulis internet2 executive trackGeorge Komatsoulis
 
Data Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access SymposiumData Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access SymposiumMerce Crosas
 
ESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsSEAD
 
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)SEAD
 
SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD
 
Dataverse, Cloud Dataverse, and DataTags
Dataverse, Cloud Dataverse, and DataTagsDataverse, Cloud Dataverse, and DataTags
Dataverse, Cloud Dataverse, and DataTagsMerce Crosas
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesASIS&T
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...Robert Grossman
 
On Evaluating and Publishing Data Concerns for Data as a Service
On Evaluating and Publishing Data Concerns for Data as a ServiceOn Evaluating and Publishing Data Concerns for Data as a Service
On Evaluating and Publishing Data Concerns for Data as a ServiceHong-Linh Truong
 
Open Data is not Enough (final version)
Open Data is not Enough (final version)Open Data is not Enough (final version)
Open Data is not Enough (final version)Research Data Alliance
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?Robert Grossman
 
Komatsoulis internet2 global forum 2015
Komatsoulis internet2 global forum 2015Komatsoulis internet2 global forum 2015
Komatsoulis internet2 global forum 2015George Komatsoulis
 
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsSome Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsRobert Grossman
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataRobert Grossman
 

La actualidad más candente (20)

D paul ecn2013
D paul ecn2013D paul ecn2013
D paul ecn2013
 
A Social Content Delivery Network for Scientific Cooperation: Vision, Design...
A Social Content Delivery Network for Scientific Cooperation: Vision,  Design...A Social Content Delivery Network for Scientific Cooperation: Vision,  Design...
A Social Content Delivery Network for Scientific Cooperation: Vision, Design...
 
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
 
Komatsoulis internet2 executive track
Komatsoulis internet2 executive trackKomatsoulis internet2 executive track
Komatsoulis internet2 executive track
 
Data Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access SymposiumData Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access Symposium
 
ESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and Tools
 
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
 
SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD slide set (October 2011)
SEAD slide set (October 2011)
 
Dataverse, Cloud Dataverse, and DataTags
Dataverse, Cloud Dataverse, and DataTagsDataverse, Cloud Dataverse, and DataTags
Dataverse, Cloud Dataverse, and DataTags
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
Digital Curation 101 - Taster
Digital Curation 101 - TasterDigital Curation 101 - Taster
Digital Curation 101 - Taster
 
Linked data in pharma R&D
Linked data in pharma R&DLinked data in pharma R&D
Linked data in pharma R&D
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
 
On Evaluating and Publishing Data Concerns for Data as a Service
On Evaluating and Publishing Data Concerns for Data as a ServiceOn Evaluating and Publishing Data Concerns for Data as a Service
On Evaluating and Publishing Data Concerns for Data as a Service
 
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
 
Open Data is not Enough (final version)
Open Data is not Enough (final version)Open Data is not Enough (final version)
Open Data is not Enough (final version)
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?
 
Komatsoulis internet2 global forum 2015
Komatsoulis internet2 global forum 2015Komatsoulis internet2 global forum 2015
Komatsoulis internet2 global forum 2015
 
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsSome Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data Platforms
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate Data
 

Destacado

Overcoming Obstacles to Sharing Research Data
Overcoming Obstacles to Sharing Research DataOvercoming Obstacles to Sharing Research Data
Overcoming Obstacles to Sharing Research DataBrian Hole
 
Supporting UC Research Data Management
Supporting UC Research Data ManagementSupporting UC Research Data Management
Supporting UC Research Data Managementslabrams
 
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...University of California Curation Center
 
NISO Webinar on data curation services at the CDL
NISO Webinar on data curation services at the CDLNISO Webinar on data curation services at the CDL
NISO Webinar on data curation services at the CDLCarly Strasser
 

Destacado (7)

Supporting UC Research Data Management
Supporting UC Research Data ManagementSupporting UC Research Data Management
Supporting UC Research Data Management
 
Overcoming Obstacles to Sharing Research Data
Overcoming Obstacles to Sharing Research DataOvercoming Obstacles to Sharing Research Data
Overcoming Obstacles to Sharing Research Data
 
Supporting UC Research Data Management
Supporting UC Research Data ManagementSupporting UC Research Data Management
Supporting UC Research Data Management
 
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
 
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
 
NISO Webinar on data curation services at the CDL
NISO Webinar on data curation services at the CDLNISO Webinar on data curation services at the CDL
NISO Webinar on data curation services at the CDL
 
What does "data publication" mean to researchers?
What does "data publication" mean to researchers?What does "data publication" mean to researchers?
What does "data publication" mean to researchers?
 

Similar a Or 2013-abrams-sharing-data-rich-research

Policy-based Data Management
Policy-based Data Management Policy-based Data Management
Policy-based Data Management Gary Wilhelm
 
How Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-useHow Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-useMatthew Vaughn
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsVivien Bonazzi
 
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.Laurent Alquier
 
Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster LEARN Project
 
GlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening KeynoteGlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening KeynoteGlobus
 
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourKNOWeSCAPE2014
 
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...Kathleen Jagodnik
 
UK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceUK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceLizLyon
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVEUDAT
 
Grid Computing July 2009
Grid Computing July 2009Grid Computing July 2009
Grid Computing July 2009Ian Foster
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals FederationManjulaPatel
 
Impact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and EducationImpact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and EducationMANENDRASINGH30
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 

Similar a Or 2013-abrams-sharing-data-rich-research (20)

Policy-based Data Management
Policy-based Data Management Policy-based Data Management
Policy-based Data Management
 
Cyberistructure
CyberistructureCyberistructure
Cyberistructure
 
How Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-useHow Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-use
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
 
Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster
 
DataShare for UC Campuses
DataShare for UC CampusesDataShare for UC Campuses
DataShare for UC Campuses
 
GlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening KeynoteGlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening Keynote
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
 
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
 
UK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceUK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalface
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROV
 
Grid Computing July 2009
Grid Computing July 2009Grid Computing July 2009
Grid Computing July 2009
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals Federation
 
Impact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and EducationImpact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and Education
 
SomeSlides
SomeSlidesSomeSlides
SomeSlides
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 

Más de University of California Curation Center

ETDs: Electronic Thesis and Dissertation Service at the University of California
ETDs: Electronic Thesis and Dissertation Service at the University of CaliforniaETDs: Electronic Thesis and Dissertation Service at the University of California
ETDs: Electronic Thesis and Dissertation Service at the University of CaliforniaUniversity of California Curation Center
 
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchThe UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchUniversity of California Curation Center
 

Más de University of California Curation Center (20)

ETDs: Electronic Thesis and Dissertation Service at the University of California
ETDs: Electronic Thesis and Dissertation Service at the University of CaliforniaETDs: Electronic Thesis and Dissertation Service at the University of California
ETDs: Electronic Thesis and Dissertation Service at the University of California
 
Dash UCCSC 2016
Dash UCCSC 2016Dash UCCSC 2016
Dash UCCSC 2016
 
Uc3 ucacc-2015-11-16
Uc3 ucacc-2015-11-16Uc3 ucacc-2015-11-16
Uc3 ucacc-2015-11-16
 
Dash: data sharing made easy
Dash: data sharing made easyDash: data sharing made easy
Dash: data sharing made easy
 
CDL research lifecycle
CDL research lifecycleCDL research lifecycle
CDL research lifecycle
 
Ucmp 20150407
Ucmp 20150407Ucmp 20150407
Ucmp 20150407
 
Researcher perspectives on publication and peer review of data.
Researcher perspectives on publication and peer review of data.Researcher perspectives on publication and peer review of data.
Researcher perspectives on publication and peer review of data.
 
Enhancing DMPTool: Further Streamlineing Data Mangement Planning Process
Enhancing DMPTool: Further Streamlineing Data Mangement Planning ProcessEnhancing DMPTool: Further Streamlineing Data Mangement Planning Process
Enhancing DMPTool: Further Streamlineing Data Mangement Planning Process
 
DataShare: Empowering Researcher Data Curation
DataShare: Empowering Researcher Data CurationDataShare: Empowering Researcher Data Curation
DataShare: Empowering Researcher Data Curation
 
Future of web archiving
Future of web archivingFuture of web archiving
Future of web archiving
 
Data preservation 101
Data preservation 101Data preservation 101
Data preservation 101
 
Creating superior data management plans with the DMPTool
Creating superior data management plans with the DMPToolCreating superior data management plans with the DMPTool
Creating superior data management plans with the DMPTool
 
ESA Ignite talk on the DMPTool by S Abrams
ESA Ignite talk on the DMPTool by S AbramsESA Ignite talk on the DMPTool by S Abrams
ESA Ignite talk on the DMPTool by S Abrams
 
DMPTool2 Webinar #1 for Administrators
DMPTool2 Webinar #1 for AdministratorsDMPTool2 Webinar #1 for Administrators
DMPTool2 Webinar #1 for Administrators
 
DMPTool2 Administrator Webinar #2
DMPTool2 Administrator Webinar #2DMPTool2 Administrator Webinar #2
DMPTool2 Administrator Webinar #2
 
Helping librarians use the DMPTool as a centerpiece for data management
Helping librarians use the DMPTool as a centerpiece for data managementHelping librarians use the DMPTool as a centerpiece for data management
Helping librarians use the DMPTool as a centerpiece for data management
 
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchThe UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
 
Dataset Metadata Publication Through EZID
Dataset Metadata Publication Through EZIDDataset Metadata Publication Through EZID
Dataset Metadata Publication Through EZID
 
DMPTool2: Improvements and Outreach
DMPTool2: Improvements and Outreach DMPTool2: Improvements and Outreach
DMPTool2: Improvements and Outreach
 
DMPTool Webinar 11: Complementary Tools
DMPTool Webinar 11: Complementary ToolsDMPTool Webinar 11: Complementary Tools
DMPTool Webinar 11: Complementary Tools
 

Último

Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 

Último (20)

Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 

Or 2013-abrams-sharing-data-rich-research

  • 1. Sharing Data-Rich Research Through Repository Layering Stephen Abrams California Digital Library Angela Rizk-Jackson Julia Kochi University of California, San Francisco Noah Wittman University of California, Berkeley
  • 2. Why is data curation important?  Accelerating scientific progress  Enabling appropriate scrutiny and verification of results  Promoting integrity and debate  Facilitating new collaborations  Avoiding needless duplication of effort  Increasingly, complying with institutional policies, publication requirements, and funder mandates Cf. White and Teds (2011), “Making the case for research data management” DCC briefing paper, www.dcc.ac.uk/resources/briefing-papers/making-case-rdm
  • 3. The library’s role  A continuation of its long-standing mission and practice to connect patrons with content of interest in meaningful ways across barriers of space and time Cf. Tenopir et al. (2012), “Academic librarians and research data services: Preparation and attitudes,” 78th IFLA General Conference and Assembly, Helsinki, conference.ifla.org/past/ifla78/116-tenopir-en.pdf  Offering solutions that enhance the natural points of alignment between the scholarly research and information lifecycles Publish Reuse ShareCreate Discover Collect PreserveAccessResearchResearch CurationCuration Scholarly lifecycle Information lifecycle
  • 4. Merritt  Curation repository available to the UC community and external partners  Preservation and access  Content agnostic, model free  Highly decentralized micro-services architecture Cf. Abrams, Cruse, Kunze, and Minor (2011), “Curation micro-services: A pipeline metaphor for repositories,” Journal of Digital Information 12(2), journals.tdl.org/jodi/article/view/1605  26 curatorial units  271 collections  325,000 objects  450,000 versions  4,500,000 files  13 TB www.cdlib.org/uc3/merritt merritt.cdlib.org
  • 5. Merritt Storage node Storage broker Inventory ONEShare UNM storage node Storage node UI/API UI/API UI/API LDAP LDAP LDAP RDBMS Fixity User agent Message queue RDBMS Load balancer Ingest Load balancer Ingest Ingest EZID No-SQL DataCite … DataONE member node RDBMS RDBMS DataONE coord’ing node … IDF Load balancer Web of Knowledge Primo SAN SDSC cloud
  • 6. (Some) issues to address  Scale  Individual objects ranging from 0 to 47,000 files  Individual files ranging from 0 to 14 GB  Maintaining control  Concern over potential loss of control over dissemination and use of data  User experience  Switch from organizational to individual interaction www.flickr.com/photos/vixon/116447718www.flickr.com/photos/traftery/4319529821www.flickr.com/photos/32195273@N05/51076852642
  • 7. (Some) issues to address  Scale  Individual objects ranging from 0 to 47,000 files  Individual files ranging from 0 to 14 GB  Maintaining control  Concern over potential loss of control over dissemination and use of data  User experience  Switch from organizational to individual interaction Augment repository function by composition (when possible) and addition (when necessary)  Loosely-coupled integration with external community supported systems and services
  • 8. Scale  Avoiding client timeout  ≤ 2 GB: File-based  stream-based AIP-to-DIP processing  > 2 GB: Asynchronous delivery  Email notification with personalized, time-limited URL  Streamlined storage provisioning  SDSC cloud cloud.sdsc.edu www.kevatron.co.uk/converting-8-24-bit-samples-in-coreaudio-on-ios www.flickr.com/photos/paulbhartzog/680749585
  • 9. Control  Data use agreements (DUAs)  Explicit assertion of license requirements and terms of use  Curatorial and consumer notification of acceptance Cf. Brazhnik and Jones (2007), “Anatomy of data integration,” Journal of Biomedical Informatics 40(3): 252- 69, doi:10.1016/j.jbi.2006.09.001 From: no-reply-merritt@ucop.edu Subject:Merritt DUA acceptance Name: Stephen Abrams Affiliation: California Digital Library Collection: UCSF DataShare Object: Frontotemporal Lobar Degeneration (FTLD) Date: 2013-05-3109:50:34PDT Terms of use: As part of this agreement, Consumer submits to the following statements: (1) I will receive access to de-identified data and will not attempt to establish the identity of any of the study subjects. (2) I will share these data only with my immediate co-workers, and I will not transfer these data to other research groups. I understand that these data are available to other research groups through the process by which I obtain them. (3) I will require anyone in my group who utilizes these data, or anyone with whom I share these data to comply with this data use agreement ...
  • 10. User experience  Due to its open eligibility policy, Merritt will always provide a more generic UX than special-purpose or disciplinary systems  Shifting user roles, shifting expectations  Institutional  individual researcher  Behavioral expectations set by the commercial/mobile web
  • 11. User experience  Due to its open eligibility policy, Merritt will always provide a more generic UX than special-purpose or disciplinary systems  Shifting user roles, shifting expectations  Institutional  individual researcher  Behavioral expectations set by the commercial web  Integration with extant services that better provide the desired UX  DataShare  Research Hub
  • 12. DataShare  “The goal of the DataShare project is to catalyze widespread sharing of scientific research data” datashare.ucsf.edu  UCSF Clinical and Translational Science Institute ctsi.ucsf.edu  UCSF Library www.library.ucsf.edu  UCSF Center for Imaging of Neurodegenerative Disease www.radiology.ucsf.edu/cind  Architecture  DataShare submission client (Ruby/Rails)  Merritt curation repository  DataShare discovery portal (XTF/Java)
  • 13. DataShare  Prepare  Describe  Upload  Curate  Discover  Share
  • 14. DataShare  Prepare  Best practice advice  Describe  Upload  Curate  Discover  Share
  • 15. DataShare  Prepare  Describe  Schema-directed metadata editor  DataCite schema schema.datacite.org  Upload  Curate  Discover  Share
  • 16. DataShare  Prepare  Describe  Upload  File browse or drag-n-drop  Curate  Discover  Share
  • 17. DataShare  Prepare  Describe  Upload  File browse or drag-n-drop  Curate  Discover  Share
  • 18. DataShare  Prepare  Describe  Upload  Curate  Manage datasets  Discover  Share
  • 19. DataShare  Prepare  Describe  Upload  Curate  Discover  Faceted search and browse  Share
  • 20. DataShare  Prepare  Describe  Upload  Curate  Discover  Share  DataONE  DataCite  (soon) Primo Web of Knowledge  SEO
  • 21. Merritt + DataShare Storage node Storage broker Inventory ONEShare UNM storage node Storage node UI/API UI/API UI/API LDAP LDAP LDAP RDBMS Fixity User agent Message queue RDBMS Load balancer Ingest Load balancer Ingest Ingest EZID No-SQL DataCite … DataONE member node RDBMS RDBMS DataONE coord’ing node … IDF Load balancer Web of Knowledge Primo SAN SDSC cloud DataShare upload Collection Atom feed XTF xtf.cdlib.org DataShare portal Lucene
  • 22. Research Hub  “Research Hub provides powerful tools for content management and collaboration” hub.berkeley.edu  Alfresco CMS www.alfresco.com  770 projects, 3,900 users  Personal file management  Project collaboration  Departmental resource pooling  Research data management  Desktop sync, mobile app, Adobe Creative Suite  UC Berkeley Information Services and Technology ist.berkeley.edu
  • 23. Research Hub  Prepare  Acquire and arrange  Describe  Upload  Curate  Discover  Share
  • 24. Research Hub  Prepare  Describe  Schema-directed metadata editors  Upload  Curate  Discover  Share
  • 25. Research Hub  Prepare  Describe  Upload  Direct action  Curate  Discover  Share
  • 26.  Prepare  Describe  Upload  Direct action  Curate  Discover  Share Research Hub
  • 27. Research Hub  Prepare  Describe  Upload  Policy-based workflow rules  Curate  Discover  Share
  • 28. Research Hub  Prepare  Describe  Upload  Drag-and-drop  Curate  Discover  Share
  • 29. Research Hub  Prepare  Describe  Upload  Confirmation  Curate  Discover  Share
  • 30. Research Hub  Prepare  Describe  Upload  Curate  Manage datasets  Discover  Share
  • 31. Research Hub  Prepare  Describe  Upload  Curate  Discover  Share
  • 32. Research Hub  Prepare  Describe  Upload  Curate  Discover  Share
  • 33. Merritt + DataShare + Research Hub Storage node Storage broker Inventory ONEShare UNM storage node Storage node UI/API UI/API UI/API LDAP LDAP LDAP RDBMS Fixity User agent Message queue RDBMS Load balancer Ingest Load balancer Ingest Ingest EZID No-SQL DataCite … DataONE member node RDBMS RDBMS DataONE coord’ing node … IDF Load balancer Web of Knowledge Primo SAN SDSC cloud DataShare upload Collection Atom feed XTF xtf.cdlib.org DataShare portal Lucene Research Hub
  • 34. Next steps  Self-service account registration  UCTrust and InCommon Shibboleth federations  Additional cloud-based replication  Outreach  Integration with Open Context archaeological portal opencontext.org  Atom-based submission  Integration with Nuxeo www.nuxeo.com  UC system-wide DAMS solution  Integration with Islandora islandora.ca  Collaboration with UCLA Library  Tuque API  Integration with DPN www.dpn.org
  • 35. Sharing research through repositories  Conform to institutional policy, publication requirements, and funder mandates  Pro-active curation of valuable research outputs  Stable citation and access  High visibility publication and discovery  Use metrics
  • 36. Sharing research through repositories  Conform to institutional policy, publication requirements, and funder mandates  Pro-active curation of valuable research outputs  Stable citation and access  High visibility publication and discovery  Use metrics  Repository layering as an appropriate division of labor  Exploiting existing capabilities already in local use
  • 37. For more information  Merritt www.cdlib.org/uc3/merritt uc3@ucop.edu Stephen Abrams David Loy Patricia Cruse Mark Reyes Shirin Faenza Joan Starr Scott Fisher Carly Strasser Erik Hetzner Marisa Strong Joshua Hubbard Bhavitavya Vedula Greg Janée Kenneth Weiss John Kunze Perry Willet Rosalie Lack  DataShare datashare.ucsf.edu Geoffrey Boushey Julia Kochi Anirvan Chatterjee Angela Rizk-Jackson Maninder Kahlon Michael Weiner  Research Hub hub.berkeley.edu Ian Crew Michael McCarthy (Tribloom) Noah Wittman Patrick McGrath www.slideshare.net/UC3/or-2013abramssharingdatarichresearch

Notas del editor

  1. Copyright © 2013 by The Regents of the University of CaliforniaThis work is made available under the terms of the Creative Commons Attribution-ShareAlike 3.0 license
  2. MatijaGrguric, Scale up! – Curved elements!, http://www.flickr.com/photos/32195273@N05/5107685264Tom Rafferty, Remote control!, http://www.flickr.com/photos/67945918@N00/4319529821Barry Egan, File rio 2006, http://www.flickr.com/photos/vixon/116447718
  3. Kevin Smith, Converting 8.24 bit samples in CoreAudio on iOS, http://www.kevatron.co.uk/converting-8-24-bit-samples-in-coreaudio-on-ios/Paul B. Hartzog, Server room, http://www.flickr.com/photos/paulbhartzog/680749585