SlideShare una empresa de Scribd logo
1 de 23
Descargar para leer sin conexión
Johns	
  Hopkins	
  University	
  Data	
  
Management	
  Services	
  
                   Sayeed	
  Choudhury	
  and	
  Barbara	
  Pralle	
  
            Digital	
  Library	
  Federation	
  –	
  October	
  31,	
  2011	
  

   Johns	
  Hopkins	
  University	
  Sheridan	
  Libraries	
  
Powered	
  by	
  Data	
  Conservancy	
  

•    JHU	
  Data	
  Management	
  Service	
  (DMS)	
  
     represents	
  the	
  culmination	
  of	
  two	
  years	
  of	
  
     research,	
  design,	
  development	
  and	
  
     implementation	
  of	
  Data	
  Conservancy	
  
•    Service	
  launched	
  in	
  July	
  2011	
  
•    DC	
  instance	
  launched	
  in	
  October	
  2011	
  
•    Important,	
  essential	
  foundations	
  in	
  place	
  
•    There	
  remains	
  work	
  to	
  be	
  done	
  

                     Johns	
  Hopkins	
  University	
  Sheridan	
  Libraries	
  
Data	
  Conservancy	
  

•    The	
  Data	
  Conservancy	
  has	
  developed	
  a	
  blueprint	
  
     for	
  institutions	
  that	
  view	
  scientific	
  data	
  curation	
  
     as	
  a	
  means	
  to	
  collect,	
  organize,	
  validate	
  and	
  
     preserve	
  data	
  so	
  that	
  scientists	
  can	
  find	
  new	
  
     ways	
  to	
  address	
  the	
  grand	
  research	
  challenges	
  
     that	
  face	
  society.	
  	
  
Technology-­‐related	
  Accomplishments	
  	
  

•    Preservation-­‐based,	
  flexible	
  data	
  model	
  
     (inspired	
  by	
  PLANETS	
  project)	
  
•    Demonstration	
  of	
  modularity	
  through	
  Archival	
  
     Storage	
  framework	
  
•    Feature	
  Extraction	
  Framework	
  
•    Demonstration	
  of	
  interoperability	
  with	
  NSIDC	
  
     glacier	
  photo	
  service,	
  arXiv,	
  IVOA	
  and	
  Sakai	
  
•    Development	
  of	
  DCS	
  instance	
  

                    Johns	
  Hopkins	
  University	
  Sheridan	
  Libraries	
  
Notion	
  of	
  the	
  DCS	
  Instance	
  (1)	
  

•    An	
  instance	
  of	
  the	
  DCS	
  (Data	
  Conservancy	
  System)	
  
     software	
  stack	
  
•    A	
  deployment	
  infrastructure	
  (hardware,	
  servers,	
  etc.)	
  
     for	
  the	
  DCS	
  
•    A	
  defined	
  (sub)set	
  of	
  DCS	
  services	
  to	
  be	
  exposed/
     provided	
  
•    A	
  defined/understood/expected	
  “context”	
  to	
  be	
  
     addressed	
  (e.g.	
  science	
  domain,	
  institutional	
  domain,	
  	
  
     etc.)	
  

                                                                                    5
Notion	
  of	
  the	
  DCS	
  Instance	
  (2)	
  

•    A	
  local	
  policy	
  framework	
  in	
  place	
  for	
  the	
  System	
  
     Operation	
  
•    Personnel/staffing	
  to	
  setup,	
  manage	
  and	
  support	
  
     the	
  continued	
  operation	
  of	
  the	
  system	
  
•    A	
  sustainability/business	
  plan	
  in	
  place	
  to	
  operate	
  
     the	
  instance	
  post	
  NSF	
  funding	
  (cost	
  model,	
  user	
  
     base,	
  etc.)	
  
•    Key	
  integrators	
  such	
  as	
  spatial,	
  temporal	
  and	
  
     taxonomic	
  queries	
  or	
  data	
  replication	
  
                                                                               6
Definition	
  of	
  Data	
  Preservation	
  

•    “Data	
  preservation	
  involves	
  providing	
  enough	
  
     representation	
  information,	
  context,	
  metadata,	
  
     fixity,	
  etc.	
  such	
  that	
  someone	
  other	
  than	
  the	
  
     original	
  data	
  producer	
  can	
  use	
  and	
  interpret	
  the	
  
     data.”	
  
      -    Ruth	
  Duerr,	
  National	
  Snow	
  and	
  Ice	
  Data	
  Center	
  




                                                                                    7
Architecture	
  mapped	
  to	
  OAIS	
  




Open	
  Archival	
  Information	
  System	
  
          Functional	
  Entities	
  


                                                                           Data	
  Conservancy	
  Service	
  
                                                                          Architecture	
  Block	
  Diagram	
  

                             Johns	
  Hopkins	
  University	
  Sheridan	
  Libraries	
  
Data	
  Model:	
  Research	
  

•    Versions	
  vs.	
  (format)	
  
     migration	
  
      -    Modification	
  of	
  significant	
  
           properties	
  versus	
  
           transformation	
  of	
  data	
  
•    Replication	
  and	
  provenance	
  
     (verifiable	
  snapshots)	
  
•    Layered	
  data	
  model	
  
      -    Preservation	
  view,	
  Persistence	
  
           view,	
  Information	
  view	
  
Data	
  Model:	
  Application	
  

•    Multiple	
  Data	
  Models	
  
•    Content	
  models	
  for	
  
     describing	
  the	
  contents	
  of	
  a	
  
     Manifestation	
  
•    General	
  Model	
  used	
  to	
  
     correlate	
  model	
  entities	
  
     across	
  heterogeneous	
  
     datasets	
  
      -    geo-­‐reference,	
  time	
  of	
  
           observation,	
  etc…	
  
Feature	
  Extraction	
  Framework:	
  Design	
  

•    Must	
  accommodate	
  a	
  
     variety	
  of	
  data	
  formats	
  
•    No	
  assumption	
  made	
  
     regarding	
  the	
  form	
  of	
  data	
  
     input	
  or	
  output	
  
•    Not	
  coupled	
  to	
  a	
  specific	
  
     execution	
  model	
  
Feature	
  Extraction	
  Framework:	
  Application	
  

•    Subsetting	
  
      -    Returning	
  a	
  portion	
  of	
  a	
  
           dataset	
  
•    Indexing	
  
      -    Output	
  suitable	
  for	
  indexing	
  by	
  
           the	
  Query	
  Framework	
  
•    Workflows	
  
      -    Process	
  Orchestration,	
  
           Meandre,	
  Taverna,	
  Kepler	
  
•    Execution	
  environment	
  for	
  
     analysis	
  
      -    Stateless	
  Mappings	
  basis	
  for	
  
           MapReduce	
  
Implementation:	
  Archival	
  Storage	
  
•    Responsible	
  for	
  long	
  term	
  storage	
  
     	
  of	
  content	
  and	
  metadata	
  (AIP)	
  
      -    Evolving	
  storage	
  technologies	
  	
  
           (including	
  cloud)	
  
      -    Policy	
  implementation	
  (e.g.	
  multiple	
  copies)	
  

•    Archival	
  Storage	
  API	
  abstracts	
  
     underlying	
  technology	
  
      -    Fedora	
  (object	
  based),	
  ELM	
  (file	
  based)	
  

•    Persistent	
  Storage	
  Framework	
  (Y2)	
  
      -    Instrumented	
  file	
  systems	
  
      -    File	
  semantics	
  over	
  block,	
  cloud,	
  content-­‐
           addressed	
  storage	
  
      -    Storage	
  services	
  (e.g.	
  integrity	
  checking)	
  
Data	
  Management	
  Layers	
  
Layers	
           Examples	
                            Implication	
  for	
  PI	
           Implication	
  relative	
  
                                                                                              to	
  NSF	
  
Curation	
         Future	
  JHU	
  Data	
  Archive	
   •  Feature	
  Extraction	
            •  Competitive	
  
                   and	
  other	
  DCS	
  instances	
   •  New	
  query	
                        advantage	
  
                                                           capabilities	
                     •  New	
  
                                                        •  Cross-­‐disciplinary	
                opportunities	
  
Preservation	
   JHU	
  Data	
  Archive	
                •  Ability	
  to	
  use	
  own	
     •  Satisfies	
  NSF	
  
                 Portico	
                                  data	
  in	
  the	
  future	
          needs	
  across	
  
                 ICPSR	
                                    (e.g.	
  5	
  yrs)	
                   directorates	
  
                                                         •  Data	
  sharing	
  	
             	
  
Archiving	
        CUAHSI	
                              •  Provides	
  identifiers	
          •  Could	
  satisfy	
  
                   NEES	
                                   for	
  sharing,	
                    most	
  NSF	
  
                   Dataverse	
                              references,	
  etc.	
                requirements	
  
Storage	
          Server	
  in	
  Lab	
                 •  Responsible	
  for:	
             •  Could	
  be	
  enough	
  
                   Website	
                                 •  Restore	
                        for	
  now	
  but	
  not	
  
                   Amazon	
  S3	
                            •  Sharing	
                        near-­‐term	
  future	
  
                                                             •  Staffing	
  
                               Johns	
  Hopkins	
  University	
  Sheridan	
  Libraries	
  
Defining	
  Sustainability	
  
                    •    “Ensuring	
  that	
  valuable	
  
                         digital	
  assets	
  will	
  be	
  
                         available	
  for	
  future	
  use	
  is	
  
                         not	
  simply	
  a	
  matter	
  of	
  
                         finding	
  sufficient	
  funds.	
  It	
  
                         is	
  about	
  mobilizing	
  
                         resources—human,	
  
                         technical,	
  and	
  financial—
                         across	
  a	
  spectrum	
  of	
  
                         stakeholders	
  diffuse	
  over	
  
                         both	
  space	
  and	
  time.”	
  
Establishing	
  the	
  JHU	
  DMS	
  
              	
  
•    May	
  2010	
  NSF	
  announces	
  DMP	
  expectations	
  
•    Services	
  incubated	
  and	
  scoped	
  summer/fall	
  2010	
  
      -    Build	
  on	
  Data	
  Conservancy	
  expertise	
  	
  
•    Proposed	
  in	
  January	
  and	
  launched	
  in	
  July	
  2011	
  
      -  Consultative	
  data	
  management	
  planning	
  services	
  to	
  
         support	
  NSF	
  proposals	
  
      -  Post	
  award	
  data	
  management	
  services	
  




                        Johns	
  Hopkins	
  University	
  Sheridan	
  Libraries	
  
Resources	
  Needed	
  for	
  Launch	
  	
  

•    Talented	
  team	
  and	
  expertise	
  
•    Flexible	
  process	
  
•    Tools	
  that	
  support	
  service	
  provision	
  
•    Systems	
  developed	
  by	
  the	
  DC	
  to	
  manage	
  data	
  
•    Marketing	
  and	
  outreach	
  partners	
  
•    Mechanism	
  for	
  supporting	
  financial	
  model	
  



                    Johns	
  Hopkins	
  University	
  Sheridan	
  Libraries	
  
People	
  and	
  Expertise	
  
            	
  
•    Create	
  data	
  management	
  consultant	
  position	
  	
  
•    Position	
  description	
  recognizes	
  diversity	
  of	
  
     experience	
  and	
  talent	
  that	
  can	
  support	
  service	
  
•    Recruited	
  people	
  with	
  skills	
  that	
  match	
  with	
  the	
  
     JHU	
  DMS	
  objectives	
  and	
  services	
  
•    Knowledge	
  transfer	
  through	
  hands	
  on	
  work	
  and	
  
     interactions	
  with	
  DC	
  partners	
  


                     Johns	
  Hopkins	
  University	
  Sheridan	
  Libraries	
  
Day	
  in	
  the	
  Life	
  of	
  a	
  JHU	
  DM	
  Consultant	
  (1)	
  

•    Consultative	
  process	
  emulates	
  a	
  reference	
  
     interview	
  
•    Adapt	
  to	
  PI	
  timeframe	
  and	
  deadlines	
  
•    Gather	
  of	
  information,	
  identify	
  gaps,	
  understand	
  
     options,	
  prepare	
  and	
  iterate	
  plan	
  
•    Support	
  better	
  data	
  management	
  by	
  
     encouraging	
  systematic	
  cultural	
  change	
  through	
  
     PI	
  interactions	
  

                     Johns	
  Hopkins	
  University	
  Sheridan	
  Libraries	
  
Day	
  in	
  the	
  Life	
  of	
  a	
  JHU	
  DM	
  Consultant	
  (2)	
  

•    In-­‐depth	
  planning	
  after	
  award	
  received	
  
•    Helping	
  PI	
  understand	
  service	
  levels	
  within	
  JHU	
  
     Data	
  Archive	
  
•    Planning	
  and	
  preparing	
  data	
  for	
  deposit	
  
•    Reviewing	
  data	
  management	
  plan	
  and	
  
     identifying	
  solutions	
  for	
  next	
  phase	
  of	
  data	
  
     management	
  


                     Johns	
  Hopkins	
  University	
  Sheridan	
  Libraries	
  
Challenges	
  

•    Timing	
  of	
  consultative	
  support,	
  deadlines,	
  and	
  
     marketing	
  of	
  service	
  
•    Building	
  awareness	
  of	
  data	
  management	
  value	
  
•    Establishing	
  common	
  vocabulary	
  ex.	
  One	
  PIs	
  
     ‘storage’	
  is	
  another	
  PIs	
  ‘archiving’	
  
•    Condensing	
  key	
  information	
  into	
  two	
  pages	
  
•    Navigating	
  different	
  data	
  retention	
  policies	
  
•    Responding	
  to	
  widely	
  ranging	
  domains	
  

                   Johns	
  Hopkins	
  University	
  Sheridan	
  Libraries	
  
Opportunities	
  

•    Grow	
  the	
  researcher/grad	
  student	
  
     understanding	
  of	
  data	
  management	
  
     -    Support	
  good	
  stewardship	
  of	
  data	
  across	
  institution	
  
•    Establish	
  an	
  archive	
  specifically	
  designed	
  for	
  
     data,	
  enabling	
  future	
  discovery	
  and	
  use	
  	
  
•    Build	
  our	
  collective	
  expertise	
  in	
  data	
  
     management	
  



                       Johns	
  Hopkins	
  University	
  Sheridan	
  Libraries	
  
Acknowledgements	
  and	
  Resources	
  

•    NSF	
  Award	
  OCI-­‐0830976	
  
•    Sheridan	
  Libraries	
  financial	
  support	
  
•    Johns	
  Hopkins	
  University	
  financial	
  support	
  
•    Elliot	
  Metsger	
  for	
  infrastructure	
  slides	
  
•    Tim	
  DiLauro	
  for	
  inspiration	
  about	
  layers	
  
•    Data	
  Conservancy	
  colleagues	
  for	
  their	
  exceptional	
  work	
  
     and	
  patience	
  
•    http://dataconservancy.org	
  
•    http://dmp.data.jhu.edu	
  

                     Johns	
  Hopkins	
  University	
  Sheridan	
  Libraries	
  

Más contenido relacionado

La actualidad más candente

Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?LIBER Europe
 
Library support for life cycle
Library support for life cycleLibrary support for life cycle
Library support for life cycleSherry Lake
 
Hw09 Terapot Email Archiving With Hadoop
Hw09   Terapot  Email Archiving With HadoopHw09   Terapot  Email Archiving With Hadoop
Hw09 Terapot Email Archiving With HadoopCloudera, Inc.
 
[Hadoop] NexR Terapot: Massive Email Archiving
[Hadoop] NexR Terapot: Massive Email Archiving[Hadoop] NexR Terapot: Massive Email Archiving
[Hadoop] NexR Terapot: Massive Email ArchivingJinho Jung
 
The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...
The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...
The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...University of Missouri
 
Silverton cleversafe-object-based-dispersed-storage
Silverton cleversafe-object-based-dispersed-storageSilverton cleversafe-object-based-dispersed-storage
Silverton cleversafe-object-based-dispersed-storageAccenture
 

La actualidad más candente (6)

Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?
 
Library support for life cycle
Library support for life cycleLibrary support for life cycle
Library support for life cycle
 
Hw09 Terapot Email Archiving With Hadoop
Hw09   Terapot  Email Archiving With HadoopHw09   Terapot  Email Archiving With Hadoop
Hw09 Terapot Email Archiving With Hadoop
 
[Hadoop] NexR Terapot: Massive Email Archiving
[Hadoop] NexR Terapot: Massive Email Archiving[Hadoop] NexR Terapot: Massive Email Archiving
[Hadoop] NexR Terapot: Massive Email Archiving
 
The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...
The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...
The Reality of the Cloud: Implications of Cloud Computing for Mobile Library ...
 
Silverton cleversafe-object-based-dispersed-storage
Silverton cleversafe-object-based-dispersed-storageSilverton cleversafe-object-based-dispersed-storage
Silverton cleversafe-object-based-dispersed-storage
 

Destacado

Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...
Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...
Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...ASIS&T
 
arXiv Sustainability Initiative Oya Reiger RDAP12
arXiv Sustainability Initiative Oya Reiger RDAP12 arXiv Sustainability Initiative Oya Reiger RDAP12
arXiv Sustainability Initiative Oya Reiger RDAP12 ASIS&T
 
Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12ASIS&T
 
Curation Service Models - Michael Witt - RDAP12
Curation Service Models - Michael Witt - RDAP12Curation Service Models - Michael Witt - RDAP12
Curation Service Models - Michael Witt - RDAP12ASIS&T
 
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...ASIS&T
 
BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)Syaifuddin Ismail
 
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)ASIS&T
 

Destacado (7)

Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...
Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...
Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...
 
arXiv Sustainability Initiative Oya Reiger RDAP12
arXiv Sustainability Initiative Oya Reiger RDAP12 arXiv Sustainability Initiative Oya Reiger RDAP12
arXiv Sustainability Initiative Oya Reiger RDAP12
 
Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12
 
Curation Service Models - Michael Witt - RDAP12
Curation Service Models - Michael Witt - RDAP12Curation Service Models - Michael Witt - RDAP12
Curation Service Models - Michael Witt - RDAP12
 
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
 
BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)
 
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
 

Similar a Dc sheridan dlf_2011_final

ESI Supplemental 1 E-research Support Slides
ESI Supplemental 1   E-research Support SlidesESI Supplemental 1   E-research Support Slides
ESI Supplemental 1 E-research Support SlidesDuraSpace
 
Institutional repository
Institutional repositoryInstitutional repository
Institutional repositoryWaqas Ahmed
 
Why manage research data?
Why manage research data?Why manage research data?
Why manage research data?Graham Pryor
 
2012 02 pre_hbs_grid_overview_ianstokesrees_pt2
2012 02 pre_hbs_grid_overview_ianstokesrees_pt22012 02 pre_hbs_grid_overview_ianstokesrees_pt2
2012 02 pre_hbs_grid_overview_ianstokesrees_pt2Boston Consulting Group
 
Matching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software ArchitecturesMatching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software ArchitecturesGeoffrey Fox
 
Matching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software ArchitecturesMatching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software ArchitecturesGeoffrey Fox
 
Relational
RelationalRelational
Relationaldieover
 
Crushing, Blending, and Stretching Data
Crushing, Blending, and Stretching DataCrushing, Blending, and Stretching Data
Crushing, Blending, and Stretching DataRay Schwartz
 
Rdap12 wrap up reagan moore
Rdap12 wrap up reagan mooreRdap12 wrap up reagan moore
Rdap12 wrap up reagan mooreASIS&T
 
Data mining - GDi Techno Solutions
Data mining - GDi Techno SolutionsData mining - GDi Techno Solutions
Data mining - GDi Techno SolutionsGDi Techno Solutions
 
Sakai09 Repo Case Study
Sakai09 Repo Case StudySakai09 Repo Case Study
Sakai09 Repo Case Studyjrmdkc
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Jian Qin
 
MetadataTheory: Learning Repositories Technologies (9th of 10)
MetadataTheory: Learning Repositories Technologies (9th of 10)MetadataTheory: Learning Repositories Technologies (9th of 10)
MetadataTheory: Learning Repositories Technologies (9th of 10)Nikos Palavitsinis, PhD
 
RUresearch: Supporting the Management and Preservation of Research Data - Ale...
RUresearch: Supporting the Management and Preservation of Research Data - Ale...RUresearch: Supporting the Management and Preservation of Research Data - Ale...
RUresearch: Supporting the Management and Preservation of Research Data - Ale...ASIS&T
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web ServicesJose Enrique Ruiz
 
OAI7 Research Objects
OAI7 Research ObjectsOAI7 Research Objects
OAI7 Research Objectsseanb
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrGrant Ingersoll
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrGrant Ingersoll
 
Big data and cloud computing 9 sep-2017
Big data and cloud computing 9 sep-2017Big data and cloud computing 9 sep-2017
Big data and cloud computing 9 sep-2017Dr. Anita Goel
 

Similar a Dc sheridan dlf_2011_final (20)

ESI Supplemental 1 E-research Support Slides
ESI Supplemental 1   E-research Support SlidesESI Supplemental 1   E-research Support Slides
ESI Supplemental 1 E-research Support Slides
 
Institutional repository
Institutional repositoryInstitutional repository
Institutional repository
 
Hawaii Pacific GIS Conference 2012: GIS in Education: K-12 and University - H...
Hawaii Pacific GIS Conference 2012: GIS in Education: K-12 and University - H...Hawaii Pacific GIS Conference 2012: GIS in Education: K-12 and University - H...
Hawaii Pacific GIS Conference 2012: GIS in Education: K-12 and University - H...
 
Why manage research data?
Why manage research data?Why manage research data?
Why manage research data?
 
2012 02 pre_hbs_grid_overview_ianstokesrees_pt2
2012 02 pre_hbs_grid_overview_ianstokesrees_pt22012 02 pre_hbs_grid_overview_ianstokesrees_pt2
2012 02 pre_hbs_grid_overview_ianstokesrees_pt2
 
Matching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software ArchitecturesMatching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software Architectures
 
Matching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software ArchitecturesMatching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software Architectures
 
Relational
RelationalRelational
Relational
 
Crushing, Blending, and Stretching Data
Crushing, Blending, and Stretching DataCrushing, Blending, and Stretching Data
Crushing, Blending, and Stretching Data
 
Rdap12 wrap up reagan moore
Rdap12 wrap up reagan mooreRdap12 wrap up reagan moore
Rdap12 wrap up reagan moore
 
Data mining - GDi Techno Solutions
Data mining - GDi Techno SolutionsData mining - GDi Techno Solutions
Data mining - GDi Techno Solutions
 
Sakai09 Repo Case Study
Sakai09 Repo Case StudySakai09 Repo Case Study
Sakai09 Repo Case Study
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
 
MetadataTheory: Learning Repositories Technologies (9th of 10)
MetadataTheory: Learning Repositories Technologies (9th of 10)MetadataTheory: Learning Repositories Technologies (9th of 10)
MetadataTheory: Learning Repositories Technologies (9th of 10)
 
RUresearch: Supporting the Management and Preservation of Research Data - Ale...
RUresearch: Supporting the Management and Preservation of Research Data - Ale...RUresearch: Supporting the Management and Preservation of Research Data - Ale...
RUresearch: Supporting the Management and Preservation of Research Data - Ale...
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web Services
 
OAI7 Research Objects
OAI7 Research ObjectsOAI7 Research Objects
OAI7 Research Objects
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
 
Big data and cloud computing 9 sep-2017
Big data and cloud computing 9 sep-2017Big data and cloud computing 9 sep-2017
Big data and cloud computing 9 sep-2017
 

Último

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfChris Hunter
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIShubhangi Sonawane
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docxPoojaSen20
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 

Último (20)

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 

Dc sheridan dlf_2011_final

  • 1. Johns  Hopkins  University  Data   Management  Services   Sayeed  Choudhury  and  Barbara  Pralle   Digital  Library  Federation  –  October  31,  2011   Johns  Hopkins  University  Sheridan  Libraries  
  • 2. Powered  by  Data  Conservancy   •  JHU  Data  Management  Service  (DMS)   represents  the  culmination  of  two  years  of   research,  design,  development  and   implementation  of  Data  Conservancy   •  Service  launched  in  July  2011   •  DC  instance  launched  in  October  2011   •  Important,  essential  foundations  in  place   •  There  remains  work  to  be  done   Johns  Hopkins  University  Sheridan  Libraries  
  • 3. Data  Conservancy   •  The  Data  Conservancy  has  developed  a  blueprint   for  institutions  that  view  scientific  data  curation   as  a  means  to  collect,  organize,  validate  and   preserve  data  so  that  scientists  can  find  new   ways  to  address  the  grand  research  challenges   that  face  society.    
  • 4. Technology-­‐related  Accomplishments     •  Preservation-­‐based,  flexible  data  model   (inspired  by  PLANETS  project)   •  Demonstration  of  modularity  through  Archival   Storage  framework   •  Feature  Extraction  Framework   •  Demonstration  of  interoperability  with  NSIDC   glacier  photo  service,  arXiv,  IVOA  and  Sakai   •  Development  of  DCS  instance   Johns  Hopkins  University  Sheridan  Libraries  
  • 5. Notion  of  the  DCS  Instance  (1)   •  An  instance  of  the  DCS  (Data  Conservancy  System)   software  stack   •  A  deployment  infrastructure  (hardware,  servers,  etc.)   for  the  DCS   •  A  defined  (sub)set  of  DCS  services  to  be  exposed/ provided   •  A  defined/understood/expected  “context”  to  be   addressed  (e.g.  science  domain,  institutional  domain,     etc.)   5
  • 6. Notion  of  the  DCS  Instance  (2)   •  A  local  policy  framework  in  place  for  the  System   Operation   •  Personnel/staffing  to  setup,  manage  and  support   the  continued  operation  of  the  system   •  A  sustainability/business  plan  in  place  to  operate   the  instance  post  NSF  funding  (cost  model,  user   base,  etc.)   •  Key  integrators  such  as  spatial,  temporal  and   taxonomic  queries  or  data  replication   6
  • 7. Definition  of  Data  Preservation   •  “Data  preservation  involves  providing  enough   representation  information,  context,  metadata,   fixity,  etc.  such  that  someone  other  than  the   original  data  producer  can  use  and  interpret  the   data.”   -  Ruth  Duerr,  National  Snow  and  Ice  Data  Center   7
  • 8. Architecture  mapped  to  OAIS   Open  Archival  Information  System   Functional  Entities   Data  Conservancy  Service   Architecture  Block  Diagram   Johns  Hopkins  University  Sheridan  Libraries  
  • 9. Data  Model:  Research   •  Versions  vs.  (format)   migration   -  Modification  of  significant   properties  versus   transformation  of  data   •  Replication  and  provenance   (verifiable  snapshots)   •  Layered  data  model   -  Preservation  view,  Persistence   view,  Information  view  
  • 10. Data  Model:  Application   •  Multiple  Data  Models   •  Content  models  for   describing  the  contents  of  a   Manifestation   •  General  Model  used  to   correlate  model  entities   across  heterogeneous   datasets   -  geo-­‐reference,  time  of   observation,  etc…  
  • 11. Feature  Extraction  Framework:  Design   •  Must  accommodate  a   variety  of  data  formats   •  No  assumption  made   regarding  the  form  of  data   input  or  output   •  Not  coupled  to  a  specific   execution  model  
  • 12. Feature  Extraction  Framework:  Application   •  Subsetting   -  Returning  a  portion  of  a   dataset   •  Indexing   -  Output  suitable  for  indexing  by   the  Query  Framework   •  Workflows   -  Process  Orchestration,   Meandre,  Taverna,  Kepler   •  Execution  environment  for   analysis   -  Stateless  Mappings  basis  for   MapReduce  
  • 13. Implementation:  Archival  Storage   •  Responsible  for  long  term  storage    of  content  and  metadata  (AIP)   -  Evolving  storage  technologies     (including  cloud)   -  Policy  implementation  (e.g.  multiple  copies)   •  Archival  Storage  API  abstracts   underlying  technology   -  Fedora  (object  based),  ELM  (file  based)   •  Persistent  Storage  Framework  (Y2)   -  Instrumented  file  systems   -  File  semantics  over  block,  cloud,  content-­‐ addressed  storage   -  Storage  services  (e.g.  integrity  checking)  
  • 14. Data  Management  Layers   Layers   Examples   Implication  for  PI   Implication  relative   to  NSF   Curation   Future  JHU  Data  Archive   •  Feature  Extraction   •  Competitive   and  other  DCS  instances   •  New  query   advantage   capabilities   •  New   •  Cross-­‐disciplinary   opportunities   Preservation   JHU  Data  Archive   •  Ability  to  use  own   •  Satisfies  NSF   Portico   data  in  the  future   needs  across   ICPSR   (e.g.  5  yrs)   directorates   •  Data  sharing       Archiving   CUAHSI   •  Provides  identifiers   •  Could  satisfy   NEES   for  sharing,   most  NSF   Dataverse   references,  etc.   requirements   Storage   Server  in  Lab   •  Responsible  for:   •  Could  be  enough   Website   •  Restore   for  now  but  not   Amazon  S3   •  Sharing   near-­‐term  future   •  Staffing   Johns  Hopkins  University  Sheridan  Libraries  
  • 15. Defining  Sustainability   •  “Ensuring  that  valuable   digital  assets  will  be   available  for  future  use  is   not  simply  a  matter  of   finding  sufficient  funds.  It   is  about  mobilizing   resources—human,   technical,  and  financial— across  a  spectrum  of   stakeholders  diffuse  over   both  space  and  time.”  
  • 16. Establishing  the  JHU  DMS     •  May  2010  NSF  announces  DMP  expectations   •  Services  incubated  and  scoped  summer/fall  2010   -  Build  on  Data  Conservancy  expertise     •  Proposed  in  January  and  launched  in  July  2011   -  Consultative  data  management  planning  services  to   support  NSF  proposals   -  Post  award  data  management  services   Johns  Hopkins  University  Sheridan  Libraries  
  • 17. Resources  Needed  for  Launch     •  Talented  team  and  expertise   •  Flexible  process   •  Tools  that  support  service  provision   •  Systems  developed  by  the  DC  to  manage  data   •  Marketing  and  outreach  partners   •  Mechanism  for  supporting  financial  model   Johns  Hopkins  University  Sheridan  Libraries  
  • 18. People  and  Expertise     •  Create  data  management  consultant  position     •  Position  description  recognizes  diversity  of   experience  and  talent  that  can  support  service   •  Recruited  people  with  skills  that  match  with  the   JHU  DMS  objectives  and  services   •  Knowledge  transfer  through  hands  on  work  and   interactions  with  DC  partners   Johns  Hopkins  University  Sheridan  Libraries  
  • 19. Day  in  the  Life  of  a  JHU  DM  Consultant  (1)   •  Consultative  process  emulates  a  reference   interview   •  Adapt  to  PI  timeframe  and  deadlines   •  Gather  of  information,  identify  gaps,  understand   options,  prepare  and  iterate  plan   •  Support  better  data  management  by   encouraging  systematic  cultural  change  through   PI  interactions   Johns  Hopkins  University  Sheridan  Libraries  
  • 20. Day  in  the  Life  of  a  JHU  DM  Consultant  (2)   •  In-­‐depth  planning  after  award  received   •  Helping  PI  understand  service  levels  within  JHU   Data  Archive   •  Planning  and  preparing  data  for  deposit   •  Reviewing  data  management  plan  and   identifying  solutions  for  next  phase  of  data   management   Johns  Hopkins  University  Sheridan  Libraries  
  • 21. Challenges   •  Timing  of  consultative  support,  deadlines,  and   marketing  of  service   •  Building  awareness  of  data  management  value   •  Establishing  common  vocabulary  ex.  One  PIs   ‘storage’  is  another  PIs  ‘archiving’   •  Condensing  key  information  into  two  pages   •  Navigating  different  data  retention  policies   •  Responding  to  widely  ranging  domains   Johns  Hopkins  University  Sheridan  Libraries  
  • 22. Opportunities   •  Grow  the  researcher/grad  student   understanding  of  data  management   -  Support  good  stewardship  of  data  across  institution   •  Establish  an  archive  specifically  designed  for   data,  enabling  future  discovery  and  use     •  Build  our  collective  expertise  in  data   management   Johns  Hopkins  University  Sheridan  Libraries  
  • 23. Acknowledgements  and  Resources   •  NSF  Award  OCI-­‐0830976   •  Sheridan  Libraries  financial  support   •  Johns  Hopkins  University  financial  support   •  Elliot  Metsger  for  infrastructure  slides   •  Tim  DiLauro  for  inspiration  about  layers   •  Data  Conservancy  colleagues  for  their  exceptional  work   and  patience   •  http://dataconservancy.org   •  http://dmp.data.jhu.edu   Johns  Hopkins  University  Sheridan  Libraries