SlideShare a Scribd company logo
1 of 15
GETTING THE MOST OUT OF DATANET: A
PANEL DISCUSSION OF THE NSF FUNDED
DATANET PARTNERSHIPS

Robert H. McDonald – SEAD – Indiana University
Catherine Fitch – TerraPop – Minnesota Population Center
Richard Marciano – Datanet Federation Consortium – University of
North Carolina
Sayeed Choudhury – Data Conservancy – Johns Hopkins University
William Michener – DataOne – University of New Mexico


                 NSF DATANET PROGRAM-
            OFFICE OF CYBERINFRASTRUCTURE
DATANET ONLINE & TWITTER
 Twitter
    @SEADdatanet @dataconservancy @DateONEorg
 Web
    http://www.sead-data.net
    http://www.pop.umn.edu
    http://dataconservancy.org
    http://www.dataone.org
 Tagging
    #dlfforum
    #datanet
NSF DATANET PROGRAM
• DataNet efforts effectively balance:
  • Production infrastructure for operational data
    curation services
  • Research to create next generation data
    cyberininfrastructure
• DataNet awards are partnerships:
  • Responsive to user communities to define their
    meaningful and useful scope
  • Form a coordinated network to provide national,
    interdisciplinary data models and infrastructure
SEAD
Sustainable Environment – Actionable
Data
http://sead-data.net
@SEADdatanet


                                  #OCI0940824
SEAD TEAM

University of Michigan: Margaret Hedstrom (UM PI), Ann
Zimmerman (Co-PI and Project Manager), George Alter, Bryan
Beecher, Charles Severance, Karen Woollams, Jude Yew. Indiana
University: Beth Plale (IU PI), Katy Borner, Robert H.
McDonald, Kavitha Chandrasekar, Robert Ping, Stacy
Kowalczyk, Robert Light. University of Illinois: Praveen Kumar
(UIUC PI), Rob Kooper, Luigi Marini, Terry McLaren. Rensselaer
Polytechnic Institute: Jim Myers (RPI PI), Ram Prasanna Govind
Krishnan, Lindsay Todd, Adam Wilson.




                                                          #OCI0940824
SEAD PARTNERSHIP



                                  Beth Plale
Margaret Hedstrom, PI             Katy Börner
Ann Zimmerman                     Robert H. McDonald




        Praveen                             James Myers
        Kumar




           George Alter & Bryan Beecher
Sustainability
      Science

              Science


Cooperation               Technology




  Policy                  Economics

              Poverty &
               Justice




                                       7
Data
challenges
•   Heterogeneity
    of all kinds
•   Multiple scales
•   Multidisciplinary
•   Many small
    datasets
Provide innovative new
models and tools for
serving the long tail of
scientific research
SEAD’S GOALS
 Provide data services that address the pressing needs of
  researchers working toward sustainability
 Integrate these services into an generalizable “Active and
  Social Curation” infrastructure well-suited to the social
  structure and economics of long-tail research
  communities
 Develop capabilities to package and migrate datasets to
  a federated repository infrastructure for long-term
  preservation
 Education, outreach, & training, to maximize value and
  disseminate SEAD’s contributions to other projects and
  communities
SEAD’S STRATEGY

Move data curation upstream in the
 data life cycle
 • Involve domain scientists in setting
   priorities for evolution of data and
   services
 • Use a wide variety of mechanisms to
   remain resilient in a dynamic research
   and technology environment
ACTIVE AND SOCIAL
CURATION
• Engage researchers during projects, not at the
  end
• Use information that is automatically captured
  or generated through tools to reduce the costs
  of metadata collection and to capture its value
  in actionable form
• Further reduce costs by re-engineering curation
  processes to leverage this rich metadata and
  volunteered effort
ACTIVE CURATION MODEL

 Active Curation                     Social Media

                                                    Review
Workflows                                           Rating
                              Data                  Commenting




                   Metadata
SEAD LAYERCAKE VIEW
                                                  Network of Data
                                                    Producers


 Services over an
 active content layer
                                                 Web User Interface
 that is backed
                                              Active Content Repository
 by/harvested into a
                                                   Services Provided
 federated archive                    Content     Curation      Archival
                                                                  data
                                                                              Other
                                      Mining      Decisions                  services
 infrastructure based                                          generation


 on institutional                                  Virtual Archives

 resources                                      Institutional Repositories

                           Data          IU          RPI        UIUC         UM         ICPSR
                        Conservancy


                                                    User Network
ACKNOWLEDGMENTS

SEAD is funded by the National Science
Foundation under cooperative agreement
#OCI0940824

More Related Content

What's hot

Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...SEAD
 
Building a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceBuilding a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceRobert H. McDonald
 
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...SEAD
 
RDAP14: DataNet Federal Consortium Update
RDAP14: DataNet Federal Consortium Update RDAP14: DataNet Federal Consortium Update
RDAP14: DataNet Federal Consortium Update ASIS&T
 
Preservation, Publishing, and People: A SEAD View
Preservation, Publishing, and  People: A SEAD ViewPreservation, Publishing, and  People: A SEAD View
Preservation, Publishing, and People: A SEAD ViewInna Kouper
 
Repository Federation: Towards Data Interoperability
Repository Federation: Towards Data InteroperabilityRepository Federation: Towards Data Interoperability
Repository Federation: Towards Data InteroperabilityRobert H. McDonald
 
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...ASIS&T
 
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectRDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectASIS&T
 
Ignite@AGU14
Ignite@AGU14Ignite@AGU14
Ignite@AGU14SEAD
 
Research Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional PolicyResearch Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional PolicyRobin Rice
 
RDAP14: David Van Riper of Terra Populus
RDAP14: David Van Riper of Terra Populus RDAP14: David Van Riper of Terra Populus
RDAP14: David Van Riper of Terra Populus ASIS&T
 
Data discovery and sharing at UCLH
Data discovery and sharing at UCLHData discovery and sharing at UCLH
Data discovery and sharing at UCLHJisc
 
RDAP14: DataONE: Data Observation Network for Earth
RDAP14: DataONE: Data Observation Network for EarthRDAP14: DataONE: Data Observation Network for Earth
RDAP14: DataONE: Data Observation Network for EarthASIS&T
 
RDAP13 Mark Parsons: The Research Data Alliance: Making Data Work
RDAP13 Mark Parsons: The Research Data Alliance: Making Data WorkRDAP13 Mark Parsons: The Research Data Alliance: Making Data Work
RDAP13 Mark Parsons: The Research Data Alliance: Making Data WorkASIS&T
 
Global registries initiative frumkin omodei
Global registries initiative frumkin omodeiGlobal registries initiative frumkin omodei
Global registries initiative frumkin omodeiASIS&T
 

What's hot (20)

Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
 
Building a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceBuilding a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability Science
 
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
 
RDAP14: DataNet Federal Consortium Update
RDAP14: DataNet Federal Consortium Update RDAP14: DataNet Federal Consortium Update
RDAP14: DataNet Federal Consortium Update
 
Preservation, Publishing, and People: A SEAD View
Preservation, Publishing, and  People: A SEAD ViewPreservation, Publishing, and  People: A SEAD View
Preservation, Publishing, and People: A SEAD View
 
Repository Federation: Towards Data Interoperability
Repository Federation: Towards Data InteroperabilityRepository Federation: Towards Data Interoperability
Repository Federation: Towards Data Interoperability
 
Wheeler & Benedict -- Enabling the Preservation Relay
Wheeler & Benedict -- Enabling the Preservation RelayWheeler & Benedict -- Enabling the Preservation Relay
Wheeler & Benedict -- Enabling the Preservation Relay
 
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
 
Engaging the Researcher in RDM
Engaging the Researcher in RDMEngaging the Researcher in RDM
Engaging the Researcher in RDM
 
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectRDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
 
Data Policy for Open Science
Data Policy for Open ScienceData Policy for Open Science
Data Policy for Open Science
 
Ignite@AGU14
Ignite@AGU14Ignite@AGU14
Ignite@AGU14
 
Research Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional PolicyResearch Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional Policy
 
RDAP14: David Van Riper of Terra Populus
RDAP14: David Van Riper of Terra Populus RDAP14: David Van Riper of Terra Populus
RDAP14: David Van Riper of Terra Populus
 
Data discovery and sharing at UCLH
Data discovery and sharing at UCLHData discovery and sharing at UCLH
Data discovery and sharing at UCLH
 
RDAP14: DataONE: Data Observation Network for Earth
RDAP14: DataONE: Data Observation Network for EarthRDAP14: DataONE: Data Observation Network for Earth
RDAP14: DataONE: Data Observation Network for Earth
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
RDAP13 Mark Parsons: The Research Data Alliance: Making Data Work
RDAP13 Mark Parsons: The Research Data Alliance: Making Data WorkRDAP13 Mark Parsons: The Research Data Alliance: Making Data Work
RDAP13 Mark Parsons: The Research Data Alliance: Making Data Work
 
Global registries initiative frumkin omodei
Global registries initiative frumkin omodeiGlobal registries initiative frumkin omodei
Global registries initiative frumkin omodei
 
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
 

Similar to Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)

CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...SEAD
 
SEAD: Sustainable Environment-Actionable Data - Robert McDonald - RDAP12
SEAD: Sustainable Environment-Actionable Data - Robert McDonald - RDAP12 SEAD: Sustainable Environment-Actionable Data - Robert McDonald - RDAP12
SEAD: Sustainable Environment-Actionable Data - Robert McDonald - RDAP12 ASIS&T
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data CommonsVivien Bonazzi
 
100503 bioinfo instsymp
100503 bioinfo instsymp100503 bioinfo instsymp
100503 bioinfo instsympNick Jones
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsVivien Bonazzi
 
Open Data is not Enough (final version)
Open Data is not Enough (final version)Open Data is not Enough (final version)
Open Data is not Enough (final version)Research Data Alliance
 
The NIH Commons: A Cloud-based Training Environment
The NIH Commons: A Cloud-based Training EnvironmentThe NIH Commons: A Cloud-based Training Environment
The NIH Commons: A Cloud-based Training EnvironmentPhilip Bourne
 
SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science Robert H. McDonald
 
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...Kathleen Jagodnik
 
Toward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data EcosystemToward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data EcosystemGlobus
 
BeSTGRID OpenGridForum 29 GIN session
BeSTGRID OpenGridForum 29 GIN sessionBeSTGRID OpenGridForum 29 GIN session
BeSTGRID OpenGridForum 29 GIN sessionNick Jones
 
ELIXIR . Technical Coordinator
ELIXIR. Technical CoordinatorELIXIR. Technical Coordinator
ELIXIR . Technical CoordinatorRafael C. Jimenez
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceGarethKnight
 
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...TERN Australia
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identificationguest453b14
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identificationguest453b14
 

Similar to Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011) (20)

CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
 
SEAD: Sustainable Environment-Actionable Data - Robert McDonald - RDAP12
SEAD: Sustainable Environment-Actionable Data - Robert McDonald - RDAP12 SEAD: Sustainable Environment-Actionable Data - Robert McDonald - RDAP12
SEAD: Sustainable Environment-Actionable Data - Robert McDonald - RDAP12
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data Commons
 
Or 2013-abrams-sharing-data-rich-research
Or 2013-abrams-sharing-data-rich-researchOr 2013-abrams-sharing-data-rich-research
Or 2013-abrams-sharing-data-rich-research
 
100503 bioinfo instsymp
100503 bioinfo instsymp100503 bioinfo instsymp
100503 bioinfo instsymp
 
100503 bioinfo instsymp
100503 bioinfo instsymp100503 bioinfo instsymp
100503 bioinfo instsymp
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
Open Data is not Enough (final version)
Open Data is not Enough (final version)Open Data is not Enough (final version)
Open Data is not Enough (final version)
 
The NIH Commons: A Cloud-based Training Environment
The NIH Commons: A Cloud-based Training EnvironmentThe NIH Commons: A Cloud-based Training Environment
The NIH Commons: A Cloud-based Training Environment
 
Ndsa 2013-abrams-integrating-repositories-for-data-sharing
Ndsa 2013-abrams-integrating-repositories-for-data-sharingNdsa 2013-abrams-integrating-repositories-for-data-sharing
Ndsa 2013-abrams-integrating-repositories-for-data-sharing
 
SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science
 
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
 
Toward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data EcosystemToward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data Ecosystem
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
 
BeSTGRID OpenGridForum 29 GIN session
BeSTGRID OpenGridForum 29 GIN sessionBeSTGRID OpenGridForum 29 GIN session
BeSTGRID OpenGridForum 29 GIN session
 
ELIXIR . Technical Coordinator
ELIXIR. Technical CoordinatorELIXIR. Technical Coordinator
ELIXIR . Technical Coordinator
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support Service
 
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 

More from SEAD

Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...
Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...
Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...SEAD
 
Practical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object PreservationPractical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object PreservationSEAD
 
Preservation, Publishing, and People: A SEAD View
Preservation, Publishing, and People: A SEAD ViewPreservation, Publishing, and People: A SEAD View
Preservation, Publishing, and People: A SEAD ViewSEAD
 
An Overview of Plans for SEAD
An Overview of Plans for SEADAn Overview of Plans for SEAD
An Overview of Plans for SEADSEAD
 
SEAD Prototype: Data Curation and Preservation for Sustainability Science
SEAD Prototype: Data Curation and Preservation for Sustainability ScienceSEAD Prototype: Data Curation and Preservation for Sustainability Science
SEAD Prototype: Data Curation and Preservation for Sustainability ScienceSEAD
 
SEAD: Opening Data in the "Long Tail" for Active and Social Curation
SEAD: Opening Data in the "Long Tail" for Active and Social CurationSEAD: Opening Data in the "Long Tail" for Active and Social Curation
SEAD: Opening Data in the "Long Tail" for Active and Social CurationSEAD
 
SEAD: A system to support social and active data curation
SEAD: A system to support social and active data curationSEAD: A system to support social and active data curation
SEAD: A system to support social and active data curationSEAD
 

More from SEAD (7)

Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...
Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...
Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...
 
Practical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object PreservationPractical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object Preservation
 
Preservation, Publishing, and People: A SEAD View
Preservation, Publishing, and People: A SEAD ViewPreservation, Publishing, and People: A SEAD View
Preservation, Publishing, and People: A SEAD View
 
An Overview of Plans for SEAD
An Overview of Plans for SEADAn Overview of Plans for SEAD
An Overview of Plans for SEAD
 
SEAD Prototype: Data Curation and Preservation for Sustainability Science
SEAD Prototype: Data Curation and Preservation for Sustainability ScienceSEAD Prototype: Data Curation and Preservation for Sustainability Science
SEAD Prototype: Data Curation and Preservation for Sustainability Science
 
SEAD: Opening Data in the "Long Tail" for Active and Social Curation
SEAD: Opening Data in the "Long Tail" for Active and Social CurationSEAD: Opening Data in the "Long Tail" for Active and Social Curation
SEAD: Opening Data in the "Long Tail" for Active and Social Curation
 
SEAD: A system to support social and active data curation
SEAD: A system to support social and active data curationSEAD: A system to support social and active data curation
SEAD: A system to support social and active data curation
 

Recently uploaded

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 

Recently uploaded (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 

Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)

  • 1. GETTING THE MOST OUT OF DATANET: A PANEL DISCUSSION OF THE NSF FUNDED DATANET PARTNERSHIPS Robert H. McDonald – SEAD – Indiana University Catherine Fitch – TerraPop – Minnesota Population Center Richard Marciano – Datanet Federation Consortium – University of North Carolina Sayeed Choudhury – Data Conservancy – Johns Hopkins University William Michener – DataOne – University of New Mexico NSF DATANET PROGRAM- OFFICE OF CYBERINFRASTRUCTURE
  • 2. DATANET ONLINE & TWITTER  Twitter  @SEADdatanet @dataconservancy @DateONEorg  Web  http://www.sead-data.net  http://www.pop.umn.edu  http://dataconservancy.org  http://www.dataone.org  Tagging  #dlfforum  #datanet
  • 3. NSF DATANET PROGRAM • DataNet efforts effectively balance: • Production infrastructure for operational data curation services • Research to create next generation data cyberininfrastructure • DataNet awards are partnerships: • Responsive to user communities to define their meaningful and useful scope • Form a coordinated network to provide national, interdisciplinary data models and infrastructure
  • 4. SEAD Sustainable Environment – Actionable Data http://sead-data.net @SEADdatanet #OCI0940824
  • 5. SEAD TEAM University of Michigan: Margaret Hedstrom (UM PI), Ann Zimmerman (Co-PI and Project Manager), George Alter, Bryan Beecher, Charles Severance, Karen Woollams, Jude Yew. Indiana University: Beth Plale (IU PI), Katy Borner, Robert H. McDonald, Kavitha Chandrasekar, Robert Ping, Stacy Kowalczyk, Robert Light. University of Illinois: Praveen Kumar (UIUC PI), Rob Kooper, Luigi Marini, Terry McLaren. Rensselaer Polytechnic Institute: Jim Myers (RPI PI), Ram Prasanna Govind Krishnan, Lindsay Todd, Adam Wilson. #OCI0940824
  • 6. SEAD PARTNERSHIP Beth Plale Margaret Hedstrom, PI Katy Börner Ann Zimmerman Robert H. McDonald Praveen James Myers Kumar George Alter & Bryan Beecher
  • 7. Sustainability Science Science Cooperation Technology Policy Economics Poverty & Justice 7
  • 8. Data challenges • Heterogeneity of all kinds • Multiple scales • Multidisciplinary • Many small datasets
  • 9. Provide innovative new models and tools for serving the long tail of scientific research
  • 10. SEAD’S GOALS  Provide data services that address the pressing needs of researchers working toward sustainability  Integrate these services into an generalizable “Active and Social Curation” infrastructure well-suited to the social structure and economics of long-tail research communities  Develop capabilities to package and migrate datasets to a federated repository infrastructure for long-term preservation  Education, outreach, & training, to maximize value and disseminate SEAD’s contributions to other projects and communities
  • 11. SEAD’S STRATEGY Move data curation upstream in the data life cycle • Involve domain scientists in setting priorities for evolution of data and services • Use a wide variety of mechanisms to remain resilient in a dynamic research and technology environment
  • 12. ACTIVE AND SOCIAL CURATION • Engage researchers during projects, not at the end • Use information that is automatically captured or generated through tools to reduce the costs of metadata collection and to capture its value in actionable form • Further reduce costs by re-engineering curation processes to leverage this rich metadata and volunteered effort
  • 13. ACTIVE CURATION MODEL Active Curation Social Media Review Workflows Rating Data Commenting Metadata
  • 14. SEAD LAYERCAKE VIEW Network of Data Producers  Services over an active content layer Web User Interface that is backed Active Content Repository by/harvested into a Services Provided federated archive Content Curation Archival data Other Mining Decisions services infrastructure based generation on institutional Virtual Archives resources Institutional Repositories Data IU RPI UIUC UM ICPSR Conservancy User Network
  • 15. ACKNOWLEDGMENTS SEAD is funded by the National Science Foundation under cooperative agreement #OCI0940824

Editor's Notes

  1. Currently, these data are difficult to find, obtain, and use because people from disciplines across the natural and social sciences collect, describe and store their data in many different ways. These data could have significant value if it was possible to connect data collectors with potential users of data and if it was easy for individuals to search for, aggregate, and maintain valuable data for the long term.
  2. To expand a bit on the previous slide … We characterize the needs of sustainability scientists as a “long tail problem” where scientists need diverse data from multiple different sources that overlap in geographic coverage and time, but also have gaps in location, time, resolution, and types of measurements. The data are heterogeneous and vary in format, metadata, size, and quality. One of the biggest challenges we face is supporting diverse needs for heterogeneous data.Our strategies for coping with the diversity of data effectively are based several underlying principles for long tail phenomena: While the aggregate demand for SEAD’s service is large and growing, demand for any particular collection of data is small and focused. Therefore, the investments SEAD makes in any particular set of data have to be quite low. Deciding which data merits investment in curation should be driven by its value to the community and its potential for productive use. Building on a strong foundation of existing infrastructure, collaborative relationships, and expertise, the SEAD team will be able to tackle challenging problems in the long tail with innovative, forwarding-looking and outward facing approaches.
  3. Mention something about the 18-month prototype and that the tasks during this time-frame focus on the first 3 bullets.
  4. Additional text for bullet 1: Provide tools and services that provide benefits to data providers during active projects Provide tools and services that allow data users to collaboratively curate data
  5. We will build usable and useful tools that scientists can take advantage of as they collect, generate and organize data in their active projects. This Active Curation approach will be designed with a great deal of user input to make sure that the tools are light-weight, easy to learn, easy to use, and more effective than the painstaking, hand-crafted approach that many sustainability scientists use today. The Active Curation approach will make data management easier for data producers and lower the curation costs to SEAD.Another part of our strategy is to deploy a variety of social networking and social-media inspired tools to engage the community of data producers and users. These include tools for annotation, rating and commentary on data sets, visualizations of publication and citation networks that map the invisible college of sustainability science researchers, and social networking tools that help build network effects.  We have designed our program with multiple mechanisms to encourage participation in SEAD and adoption of its approach. These include domain engagement workshops to surface needs and requirements, ensure usability of tools, and enlisting key leaders in sustainability as early adopters and promoters of SEAD. These strategies along with support for centralized curation services, education, outreach and training will create a model for sustainable access and preservation of heterogeneous data for sustainability science and other small science disciplines in the long tail.
  6. Robert, I wanted to illustrate the long-term repository piece, but couldn’t find anything very good from previous slides. I put this in for now, but you may have something better.