This presentation was jointly given by Kevin Read and Alisa Surkis of New York University during the two-part NISO webinar, Digital and Data Literacy, held on September 20, 2017.
3. Data-related Competencies
National Postdoctoral Association
● Establishing data back up protocols
● Sharing of data with collaborators
● Ownership and access to data
Clinical and Translational Science Awards Core Competencies
● Implement quality assurance systems with control procedures for data intake,
management, and monitoring for different study designs.
NIH Rigor and Reproducibility Guidelines
● Require formal instruction in scientific rigor and transparency to enhance
reproducibility for all individuals supported by institutional training grants
4. Training Shortcomings
Lack of data management training
● Barone L, Williams J, Micklos D. Unmet Needs for Analyzing Biological Big
Data: A Survey of 704 NSF Principal Investigators. bioRxiv. 2017.
Need for improved training in data sharing, management and collection practices
● Jahnke L, Asher A, Henry C, Keralis SDC. The Problem of Data. Council on
Library and Information Resources. 2012:1-43.
6. NLM New Data Hub at NIH
“NLM is now poised to build on its activities in computational-based research, data
dissemination, and training to assume the NIH leadership role in data science.”
- Advisory Committee to the NIH Director
“The challenges of big data—its size, variability, and accessibility—align with the
strengths of the library. “
- NLM Director, Patti Brennan
7. Widespread data management training
● Basic science institute
● Postdoctoral scholars
● Clinical and translational science trainees
● Individual research groups
Research Data Management @ NYU Langone Health
8. Required 1-credit course for first-year PhD, MD/PhD students
Titled: Fundamental Skills and Tools for Research
Includes:
● Research data management
● Rigor and reproducibility
● Open science/data
● Data visualization
● Git and GitHub
Research Data Management @ NYU Langone Health
9. Data Day to Day Series
● 8-12+ data classes offered 3 times/year
● Mix of library-taught, and classes offered by collaborators
10. Classes offered
Summer 2016 Fall 2016 Spring 2017 Summer 2017
Data Visualization (2) REDCap (2) Intro to R (2) Intro to R (2)
RDM Essentials Data Visualization Data Visualization (2) Data viz w/ggplot2
Intro to REDCap & i2b2 Qualitative Study Designs REDCap (2) Data Viz w/GraphPad Prism (3)
Qualitative Data Analysis QGIS (2) Git & GitHub Git & GitHub
Big Data in Medicine Qualitative Interviews RDM Workshop REDCap (3)
Data Wrangling Intro to R Data Saturation in Qualitative Res. Regression Analysis
Clinical Data Management Research w/ Limited English Pop Research w/Limited English Pop
Hypothesis Testing in R Survey Design
High Performance Computing
Machine Learning
SPSS
12. Continuing education classes for medical librarians
NIH Big Data to Knowledge training grant
National Network of Libraries of Medicine funded pilot
Data Management Training for Medical Librarians
13. Learning Objectives:
● Current roles libraries play in research data management (RDM)
● Research process
● Differences between clinical and bench science researchers and their RDM
needs
● Current climate around data management and sharing
● Best practices in data documentation and description
● Relevance of standards to data management
● Issues in storage, preservation, and sharing of data
Continuing Education for Medical Librarians
14. NIH Big Data to Knowledge Initiative
● Launched in 2014 to facilitate broad use of biomedical big data
● Develop and disseminate analysis methods and software
● Enhance training relevant for large-scale data analysis
● Establish centers of excellence for biomedical big data
Awarded training grant R25 LM012283 2015-2017
● Preparing Medical Librarians to Understand and Teach Research Data
Management
15. Online Librarian Training
Practice of research
Story of data
Fundamentals of RDM
Cohort of biomedical librarians
Varied teaching experience
Slides + script
Evaluation methods
“Edutainment” videos
Observing librarians teach
Follow-up interviews with
librarians
Knowledge gain
Satisfaction
Intent to practice
Dissemination to
Biomedical Library
Community
Piloting
In-person
Curriculum
Knowledge gain & satisfaction
Comfort level/Intent to teach
Qualitative follow-up
Librarians Teaching Researchers
Librarian
Evaluation
Piloting
Web-based
Curriculum
Evaluation
Researcher
17. Variable
Name
Field type Description Choices
subject_id Text Unique identifier for each
subject
last_name Text Subject last name
first_name Text Subject first name
birth_date Date Subject birth date
sex Integer Subject sex
RDM Best Practices
18. Variable
Name
Field type Description Choices
subject_id Text Unique identifier for each
subject
last_name Text Subject last name
first_name Text Subject first name
birth_date Date Subject birth date
sex Integer Subject sex
DNA by pedro baños cancer https://thenounproject.com/search/?q=genome&i=182407
lab flasks by Carla Dias https://thenounproject.com/search/?q=lab+animal&i=546042
RDM Best Practices Understanding Research Context
40. Health Science Libraries
Data Community
NNLM
Region
NYU
Langone
SUNY
Stony
Brook
University
of Buffalo
Temple
University
Drexel
University
University
of
Delaware
41. Health Science Libraries
Data Community
NNLM
Region
NYU
Langone
SUNY
Stony
Brook
University
of Buffalo
Temple
University
Drexel
University
University
of
Delaware
42. Project Goals
Support health sciences libraries in developing a viable research data management
component to their service offerings
Establish a community of librarians and libraries offering data services
43. Project Components
Training
Online Modules: http://bit.ly/RDM_Modules
Service
Teaching Toolkit: http://bit.ly/2s1Nmqg
Data Interviews: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4511052/
Data Series: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5370612/
45. Philosophy of why we designed the interviews
Necessary for gaining a better understanding of researchers data management
challenges and opportunities
Identified researchers conducting basic science and clinical research
Used active grant system to identify a range of researchers
47. Effectiveness
Associates library with data
Helps librarians understand the language and perceptions of researchers
Led to:
● New training opportunities
● Better context and understanding when communicating with researchers about
their data
● New strategies for developing library services and resources
49. Why a data series?
● Creates/strengthens partnership with data expertise across institution
● Wider range of classes draws more attention to library classes
● Focus marketing efforts
50. How/who we’ve partnered with
● Institute for Innovations in Medical Education
● Department of Population Health
● College of Nursing
● Medical Center IT
● DataCore
● Main Campus Library Data Services