3. Autodidactic Librarian’s Guide to Learning Data Science
Photo by John (cygnus921) http://www.flickr.com/photos/cygnus921/2678359760
4. WHAT?
Basics of Data
Software
Library
Components
Photo by Johnathan Nightengale http://www.flickr.com/photos/johnath/7270799658/
5. Gold, Anna. Cyberinfrastructure, Data, and
Librarians: Parts 1 & 2. D-Lib.
September/October 2007:13(9/10).
Salo, Dorothea. Retooling Libraries for the
Data Challenge. Ariadne. July 2010:64.
2012 Priscilla Mayden Lecture from the
University of Utah: eScience and the
Evolution of Library Services
For advanced readings, review Dorothea
Salo’s 855 syllabus. http://dsalo.info/wp-
content/uploads/2011/05/855syll2012.pdf
16. Librarian Research Data
Data Management Plans
IRB
Metadata
Reuse potential
Data Discovery
17. Desk Metrics
Locate
Digitize/clean
Provide key/metadata
Visualizations
Store and Publish
18. Citations
Slide 3: Photo by John (cygnus921) http://www.flickr.com/photos/cygnus921/2678359760
Slide 4: Photo by Johnathan Nightengale http://www.flickr.com/photos/johnath/7270799658/
Slide 5: Gold, Anna. Cyberinfrastructure, Data, and Librarians: Parts 1 & 2. D-Lib.
September/October 2007:13(9/10).
Salo, Dorothea. Retooling Libraries for the Data Challenge. Ariadne. July 2010:64.
2012 Priscilla Mayden Lecture from the University of Utah: eScience and the Evolution
of Library Services http://library.med.utah.edu/or/pmayden/home.php
Salo, Dorothea. Syllabus: LIS 855, Digital Curation.
http://dsalo.info/wp-content/uploads/2011/05/855syll2012.pdf
Slide 6: Stuart, David. Facilitating Access to the Web of Data: a Guide for Librarians. October 2011:
Facet Publishing. London.
Milton, Michael. Head First Excel. 2010: O’Reilly. Sebastopol, CA.
Caldwell, Sally. Statistics Unplugged. 3rd Ed. 2010: Cengage Learning. Wadsworth, MA.
Slide 7: http://www.icpsr.umich.edu/icpsrweb/ICPSR/
http://www.arl.org/sparc/
http://www.ala.org/lita/
http://www.asis.org/Conferences/RDAP11/
http://www.iassistdata.org/
http://www.ala.org/alcts/
http://strataconf.com/stratany2012
Slide 8: http://datacurationprofiles.org/
http://www.data.gov/
https://dmp.cdlib.org/
http://www.wolframalpha.com/
http://www.gapminder.org/
http://databib.org
http://code.google.com/p/google-refine/
Why am I on this stage?*AAAS Confererence*Started at UIC in Dec 2010 (Public Small Town to Urban R1 Medical)*Just following the NSF DMP Mandate*Database Designer –been working with data, cleaning, building –this appealed. *Started seeing Data related things turning up in job ads*Who is doing it in libraries? GIS, IT, and Catalogers—Linked library data, FRBR, RDA
So I’m supposed to be at least slightly autodidactic, right? I dug, I read I listened. Today we’re going to go through the what, where, and who that I’ve identified thus farNOT Comprehensive but a launch point for you or someone at your institution who is trying to get up to speed
Basics of Data: *Comes in all shapes and sizes*Clean*Good? *Good doesn’t always mean complex or vice versa*I charge by the hour to clean data*Also includes STATISTICS and DATA ANALYSIS *We don’t all need to statisticians*Software: SPSS/SAS/R, Mathematica (what’s behind Wolfram Alpha), and Advanced Excel. Excel –also works for grocery lists, though it’s harder to do calculations on those tables. *Library ComponentsMetadata and OntologiesSources of DataData Management Plans Basically: Find/Acquire, Describe, and Provide Access. Gee, now doesn’t that sound familiar…. Metadata—lot easier to find, we’ve been doing that for much longer, now it’s finding content appropriate ontologiesSources—best thing I can recommend is go out and look. I’ll give you some links, go digging around in other people’s LibGuidesE.G. Federal Government –alcohol DMPs are a natural for us.