Save the Cows! Cyberinfrastructure for the Rest of Us

Save the Cows!

Cyberinfrastructure for the
rest of us
Dorothea Salo
Digital Repository Librarian
University of Wisconsin
11 March 2009

Cyberinfrastructure
Petabytes
Y TE S
Data mining

X A B
Grid Computing
E tio n Terabytes
Id
aE-Research entit
or E-Science
ab ta
oll da
y
C a
G H !Stan
et Data Curation s dard
M
A A R
IT?
A A Faculty?
Libraries?

It’s simpler than that.

(thank goodness!)

Scholars use

in their research

So now we have
to support that.
Data generation
Data management
Data storage
Data certification
Data discovery and reuse

That’s all this is
about. Really.

What I will not
talk about today
• Collaboration technology
• Identity-management, authentication,
authorization, etc.
• Grid computing
• Instrument science
• Open Notebook Science
Of course these are important.
I’m just not competent to opine.
Fortunately, you have Melissa!

Charts and graphs
are DEAD data

Killed! Cut in pieces!

Ground up! Unrecognizable!

Not revivable! Not reusable!

Okay, what’s
data, then?

We have to save the cows!

In case you’re
wondering...
ike
it l ”
ab
ML nto co
is ws.
F to X ers i ay/
PD rg l K 607 >
ng ambu aev/200html
erti g h
nv in
ichl-de 0509.
—Mes/xm sg0
“Co vert arc
hiv m

co n l .org
/

ist s.xm
/l
ht tp:/
<

Do we have to
keep data?

SOMETIMES.
(but it’s often a good idea even if
you don’t have to)

Here’s the catch

Some of these places
have built barns Many haven’t.
for the cows.

Guess who’s on if they don’t?

What can be
done with data?
• Experimental validation
• Meta-analysis, data-mining, mashups
• Interdisciplinary investigation
• Historical investigation
• Modeling and model validation
• ... the possibilities are endless—IF we
have the cows the data.

Is all data from
“BIG SCIENCE”?

Absolutely not.

(they don’t even need our help)

“Small Science”
Less money

Less know-how

In aggregate? MORE COWS.

Nobody knows
how to do all this.
(yet)

But we do know
a few things...

Cows are dumb.

They will not save
themselves.

It takes a village

to save the cows.

Researchers
Can you tell a Holstein from an Angus?

Me neither.

But researchers know their cows.

Librarians
i ful f
e aut re o oes
s b uctu at g le
thi str
... he e th peop
g is ng t
in di e ocod the talk t’s
pen tan g th le t ed to tha
ap ers
h d n din sab us now
see f un rsta e it u at we —
at I n o
h io u nde mak k th rian 5”
n 1 ..”
w
B ut inat n, and w to I thin libra b
“Li Suc
ia
rar cess.
b tio o . d
c om ma nd h ess it hybri rs o
f
or t, a acc the
inf ind i t to g Fac
to
o r fyin
b eh wan ded, I den
ti
n l., “
ho t ble n.
w u e re
ta
ia alm
abo librar P

the

Grant
administrators

Cows don’t corral themselves.
Neither do researchers.

The big gray area

Informaticists?

Researchers who code?

IT pros who grok metadata?

Librarians who model data?

Ten Questions
1. What is the story of your data?
2. What form and format are the data in?
3. What is the expected lifecycle of your data?
4. How could your data be used, reused, and repurposed?
5. How large is your dataset, and what is its rate of
growth?
6. Who are the potential audiences for your data?
7. Who owns the data?
8. Does the dataset include any sensitive information?
9. What publications or discoveries have resulted from the
data?
10. How should the data be made accessible?
—Michael Witt and Jake Carlson, Purdue University

If this seems like
common sense...

... good! It mostly is!

Thank you!

(and save a cow today!)

Credits
• Title slide: http://www.flickr.com/photos/flikr/131673772/
• Server rack: http://www.flickr.com/photos/dumbledad/3276756770/
• Command centre: http://www.flickr.com/photos/soundman1024/2054512893/
• Laptop: http://www.flickr.com/photos/arbron/56216464/
• Dual-monitor setup: http://www.flickr.com/photos/blakespot/2372432028/
• Photo-data: http://www.flickr.com/photos/51114580@N00/1597765466/
• Word cloud: http://www.flickr.com/photos/55772089@N00/3291287830/
• Internet map: http://www.flickr.com/photos/jurvetson/63009926/
• Dhaka image: http://www.flickr.com/photos/ahaqueusa/1268467179/
• Plant cross-section: http://www.flickr.com/photos/tonios-pics/387510805/
• Journals: http://www.flickr.com/photos/emdot/56157732/
• Books: http://www.flickr.com/photos/guwashi999/2635608241/
• Manuscript: http://www.flickr.com/photos/86624586@N00/10187684/
• Hamburger: http://www.flickr.com/photos/nadya/1019816514/
• Row of cows: http://www.flickr.com/photos/flikr/230379411/
• Beware of cow: http://www.flickr.com/photos/tm-tm/2339539399/
• Cowboys: http://www.flickr.com/photos/bistrosavage/30710414/
• Hands: http://www.flickr.com/photos/iandesign/1204632335/
• Money: http://www.flickr.com/photos/emraya/2867188734/

Save the Cows! Cyberinfrastructure for the Rest of Us

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (19)

Similar to Save the Cows! Cyberinfrastructure for the Rest of Us

Similar to Save the Cows! Cyberinfrastructure for the Rest of Us (20)

More from Dorothea Salo

More from Dorothea Salo (18)

Recently uploaded

Recently uploaded (20)

Save the Cows! Cyberinfrastructure for the Rest of Us

Editor's Notes