SlideShare una empresa de Scribd logo
1 de 26
Building data
infrastructures for
science
Vince Smith

Informatics Horizons, London
24 July 2013
Overview

1. (my) Background
•
•

Lice to data infrastructures!
Why data infrastructures at the NHM

2. Building data infrastructures
•
•
•

Recent core investment in NHM infrastructures
Leveraging external investment in NHM infrastructures
Infrastructure design principles & coordination

3. NHM 5-year data infrastructure horizons
•
•
•

Collections digitisation
Large-scale use of collections data
New approaches to biodiversity discovery

4. Decadal community infrastructure challenges
•
•

The long view – science data strategies
Data modeling and real time monitoring as a unifying theme
1. (my) Background
Lice to data infrastructures!

Systematics (circa 1998)
- No high level keys
- Poor high level taxonomy
- Just one phylogeny
- Few living experts!

Circa 5,000 spp.
Mammals & birds
12,000 associations
15,000 potential hosts
My data infrastructure (circa 1998)
- Taxonomic names
- Authorities (name concepts)
- Citations
- Collection data
- Morphological characters
- Textual descriptions
- Diagnostic keys
- Illustrations
- Photographs

Palma, R.L., and
R.L.C. Pilgrim.
2002. A revision
of the genus
Naubates
(Insecta:
Phthiraptera:
Philopteridae). J.
R. Soc. N.Z. 32:760.

142 pieces of “raw”
data in 4 of 54 pages,
in 1 of 9,110 taxonomic
papers on lice
“The bane of my existence is doing
things that I know the computer could do
for me”
-- Dan Connolly, The XML Revolution
(Nature, 1998)
My data infrastructures (circa 2004)

Images

Specimens

(SID)

LouseBASE

Glasgow version at:
http://darwin.zoology.gla.ac.uk/~rpage/LouseBase/2/

Lab Notebook

Literature

http://darwin.zoology.gla.ac.uk/~SID/

Host-Parasite Checklists

PHPBib
http://www2.flmnh.ufl.edu/pdb/

http://myphpbib.sourceforge.net/

http://www2.flmnh.ufl.edu/adb/
My publications in 2004 (enabled by these infrastructures)
Making louse research more efficient, more collaborative and more productive

Biol. Letters

Zoo. Scripta

Syst. Biol.

Specimens

Grzimek’s Ency.

Mol. Phyl. Evol.

Images

Ent. Abh.

Proc. R. Soc. B

Lab Notebooks

PLoS Biology

Science

Literature

Checklists
Why data infrastructures at the NHM: lots of potential

Card indices

Library

Archives

Staff

Frozen Tissue

Labels

Slides

Spirit

Dry
2. Building data
infrastructures
Recent NHM investment in science data infrastructures

1. KE EMu (collections data)
•
•
•

Improved interface (speed, complexity, data quality, support)
Rapid Data Entry Web-Interface
Improved import & export functionality (CLD & data portal)

2. DAMS (multimedia) ?
•

Review (Digital Strategy Group)

3. NHM Virtual Library (literature)
•
•

Integrated search & discovery of NHM resources
Better integration with external resources

4. NHM Data Portal (access, citation & archival)
•
•
•
•

Discovery & visualisation of collections data on the Web
Web exposure & archival of NHM research datasets
Sub-portals for collaborative projects
As strategically important as the Web in 3 years time!
Enabling the NHM mission?
Collections

Public Engagement Research
What are Scratchpads? (http://scratchpads.eu)
External investment in science data infrastructures
1. ViBRANT (EU FP7 Infrastructures, 17 partners, €4.75M)
•
•

Virtual Biodiversity Research & Access Network for Taxonomy
Building & integrating tools supporting biodiversity research communities
(publishing, literature & vocabulary management, ID keys, conservation assessments,
mapping & visualisation tools, citizen science support)

2. e-Monocot (NERC Consortium; Kew Oxford & NHM, £2.38M)
•
•

Sustainable, integrated resource on Monocot plants
Content and supporting digital infrastructure
(Complete family level keys & taxon pages; generic keys & pages for 8 families; select
species-level resources from European Monocots, Red-list species and Slipper orchids)

3. SYNTHESYS 1,2 & 3 (EU FP5/6/7 Infrastructures, 18 partners, €10M)
•
•

Support for physical access to participating collections
JRA: Research into mass collections digitisation
(Image analysis, segmentation, transcription & crowdsourcing)

4. Others
•
•

Open-UP
BHL-EUROPE

ViBRANT

Virtual Biodiversity
What are Scratchpads? (http://scratchpads.eu)
Scratchpad VRE: foundation for ViBRANT & eMonocot

Taxa
(Classifications, taxon profiles, specimens, literature, images, maps, phenotypic, genotypic
& morphometric datasets, keys, phylogenies)

Conservation

Projects

Regions

Societies
Impact: What are Scratchpads? (http://scratchpads.eu)
Scratchpad usage (July 2013)

525 Scratchpad Communities
by

6,550 active registered users

covering

73,444 taxa

in 535,317 pages.

In total more than

1,300,000 visitors

81 paper citations in 2012

Per month unique visitors to Scratchpad sites

119 NHM staff,
83 sites
65,000
unique visitors/month
3. Our near-term
infrastructure horizons
Digital Ambition: NHM Science Strategy 2013-2017

A New Voyage of Discovery
Three Focal Areas
1. Scientific discovery
2. Scientific infrastructure
3. Scientific engagement

Five Challenges
1. The digital NHM
2. Origins, evolution & futures
3. Biodiversity discovery
4. Natural resources & hazards
5. Science, society & skills

Resources & funding
Measuring success
Digital Ambition: NHM Science Strategy 2013-2017

A New Voyage of Discovery
Three Focal Areas
1. Scientific discovery
2. Scientific Infrastructure
3. Scientific engagement

Five Challenges
1. The digital NHM
2. Origins, evolution & futures
3. Biodiversity discovery
4. Natural resources & hazards
5. Science, society & skills

Resources & funding
Measuring success

Collections digitisation
Large-scale use of collections data
New approaches to biodiversity discovery
Collections digitisation (data mobalisation)
Target
20M specimens available digitally in 5-years

Challenges
Current fragmented efforts
Heterogeneity of process
Existing data (2.8M lots; 400k geo.; 120k images)
Scale of operation (iCollections, 130k in 1 year)
Transcription (Citizen Sci. / crowdsourcing)
Data quality, annotation & feedback

Resources & funding
Expensive (£20-£60M @ £1-3 per specimen)
Linked to our public offer

Next steps (Sept. 2013)
Coll. Descriptions & protocols
Greater coordination of effort
Programme group with project portfolio?
Planning of digital access via NHM Data Portal
Large scale use of collections data (or why digitise)
Data applications help set digitisation priorities

1000

Crop Wild Relatives

500

Invasive alien species
Impacts of climate change
Species conservation & protected areas
Impacts of human development
Biodiversity & human health
Food, farming & biofuels

Sustainable delivery of data

0
Poaceae
Legumino…
Brassicac…
Rosaceae
Solanaceae
Composit…
Rubiaceae
Vitaceae
Anacardi…
Araceae
Arecaceae
Moraceae
Malvaceae
Musaceae
Cucurbita…
Amaryllid…
Grossular…
Amarant…
Aquifoliac…
Theaceae
Juglandac…
Euphorbi…
Apiaceae
Caricaceae
Asparaga…
Dioscorea…
Pedaliace…
Rutaceae
Lauraceae
Betulaceae
Convolvul…
Myrtaceae
Oleaceae
Zingibera…
Bromelia…
Piperaceae
Lecythida…

Potential applications for NHM data

NHM Data Portal

NHM Data portal
Promote access & reuse of data
Sub-portals for specific themes
Delivering content to third parties (e.g. GBIF)

Next steps (requirements)
Storage (Access, backup & archival)
Citation, linking & measuring impact (identifiers)
Data layering & visualisation
H.P.C. (Ecol. niche modeling & analysis)

Data visualisation
New approaches to biodiversity discovery (new types of data)
Take home messages from NHM Tropical Biodiversity Symposium

Molecular approaches
Molecular detection & monitoring of organisms is routine
Metagenomics (env. sequencing) commonplace
Whole genomes are normal
The primary route to understanding biodiversity for many

Ecological observatories

3-4 June 2013, NHM

Automated biodiversity detection
Remote sensing (e.g. satellite & acoustic data, drones, camera traps)
Monitoring conspicuous, rare or invasive spp. (algal blooms, palms)
Monitoring human activity
Supplement field research, fills in gaps & scales

Digital infrastructure requirements
Very large quantities of data (2.5-10TB per researcher per yr.)
Doesn’t map to existing NHM collections infrastructures
Challenge current networking & storage capacity
Digital and physical collections become equally important?
22 July, 2013
4. Community decadal
challenges
The long view: community informatics challenges

GBIF GBIC Report
(Coming soon)

EU Biodiversity Strategy
(2011)

Biodiv. Inf. Challenges
(2013)
Modeling the biosphere: a (the) 30 year goal?

A clear, singular
long-term
vision, that NHM
data can
contribute too

Nature 2013, doi:10.1038/493295a
QUESTIONS
What are Scratchpads?
Infrastructure design principals* (http://scratchpads.eu)
= experience from 7-years with the Scratchpads
= lessons for building NHM data infrastructures?
1. Start with needs - focus on real user needs (not just the ‘official process’)
2. Do less - if someone else is doing it, link to it or use it
3. Design with data - prototype and test with real users on the live website
4. Do the hard work to make it simple - let the computer take the strain
5. Iterate. Then iterate again. - iteration reduces risk & is more sustainable
6. Build for inclusion – it’s easier in the long run
7. Understand context - we are designing for people, not a screen or a brand
8. Build digital services, not websites - there is life beyond the website
9. Be consistent, not uniform - every circumstance is different
10. Make things open: it makes things better - it’s more sustainable
*https://www.gov.uk/designprinciples
What are Scratchpads? (http://scratchpads.eu)
Better NHM digital coordination from 2013

Digital Strategy
Group

Developing common vision
High level strategy
Director level engagement
(Science, PEG & Corp. Services)

Digital Design
Group

Digital
Programme
Group

Delivering & leading digital activities
Fund raising (internal & external)
Prioritisation

Administrative support
Resource management
Analysis of impact

Más contenido relacionado

La actualidad más candente

D4Science: An e-Infrastructure for Facilitating Fisheries and Aquaculture Re...
D4Science:An e-Infrastructure for Facilitating Fisheries and Aquaculture Re...D4Science:An e-Infrastructure for Facilitating Fisheries and Aquaculture Re...
D4Science: An e-Infrastructure for Facilitating Fisheries and Aquaculture Re...FAO
 
E cconcertation lyon-22-sep2011-v3
E cconcertation lyon-22-sep2011-v3E cconcertation lyon-22-sep2011-v3
E cconcertation lyon-22-sep2011-v3Alex Hardisty
 
Building the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of ScientistsBuilding the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of ScientistsCarole Goble
 
SYNTHESYS 3 Overview
SYNTHESYS 3 OverviewSYNTHESYS 3 Overview
SYNTHESYS 3 OverviewVince Smith
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarshiptsbbbu
 
AI & Bio Medical Presentation @JoshArnold et al
AI & Bio Medical Presentation @JoshArnold et alAI & Bio Medical Presentation @JoshArnold et al
AI & Bio Medical Presentation @JoshArnold et alClinton Arnold
 
Trees4Future general presentation June 2012
Trees4Future general presentation June 2012Trees4Future general presentation June 2012
Trees4Future general presentation June 2012Trees4Future
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesManjulaPatel
 
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.Carole Goble
 
The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...ManjulaPatel
 
Understanding the Big Picture of e-Science
Understanding the Big Picture of e-ScienceUnderstanding the Big Picture of e-Science
Understanding the Big Picture of e-ScienceAndrew Sallans
 
Aaas Data Intensive Science And Grid
Aaas Data Intensive Science And GridAaas Data Intensive Science And Grid
Aaas Data Intensive Science And GridIan Foster
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...Carole Goble
 
Forschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und PerspektivenForschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und PerspektivenHeinz Pampel
 
The value of digitally encoded information for libraries
The value of digitally encoded information for librariesThe value of digitally encoded information for libraries
The value of digitally encoded information for librariesLIBER Europe
 
Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...Vince Smith
 
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017Deborah McGuinness
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeVince Smith
 

La actualidad más candente (20)

D4Science: An e-Infrastructure for Facilitating Fisheries and Aquaculture Re...
D4Science:An e-Infrastructure for Facilitating Fisheries and Aquaculture Re...D4Science:An e-Infrastructure for Facilitating Fisheries and Aquaculture Re...
D4Science: An e-Infrastructure for Facilitating Fisheries and Aquaculture Re...
 
Keller geo edu
Keller geo eduKeller geo edu
Keller geo edu
 
E cconcertation lyon-22-sep2011-v3
E cconcertation lyon-22-sep2011-v3E cconcertation lyon-22-sep2011-v3
E cconcertation lyon-22-sep2011-v3
 
Building the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of ScientistsBuilding the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of Scientists
 
SYNTHESYS 3 Overview
SYNTHESYS 3 OverviewSYNTHESYS 3 Overview
SYNTHESYS 3 Overview
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarship
 
AI & Bio Medical Presentation @JoshArnold et al
AI & Bio Medical Presentation @JoshArnold et alAI & Bio Medical Presentation @JoshArnold et al
AI & Bio Medical Presentation @JoshArnold et al
 
Trees4Future general presentation June 2012
Trees4Future general presentation June 2012Trees4Future general presentation June 2012
Trees4Future general presentation June 2012
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural Sciences
 
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
 
The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...
 
Understanding the Big Picture of e-Science
Understanding the Big Picture of e-ScienceUnderstanding the Big Picture of e-Science
Understanding the Big Picture of e-Science
 
Aaas Data Intensive Science And Grid
Aaas Data Intensive Science And GridAaas Data Intensive Science And Grid
Aaas Data Intensive Science And Grid
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
Forschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und PerspektivenForschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
 
The value of digitally encoded information for libraries
The value of digitally encoded information for librariesThe value of digitally encoded information for libraries
The value of digitally encoded information for libraries
 
Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...
 
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics Landscape
 
Towards Knowledge-Enabled Society
Towards Knowledge-Enabled SocietyTowards Knowledge-Enabled Society
Towards Knowledge-Enabled Society
 

Destacado

RAVELLO LAB 2014 | L.I.S.A.
RAVELLO LAB 2014 | L.I.S.A.RAVELLO LAB 2014 | L.I.S.A.
RAVELLO LAB 2014 | L.I.S.A.Creactivitas
 
HRRHCongres day 3: The Forge
HRRHCongres day 3: The ForgeHRRHCongres day 3: The Forge
HRRHCongres day 3: The ForgeHRmagazine
 
University commercialization
University commercializationUniversity commercialization
University commercializationJack Brittain
 
Mr Guillermo A. Lemarchand - Science Policy Consultant, Division of Science P...
Mr Guillermo A. Lemarchand - Science Policy Consultant, Division of Science P...Mr Guillermo A. Lemarchand - Science Policy Consultant, Division of Science P...
Mr Guillermo A. Lemarchand - Science Policy Consultant, Division of Science P...UNESCO Venice Office
 
Burton Lee - University Research Panel - Intl Technology Law Assn 4th Confere...
Burton Lee - University Research Panel - Intl Technology Law Assn 4th Confere...Burton Lee - University Research Panel - Intl Technology Law Assn 4th Confere...
Burton Lee - University Research Panel - Intl Technology Law Assn 4th Confere...Burton Lee
 
What does 'open innovation' mean for the Cambridge high tech cluster?
What does 'open innovation' mean for the Cambridge high tech cluster? What does 'open innovation' mean for the Cambridge high tech cluster?
What does 'open innovation' mean for the Cambridge high tech cluster? Tim Minshall
 
Gert Vilhelmballing | OIS 2012 | Como construir ambientes de transferência de...
Gert Vilhelmballing | OIS 2012 | Como construir ambientes de transferência de...Gert Vilhelmballing | OIS 2012 | Como construir ambientes de transferência de...
Gert Vilhelmballing | OIS 2012 | Como construir ambientes de transferência de...Allagi Open Innovation Services
 
Open Innovation Seminar 2008 - Brazil - Henry Chesbrough
Open Innovation Seminar 2008 - Brazil - Henry ChesbroughOpen Innovation Seminar 2008 - Brazil - Henry Chesbrough
Open Innovation Seminar 2008 - Brazil - Henry ChesbroughAllagi Open Innovation Services
 
FITT Toolbox: Involving Researchers in Spinoffs
FITT Toolbox: Involving Researchers in SpinoffsFITT Toolbox: Involving Researchers in Spinoffs
FITT Toolbox: Involving Researchers in SpinoffsFITT
 
Spin up entrepreneurship training and coaching programme for USO
Spin up entrepreneurship training and coaching programme for USOSpin up entrepreneurship training and coaching programme for USO
Spin up entrepreneurship training and coaching programme for USOAna Barroca
 
Toolbox Involving Researchers In Spin Offs Ppt Final
Toolbox Involving Researchers In Spin Offs Ppt FinalToolbox Involving Researchers In Spin Offs Ppt Final
Toolbox Involving Researchers In Spin Offs Ppt FinalFITT
 
Institutional determinants of University spin -off quantity and quality..Mik...
Institutional  determinants of University spin -off quantity and quality..Mik...Institutional  determinants of University spin -off quantity and quality..Mik...
Institutional determinants of University spin -off quantity and quality..Mik...enterpriseresearchcentre
 
Sam Inkinen Open Innovation and Web 2.0
Sam Inkinen Open Innovation and Web 2.0Sam Inkinen Open Innovation and Web 2.0
Sam Inkinen Open Innovation and Web 2.0samink
 
TCI 2016 Open Innovation Platforms
TCI 2016 Open Innovation PlatformsTCI 2016 Open Innovation Platforms
TCI 2016 Open Innovation PlatformsTCI Network
 
Open innovation: only two things to remember
Open innovation: only two things to rememberOpen innovation: only two things to remember
Open innovation: only two things to rememberCREAX
 
The open innovation research landscape: Established perspectives and emerging...
The open innovation research landscape: Established perspectives and emerging...The open innovation research landscape: Established perspectives and emerging...
The open innovation research landscape: Established perspectives and emerging...Ian McCarthy
 
Open Innovation
Open Innovation Open Innovation
Open Innovation Alar Kolk
 
Open Innovation - global trends and examples
Open Innovation - global trends and examplesOpen Innovation - global trends and examples
Open Innovation - global trends and examplesJose Claudio Terra
 

Destacado (19)

RAVELLO LAB 2014 | L.I.S.A.
RAVELLO LAB 2014 | L.I.S.A.RAVELLO LAB 2014 | L.I.S.A.
RAVELLO LAB 2014 | L.I.S.A.
 
HRRHCongres day 3: The Forge
HRRHCongres day 3: The ForgeHRRHCongres day 3: The Forge
HRRHCongres day 3: The Forge
 
University commercialization
University commercializationUniversity commercialization
University commercialization
 
Mr Guillermo A. Lemarchand - Science Policy Consultant, Division of Science P...
Mr Guillermo A. Lemarchand - Science Policy Consultant, Division of Science P...Mr Guillermo A. Lemarchand - Science Policy Consultant, Division of Science P...
Mr Guillermo A. Lemarchand - Science Policy Consultant, Division of Science P...
 
Burton Lee - University Research Panel - Intl Technology Law Assn 4th Confere...
Burton Lee - University Research Panel - Intl Technology Law Assn 4th Confere...Burton Lee - University Research Panel - Intl Technology Law Assn 4th Confere...
Burton Lee - University Research Panel - Intl Technology Law Assn 4th Confere...
 
What does 'open innovation' mean for the Cambridge high tech cluster?
What does 'open innovation' mean for the Cambridge high tech cluster? What does 'open innovation' mean for the Cambridge high tech cluster?
What does 'open innovation' mean for the Cambridge high tech cluster?
 
Gert Vilhelmballing | OIS 2012 | Como construir ambientes de transferência de...
Gert Vilhelmballing | OIS 2012 | Como construir ambientes de transferência de...Gert Vilhelmballing | OIS 2012 | Como construir ambientes de transferência de...
Gert Vilhelmballing | OIS 2012 | Como construir ambientes de transferência de...
 
Open Innovation Seminar 2008 - Brazil - Henry Chesbrough
Open Innovation Seminar 2008 - Brazil - Henry ChesbroughOpen Innovation Seminar 2008 - Brazil - Henry Chesbrough
Open Innovation Seminar 2008 - Brazil - Henry Chesbrough
 
act4-b4
act4-b4act4-b4
act4-b4
 
FITT Toolbox: Involving Researchers in Spinoffs
FITT Toolbox: Involving Researchers in SpinoffsFITT Toolbox: Involving Researchers in Spinoffs
FITT Toolbox: Involving Researchers in Spinoffs
 
Spin up entrepreneurship training and coaching programme for USO
Spin up entrepreneurship training and coaching programme for USOSpin up entrepreneurship training and coaching programme for USO
Spin up entrepreneurship training and coaching programme for USO
 
Toolbox Involving Researchers In Spin Offs Ppt Final
Toolbox Involving Researchers In Spin Offs Ppt FinalToolbox Involving Researchers In Spin Offs Ppt Final
Toolbox Involving Researchers In Spin Offs Ppt Final
 
Institutional determinants of University spin -off quantity and quality..Mik...
Institutional  determinants of University spin -off quantity and quality..Mik...Institutional  determinants of University spin -off quantity and quality..Mik...
Institutional determinants of University spin -off quantity and quality..Mik...
 
Sam Inkinen Open Innovation and Web 2.0
Sam Inkinen Open Innovation and Web 2.0Sam Inkinen Open Innovation and Web 2.0
Sam Inkinen Open Innovation and Web 2.0
 
TCI 2016 Open Innovation Platforms
TCI 2016 Open Innovation PlatformsTCI 2016 Open Innovation Platforms
TCI 2016 Open Innovation Platforms
 
Open innovation: only two things to remember
Open innovation: only two things to rememberOpen innovation: only two things to remember
Open innovation: only two things to remember
 
The open innovation research landscape: Established perspectives and emerging...
The open innovation research landscape: Established perspectives and emerging...The open innovation research landscape: Established perspectives and emerging...
The open innovation research landscape: Established perspectives and emerging...
 
Open Innovation
Open Innovation Open Innovation
Open Innovation
 
Open Innovation - global trends and examples
Open Innovation - global trends and examplesOpen Innovation - global trends and examples
Open Innovation - global trends and examples
 

Similar a Building data infrastructures for science

Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince Smith
 
The biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveThe biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveVince Smith
 
Delivering biodiversity knowledge in the information age
Delivering biodiversity knowledge in the information ageDelivering biodiversity knowledge in the information age
Delivering biodiversity knowledge in the information ageVince Smith
 
Digital Preservation
Digital PreservationDigital Preservation
Digital PreservationSmita Chandra
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservationsmtcd
 
An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...
An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...
An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...Vince Smith
 
AH-XLDBEurope-position-09 jun2011
AH-XLDBEurope-position-09 jun2011AH-XLDBEurope-position-09 jun2011
AH-XLDBEurope-position-09 jun2011Alex Hardisty
 
The Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresThe Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresguest0dc425
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeEdward Baker
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeVince Smith
 
Australia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityAustralia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityTERN Australia
 
Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Peter Löwe
 
Current and emerging scientific data curation practices
Current and emerging scientific data curation practicesCurrent and emerging scientific data curation practices
Current and emerging scientific data curation practicesMichael Day
 
NIST Big Data Public Working Group NBD-PWG
NIST Big Data Public Working Group NBD-PWGNIST Big Data Public Working Group NBD-PWG
NIST Big Data Public Working Group NBD-PWGGeoffrey Fox
 
Lorna hughes 12 05-2013 NeDiMAH and ontology for DH
Lorna hughes 12 05-2013 NeDiMAH and ontology for DHLorna hughes 12 05-2013 NeDiMAH and ontology for DH
Lorna hughes 12 05-2013 NeDiMAH and ontology for DHlorna_hughes
 
Sla2009 D Curation Heidorn
Sla2009 D Curation HeidornSla2009 D Curation Heidorn
Sla2009 D Curation HeidornBryan Heidorn
 
Scratchpads introductory presentation 45mins
Scratchpads introductory presentation   45minsScratchpads introductory presentation   45mins
Scratchpads introductory presentation 45minsDimitrios Koureas
 
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...e-ROSA
 

Similar a Building data infrastructures for science (20)

Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notext
 
The biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveThe biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspective
 
Delivering biodiversity knowledge in the information age
Delivering biodiversity knowledge in the information ageDelivering biodiversity knowledge in the information age
Delivering biodiversity knowledge in the information age
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...
An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...
An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...
 
AH-XLDBEurope-position-09 jun2011
AH-XLDBEurope-position-09 jun2011AH-XLDBEurope-position-09 jun2011
AH-XLDBEurope-position-09 jun2011
 
The Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresThe Developing Needs for e-infrastructures
The Developing Needs for e-infrastructures
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 
Australia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityAustralia's Environmental Predictive Capability
Australia's Environmental Predictive Capability
 
Using Open Research Data for Public Policy Making: Opportunities of Virtual R...
Using Open Research Data for Public Policy Making: Opportunities of Virtual R...Using Open Research Data for Public Policy Making: Opportunities of Virtual R...
Using Open Research Data for Public Policy Making: Opportunities of Virtual R...
 
Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...
 
Current and emerging scientific data curation practices
Current and emerging scientific data curation practicesCurrent and emerging scientific data curation practices
Current and emerging scientific data curation practices
 
NIST Big Data Public Working Group NBD-PWG
NIST Big Data Public Working Group NBD-PWGNIST Big Data Public Working Group NBD-PWG
NIST Big Data Public Working Group NBD-PWG
 
Lorna hughes 12 05-2013 NeDiMAH and ontology for DH
Lorna hughes 12 05-2013 NeDiMAH and ontology for DHLorna hughes 12 05-2013 NeDiMAH and ontology for DH
Lorna hughes 12 05-2013 NeDiMAH and ontology for DH
 
Sla2009 D Curation Heidorn
Sla2009 D Curation HeidornSla2009 D Curation Heidorn
Sla2009 D Curation Heidorn
 
Scratchpads introductory presentation 45mins
Scratchpads introductory presentation   45minsScratchpads introductory presentation   45mins
Scratchpads introductory presentation 45mins
 
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
 
Elab 16 5-13-re3data-scholze-final
Elab 16 5-13-re3data-scholze-finalElab 16 5-13-re3data-scholze-final
Elab 16 5-13-re3data-scholze-final
 

Más de Vince Smith

DiSSCo institutional benefits
DiSSCo institutional benefitsDiSSCo institutional benefits
DiSSCo institutional benefitsVince Smith
 
FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...Vince Smith
 
Use it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresUse it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresVince Smith
 
Consolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsConsolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsVince Smith
 
Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Vince Smith
 
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Vince Smith
 
Scratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataScratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataVince Smith
 
Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Vince Smith
 
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Vince Smith
 
Don't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easyDon't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easyVince Smith
 
Don’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easyDon’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easyVince Smith
 
2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smith2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smithVince Smith
 
Sharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT waySharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT wayVince Smith
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
 
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Vince Smith
 
Scratchpad 2, Virtual Research Environment: Project Update
 Scratchpad 2, Virtual Research Environment: Project Update Scratchpad 2, Virtual Research Environment: Project Update
Scratchpad 2, Virtual Research Environment: Project UpdateVince Smith
 
A timescale for the evolution of lice
A timescale for the evolution of liceA timescale for the evolution of lice
A timescale for the evolution of liceVince Smith
 
ViBRANT: linking communities and services
ViBRANT: linking communities and servicesViBRANT: linking communities and services
ViBRANT: linking communities and servicesVince Smith
 
Thoughts on addressing data citation challenges: experiences of Vibrant project
Thoughts on addressing data citation challenges: experiences of Vibrant projectThoughts on addressing data citation challenges: experiences of Vibrant project
Thoughts on addressing data citation challenges: experiences of Vibrant projectVince Smith
 

Más de Vince Smith (19)

DiSSCo institutional benefits
DiSSCo institutional benefitsDiSSCo institutional benefits
DiSSCo institutional benefits
 
FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...
 
Use it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresUse it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructures
 
Consolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsConsolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review Presentations
 
Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...
 
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
 
Scratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataScratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity data
 
Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...
 
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
 
Don't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easyDon't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easy
 
Don’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easyDon’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easy
 
2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smith2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smith
 
Sharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT waySharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT way
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
 
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
 
Scratchpad 2, Virtual Research Environment: Project Update
 Scratchpad 2, Virtual Research Environment: Project Update Scratchpad 2, Virtual Research Environment: Project Update
Scratchpad 2, Virtual Research Environment: Project Update
 
A timescale for the evolution of lice
A timescale for the evolution of liceA timescale for the evolution of lice
A timescale for the evolution of lice
 
ViBRANT: linking communities and services
ViBRANT: linking communities and servicesViBRANT: linking communities and services
ViBRANT: linking communities and services
 
Thoughts on addressing data citation challenges: experiences of Vibrant project
Thoughts on addressing data citation challenges: experiences of Vibrant projectThoughts on addressing data citation challenges: experiences of Vibrant project
Thoughts on addressing data citation challenges: experiences of Vibrant project
 

Último

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 

Último (20)

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 

Building data infrastructures for science

  • 1. Building data infrastructures for science Vince Smith Informatics Horizons, London 24 July 2013
  • 2. Overview 1. (my) Background • • Lice to data infrastructures! Why data infrastructures at the NHM 2. Building data infrastructures • • • Recent core investment in NHM infrastructures Leveraging external investment in NHM infrastructures Infrastructure design principles & coordination 3. NHM 5-year data infrastructure horizons • • • Collections digitisation Large-scale use of collections data New approaches to biodiversity discovery 4. Decadal community infrastructure challenges • • The long view – science data strategies Data modeling and real time monitoring as a unifying theme
  • 4. Lice to data infrastructures! Systematics (circa 1998) - No high level keys - Poor high level taxonomy - Just one phylogeny - Few living experts! Circa 5,000 spp. Mammals & birds 12,000 associations 15,000 potential hosts
  • 5. My data infrastructure (circa 1998) - Taxonomic names - Authorities (name concepts) - Citations - Collection data - Morphological characters - Textual descriptions - Diagnostic keys - Illustrations - Photographs Palma, R.L., and R.L.C. Pilgrim. 2002. A revision of the genus Naubates (Insecta: Phthiraptera: Philopteridae). J. R. Soc. N.Z. 32:760. 142 pieces of “raw” data in 4 of 54 pages, in 1 of 9,110 taxonomic papers on lice
  • 6. “The bane of my existence is doing things that I know the computer could do for me” -- Dan Connolly, The XML Revolution (Nature, 1998)
  • 7. My data infrastructures (circa 2004) Images Specimens (SID) LouseBASE Glasgow version at: http://darwin.zoology.gla.ac.uk/~rpage/LouseBase/2/ Lab Notebook Literature http://darwin.zoology.gla.ac.uk/~SID/ Host-Parasite Checklists PHPBib http://www2.flmnh.ufl.edu/pdb/ http://myphpbib.sourceforge.net/ http://www2.flmnh.ufl.edu/adb/
  • 8. My publications in 2004 (enabled by these infrastructures) Making louse research more efficient, more collaborative and more productive Biol. Letters Zoo. Scripta Syst. Biol. Specimens Grzimek’s Ency. Mol. Phyl. Evol. Images Ent. Abh. Proc. R. Soc. B Lab Notebooks PLoS Biology Science Literature Checklists
  • 9. Why data infrastructures at the NHM: lots of potential Card indices Library Archives Staff Frozen Tissue Labels Slides Spirit Dry
  • 11. Recent NHM investment in science data infrastructures 1. KE EMu (collections data) • • • Improved interface (speed, complexity, data quality, support) Rapid Data Entry Web-Interface Improved import & export functionality (CLD & data portal) 2. DAMS (multimedia) ? • Review (Digital Strategy Group) 3. NHM Virtual Library (literature) • • Integrated search & discovery of NHM resources Better integration with external resources 4. NHM Data Portal (access, citation & archival) • • • • Discovery & visualisation of collections data on the Web Web exposure & archival of NHM research datasets Sub-portals for collaborative projects As strategically important as the Web in 3 years time! Enabling the NHM mission? Collections Public Engagement Research
  • 12. What are Scratchpads? (http://scratchpads.eu) External investment in science data infrastructures 1. ViBRANT (EU FP7 Infrastructures, 17 partners, €4.75M) • • Virtual Biodiversity Research & Access Network for Taxonomy Building & integrating tools supporting biodiversity research communities (publishing, literature & vocabulary management, ID keys, conservation assessments, mapping & visualisation tools, citizen science support) 2. e-Monocot (NERC Consortium; Kew Oxford & NHM, £2.38M) • • Sustainable, integrated resource on Monocot plants Content and supporting digital infrastructure (Complete family level keys & taxon pages; generic keys & pages for 8 families; select species-level resources from European Monocots, Red-list species and Slipper orchids) 3. SYNTHESYS 1,2 & 3 (EU FP5/6/7 Infrastructures, 18 partners, €10M) • • Support for physical access to participating collections JRA: Research into mass collections digitisation (Image analysis, segmentation, transcription & crowdsourcing) 4. Others • • Open-UP BHL-EUROPE ViBRANT Virtual Biodiversity
  • 13. What are Scratchpads? (http://scratchpads.eu) Scratchpad VRE: foundation for ViBRANT & eMonocot Taxa (Classifications, taxon profiles, specimens, literature, images, maps, phenotypic, genotypic & morphometric datasets, keys, phylogenies) Conservation Projects Regions Societies
  • 14. Impact: What are Scratchpads? (http://scratchpads.eu) Scratchpad usage (July 2013) 525 Scratchpad Communities by 6,550 active registered users covering 73,444 taxa in 535,317 pages. In total more than 1,300,000 visitors 81 paper citations in 2012 Per month unique visitors to Scratchpad sites 119 NHM staff, 83 sites 65,000 unique visitors/month
  • 16. Digital Ambition: NHM Science Strategy 2013-2017 A New Voyage of Discovery Three Focal Areas 1. Scientific discovery 2. Scientific infrastructure 3. Scientific engagement Five Challenges 1. The digital NHM 2. Origins, evolution & futures 3. Biodiversity discovery 4. Natural resources & hazards 5. Science, society & skills Resources & funding Measuring success
  • 17. Digital Ambition: NHM Science Strategy 2013-2017 A New Voyage of Discovery Three Focal Areas 1. Scientific discovery 2. Scientific Infrastructure 3. Scientific engagement Five Challenges 1. The digital NHM 2. Origins, evolution & futures 3. Biodiversity discovery 4. Natural resources & hazards 5. Science, society & skills Resources & funding Measuring success Collections digitisation Large-scale use of collections data New approaches to biodiversity discovery
  • 18. Collections digitisation (data mobalisation) Target 20M specimens available digitally in 5-years Challenges Current fragmented efforts Heterogeneity of process Existing data (2.8M lots; 400k geo.; 120k images) Scale of operation (iCollections, 130k in 1 year) Transcription (Citizen Sci. / crowdsourcing) Data quality, annotation & feedback Resources & funding Expensive (£20-£60M @ £1-3 per specimen) Linked to our public offer Next steps (Sept. 2013) Coll. Descriptions & protocols Greater coordination of effort Programme group with project portfolio? Planning of digital access via NHM Data Portal
  • 19. Large scale use of collections data (or why digitise) Data applications help set digitisation priorities 1000 Crop Wild Relatives 500 Invasive alien species Impacts of climate change Species conservation & protected areas Impacts of human development Biodiversity & human health Food, farming & biofuels Sustainable delivery of data 0 Poaceae Legumino… Brassicac… Rosaceae Solanaceae Composit… Rubiaceae Vitaceae Anacardi… Araceae Arecaceae Moraceae Malvaceae Musaceae Cucurbita… Amaryllid… Grossular… Amarant… Aquifoliac… Theaceae Juglandac… Euphorbi… Apiaceae Caricaceae Asparaga… Dioscorea… Pedaliace… Rutaceae Lauraceae Betulaceae Convolvul… Myrtaceae Oleaceae Zingibera… Bromelia… Piperaceae Lecythida… Potential applications for NHM data NHM Data Portal NHM Data portal Promote access & reuse of data Sub-portals for specific themes Delivering content to third parties (e.g. GBIF) Next steps (requirements) Storage (Access, backup & archival) Citation, linking & measuring impact (identifiers) Data layering & visualisation H.P.C. (Ecol. niche modeling & analysis) Data visualisation
  • 20. New approaches to biodiversity discovery (new types of data) Take home messages from NHM Tropical Biodiversity Symposium Molecular approaches Molecular detection & monitoring of organisms is routine Metagenomics (env. sequencing) commonplace Whole genomes are normal The primary route to understanding biodiversity for many Ecological observatories 3-4 June 2013, NHM Automated biodiversity detection Remote sensing (e.g. satellite & acoustic data, drones, camera traps) Monitoring conspicuous, rare or invasive spp. (algal blooms, palms) Monitoring human activity Supplement field research, fills in gaps & scales Digital infrastructure requirements Very large quantities of data (2.5-10TB per researcher per yr.) Doesn’t map to existing NHM collections infrastructures Challenge current networking & storage capacity Digital and physical collections become equally important? 22 July, 2013
  • 22. The long view: community informatics challenges GBIF GBIC Report (Coming soon) EU Biodiversity Strategy (2011) Biodiv. Inf. Challenges (2013)
  • 23. Modeling the biosphere: a (the) 30 year goal? A clear, singular long-term vision, that NHM data can contribute too Nature 2013, doi:10.1038/493295a
  • 25. What are Scratchpads? Infrastructure design principals* (http://scratchpads.eu) = experience from 7-years with the Scratchpads = lessons for building NHM data infrastructures? 1. Start with needs - focus on real user needs (not just the ‘official process’) 2. Do less - if someone else is doing it, link to it or use it 3. Design with data - prototype and test with real users on the live website 4. Do the hard work to make it simple - let the computer take the strain 5. Iterate. Then iterate again. - iteration reduces risk & is more sustainable 6. Build for inclusion – it’s easier in the long run 7. Understand context - we are designing for people, not a screen or a brand 8. Build digital services, not websites - there is life beyond the website 9. Be consistent, not uniform - every circumstance is different 10. Make things open: it makes things better - it’s more sustainable *https://www.gov.uk/designprinciples
  • 26. What are Scratchpads? (http://scratchpads.eu) Better NHM digital coordination from 2013 Digital Strategy Group Developing common vision High level strategy Director level engagement (Science, PEG & Corp. Services) Digital Design Group Digital Programme Group Delivering & leading digital activities Fund raising (internal & external) Prioritisation Administrative support Resource management Analysis of impact