“Hot Topics: The DuraSpace Community Webinar Series, "Series Five: VIVO: Research Discovery and Networking.” Webinar #2: Case Studies: VIVO at Colorado, Brown, Duke, and Weill Cornell Medical College, June 4, 2013
Presented by: Alex Viggio, VIVO Implementation Lead, University of Colorado, Boulder, Steven McCauley, Brown University, Julia Trimmer, Duke University and Paul Albert, Weill Cornell Medical College.
1. Hot Topics: The DuraSpace
Community Webinar Series
Series Five:
“VIVO: Research Discovery &
Networking ”
Curated by Dean Krafft
2. Hot Topics: DuraSpace Community Webinar Series
Webinar 2:
Case Studies: VIVO at Colorado, Brown,
Duke, and Weill Cornell Medical College
Presented by:
Alex Viggio, University of Colorado Boulder
Julia Trimmer, Duke University
Steven McCauley, Brown University
Paul Albert, Weill Cornell Medical College
3. VIVO Community
Also see http://vivo.vivoweb.org
Hot Topics: DuraSpace Community Webinar Series
5. University of Colorado Boulder
Alex Viggio
FIS Lead Developer
Office of Faculty Affairs
alex.viggio@colorado.edu
Hot Topics: DuraSpace Community Webinar Series
6. VIVO at CU-Boulder
• Project led by Faculty Information System
(FIS) team in the Office of Faculty Affairs
• 1 domain expert FTE, 2.5 developer FTEs,
<0.5 system admin FTE
• Original developers and IT group had Java and
SQL/RDBMS expertise, but no prior Semantic
Web work experience
• Partner with campus IT Managed Services
group for web and database hosting
• Reuse existing FIS database and web servers
Hot Topics: DuraSpace Community Webinar Series
7. CU-Boulder Rollout
• Demos for CU-Boulder Provost, VC of Research and
Dean of the Graduate School in late 2010
• Implementation started in January 2011
• Initial campus launch in April 2011
• Public WWW launch in September 2012
• Current status
– Covers 64 academic units in seven schools and
colleges, as well as the libraries, 11 research
institutes and more than 40 non-academic units
– 1,750+ profiles updated twice a week
– No direct edits, no publications or grants data yet
• Publications ingest project starting Summer 2013
Hot Topics: DuraSpace Community Webinar Series
8. CU-Boulder FIS Overview
Hot Topics: DuraSpace Community Webinar Series
from VIVO: A Semantic Approach to Scholarly Networking and Discovery, Figure 4.1
http://www.morganclaypool.com/doi/abs/10.2200/S00428ED1V01Y201207WBE002
11. Regional Linked Data Efforts
Hot Topics: DuraSpace Community Webinar Series
Host institution of 2012 and 2013 VIVO Implementation Fests
http://2013vivoimplementationfest.sched.org
12. CU-Boulder Lessons Learned
Hot Topics: DuraSpace Community Webinar Series
• An incremental, value focused approach works for VIVO
implementation
• Address faculty concerns as a priority
– Limit launch to campus users to allow for review
• Data quality
– All data requires clean up before public display
– FIS VIVO Curation Module
• Building the campus initiative with internal PR
– Address perceived competition with similar efforts
• VIVO’s low cost – harder to justify resource needs
– A small, entrepreneurial team worked for us
• VIVO builds conversation about Big Data, Linked Open
Data, Open Access
13. Duke University
Julia Trimmer
Manager, Faculty Data Systems and Analysis
Office of the Provost
julia.trimmer@duke.edu
Hot Topics: DuraSpace Community Webinar Series
14. VIVO at Duke
• Project team under Provost’s Office
• Developers in University IT group
• Elements team in Library
• Currently: 3,500 faculty in 29
departments and centers, 100K pubs
• By Dec: adding 2,100 faculty in 16
schools and institutes, 60K pubs
• Replacing two legacy systems
Hot Topics: DuraSpace Community Webinar Series
16. Rollout Plan
• School of Medicine: May 13
• Business, Environment, Engineering,
Nursing: July 15
• Arts & Sciences, Divinity, Law, Public
Policy: September 3
• Faculty Annual Reporting tool: 2014
Hot Topics: DuraSpace Community Webinar Series
22. Support for Scholars@Duke
• Small army of “power users”
• First level of support for faculty
• Liaisons for issues or problems
• Support page lists power users plus
learning and support materials
Hot Topics: DuraSpace Community Webinar Series
24. VIVO at Brown
• 4-person team; 1 full-time staffer
• Sponsors: Office of the Vice President
for Research, Dean of the Faculty,
Division of Biology and Medicine,
University Library, Computing and
Information Services
• VIVO to replace DRRB
25. Implementation: Plans
• Data from faculty CVs
• VIVO for faculty review
• Profile self-editing
• Implications: ontology and interface
with broad expressivity and
representation
26. Implementation: Process
• Parse available CVs (Summer 2012)
• Harvest available databases (Fall-Winter 2012/13)
• Clean and load data (Fall-Winter 2012/13; ongoing)
• Public review (Winter 2012; ongoing)
• Phase out legacy system (future work)
• Integrate with other data sources, internal
and external (future work)
27.
28.
29.
30.
31.
32. Implementation: Problems + Solutions
• Bad data
Text processing, natural language processing,
reconciliation and LOD services
• Interface confusion
“Change management”; managing RDBMS
expectations
• Capturing scholarly activity
Meeting with domain specialists; reviewing
prior art; attention to the Humanities
33. Lessons Learned
• Being the bearer of bad news
GIGO and the perils of data integration
• Less is more
Better to have less data of high quality than more data of low quality
• Looks are everything
Users care more about appearances and accessories than the
architecture of the Semantic Web
• Communication is key
Identify power users and mediators: work on a department-
by-department basis
34. Weill Cornell Medical College
Paul Albert
Project Manager, VIVO
Weill Cornell Medical Library
paa2013@med.cornell.edu
Hot Topics: DuraSpace Community Webinar Series