2. Let’s talk about the data…
The Semantic Web isn't just about putting data
on the web. It is about making links, so that a
person or machine can explore the web of
data. With linked data, when you have some of
it, you can find other related data.
3. http://www.slate.com/
Images and data from http://en.wikipedia.org
Detailed Data Relationships
Minnesota
Vikings
Chicago
Bears
John Fox
NFC Northern
Division
Saturday Night Live
has head
coach
head
coach of
team colors of
has
membership in
has team
colors
organization with
membership for
features
featured in
organization with
membership for
has membership in
is skit on
has
skit
Connections among concepts illustrated
Current and accurate data revealed
Inverse relationships are created
has team
colors
Denver Broncos
team colors of
Bill Swerski’s
Superfans
has recurring
acting role in
skit
skit with recurring
acting role for
Robert
Smigel
Mike
Meyers
George
Wendt
John
Goodman
Chris
Farley
has featured
acting role
Featured
acting role
for
Mike Ditka
Michael Jordan
navy, orange,
white
had head
coach
head
coach of
5. Current challenges
• Research is increasingly more interdisciplinary
• How can you find collaborators, track competitors, and stay abreast of current
research inside large institutions, at other institutions, and globally?
• How can you find others with shared interests or expertise?
• How can you build diverse teams? Find mentors? Be identified as a partner
by community groups?
Faculty
• Library administration or directors of core facilities want to align their strategic
plan with the evolving research needs of their clientele.
• Identifying growth areas of research through increasing publications, focused
areas of research and grant dollars enables this task to become more
evidence-based.
Support: facilities and personnel
• Research institutions can be extremely large and diverse
• How can administrators showcase and monitor research activity, track
competitors, and stay abreast of current research inside large institutions, at
other institutions, and globally?
• How can you enhance visibility and present a unified picture of an institution?
Administrators
6. Research Networking
Information about scholars is optimized using a Web-based
infrastructure of standards and technologies which allows for a
distributable, machine readable description of data that allows for
stronger data and smart web application linkages across many
universities, agencies, societies both within the US and abroad.
Why is this important?
Linked data infrastructure allows for
• Visualizations, research and clinical data integration,
and deep semantic searching across multiple types
and sources of data
• By breaking data out of traditional database silos,
research networking platforms promote a network
effect within a single site and across multiple sites
– The value of the network increases with the amount of
linked data and applications that are available to
consume the linked data.
7. What is VIVO?
1. An open source
semantic web
application
2. An information model
3. An open community
8. What is VIVO?
1. An open source
semantic web
application
2. An information model
3. An open community
9. VIVO
An open-source semantic web application that
enables the discovery of research and scholarship
across disciplines in an institution.
VIVO harvests data from verified sources and
offers detailed profiles of faculty and researchers.
Public, structured linked data about investigators
interests, activities and accomplishments, and
tools to use that data to advance science.
VIVO enjoys a robust open community space to
support implementation, adoption, &development
efforts around the world.
See http://wiki.duraspace.org/display/VIVO
10. A VIVO profile allows you to:
Showcase credentials, expertise, skills, and professional
achievements for individuals and campus groups.
Connect within focus areas and geographic expertise.
Simplify reporting tasks and link data to external
applications – e.g., to generate biosketches or CV or for
reporting purposes.
Publish the URL or link the profile to other applications.
Discover potential colleagues or campus resources by
work area, authorship, & collaborations.
Display visualizations of expertise areas or complex
collaboration networks and relationships.
11.
12. What is VIVO?
1. An open source
semantic web
application
2. An information
model
3. An open community
13. CTSA: Recommendations and Best
Practices for Research Networking
The Research Networking Recommendations were approved by the CTSA Consortium
Executive and Steering Committee on October 25, 2011.
Recommendations for Research Networking:
• Recommendation: All CTSAs should encourage their institution(s) to implement
research networking tool(s) institution-wide that utilize RDF triples and an ontology
compatible with the VIVO ontology.
• Recommendation: Information in people profiles at institutions should be publicly
available as data as a general principle, specifically as Linked Open Data. To ensure
quality of information, authoritative electronic data sources versus manual entry should
be emphasized. Institutions will vary in the amount of information that they will include
and make publicly available but the value is enhanced by the quality and quantity of
information.
• Recommendation: Monitoring of the research networking landscape, technology, and
tools should continue to be overseen by experts from the CTSA consortium (e.g., the
Research Networking group of the Informatics KFC).
https://www.ctsacentral.org/recommendations-and-best-practices-research-networking
14. Building a large web of data, greater
than any one effort
Data Creators, Data
Aggregators, & Data
Consumers
Repositories. Tools.
Applications. Workflows
16. Brown University
• PLATFORM: VIVO
• DATA: Faculty CVs in Word or PDF converted to
structured data by bibliographic services vendor
• Data structures mapped to VIVO ontologies
• Data converted to RDF and ingested into VIVO
• Sparql queries create RDF graphs (relationships)
17. Brown University
• PLATFORM: VIVO
• DATA: Faculty CVs in Word or PDF converted to
structured data by bibliographic services vendor
• Data structures mapped to VIVO ontologies
• Data converted to RDF and ingested into VIVO
• Sparql queries create RDF graphs (relationships)
18. Weill Cornell Medical College
• PLATFORM: VIVO
• DATA: a variety of sources, applied and leveraged in
various ways…
http://libraryconnect.elsevier.com/articles/technology-content/2013-03/authoritative-researcher-metadata-one-place-vivo
19. Weill Cornell Medical College
• PLATFORM: VIVO
• DATA: a variety of sources, applied and leveraged in
various ways…
http://libraryconnect.elsevier.com/articles/technology-content/2013-03/authoritative-researcher-metadata-one-place-vivo
20. WCMC CTSC’s VIVO data sources
http://libraryconnect.elsevier.com/articles/technology-content/2013-03/authoritative-researcher-metadata-one-place-vivo
22. Duke University
• PLATFORM: VIVO
• DATA: a variety of sources
• REACH NC – Scopus data
• Symplectic Elements
23. Data, Tools and Scientists
http://vivosearch.org/
http://vivosearchlight.org/
http://vivo.cns.iu.edu/gallery.html
24. VIVO search scenarios
• Multiple campuses of one university
• Regional connections
• e.g., Illinois ties with regional federal labs
• Consortia – 62+ CTSAs, USDA plus land grant universities
• International
• 13 Netherlands universities and the National Library
• German Universities
• AgriVIVO – UN FAO
30. VIVOs
• 120+ impl. &
pilot projects
• 35+ countries
• 15+ CTSAs
Standards
• CTSAconnect
Integrated
Semantic
Framework
ontology
• ORCID
• CASRAI
• others
Partners
• Symplectic
• euroCRIS
• W3C, DERI,
ConceptWeb
Alliance,
OpenPHACTS
• Institutions/organi
zations
Events
• VIVO conference
Aug. 2015
• Spring
Implementation
Fest @ OHSU
• DuraSpace VIVO
webinars
Community
• VIVO wiki
• Listservs
• Weekly calls
• GitHub
• vivoweb.org
• @VIVOcollab
vivoweb.org
VIVO Updates
31. • Are a trusted, neutral entity
• Have a tradition of service and support
• Strive to serve all missions of the institution
• Are technology centers and have IT and data expertise
• Have skills—information organization, instruction,
usability, subject expertise
• Have close relationships with their clients (buy in)
• Understand user needs
• Understand the importance of collaboration and know how
to bring people together
• Have knowledge of institution, research, education, clinical
landscape
Library Staff:
Libraries:
What roles can the library play?
32. What roles can the library play?
Librarians are successfully stepping up to the
semantic web plate in a variety of roles related to
institutional research networking platforms.
• Outreach and adoption activities
• Education and training on the use of the platform
• Ontology and controlled vocabulary expertise,
extending the model
• Negotiations with data providers
• Programming, technical support
• Workgroup representation
• …and more!
Research networking also provides an opportunity for
libraries to become familiar with many concepts around
linked open data and the semantic web.
33. Acknowledgements
Funding:
• Northwestern University Clinical and
Translational Sciences Institute, NIH award
UL1TR000150
• VIVO, NIH award U24 RR029822
• VIVO/DuraSpace
Questions/Follow-up:
• kristi.holmes@northwestern.edu
• kristi@vivoweb.org
• Twitter: @kristiholmes Thanks!
GALTER
LIBRARY
Notas del editor
VIVO enables collaboration and understanding across an institution and among institutions.
As research efforts become more interdisciplinary, it can be hard to find collaborators outside your own area of expertise
VIVO will help facilitate communication and collaboration NOT ONLY for researchers but also for administrators, librarians, students, faculty, donors, funding agencies, and the public.
VIVO enables collaboration and understanding across an institution and among institutions
VIVO harvests much of its data automatically from verified sources so it is accurate and current, reducing the need for manual input.
The rich information in VIVO profiles can be repurposed and shared with other institutional web pages and consumers, reducing cost and increasing efficiencies across the institution.
Data is housed and maintained at the local institutions. There it can be updated on a regular basis.
Search results are faceted so information can be located rapidly and with less time spent sorting through information.
Profiles are largely created via automated data feeds, but can be customized to suit the needs of the individual.
Profiles are richer in content than typical [web pages or] social networking sites and will rank higher in general internet searches.
Across institutions VIVO provides a uniform semantic structure to enable a new class of tools using the data to advance science. …..visualizations, search, discovery, etc
Each institution provides its own VIVO system and data. Local governance determines data to be provided.
VIVO structures data in RDF triples using the VIVO ontology. Moreover, the recommendations state that as a general principle the profile data should be publically available as Linked Open Data. This announcement demonstrates the CTSA Consortium’s recognition of the value of semantic web standards and increasing momentum in support of semantic web technologies to facilitate research discovery. Examples of applications which consume these rich data, including: visualizations (Katy’s viz URL), enhanced multi-site search (VIVO search URL), and VIVO Searchlight (searchlight URL). Other utilities are in development across a wide range of functionalities.
Profiles are largely created via automated data feeds, but can be customized to suit the needs of the individual.
Information is open source (free) and is stored in a framework that allows for exporting to other applications.
Profiles are richer in content than typical [web pages or] social networking sites and will rank higher in general internet searches.
There are many beautiful visualizations, developed by Katy Borner’s group at Indiana University.
These include co-author and co-investigator networks and even temporal visualizations which allows discovery of grants and publications by defined groups over time within and beyond an institution.
Most recently, the visualization team implemented a Science Map visualization, which allows users to visually explore the scientific strengths of a university, school, department, or person in the VIVO instance. Users will be able to see where an organization or person’s interests lay across 13 major scientific disciplines or 554 sub-disciplines, and will be able to see how these disciplines and sub-disciplines interrelate with one another on the map of science.
VIVO uses linked open data concepts to provide data as RDF at URIs for each scientist. Critically important for building a web of data.
Predicates have addresses, sites point to objects in other triples stores.
Resolve queries across triple stores – “show investigators who genetic work is implicated in breast cancer.” VIVO won’t have information linkages between breast cancer and disease. Other resources will. But VIVO can link to external sources. “Mike worksOn GeneY”
So where does data about Interests, activities and accomplishments come from?
Archives. Data Aggregators. Publishers. Institutional repositories.
So now we turn to a few specific examples…
Data is reused and repurposed in a wide array of tools and settings.
Cornell University has done a stellar job of this – using VIVO data to provide current information about faculty and their interests for department and college websites; University of Florida reuses data from their VIVO for their CTSI member database – a move that other institutions are making, as well.
IICA
VIVO chooses to meet these participation challenges with the help of libraries – neutral campus entities that understand their user communities and their research environment.
Libraries are increasingly involved in IT decisions
Libraries have subject experts. Many librarians who work to support VIVO also provide reference and instructional support to the agricultural sciences, basic sciences, social sciences, and humanities.
Librarians have good knowledge of their user communities – they understand what drives their institution’s research environment.
Of course, librarians have a strong service ethic and have a long history of providing academic support
Facilitate the resolution of data integration issues endemic to legacy data/faculty reporting systems.