Pre-conference Workshop: Facilitate Research Communities Adoption of Open Science Publishing Principles: The Role of Repositories and the OpenAIRE-Connect Services.
COAR Annual Meeting, May 21, 2019 - Lyon, France
Hosted by The Center for Direct Scientific Communication (CCSD).
Similar a Facilitate Research Communities Adoption of Open Science Publishing Principles: The Role of Repositories and the OpenAIRE-Connect Services (20)
6. Project highlights
Research community services: offering
support for a uniform transition of research
communities towards OS publishing;
Pilot-driven approach
Content provider services: leveraging the
transition of CP towards OS publishing;
Testing, validation, deployment, integration
in OpenAIRE technical infrastructure.
REALIZATION OF
OPEN SCIENCE SERVICES
BUILDINGCOMMUNITYCAPACITIES
FOREUROPEANAND GLOBAL
ALIGNMENTONOPENSCIENCE
Extending the technological services and networking bridges
(human/social/support) today offered by the OpenAIRE
Engaging and supporting communities and
content providers (research community
open science desks, legal advisory board)
Interoperability guidelines to facilitate
metadata exchange
Support of OpenAIRE networking services
7. BlowingonthefireofOpenScience
Facilitate Research Communities adoption of
Open Science publishing principles by supporting
artefact publishing tools as-a-Service
Facilitate Content Providers at moving towards
Open Science publishing by supporting
notification-based research communication as-a-
Service.
8. OpenScience as-a-Service (OSaaS)
Content Providers
DashboardSoftware
Packages
Research Community
Dashboard
Search-Navigate-Monitor-
Research Impact
Subscription & Notification
Articles
Data
Researchers
Repositories
Articles
Data
Projects
Project community
FunderStream
Product
Publication Data Software
Organization
source
Software
9. ResearchCommunitiesandOpenSciencebenefits
• Can continue their publishing practices, but, if needed they have support for
deposition of any artefact
Common repository for publishing (deposition) of
datasets, methods, and packages
• Community information space to share, discovery, and reuse (reproduce)
scientific results
Collaborative curation of a community-specific
research communication domain
• Scientific reward strategies can be developed
Research impact and statistics
10. Research Community Dashboard
10
Data
Publications
Software
Project community
FunderStream
Product
Publication
Data Software
Organization
Research communities
can…
Claim products
Claim links
Search &
navigate content
Statistics on
research impact
Deposit
productsAdditional links
automatically generated
based on inferences
OpenAIRE Graph
(deduplicated & enriched)
ResearchCommunityDashboard
Repositories
& RI services
Harvest &
push
metadata
12. OpenAIRE Research Communities services status
Aims:publish,shareand discovery
COMMUNITY
Aims:researchimpact
RESEARCH INITIATIVE
March 2019
13. Repositories: Open Science as-a-Service benefits
• Enabling addition of links to artefacts of any kind
Extending repository metadata models to Open Science
• “Almost real-time” exchange of information: notifications about links to other
artefacts, missing properties, and missing artefacts
Keeping their collection up-to-date: enrichments and additions
• Enabling repositories to be notified of content of interest, enabling
construction of research-focused aggregators by notifications
Fostering notification-based and federated dissemination of
knowledge
14. OpenAIRE’s e-infrastructure Commons
Publications
repositories
Research Data
repositories
CRIS
systems
Registries
(e.g. projects)
OA
Journals
Software
Repositories
Validation
Cleaning De-duplication
Enrichment
By inference
CONTENT PROVIDERS
INFO SPACE SERVICES
Project initiative
FunderFunding
Result
Publication Data Software
Organization
GUIDE
LINES
TERMS
OF USE
Repositories in OpenAIRE may
be interested to acquire
metadata information about
publications that are
“potentially of interest
to them”
i.e. be part of their collection:
add new records, enrich the
records with extra metadata
information.
18. Research
communities
Researchers (All)
Content providers
Innovators
Research
managers
Funders
Building the OpenAIRE research graph and the Dashboard services
OpenAIRE Graph & Dashboards
Validation
Cleaning De-duplication
Inference
Research Graph Services
Project communiity
FunderFunding
Product
Publicatio
n
Data Software
Organizatio
n
TERMS
OF USE
Harvesting Uploading
Brokering
Source
ORP
Publications
repositories
Data
repositories
Hybrid
repositories
Registries
OA
Journals
Software
repositories
Content Providers Research
Infras
GUIDE
LINES
19. OpenAIRE: materializing the Open Science graph
Full-text mining
Harvesting
Deposition
Project community
FunderFunding
Product
Publication
Research
Data
Software
Organization
Source
Other res.
products
GUIDE
LINES
10Mi PDFs
12,000
sources
20. Building and maintaining an open metadata research graph of
interlinked scientific products, with Open Access information,
linked to funding information and communities
The OpenAIRE research graph
Complete
De-duplicated
Participatory
Decentralized
Trusted
Research Graph
21. ALL Literature, Research data, Software, Other research
products
Complete
OpenAIRE Content Acquisition Policy
Respecting the OpenAIRE guidelines
(DataCite metadata)
Using PIDs with resolvers
22. • Waiting for stable information space
inBETA
Completionoffine-grainedharvesting
Fine-tuningofmetadata ingestionworkflow
(e.g.DOIBoost)
• Resources: replica of theBETA Solr
Index Cluster
OpenAIRE Research Graph: the «switch»
Content acquisition policies
26,019,748 publications
1,130,981 datasets
95,850 software
17 funders
+6more
94,210,770
8,000,000
192,661
28
May 2019
23. Objectives of OpenAIRE’s Aggregation Policy
Content Acquisition Policy released 05-Oct-2018, https://doi.org/10.5281/zenodo.1446408
24. Metadata describing Open Access and
non-Open Access material will be
included and links to other products
will be resolved where this is possible
(i.e. the provided PIDs have a resolver).
as stated in the Content Acquisition Policy, published Oct. 2018
https://doi.org/10.5281/zenodo.1446408
28. • Publications
Products with “equivalent” PIDs, title, authors, dates are grouped
• Dataset
Products with “equivalent” PIDs are grouped
De-duplicated
• Software
Products with “equivalent” PIDs and original URLs
are grouped
• Other products
Products with “equivalent” PIDs, title, authors,
dates are grouped
29. • Rely on quality scholarly communication sources of different
kinds and giving them visibility by provenance
Institutional repositories, aggregators, data archives, software
repositories, research infrastructure sources, funder grant
repositories, entity registries, publishers
Participatory
• Include solutions and content from
any interested and trusted content
provider in scholarly communication
30. • Content of the graph is CC-BY (roadmap to CC-0) accessible via
APIs (DEVELOP)
• Via OpenAIRE-Broker Service it is re-distributed across
contributing sources, for enrichment of such sources and for
preservation beyond OpenAIRE
Decentralized
• The graph is exchanged with other graph
initiatives, to make the graph richer and
for “preservation beyond OpenAIRE”
31. • Authors in the loop to enrich their ORCID record (letter of
support from ORCID)
• Populate and curate an high-quality open graph for
Monitoring and assessment by organizations, funders, research
infrastructures, and researchers
Perform business by SMEs
Trusted
42. Interoperable metadata is key for
effective content sharing
Use our validation service and see how you can apply the
OpenAIRE Guidelines to expose your contents using
global standards.
VALIDATE
43. Reach a wider audience around the world
Register your datasource in OpenAIRE and be part of a
global interlinked network.
REGISTER
44. Improve your metadata.
Get more connections
OA Broker service offers a wealth of information on
scholarly communication data.ENRICH
Find out what interests you and subscribe to enrich your records.
More & Missing events that may enrich your Repository:
• Persistent identifiers
• Open Access Versions
• Projects
• Subjects
• Abstracts
… datasets, software
45. Open research impact empowers
Open Science
Open Metrics service by sharing your usage data.
Get the benefit of an aggregated environment to
broaden the mechanisms for impact assessment.
MEASURE
Get usage statistics reports for your datasource
51. Content Provider Dashboard: testing phase
1 Webinar, 59 Attendees
21 Repositories represented
Portugal Spain
Real users of the
Dashboard
• Repository Managers from
Portugal & Spain
2 Webinars (Demo)
• Main OpenAIRE services for
content providers
(Dashboard and Broker),
RCAAP and RECOLECTA.
Test drive
• Grant the access to the
Dashboard
• Guidance on the
functionalities usage
Collect feedback
• Questionnaire & Helpdesk
1 Webinar, 49 Attendees
28 Repositories represented
May/June 2018
53. Support materials for Content Providers Dashboard uptake
• Provide - How to validate and register your
repository
• Provide - How to enrich research artifacts
• Usage Statistics – How to track the usage
activity of your repository
• ScholExplorer - Literature & Data
interlinking
• Making your repository Open
Support – guides
• Make your content count - OpenAIRE
Content providers Dashboard: service for
repository managers
• OpenAIRE metrics service: usage statistics
• OpenAIRE Guidelines for data providers:
new Metadata Application Profile for
Literature Repositories
Training – webinars
59. Research Community Dashboard
59
Data
Publications
Software
Project community
FunderStream
Product
Publication
Data Software
Organization
Research communities
can…
Claim products
Claim links
Search &
navigate content
Statistics on
research impact
Deposit
productsAdditional links
automatically generated
based on inferences
OpenAIRE Graph
(deduplicated & enriched)
ResearchCommunityDashboard
Repositories
& RI services
Harvest &
push
metadata
61. • Community, e.g. group of scientists with common
research intents, interests, vision (discipline-driven)
Aims: publish, share and discovery
• Research Initiative, e.g. long-term (cross-project)
funded initiatives such as EGI, RDA and discipline
research infrastructures (ELIXIR, EPOS)
Aims: research impact
Research communities
The concept is more articulated
62. Communities requiring tools to
deposit, share, interlink, and find all
kinds of research products
relevant to their discipline.
Research Communities
Discovery!
Research Initiatives
Monitoring!
Organizations requiring tools for
monitoring their research impact in terms
of the research products they funded or
enabled/supported the creation of.
62
64. List of
subjects/keywords
List of projects
available in
OpenAIRE
List of content
provider from
those aggregated
by OpenAIRE
List of Zenodo
communities
List of other RCDs
or RIDs
List of ORCID
identifiers
Other criteria
suggested by
research
communities
Research Community Dashboard: configuration criteria
65. List of
subjects/keywords
List of projects
available in
OpenAIRE
List of content
provider from
those aggregated
by OpenAIRE
List of Zenodo
communities
List of other RCDs
or RIDs
List of ORCID
identifiers
Other criteria
suggested by
research
communities
Mining algorithm
(e.g. ack
statements)
Research Initiative Dashboard: configuration criteria
67. Import metadata
from CrossRef
and Datacite given
a list of DOIs
Import metadata
from ORCID given
an ORCID id
Deposit a product
in one of the
related Zenodo
communities
Deposit a product
in one of the
related content
provider
Use the linking
functionality to
add a product to
the community
Import metadata
from ePMC given
a list of PMID
Import metadata
from HAL given a
list of HAL ids
Actions of researchers to grow the graph
68. Research Communities on board
Fisheries & Aquaculture
Management
European Marine Science
Sustainable and
Development Solutions
Network - Greece
Neuroinformatics
Digital Humanities
and Cultural
Heritage
71. TESTING PLAN & STATUS
• Webinar demo for Community
Managers
• Live tests with 2 end-users
per Community
1st Testing Phase
(May-July 2018)
• Webinar for Community
Managers
• Webinar demo for 5 end users
per Community
• Live tests with 2 end users
per Community
• Tests within the Communities
workshops (outreach)
2nd Testing Phase
(ongoing) • Test the production ready-
ness of the RCD
• Tests within the Communities
outreach workshops
• Test user-friendly-ness and
effective acquisition of testing
results
3rd Testing Phase
(April-June 2019)
Beta
release 1
(April 2018)
Beta
release 2
(Dec. 2018)
Beta
release 2
improvements
72. EuroMarine Young Scientist Working Group (OYSTER)
• 28January2019,Cádiz,Spain(15participants)
Workshop - Community Test Drive
78. EuroMarine – Wider Community Awareness3.
http://www.euromarinenetwork.eu/
79. • Represented by members of the France Life Imaging (FLI)
collaboration
• Focus on e-infrastructure services to enable
interoperability between in-vivo image acquisition
platforms at National and international level (EGI)
• Use OpenAIRE-Connect services and promote their
adoption across neuroinformatics scientists
• Generate and share packages of research artefacts
Neuroinformatics community
80. SHAring NeurOImaging Resources
An open source web platform for neuro-imagingDownload
stored data
Support for
processed
(derived) data
Online
Visualization of
stored data
Data de-identification
and patient privacy
Download
Processed data
Support for multi-
centric research
studies
User access
control
Support for clinical and
neuropsychological scores
Web
Portal
Collect neuroimaging data from several
sources :
• Dicom CD / DVD
• PACS (via Dicom Q& R)
• Nifti / Analyze image files
a model build on an formal
ontology to
• Enhance data integrity
• Structure the data
• Manage data provenance
• Facilitate collaborative research
• Pool resources
81. Web portal
Users
1000+ registered users in October 2018
44 publications since 2011
Neuro-image analysisCancer therapy simulation
Prostate radiotherapy plan simulated
with GATE(L. Grevillot and D. Sarrut)
Image simulation
Echocardiography simulated with
FIELD-II (O. Bernard et al)
Modeling and optimization of
distributed computing systems
Acceleration yielded by non-clairvoyant
task replication (R. Ferreira da Silva et al)
Brain tissue segmentation
with Freesurfer
Scientific applications
Infrastructure
Supported by EGI Infrastructure
Uses biomed VO (~65 sites in Europe and beyond)
230 cumulated CPU years utilized by VIP applications in 1 year
DIRAC
France-Grilles
Application as a service
File transfer to/from grid
Virtual Imaging Platform (VIP)
https://vip.creatis.insa-lyon.fr
82. • Describe, publish, integrate and
execute command-line applications
across platforms
– facilitate application porting
– import and exchange of applications
– open and reproducible science
• Versatile JSON format to describe the
command-line, inputs and outputs
• Use of Linux containers to facilitate
application installation and sharing
• https://github.com/boutiques
Boutiques
Neuroinformatics software
descriptors published to
83. • Become providers/repositories from which OpenAire
Connect can harvest software (Boutiques pipelines,
Dockers, etc) and data
• Upload such products to Zenodo (and get a DOI)
• Boutiques implements “bosh publish” to Zenodo
with support for tags
• Enable interoperability and reproducibility
VIP and Shanoir CAN
84. Neuroinformatics Dashboard
• Gather and search for all
kind of research artefacts
from the neuroinformatics
community: literature,
datasets, software.
• Link datasets and software
to articles.
• Publish artefacts
automatically and directly
from data storage and
computing platforms.
• Enable open and
reproducible science.