SlideShare a Scribd company logo
1 of 27
Exploiting scientific data in
the international context -
The elements of success
Gergely Sipos
Head of Services, Solutions, Support
EGI Foundation (Amsterdam)
gergely.sipos@egi.eu
www.egi.eu
This work by the EGI Foundation is licensed under a
Creative Commons Attribution 4.0 International License
(http://creativecommons.org/licenses/by/4.0/)
LOFAR
IceCube
WeNMR
Bioimaging
SeaDataNet
ENES
…
?
LSST
~2000 2020+
>110 data centres from 34
countries for discovery and
access to marine data.
OPPORTUNITY
CHALLENGE
Hard to import large amounts
of data into Ocean Data
Viewer to explore and analyse
these.
Data download is a big barrier
for climate scientists: download
takes significant amount of
time; networks are unstable.
OCEAN CLIMATE
3-5 PB data from CMIP5
experiment alongside 30+
software tools targeting
different user communities.
https://portal.enes.org
www.seadatanet.org
ASTRONOMY
Transforming raw data to
science-ready data is a
challenge, limiting the impact
of LOFAR output.
31 stations in Europe with 30
million data products in 50+PB
storage.
www.astron.nl/telescopes/lofar
International user communities
including thousands of
researchers from China
A future outlook
• Cross-Research Infrastructure (RI) and cross-border access is difficult due
to different funding models, access and provisioning policies
• Data and service provisioning to international user communities possible only when
supported by sound business models or existing collaboration agreements. Today only
a few structured int. research groups have achieved this.
• Needs of large investments for the creation, processing, preservation,
access and reuse of research data  will the funding match the
anticipated needs of future data-intensive science?
• Opportunities for economies of scale and aggregation of demand can arise with joint
provisioning of infrastructure common components
• Major separation between data preservation and data exploitation
infrastructures in many disciplines
• RIs and e-Infrastructures should collaborate to support the entire research workflow of
an experiment
Tomorrow’s scenario?
The Data Science Commons
A federation of research data,
computing, applications and other
open science resources,
responding to the problem of
scalable access to research data
through a new data provisioning
service approach that is
complementary to the traditional
data download model.
https://zenodo.org/record/4884748
Open Science Commons
(2014)
The Data Commons should...
Allow to discover, access and analyze major research
datasets and information for third-party exploitation
Provide access to the data & data products close to
processing facilities while avoiding duplication of local data
storage & compute infrastructures across research
performing organizations in Europe
The Data Commons should…
• Offer a hybrid distributed compute platform (HTC, HPC, cloud) and
integrated rich portfolio of scientific application tools supporting self-
service provisioning
• Offer tools for scalable data movement across data preservation
infrastructures and distributed interconnected network of “data hubs”
• Provide integrated capabilities for publishing and sharing scientific
outputs from experiments to support open science
• Support federated authentication and authorization for use of existing
personal credentials and easy to use access channels
The Data Commons require...
• Coordinated development, operations, procurement, management and
provisioning across RIs and e-Infrastructures sharing expertise and technologies
of common interest
• The federation of existing/future publicly funded digital infrastructures
leveraging national investments to achieve economies of scale across Europe
• Changes in the policies of the participating infrastructures to support both
national and international user groups - Infrastructures open by default
• International cooperation among digital infrastructures in the world, to scale
beyond Europe
https://eosc.eu/
• Provider onboarding
• Catalogue of services
• Service access management
• Helpdesk + FAQ
https://eosc.eu/
?
Joint position paper (Sep 2020)
The research e-Infrastructures claim
the need…
• to sustain the costs of re-use of
data by integrating data,
visualization, analysis and physical
resources to store and re-use data
for open science
Recommend…
• EOSC should play a role in
sustaining the cost of open data
re-use
Advocate that…
• the Member States need to play a
key role in the long-term service
provisioning funding to avoid
complex commercial transactions.
https://zenodo.org/record/4044010
EOSC project landscape
EGI Advanced Computing for EOSC
Mission: Implement the Compute Platform of the European
Open Science Cloud and contribute to the EOSC Data
Commons by delivering integrated computing, platforms,
data spaces and tools as an integrated solution that is
aligned with major European cloud federation projects
and HPC initiatives.
Start: January 2021
Duration: 30 months
Consortium: 33 participating partners, 23 third-parties
Budget: 12 Million EUR
57% of the budget allocated
service provisioning
▪ 16 partners to deliver cloud, HTC
and HPC resources
▪ 12 user access and platform
solution providers
▪ 21 providers from 13 communities
to deliver data spaces
▪ 12 federation service providers
▪ EGI Foundation as coordinator
Service portfolio
Interactive
Computing
Distributed AI
training
Workload
Management
Automated Cluster
deployment
HPC
Public
Clouds
HTC
Research
Clouds
Federated Identity
Federated Compute Federated Data
Fundamental
Science Data
Space
Health Data
Space
Green Deal
Data Space
EOSC
portal
Service
Management,
Tools,
Processes,
Policies
Humanities
Data Space
Early
adopters
Data Spaces
and Analytics
Data and thematic
data analytics and
processing tools
Platforms
generic added-value
platform level
services
Federated
Access
Federation-wide
management of
data and computing
Federated
Resources
Compute and
storage facilities
Infrastructure services: Cloud, HTC, HPC
For users and communities within the project AND for new EOSC users
• Cloud-HTC compute and short-mid term storage:
▪ 80 Million CPU hours / 250,000 GPU hours / 20 PB storage
• Mix of funding mechanisms:
▪ Virtual Access:
▪ Local grants enabling policy-based access (EGI Cloud Federation +
new partners):
▪ Pay-for-use:
▪ Pilot HPC sites:
Moldova Hungary
Georgia
Latvia Austria USA China South Africa
Data spaces
Research data:
• ESGF data archive
Tools and applications:
• JupyterLab
• Enes libraries
• EC3
• Synda & DataHub
Compute infrastructure:
• Cloud and HPC sites
Compute Platform:
Resource allocation
International projects
Multi-national communities
Single users
Small groups
Experimental users
EOSC USERS
● High capacity demand
● Custom configurations
● Long term engagement
EGI-ACE Open calls
Evaluation every 2 months
EOSC Portal
● Ready-to-use resources/services
● Self-service configuration
● Short term engagement
Business-to-User Business-to-Business
Opportunity for joint EU-CN use cases!
https://www.egi.eu/projects/egi-ace/call-for-use-cases
Credits: Jianhui LI
Computer Network Information Center, CAS
EGI-ACE Objective 4: Contribute to the realization of a global Open Science Cloud
LOFAR
IceCube
WeNMR
Bioimaging
SeaDataNet
ENES
…
LSST
~2000 2020+
?
EOSC and EGI-ACE by
implementing the Data
Science Commons
Thank you!
gergely.sipos@egi.eu
www.egi.eu
Implementing the Data Science Commons through EOSC and EGI-ACE

More Related Content

What's hot

The value of EOSC from a user perspective: Key themes and actions from Day 1
The value of EOSC from a user perspective: Key themes and actions from Day 1The value of EOSC from a user perspective: Key themes and actions from Day 1
The value of EOSC from a user perspective: Key themes and actions from Day 1EOSCpilot .eu
 
Tiziana ferrari icri 2018 v3
Tiziana ferrari icri 2018 v3Tiziana ferrari icri 2018 v3
Tiziana ferrari icri 2018 v3EGI Federation
 
ENERGIC-OD @ GEO Business 2017 presentation
ENERGIC-OD @ GEO Business 2017 presentationENERGIC-OD @ GEO Business 2017 presentation
ENERGIC-OD @ GEO Business 2017 presentationTrilateral Research
 
Building bridges between the EOSC and Arts and Humanities research communities
Building bridges between the EOSC and Arts and Humanities research communitiesBuilding bridges between the EOSC and Arts and Humanities research communities
Building bridges between the EOSC and Arts and Humanities research communitiesJisc
 
The Ascent of Open Science and the European Open Science Cloud
The Ascent of Open Science and the European Open Science CloudThe Ascent of Open Science and the European Open Science Cloud
The Ascent of Open Science and the European Open Science CloudTiziana Ferrari
 
The EOSC-hub: Integrating and managing services for the European Open Science...
The EOSC-hub: Integrating and managing services for the European Open Science...The EOSC-hub: Integrating and managing services for the European Open Science...
The EOSC-hub: Integrating and managing services for the European Open Science...EOSCpilot .eu
 
Sshoc kick off meeting - 1.2.1 How to Connect to EOSC? - Tiziana Ferrari - EGI
Sshoc kick off meeting - 1.2.1 How to Connect to EOSC? - Tiziana Ferrari - EGISshoc kick off meeting - 1.2.1 How to Connect to EOSC? - Tiziana Ferrari - EGI
Sshoc kick off meeting - 1.2.1 How to Connect to EOSC? - Tiziana Ferrari - EGISSHOC
 
Governance and Sustainability of EOSC: ambitions, challenges and opportunities
Governance and Sustainability of EOSC: ambitions, challenges and opportunitiesGovernance and Sustainability of EOSC: ambitions, challenges and opportunities
Governance and Sustainability of EOSC: ambitions, challenges and opportunitiesEOSCpilot .eu
 
Gianluca Correndo, Simon Crowle, Juri Papay and Michael Boniface | Enhancing ...
Gianluca Correndo, Simon Crowle, Juri Papay and Michael Boniface | Enhancing ...Gianluca Correndo, Simon Crowle, Juri Papay and Michael Boniface | Enhancing ...
Gianluca Correndo, Simon Crowle, Juri Papay and Michael Boniface | Enhancing ...semanticsconference
 
Linking EUDAT services to the EGI Fed-Cloud - EUDAT Summer School (Hans van P...
Linking EUDAT services to the EGI Fed-Cloud - EUDAT Summer School (Hans van P...Linking EUDAT services to the EGI Fed-Cloud - EUDAT Summer School (Hans van P...
Linking EUDAT services to the EGI Fed-Cloud - EUDAT Summer School (Hans van P...EUDAT
 
Soap box session - Intermediaries, Research communities & Libraries
Soap box session - Intermediaries, Research communities & LibrariesSoap box session - Intermediaries, Research communities & Libraries
Soap box session - Intermediaries, Research communities & LibrariesEOSCpilot .eu
 
eROSA Stakeholder WS1: EOSC Architecture
eROSA Stakeholder WS1: EOSC ArchitectureeROSA Stakeholder WS1: EOSC Architecture
eROSA Stakeholder WS1: EOSC Architecturee-ROSA
 
eROSA Stakeholder WS1: The European Open Science Cloud for Research Pilot Pro...
eROSA Stakeholder WS1: The European Open Science Cloud for Research Pilot Pro...eROSA Stakeholder WS1: The European Open Science Cloud for Research Pilot Pro...
eROSA Stakeholder WS1: The European Open Science Cloud for Research Pilot Pro...e-ROSA
 
Service provisioning for Excellent Science (Daan Broeder - EUDAT/CLARIN) | Op...
Service provisioning for Excellent Science (Daan Broeder - EUDAT/CLARIN) | Op...Service provisioning for Excellent Science (Daan Broeder - EUDAT/CLARIN) | Op...
Service provisioning for Excellent Science (Daan Broeder - EUDAT/CLARIN) | Op...EUDAT
 
EGI - Open Data Platform
EGI - Open Data PlatformEGI - Open Data Platform
EGI - Open Data PlatformEUDAT
 
EUDAT 3rd Conference: What's on the Horizon? - Kimmo Koski, Managing Director...
EUDAT 3rd Conference: What's on the Horizon? - Kimmo Koski, Managing Director...EUDAT 3rd Conference: What's on the Horizon? - Kimmo Koski, Managing Director...
EUDAT 3rd Conference: What's on the Horizon? - Kimmo Koski, Managing Director...EUDAT
 

What's hot (20)

Using Open Research Data for Public Policy Making: Opportunities of Virtual R...
Using Open Research Data for Public Policy Making: Opportunities of Virtual R...Using Open Research Data for Public Policy Making: Opportunities of Virtual R...
Using Open Research Data for Public Policy Making: Opportunities of Virtual R...
 
Towards a Linked Data Publishing Methodology
Towards a Linked Data Publishing MethodologyTowards a Linked Data Publishing Methodology
Towards a Linked Data Publishing Methodology
 
The value of EOSC from a user perspective: Key themes and actions from Day 1
The value of EOSC from a user perspective: Key themes and actions from Day 1The value of EOSC from a user perspective: Key themes and actions from Day 1
The value of EOSC from a user perspective: Key themes and actions from Day 1
 
Tiziana ferrari icri 2018 v3
Tiziana ferrari icri 2018 v3Tiziana ferrari icri 2018 v3
Tiziana ferrari icri 2018 v3
 
Overview of HNSciCloud - Bob Jones (CERN)
Overview of HNSciCloud - Bob Jones (CERN)Overview of HNSciCloud - Bob Jones (CERN)
Overview of HNSciCloud - Bob Jones (CERN)
 
ENERGIC-OD @ GEO Business 2017 presentation
ENERGIC-OD @ GEO Business 2017 presentationENERGIC-OD @ GEO Business 2017 presentation
ENERGIC-OD @ GEO Business 2017 presentation
 
Building bridges between the EOSC and Arts and Humanities research communities
Building bridges between the EOSC and Arts and Humanities research communitiesBuilding bridges between the EOSC and Arts and Humanities research communities
Building bridges between the EOSC and Arts and Humanities research communities
 
The Ascent of Open Science and the European Open Science Cloud
The Ascent of Open Science and the European Open Science CloudThe Ascent of Open Science and the European Open Science Cloud
The Ascent of Open Science and the European Open Science Cloud
 
The EOSC-hub: Integrating and managing services for the European Open Science...
The EOSC-hub: Integrating and managing services for the European Open Science...The EOSC-hub: Integrating and managing services for the European Open Science...
The EOSC-hub: Integrating and managing services for the European Open Science...
 
Sshoc kick off meeting - 1.2.1 How to Connect to EOSC? - Tiziana Ferrari - EGI
Sshoc kick off meeting - 1.2.1 How to Connect to EOSC? - Tiziana Ferrari - EGISshoc kick off meeting - 1.2.1 How to Connect to EOSC? - Tiziana Ferrari - EGI
Sshoc kick off meeting - 1.2.1 How to Connect to EOSC? - Tiziana Ferrari - EGI
 
Governance and Sustainability of EOSC: ambitions, challenges and opportunities
Governance and Sustainability of EOSC: ambitions, challenges and opportunitiesGovernance and Sustainability of EOSC: ambitions, challenges and opportunities
Governance and Sustainability of EOSC: ambitions, challenges and opportunities
 
Gianluca Correndo, Simon Crowle, Juri Papay and Michael Boniface | Enhancing ...
Gianluca Correndo, Simon Crowle, Juri Papay and Michael Boniface | Enhancing ...Gianluca Correndo, Simon Crowle, Juri Papay and Michael Boniface | Enhancing ...
Gianluca Correndo, Simon Crowle, Juri Papay and Michael Boniface | Enhancing ...
 
Linking EUDAT services to the EGI Fed-Cloud - EUDAT Summer School (Hans van P...
Linking EUDAT services to the EGI Fed-Cloud - EUDAT Summer School (Hans van P...Linking EUDAT services to the EGI Fed-Cloud - EUDAT Summer School (Hans van P...
Linking EUDAT services to the EGI Fed-Cloud - EUDAT Summer School (Hans van P...
 
Soap box session - Intermediaries, Research communities & Libraries
Soap box session - Intermediaries, Research communities & LibrariesSoap box session - Intermediaries, Research communities & Libraries
Soap box session - Intermediaries, Research communities & Libraries
 
eROSA Stakeholder WS1: EOSC Architecture
eROSA Stakeholder WS1: EOSC ArchitectureeROSA Stakeholder WS1: EOSC Architecture
eROSA Stakeholder WS1: EOSC Architecture
 
E Infrastructure for OA
E Infrastructure for OAE Infrastructure for OA
E Infrastructure for OA
 
eROSA Stakeholder WS1: The European Open Science Cloud for Research Pilot Pro...
eROSA Stakeholder WS1: The European Open Science Cloud for Research Pilot Pro...eROSA Stakeholder WS1: The European Open Science Cloud for Research Pilot Pro...
eROSA Stakeholder WS1: The European Open Science Cloud for Research Pilot Pro...
 
Service provisioning for Excellent Science (Daan Broeder - EUDAT/CLARIN) | Op...
Service provisioning for Excellent Science (Daan Broeder - EUDAT/CLARIN) | Op...Service provisioning for Excellent Science (Daan Broeder - EUDAT/CLARIN) | Op...
Service provisioning for Excellent Science (Daan Broeder - EUDAT/CLARIN) | Op...
 
EGI - Open Data Platform
EGI - Open Data PlatformEGI - Open Data Platform
EGI - Open Data Platform
 
EUDAT 3rd Conference: What's on the Horizon? - Kimmo Koski, Managing Director...
EUDAT 3rd Conference: What's on the Horizon? - Kimmo Koski, Managing Director...EUDAT 3rd Conference: What's on the Horizon? - Kimmo Koski, Managing Director...
EUDAT 3rd Conference: What's on the Horizon? - Kimmo Koski, Managing Director...
 

Similar to Implementing the Data Science Commons through EOSC and EGI-ACE

WEBINAR: "How to manage your data to make them open and fair"
WEBINAR:  "How to manage your data to make them open and fair"  WEBINAR:  "How to manage your data to make them open and fair"
WEBINAR: "How to manage your data to make them open and fair" OpenAIRE
 
EGI Engage: Impact & Results
EGI Engage: Impact & ResultsEGI Engage: Impact & Results
EGI Engage: Impact & ResultsEGI Federation
 
Data management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euData management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euEUDAT
 
Cloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hub
Cloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hubCloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hub
Cloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hubBjörn Backeberg
 
BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24Dag Endresen
 
Bridging the gap to facilitate selection and image analysis activities for la...
Bridging the gap to facilitate selection and image analysis activities for la...Bridging the gap to facilitate selection and image analysis activities for la...
Bridging the gap to facilitate selection and image analysis activities for la...Phidias
 
Eudat presentation nov2013 | www.eudat.eu |
Eudat presentation nov2013 | www.eudat.eu | Eudat presentation nov2013 | www.eudat.eu |
Eudat presentation nov2013 | www.eudat.eu | EUDAT
 
UK e-Infrastructure: Widening Access, Increasing Participation
UK e-Infrastructure: Widening Access, Increasing ParticipationUK e-Infrastructure: Widening Access, Increasing Participation
UK e-Infrastructure: Widening Access, Increasing ParticipationNeil Chue Hong
 
OSFair2017 Workshop | The European Open Science Cloud Pilot
OSFair2017 Workshop | The European Open Science Cloud Pilot OSFair2017 Workshop | The European Open Science Cloud Pilot
OSFair2017 Workshop | The European Open Science Cloud Pilot Open Science Fair
 
Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...
Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...
Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...EOSC-hub project
 
Hopsworks - ExtremeEarth Open Workshop
Hopsworks - ExtremeEarth Open WorkshopHopsworks - ExtremeEarth Open Workshop
Hopsworks - ExtremeEarth Open WorkshopExtremeEarth
 
Big Data Europe at eHealth Week 2017: Linking Big Data in Health
Big Data Europe at eHealth Week 2017: Linking Big Data in HealthBig Data Europe at eHealth Week 2017: Linking Big Data in Health
Big Data Europe at eHealth Week 2017: Linking Big Data in HealthBigData_Europe
 
European Cloud Initiative: implementation status
European Cloud Initiative: implementation statusEuropean Cloud Initiative: implementation status
European Cloud Initiative: implementation statusEUDAT
 
The Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresThe Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresguest0dc425
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
 
Building COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science ProjectBuilding COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science Projectvty
 
Sshoc kick off meeting - 1.2.3 EOSC board - Social Sciences and Humanities Op...
Sshoc kick off meeting - 1.2.3 EOSC board - Social Sciences and Humanities Op...Sshoc kick off meeting - 1.2.3 EOSC board - Social Sciences and Humanities Op...
Sshoc kick off meeting - 1.2.3 EOSC board - Social Sciences and Humanities Op...SSHOC
 

Similar to Implementing the Data Science Commons through EOSC and EGI-ACE (20)

WEBINAR: "How to manage your data to make them open and fair"
WEBINAR:  "How to manage your data to make them open and fair"  WEBINAR:  "How to manage your data to make them open and fair"
WEBINAR: "How to manage your data to make them open and fair"
 
EOSC-hub in EOSC context
EOSC-hub in EOSC contextEOSC-hub in EOSC context
EOSC-hub in EOSC context
 
EGI Engage: Impact & Results
EGI Engage: Impact & ResultsEGI Engage: Impact & Results
EGI Engage: Impact & Results
 
Data management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euData management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.eu
 
Cloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hub
Cloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hubCloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hub
Cloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hub
 
BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24
 
Bridging the gap to facilitate selection and image analysis activities for la...
Bridging the gap to facilitate selection and image analysis activities for la...Bridging the gap to facilitate selection and image analysis activities for la...
Bridging the gap to facilitate selection and image analysis activities for la...
 
Eudat presentation nov2013 | www.eudat.eu |
Eudat presentation nov2013 | www.eudat.eu | Eudat presentation nov2013 | www.eudat.eu |
Eudat presentation nov2013 | www.eudat.eu |
 
Intro-EOSC.pptx
Intro-EOSC.pptxIntro-EOSC.pptx
Intro-EOSC.pptx
 
UK e-Infrastructure: Widening Access, Increasing Participation
UK e-Infrastructure: Widening Access, Increasing ParticipationUK e-Infrastructure: Widening Access, Increasing Participation
UK e-Infrastructure: Widening Access, Increasing Participation
 
OSFair2017 Workshop | The European Open Science Cloud Pilot
OSFair2017 Workshop | The European Open Science Cloud Pilot OSFair2017 Workshop | The European Open Science Cloud Pilot
OSFair2017 Workshop | The European Open Science Cloud Pilot
 
Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...
Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...
Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...
 
Hopsworks - ExtremeEarth Open Workshop
Hopsworks - ExtremeEarth Open WorkshopHopsworks - ExtremeEarth Open Workshop
Hopsworks - ExtremeEarth Open Workshop
 
Big Data Europe at eHealth Week 2017: Linking Big Data in Health
Big Data Europe at eHealth Week 2017: Linking Big Data in HealthBig Data Europe at eHealth Week 2017: Linking Big Data in Health
Big Data Europe at eHealth Week 2017: Linking Big Data in Health
 
Seminario Sobre Datasets Consorcio Madrono
Seminario Sobre Datasets Consorcio Madrono Seminario Sobre Datasets Consorcio Madrono
Seminario Sobre Datasets Consorcio Madrono
 
European Cloud Initiative: implementation status
European Cloud Initiative: implementation statusEuropean Cloud Initiative: implementation status
European Cloud Initiative: implementation status
 
The Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresThe Developing Needs for e-infrastructures
The Developing Needs for e-infrastructures
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
Building COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science ProjectBuilding COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science Project
 
Sshoc kick off meeting - 1.2.3 EOSC board - Social Sciences and Humanities Op...
Sshoc kick off meeting - 1.2.3 EOSC board - Social Sciences and Humanities Op...Sshoc kick off meeting - 1.2.3 EOSC board - Social Sciences and Humanities Op...
Sshoc kick off meeting - 1.2.3 EOSC board - Social Sciences and Humanities Op...
 

Recently uploaded

Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 

Recently uploaded (20)

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 

Implementing the Data Science Commons through EOSC and EGI-ACE

  • 1. Exploiting scientific data in the international context - The elements of success Gergely Sipos Head of Services, Solutions, Support EGI Foundation (Amsterdam) gergely.sipos@egi.eu www.egi.eu This work by the EGI Foundation is licensed under a Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/)
  • 3. >110 data centres from 34 countries for discovery and access to marine data. OPPORTUNITY CHALLENGE Hard to import large amounts of data into Ocean Data Viewer to explore and analyse these. Data download is a big barrier for climate scientists: download takes significant amount of time; networks are unstable. OCEAN CLIMATE 3-5 PB data from CMIP5 experiment alongside 30+ software tools targeting different user communities. https://portal.enes.org www.seadatanet.org ASTRONOMY Transforming raw data to science-ready data is a challenge, limiting the impact of LOFAR output. 31 stations in Europe with 30 million data products in 50+PB storage. www.astron.nl/telescopes/lofar
  • 4.
  • 5. International user communities including thousands of researchers from China
  • 6.
  • 7. A future outlook • Cross-Research Infrastructure (RI) and cross-border access is difficult due to different funding models, access and provisioning policies • Data and service provisioning to international user communities possible only when supported by sound business models or existing collaboration agreements. Today only a few structured int. research groups have achieved this. • Needs of large investments for the creation, processing, preservation, access and reuse of research data  will the funding match the anticipated needs of future data-intensive science? • Opportunities for economies of scale and aggregation of demand can arise with joint provisioning of infrastructure common components • Major separation between data preservation and data exploitation infrastructures in many disciplines • RIs and e-Infrastructures should collaborate to support the entire research workflow of an experiment
  • 8. Tomorrow’s scenario? The Data Science Commons A federation of research data, computing, applications and other open science resources, responding to the problem of scalable access to research data through a new data provisioning service approach that is complementary to the traditional data download model. https://zenodo.org/record/4884748 Open Science Commons (2014)
  • 9. The Data Commons should... Allow to discover, access and analyze major research datasets and information for third-party exploitation Provide access to the data & data products close to processing facilities while avoiding duplication of local data storage & compute infrastructures across research performing organizations in Europe
  • 10. The Data Commons should… • Offer a hybrid distributed compute platform (HTC, HPC, cloud) and integrated rich portfolio of scientific application tools supporting self- service provisioning • Offer tools for scalable data movement across data preservation infrastructures and distributed interconnected network of “data hubs” • Provide integrated capabilities for publishing and sharing scientific outputs from experiments to support open science • Support federated authentication and authorization for use of existing personal credentials and easy to use access channels
  • 11. The Data Commons require... • Coordinated development, operations, procurement, management and provisioning across RIs and e-Infrastructures sharing expertise and technologies of common interest • The federation of existing/future publicly funded digital infrastructures leveraging national investments to achieve economies of scale across Europe • Changes in the policies of the participating infrastructures to support both national and international user groups - Infrastructures open by default • International cooperation among digital infrastructures in the world, to scale beyond Europe
  • 13. • Provider onboarding • Catalogue of services • Service access management • Helpdesk + FAQ
  • 14.
  • 16. Joint position paper (Sep 2020) The research e-Infrastructures claim the need… • to sustain the costs of re-use of data by integrating data, visualization, analysis and physical resources to store and re-use data for open science Recommend… • EOSC should play a role in sustaining the cost of open data re-use Advocate that… • the Member States need to play a key role in the long-term service provisioning funding to avoid complex commercial transactions. https://zenodo.org/record/4044010
  • 17. EOSC project landscape EGI Advanced Computing for EOSC
  • 18. Mission: Implement the Compute Platform of the European Open Science Cloud and contribute to the EOSC Data Commons by delivering integrated computing, platforms, data spaces and tools as an integrated solution that is aligned with major European cloud federation projects and HPC initiatives. Start: January 2021 Duration: 30 months Consortium: 33 participating partners, 23 third-parties Budget: 12 Million EUR
  • 19. 57% of the budget allocated service provisioning ▪ 16 partners to deliver cloud, HTC and HPC resources ▪ 12 user access and platform solution providers ▪ 21 providers from 13 communities to deliver data spaces ▪ 12 federation service providers ▪ EGI Foundation as coordinator
  • 20. Service portfolio Interactive Computing Distributed AI training Workload Management Automated Cluster deployment HPC Public Clouds HTC Research Clouds Federated Identity Federated Compute Federated Data Fundamental Science Data Space Health Data Space Green Deal Data Space EOSC portal Service Management, Tools, Processes, Policies Humanities Data Space Early adopters Data Spaces and Analytics Data and thematic data analytics and processing tools Platforms generic added-value platform level services Federated Access Federation-wide management of data and computing Federated Resources Compute and storage facilities
  • 21. Infrastructure services: Cloud, HTC, HPC For users and communities within the project AND for new EOSC users • Cloud-HTC compute and short-mid term storage: ▪ 80 Million CPU hours / 250,000 GPU hours / 20 PB storage • Mix of funding mechanisms: ▪ Virtual Access: ▪ Local grants enabling policy-based access (EGI Cloud Federation + new partners): ▪ Pay-for-use: ▪ Pilot HPC sites: Moldova Hungary Georgia Latvia Austria USA China South Africa
  • 22. Data spaces Research data: • ESGF data archive Tools and applications: • JupyterLab • Enes libraries • EC3 • Synda & DataHub Compute infrastructure: • Cloud and HPC sites
  • 23. Compute Platform: Resource allocation International projects Multi-national communities Single users Small groups Experimental users EOSC USERS ● High capacity demand ● Custom configurations ● Long term engagement EGI-ACE Open calls Evaluation every 2 months EOSC Portal ● Ready-to-use resources/services ● Self-service configuration ● Short term engagement Business-to-User Business-to-Business Opportunity for joint EU-CN use cases! https://www.egi.eu/projects/egi-ace/call-for-use-cases
  • 24. Credits: Jianhui LI Computer Network Information Center, CAS EGI-ACE Objective 4: Contribute to the realization of a global Open Science Cloud
  • 25. LOFAR IceCube WeNMR Bioimaging SeaDataNet ENES … LSST ~2000 2020+ ? EOSC and EGI-ACE by implementing the Data Science Commons