SlideShare a Scribd company logo
1 of 17
Get Data to Computation
eudat.eu/b2stage
www.eudat.eu
B2STAGE
How to shift large amounts of data
Version 4
February 2016
This work is licensed under the Creative
Commons CC-BY 4.0 licence.
Attribution: EUDAT – www.eudat.eu
eudat.eu/b2stage
B2STAGE is…
a reliable, efficient, light-weight and easy-
to-use service to transfer research data
sets between EUDAT storage resources
and high-performance computing (HPC)
workspaces
2
eudat.eu/b2stage
A truly pan-European Infrastructure
3
EUDAT offers common data services
to both research communities and
individuals through a network of 35
European organisations.
EUDAT wants to enable
European researchers from any
discipline to preserve, find,
access, and process data in a
trusted environment, as part of a
Collaborative Data Infrastructure.
European infrastructures
Technology Providers
Research Communities
eudat.eu/b2stage
Community-Driven Solutions
4
PHYSICAL SCIENCES
& ENGINEERING
MATERIALS &
ANALYTICAL FACILITIES
MAPPER
BIOMEDICAL &
MEDICAL SCIENCES
EUDAT services are designed, built and implemented
based on user community requirements.
eudat.eu/b2stage
The EUDAT Service Suite
5
eudat.eu/b2stage
move large amounts of data between
data stores and high-performance
compute resources
re-ingest computational results back
into EUDAT
deposit large data sets into EUDAT
resources for long-term preservation
Facilitating communities to:
Features:
high-speed transfer
reliable and light-weight
manages permanent PIDs
6
B2STAGE Features
eudat.eu/b2stage
Why use B2STAGE?
7
Research challenges are getting larger and
more complex:
E.g. full-Earth climate simulation, coupled
simulations of multiple organs in the human
body, seismic analyses of earthquakes at
continental scale
Researcher data and compute demands are rising fast
Efficient transfer of data to high performance computing (HPC)
workspaces is essential especially in distributed computing,
where resources are geographically dispersed
eudat.eu/b2stage
Why use B2STAGE?
8
Facilitates transfer of large data
collections from EUDAT storage
resources to HPC facilities.
Provides the means to re-ingest computational results back
into the EUDAT infrastructure.
Ingests data sets into EUDAT resources for long-term
preservation.
Offers reliable, efficient, easy-to-use tools to manage data
transfers.
The Data Staging Script is the only tool handling data
transfer using PIDs.
eudat.eu/b2stage
Who can use B2STAGE?
Researchers can transfer large data collections from
EUDAT storage resources to HPC facilities for processing.
Community Managers can replicate community data
through a lightweight service and ingest data sets to
EUDAT storage resources for long term preservation.
9
eudat.eu/b2stage
How can you use B2STAGE?
EUDAT offers B2STAGE to all registered researchers and
interested communities, enabling them to make use of
the service to stage data out of EUDAT, and ingest
computational results back.
Access to remote HPC facilities should be negotiated
and arranged by individual users in parallel.
To help researchers use the B2STAGE service, EUDAT
offers documentation, training material and a service
helpdesk.
10
For more information please email:
eudat-datastaging@postit.csc.fi
eudat.eu/b2stage
How can you use B2STAGE?
11
eudat.eu/b2stage
How does B2STAGE work?
12
GridFTP server
iRODS-DSI
User desktop
GridFTP client
data
control
PID
Registry
PID
control
HPC
GridFTP server
eudat.eu/b2stage
User desktop
How does B2STAGE work?
13
GridFTP client
File system
GridFTP server
iRODS-DSI
PID
Registry
PID
data
control
eudat.eu/b2stage
B2STAGE User communities
VPH Community ingesting data onto EUDAT resources
Approximately 12TB will be ingested through this service
VPH data also replicated between RZG and PSNC sites
B2STAGE will foster the collaboration with EGI and PRACE to
develop cross-infrastructure usage:
B2STAGE will be the main service to enable the
interoperability of these infrastructures.
Numerous new communities to adopt it as part of the 2015
and 2016 Calls for Collaboration
14
eudat.eu/b2stage
B2STAGE summary
B2STAGE offers:
data staging functionalities to easily and efficiently
transfer data from EUDAT storage resources to HPC
facilities
a powerful mechanism to ingest data onto EUDAT
resources
a script to facilitate the staging, ingest and retrieval of
PID information of transferred data
B2STAGE is unique in handling PIDs for the data
15
eudat.eu/b2stage
Future features
The Data Staging Script will be replaced by a modular
and extensible python library which will furnish the users
with a programmable interface towards most of the
EUDAT services.
16
eudat.eu/b2stage
17
For more info: http://eudat.eu/services/b2stage
User documentation: http://eudat.eu/services/userdoc/b2stage
Thank you

More Related Content

What's hot

EUDAT B2Service Suite| - A new version is available at http://ow.ly/fsCi30grKHV
EUDAT B2Service Suite| - A new version is available at http://ow.ly/fsCi30grKHVEUDAT B2Service Suite| - A new version is available at http://ow.ly/fsCi30grKHV
EUDAT B2Service Suite| - A new version is available at http://ow.ly/fsCi30grKHVEUDAT
 
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu |
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu | How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu |
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu | EUDAT
 
Persistent Identifiers in EUDAT Services. B2HANDLE Python library | www.eudat...
Persistent Identifiers in EUDAT Services. B2HANDLE Python library | www.eudat...Persistent Identifiers in EUDAT Services. B2HANDLE Python library | www.eudat...
Persistent Identifiers in EUDAT Services. B2HANDLE Python library | www.eudat...EUDAT
 
Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to MetadataEUDAT
 
B2FIND Overview February 2017 | www.eudat.eu |
B2FIND Overview February 2017 | www.eudat.eu | B2FIND Overview February 2017 | www.eudat.eu |
B2FIND Overview February 2017 | www.eudat.eu | EUDAT
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT
 
B2SHARE: Record lifecycle and HTTP API| www.eudat.eu |
B2SHARE: Record lifecycle and HTTP API| www.eudat.eu | B2SHARE: Record lifecycle and HTTP API| www.eudat.eu |
B2SHARE: Record lifecycle and HTTP API| www.eudat.eu | EUDAT
 
Introduction to eudat and its services
Introduction to eudat and its servicesIntroduction to eudat and its services
Introduction to eudat and its servicesEUDAT
 
Research engagement in EUDAT| www.eudat.eu |
Research engagement in EUDAT| www.eudat.eu | Research engagement in EUDAT| www.eudat.eu |
Research engagement in EUDAT| www.eudat.eu | EUDAT
 
Legal Issues in Research Data Collection and Sharing: An Introduction by EUDA...
Legal Issues in Research Data Collection and Sharing: An Introduction by EUDA...Legal Issues in Research Data Collection and Sharing: An Introduction by EUDA...
Legal Issues in Research Data Collection and Sharing: An Introduction by EUDA...EUDAT
 
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...EUDAT
 
Research Data Services: The EUDAT B2SERVICE SUITE | www.eudat.eu |
Research Data Services: The EUDAT B2SERVICE SUITE | www.eudat.eu | Research Data Services: The EUDAT B2SERVICE SUITE | www.eudat.eu |
Research Data Services: The EUDAT B2SERVICE SUITE | www.eudat.eu | EUDAT
 
Research data management & planning: an introduction
Research data management & planning: an introductionResearch data management & planning: an introduction
Research data management & planning: an introductionMaggie Neilson
 
Unpacking persistent identifiers for research
Unpacking persistent identifiers for researchUnpacking persistent identifiers for research
Unpacking persistent identifiers for researchARDC
 
B2DROP User Training | www.eudat.eu |
B2DROP User Training | www.eudat.eu | B2DROP User Training | www.eudat.eu |
B2DROP User Training | www.eudat.eu | EUDAT
 
Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...EDINA, University of Edinburgh
 
EUDAT B2SHARE: How to store and publish research data | www.eudat.eu
EUDAT B2SHARE: How to store and publish research data | www.eudat.euEUDAT B2SHARE: How to store and publish research data | www.eudat.eu
EUDAT B2SHARE: How to store and publish research data | www.eudat.euEUDAT
 
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)EUDAT
 
Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to MetadataJenn Riley
 

What's hot (20)

EUDAT B2Service Suite| - A new version is available at http://ow.ly/fsCi30grKHV
EUDAT B2Service Suite| - A new version is available at http://ow.ly/fsCi30grKHVEUDAT B2Service Suite| - A new version is available at http://ow.ly/fsCi30grKHV
EUDAT B2Service Suite| - A new version is available at http://ow.ly/fsCi30grKHV
 
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu |
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu | How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu |
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu |
 
Persistent Identifiers in EUDAT Services. B2HANDLE Python library | www.eudat...
Persistent Identifiers in EUDAT Services. B2HANDLE Python library | www.eudat...Persistent Identifiers in EUDAT Services. B2HANDLE Python library | www.eudat...
Persistent Identifiers in EUDAT Services. B2HANDLE Python library | www.eudat...
 
Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to Metadata
 
B2FIND Overview February 2017 | www.eudat.eu |
B2FIND Overview February 2017 | www.eudat.eu | B2FIND Overview February 2017 | www.eudat.eu |
B2FIND Overview February 2017 | www.eudat.eu |
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
 
B2SHARE: Record lifecycle and HTTP API| www.eudat.eu |
B2SHARE: Record lifecycle and HTTP API| www.eudat.eu | B2SHARE: Record lifecycle and HTTP API| www.eudat.eu |
B2SHARE: Record lifecycle and HTTP API| www.eudat.eu |
 
Introduction to eudat and its services
Introduction to eudat and its servicesIntroduction to eudat and its services
Introduction to eudat and its services
 
Research engagement in EUDAT| www.eudat.eu |
Research engagement in EUDAT| www.eudat.eu | Research engagement in EUDAT| www.eudat.eu |
Research engagement in EUDAT| www.eudat.eu |
 
Legal Issues in Research Data Collection and Sharing: An Introduction by EUDA...
Legal Issues in Research Data Collection and Sharing: An Introduction by EUDA...Legal Issues in Research Data Collection and Sharing: An Introduction by EUDA...
Legal Issues in Research Data Collection and Sharing: An Introduction by EUDA...
 
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
 
Research Data Services: The EUDAT B2SERVICE SUITE | www.eudat.eu |
Research Data Services: The EUDAT B2SERVICE SUITE | www.eudat.eu | Research Data Services: The EUDAT B2SERVICE SUITE | www.eudat.eu |
Research Data Services: The EUDAT B2SERVICE SUITE | www.eudat.eu |
 
Research data management & planning: an introduction
Research data management & planning: an introductionResearch data management & planning: an introduction
Research data management & planning: an introduction
 
Unpacking persistent identifiers for research
Unpacking persistent identifiers for researchUnpacking persistent identifiers for research
Unpacking persistent identifiers for research
 
B2DROP User Training | www.eudat.eu |
B2DROP User Training | www.eudat.eu | B2DROP User Training | www.eudat.eu |
B2DROP User Training | www.eudat.eu |
 
Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...
 
EUDAT B2SHARE: How to store and publish research data | www.eudat.eu
EUDAT B2SHARE: How to store and publish research data | www.eudat.euEUDAT B2SHARE: How to store and publish research data | www.eudat.eu
EUDAT B2SHARE: How to store and publish research data | www.eudat.eu
 
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
 
General concepts: DDI
General concepts: DDIGeneral concepts: DDI
General concepts: DDI
 
Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to Metadata
 

Similar to B2STAGE- how to shift large amounts of data| www.eudat.eu |

Coupling HPC and Data Resources and services together - EUDAT Workshop at exd...
Coupling HPC and Data Resources and services together - EUDAT Workshop at exd...Coupling HPC and Data Resources and services together - EUDAT Workshop at exd...
Coupling HPC and Data Resources and services together - EUDAT Workshop at exd...EUDAT
 
Eudat presentation nov2013 | www.eudat.eu |
Eudat presentation nov2013 | www.eudat.eu | Eudat presentation nov2013 | www.eudat.eu |
Eudat presentation nov2013 | www.eudat.eu | EUDAT
 
EUDAT B2Service Suite - November 2017 | www.eudat.eu |
EUDAT B2Service Suite - November 2017 | www.eudat.eu |EUDAT B2Service Suite - November 2017 | www.eudat.eu |
EUDAT B2Service Suite - November 2017 | www.eudat.eu |EUDAT
 
EUDAT Research Data Services for all | www.eudat.eu |
EUDAT Research Data Services for all | www.eudat.eu | EUDAT Research Data Services for all | www.eudat.eu |
EUDAT Research Data Services for all | www.eudat.eu | EUDAT
 
eROSA Stakeholder WS1: EUDAT – The pan-European data infrastructure
eROSA Stakeholder WS1: EUDAT – The pan-European data infrastructureeROSA Stakeholder WS1: EUDAT – The pan-European data infrastructure
eROSA Stakeholder WS1: EUDAT – The pan-European data infrastructuree-ROSA
 
Data Processing and Analysis
Data Processing and AnalysisData Processing and Analysis
Data Processing and AnalysisEUDAT
 
OSFair2017 Workshop | Service provisioning for excellent sciences
OSFair2017 Workshop | Service provisioning for excellent sciencesOSFair2017 Workshop | Service provisioning for excellent sciences
OSFair2017 Workshop | Service provisioning for excellent sciencesOpen Science Fair
 
eROSA Stakeholder WS1: EUDAT – The pan-European data infrastructure
eROSA Stakeholder WS1: EUDAT – The pan-European data infrastructureeROSA Stakeholder WS1: EUDAT – The pan-European data infrastructure
eROSA Stakeholder WS1: EUDAT – The pan-European data infrastructuree-ROSA
 
Cross e-Infrastructure collaborations
Cross e-Infrastructure collaborationsCross e-Infrastructure collaborations
Cross e-Infrastructure collaborationsEUDAT
 
EUDAT CDI Architecture
EUDAT CDI ArchitectureEUDAT CDI Architecture
EUDAT CDI ArchitectureEUDAT
 
EUDAT Booklet Mar22 (2).pdf
EUDAT Booklet Mar22 (2).pdfEUDAT Booklet Mar22 (2).pdf
EUDAT Booklet Mar22 (2).pdfEUDAT
 
Data management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euData management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euEUDAT
 
EUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan BroederEUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan BroederOpenAIRE
 
EUDAT Services Update
EUDAT Services UpdateEUDAT Services Update
EUDAT Services UpdateEUDAT
 
The EOSC Compute Platform with the EGI-ACE project
The EOSC Compute Platform with the EGI-ACE project The EOSC Compute Platform with the EGI-ACE project
The EOSC Compute Platform with the EGI-ACE project EGI Federation
 
EUDAT Brochure - B2SHARE.pdf
EUDAT Brochure - B2SHARE.pdfEUDAT Brochure - B2SHARE.pdf
EUDAT Brochure - B2SHARE.pdfEUDAT
 
Gergely Sipos (EGI): Exploiting scientific data in the international context ...
Gergely Sipos (EGI): Exploiting scientific data in the international context ...Gergely Sipos (EGI): Exploiting scientific data in the international context ...
Gergely Sipos (EGI): Exploiting scientific data in the international context ...Gergely Sipos
 
EOSC-hub service portfolio
EOSC-hub service portfolioEOSC-hub service portfolio
EOSC-hub service portfolioEOSC-hub project
 
EGI-EUDAT interoperability| www.eudat.eu |
EGI-EUDAT interoperability| www.eudat.eu | EGI-EUDAT interoperability| www.eudat.eu |
EGI-EUDAT interoperability| www.eudat.eu | EUDAT
 

Similar to B2STAGE- how to shift large amounts of data| www.eudat.eu | (20)

Coupling HPC and Data Resources and services together - EUDAT Workshop at exd...
Coupling HPC and Data Resources and services together - EUDAT Workshop at exd...Coupling HPC and Data Resources and services together - EUDAT Workshop at exd...
Coupling HPC and Data Resources and services together - EUDAT Workshop at exd...
 
Eudat presentation nov2013 | www.eudat.eu |
Eudat presentation nov2013 | www.eudat.eu | Eudat presentation nov2013 | www.eudat.eu |
Eudat presentation nov2013 | www.eudat.eu |
 
EUDAT B2Service Suite - November 2017 | www.eudat.eu |
EUDAT B2Service Suite - November 2017 | www.eudat.eu |EUDAT B2Service Suite - November 2017 | www.eudat.eu |
EUDAT B2Service Suite - November 2017 | www.eudat.eu |
 
EUDAT Research Data Services for all | www.eudat.eu |
EUDAT Research Data Services for all | www.eudat.eu | EUDAT Research Data Services for all | www.eudat.eu |
EUDAT Research Data Services for all | www.eudat.eu |
 
eROSA Stakeholder WS1: EUDAT – The pan-European data infrastructure
eROSA Stakeholder WS1: EUDAT – The pan-European data infrastructureeROSA Stakeholder WS1: EUDAT – The pan-European data infrastructure
eROSA Stakeholder WS1: EUDAT – The pan-European data infrastructure
 
Data Processing and Analysis
Data Processing and AnalysisData Processing and Analysis
Data Processing and Analysis
 
EUDAT B2SAFE & EOSC-hub
EUDAT B2SAFE & EOSC-hubEUDAT B2SAFE & EOSC-hub
EUDAT B2SAFE & EOSC-hub
 
OSFair2017 Workshop | Service provisioning for excellent sciences
OSFair2017 Workshop | Service provisioning for excellent sciencesOSFair2017 Workshop | Service provisioning for excellent sciences
OSFair2017 Workshop | Service provisioning for excellent sciences
 
eROSA Stakeholder WS1: EUDAT – The pan-European data infrastructure
eROSA Stakeholder WS1: EUDAT – The pan-European data infrastructureeROSA Stakeholder WS1: EUDAT – The pan-European data infrastructure
eROSA Stakeholder WS1: EUDAT – The pan-European data infrastructure
 
Cross e-Infrastructure collaborations
Cross e-Infrastructure collaborationsCross e-Infrastructure collaborations
Cross e-Infrastructure collaborations
 
EUDAT CDI Architecture
EUDAT CDI ArchitectureEUDAT CDI Architecture
EUDAT CDI Architecture
 
EUDAT Booklet Mar22 (2).pdf
EUDAT Booklet Mar22 (2).pdfEUDAT Booklet Mar22 (2).pdf
EUDAT Booklet Mar22 (2).pdf
 
Data management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euData management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.eu
 
EUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan BroederEUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan Broeder
 
EUDAT Services Update
EUDAT Services UpdateEUDAT Services Update
EUDAT Services Update
 
The EOSC Compute Platform with the EGI-ACE project
The EOSC Compute Platform with the EGI-ACE project The EOSC Compute Platform with the EGI-ACE project
The EOSC Compute Platform with the EGI-ACE project
 
EUDAT Brochure - B2SHARE.pdf
EUDAT Brochure - B2SHARE.pdfEUDAT Brochure - B2SHARE.pdf
EUDAT Brochure - B2SHARE.pdf
 
Gergely Sipos (EGI): Exploiting scientific data in the international context ...
Gergely Sipos (EGI): Exploiting scientific data in the international context ...Gergely Sipos (EGI): Exploiting scientific data in the international context ...
Gergely Sipos (EGI): Exploiting scientific data in the international context ...
 
EOSC-hub service portfolio
EOSC-hub service portfolioEOSC-hub service portfolio
EOSC-hub service portfolio
 
EGI-EUDAT interoperability| www.eudat.eu |
EGI-EUDAT interoperability| www.eudat.eu | EGI-EUDAT interoperability| www.eudat.eu |
EGI-EUDAT interoperability| www.eudat.eu |
 

More from EUDAT

EUDAT_Brochure_Generica_Jan_UPDATED(5).pdf
EUDAT_Brochure_Generica_Jan_UPDATED(5).pdfEUDAT_Brochure_Generica_Jan_UPDATED(5).pdf
EUDAT_Brochure_Generica_Jan_UPDATED(5).pdfEUDAT
 
EUDAT_Brochure_Generica_Jan_UPDATED (1).pdf
EUDAT_Brochure_Generica_Jan_UPDATED (1).pdfEUDAT_Brochure_Generica_Jan_UPDATED (1).pdf
EUDAT_Brochure_Generica_Jan_UPDATED (1).pdfEUDAT
 
EUDAT Brochure - B2HANDLE.pdf
EUDAT Brochure - B2HANDLE.pdfEUDAT Brochure - B2HANDLE.pdf
EUDAT Brochure - B2HANDLE.pdfEUDAT
 
EUDAT Brochure - B2DROP.pdf
EUDAT Brochure - B2DROP.pdfEUDAT Brochure - B2DROP.pdf
EUDAT Brochure - B2DROP.pdfEUDAT
 
EUDAT Brochure - B2SAFE.pdf
EUDAT Brochure - B2SAFE.pdfEUDAT Brochure - B2SAFE.pdf
EUDAT Brochure - B2SAFE.pdfEUDAT
 
EUDAT Brochure - B2FIND(1).pdf
EUDAT Brochure - B2FIND(1).pdfEUDAT Brochure - B2FIND(1).pdf
EUDAT Brochure - B2FIND(1).pdfEUDAT
 
EUDAT Brochure - B2ACCESS.pdf
EUDAT Brochure - B2ACCESS.pdfEUDAT Brochure - B2ACCESS.pdf
EUDAT Brochure - B2ACCESS.pdfEUDAT
 
Rob Carrillo - Writing effective service documentation for EUDAT services
Rob Carrillo - Writing effective service documentation for EUDAT servicesRob Carrillo - Writing effective service documentation for EUDAT services
Rob Carrillo - Writing effective service documentation for EUDAT servicesEUDAT
 
Ariyo - EUDAT CDI B2 services documentation
Ariyo - EUDAT CDI B2 services documentationAriyo - EUDAT CDI B2 services documentation
Ariyo - EUDAT CDI B2 services documentationEUDAT
 
Using B2NOTE: The U.Porto Pilot
Using B2NOTE: The U.Porto PilotUsing B2NOTE: The U.Porto Pilot
Using B2NOTE: The U.Porto PilotEUDAT
 
OpenAIRE Advance - Kick off last week
OpenAIRE Advance - Kick off last weekOpenAIRE Advance - Kick off last week
OpenAIRE Advance - Kick off last weekEUDAT
 
European Open Science Cloud - Skills workshop
European Open Science Cloud - Skills workshopEuropean Open Science Cloud - Skills workshop
European Open Science Cloud - Skills workshopEUDAT
 
Linking service capabilities to data stweardship competences for professional...
Linking service capabilities to data stweardship competences for professional...Linking service capabilities to data stweardship competences for professional...
Linking service capabilities to data stweardship competences for professional...EUDAT
 
FAIRness of training materials
FAIRness of training materialsFAIRness of training materials
FAIRness of training materialsEUDAT
 
Training by EOSC-hub - Integrating and Managing services for the European Ope...
Training by EOSC-hub - Integrating and Managing services for the European Ope...Training by EOSC-hub - Integrating and Managing services for the European Ope...
Training by EOSC-hub - Integrating and Managing services for the European Ope...EUDAT
 
Draft Governance Framework for the EOSC
Draft Governance Framework for the EOSCDraft Governance Framework for the EOSC
Draft Governance Framework for the EOSCEUDAT
 
Building Interoperable AAI for Researchers
Building Interoperable AAI for ResearchersBuilding Interoperable AAI for Researchers
Building Interoperable AAI for ResearchersEUDAT
 
ENVRIPLUS Data for Science Theme
ENVRIPLUS Data for Science ThemeENVRIPLUS Data for Science Theme
ENVRIPLUS Data for Science ThemeEUDAT
 
Data for Science Service Portfolio
Data for Science Service PortfolioData for Science Service Portfolio
Data for Science Service PortfolioEUDAT
 
The ENVRI user landscape
The ENVRI user landscapeThe ENVRI user landscape
The ENVRI user landscapeEUDAT
 

More from EUDAT (20)

EUDAT_Brochure_Generica_Jan_UPDATED(5).pdf
EUDAT_Brochure_Generica_Jan_UPDATED(5).pdfEUDAT_Brochure_Generica_Jan_UPDATED(5).pdf
EUDAT_Brochure_Generica_Jan_UPDATED(5).pdf
 
EUDAT_Brochure_Generica_Jan_UPDATED (1).pdf
EUDAT_Brochure_Generica_Jan_UPDATED (1).pdfEUDAT_Brochure_Generica_Jan_UPDATED (1).pdf
EUDAT_Brochure_Generica_Jan_UPDATED (1).pdf
 
EUDAT Brochure - B2HANDLE.pdf
EUDAT Brochure - B2HANDLE.pdfEUDAT Brochure - B2HANDLE.pdf
EUDAT Brochure - B2HANDLE.pdf
 
EUDAT Brochure - B2DROP.pdf
EUDAT Brochure - B2DROP.pdfEUDAT Brochure - B2DROP.pdf
EUDAT Brochure - B2DROP.pdf
 
EUDAT Brochure - B2SAFE.pdf
EUDAT Brochure - B2SAFE.pdfEUDAT Brochure - B2SAFE.pdf
EUDAT Brochure - B2SAFE.pdf
 
EUDAT Brochure - B2FIND(1).pdf
EUDAT Brochure - B2FIND(1).pdfEUDAT Brochure - B2FIND(1).pdf
EUDAT Brochure - B2FIND(1).pdf
 
EUDAT Brochure - B2ACCESS.pdf
EUDAT Brochure - B2ACCESS.pdfEUDAT Brochure - B2ACCESS.pdf
EUDAT Brochure - B2ACCESS.pdf
 
Rob Carrillo - Writing effective service documentation for EUDAT services
Rob Carrillo - Writing effective service documentation for EUDAT servicesRob Carrillo - Writing effective service documentation for EUDAT services
Rob Carrillo - Writing effective service documentation for EUDAT services
 
Ariyo - EUDAT CDI B2 services documentation
Ariyo - EUDAT CDI B2 services documentationAriyo - EUDAT CDI B2 services documentation
Ariyo - EUDAT CDI B2 services documentation
 
Using B2NOTE: The U.Porto Pilot
Using B2NOTE: The U.Porto PilotUsing B2NOTE: The U.Porto Pilot
Using B2NOTE: The U.Porto Pilot
 
OpenAIRE Advance - Kick off last week
OpenAIRE Advance - Kick off last weekOpenAIRE Advance - Kick off last week
OpenAIRE Advance - Kick off last week
 
European Open Science Cloud - Skills workshop
European Open Science Cloud - Skills workshopEuropean Open Science Cloud - Skills workshop
European Open Science Cloud - Skills workshop
 
Linking service capabilities to data stweardship competences for professional...
Linking service capabilities to data stweardship competences for professional...Linking service capabilities to data stweardship competences for professional...
Linking service capabilities to data stweardship competences for professional...
 
FAIRness of training materials
FAIRness of training materialsFAIRness of training materials
FAIRness of training materials
 
Training by EOSC-hub - Integrating and Managing services for the European Ope...
Training by EOSC-hub - Integrating and Managing services for the European Ope...Training by EOSC-hub - Integrating and Managing services for the European Ope...
Training by EOSC-hub - Integrating and Managing services for the European Ope...
 
Draft Governance Framework for the EOSC
Draft Governance Framework for the EOSCDraft Governance Framework for the EOSC
Draft Governance Framework for the EOSC
 
Building Interoperable AAI for Researchers
Building Interoperable AAI for ResearchersBuilding Interoperable AAI for Researchers
Building Interoperable AAI for Researchers
 
ENVRIPLUS Data for Science Theme
ENVRIPLUS Data for Science ThemeENVRIPLUS Data for Science Theme
ENVRIPLUS Data for Science Theme
 
Data for Science Service Portfolio
Data for Science Service PortfolioData for Science Service Portfolio
Data for Science Service Portfolio
 
The ENVRI user landscape
The ENVRI user landscapeThe ENVRI user landscape
The ENVRI user landscape
 

Recently uploaded

DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxUnduhUnggah1
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxdolaknnilon
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一F sss
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 

Recently uploaded (20)

DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docx
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptx
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 

B2STAGE- how to shift large amounts of data| www.eudat.eu |

  • 1. Get Data to Computation eudat.eu/b2stage www.eudat.eu B2STAGE How to shift large amounts of data Version 4 February 2016 This work is licensed under the Creative Commons CC-BY 4.0 licence. Attribution: EUDAT – www.eudat.eu
  • 2. eudat.eu/b2stage B2STAGE is… a reliable, efficient, light-weight and easy- to-use service to transfer research data sets between EUDAT storage resources and high-performance computing (HPC) workspaces 2
  • 3. eudat.eu/b2stage A truly pan-European Infrastructure 3 EUDAT offers common data services to both research communities and individuals through a network of 35 European organisations. EUDAT wants to enable European researchers from any discipline to preserve, find, access, and process data in a trusted environment, as part of a Collaborative Data Infrastructure. European infrastructures Technology Providers Research Communities
  • 4. eudat.eu/b2stage Community-Driven Solutions 4 PHYSICAL SCIENCES & ENGINEERING MATERIALS & ANALYTICAL FACILITIES MAPPER BIOMEDICAL & MEDICAL SCIENCES EUDAT services are designed, built and implemented based on user community requirements.
  • 6. eudat.eu/b2stage move large amounts of data between data stores and high-performance compute resources re-ingest computational results back into EUDAT deposit large data sets into EUDAT resources for long-term preservation Facilitating communities to: Features: high-speed transfer reliable and light-weight manages permanent PIDs 6 B2STAGE Features
  • 7. eudat.eu/b2stage Why use B2STAGE? 7 Research challenges are getting larger and more complex: E.g. full-Earth climate simulation, coupled simulations of multiple organs in the human body, seismic analyses of earthquakes at continental scale Researcher data and compute demands are rising fast Efficient transfer of data to high performance computing (HPC) workspaces is essential especially in distributed computing, where resources are geographically dispersed
  • 8. eudat.eu/b2stage Why use B2STAGE? 8 Facilitates transfer of large data collections from EUDAT storage resources to HPC facilities. Provides the means to re-ingest computational results back into the EUDAT infrastructure. Ingests data sets into EUDAT resources for long-term preservation. Offers reliable, efficient, easy-to-use tools to manage data transfers. The Data Staging Script is the only tool handling data transfer using PIDs.
  • 9. eudat.eu/b2stage Who can use B2STAGE? Researchers can transfer large data collections from EUDAT storage resources to HPC facilities for processing. Community Managers can replicate community data through a lightweight service and ingest data sets to EUDAT storage resources for long term preservation. 9
  • 10. eudat.eu/b2stage How can you use B2STAGE? EUDAT offers B2STAGE to all registered researchers and interested communities, enabling them to make use of the service to stage data out of EUDAT, and ingest computational results back. Access to remote HPC facilities should be negotiated and arranged by individual users in parallel. To help researchers use the B2STAGE service, EUDAT offers documentation, training material and a service helpdesk. 10 For more information please email: eudat-datastaging@postit.csc.fi
  • 11. eudat.eu/b2stage How can you use B2STAGE? 11
  • 12. eudat.eu/b2stage How does B2STAGE work? 12 GridFTP server iRODS-DSI User desktop GridFTP client data control PID Registry PID control HPC GridFTP server
  • 13. eudat.eu/b2stage User desktop How does B2STAGE work? 13 GridFTP client File system GridFTP server iRODS-DSI PID Registry PID data control
  • 14. eudat.eu/b2stage B2STAGE User communities VPH Community ingesting data onto EUDAT resources Approximately 12TB will be ingested through this service VPH data also replicated between RZG and PSNC sites B2STAGE will foster the collaboration with EGI and PRACE to develop cross-infrastructure usage: B2STAGE will be the main service to enable the interoperability of these infrastructures. Numerous new communities to adopt it as part of the 2015 and 2016 Calls for Collaboration 14
  • 15. eudat.eu/b2stage B2STAGE summary B2STAGE offers: data staging functionalities to easily and efficiently transfer data from EUDAT storage resources to HPC facilities a powerful mechanism to ingest data onto EUDAT resources a script to facilitate the staging, ingest and retrieval of PID information of transferred data B2STAGE is unique in handling PIDs for the data 15
  • 16. eudat.eu/b2stage Future features The Data Staging Script will be replaced by a modular and extensible python library which will furnish the users with a programmable interface towards most of the EUDAT services. 16
  • 17. eudat.eu/b2stage 17 For more info: http://eudat.eu/services/b2stage User documentation: http://eudat.eu/services/userdoc/b2stage Thank you

Editor's Notes

  1. The EUDAT datacentres store and replicate large amounts of data for the communities. But what about processing these data? And how do these data get into the EUDAT datacentres in the first place?
  2. B2STAGE is a reliable, efficient, light-weight and easy-to-use service to transfer research data sets between EUDAT storage resources and high-performance computing (HPC) workspaces
  3. B2STAGE is one of the services offered by EUDAT, the pan-European Infrastructure. EUDAT offers common data services, supporting multiple research communities as well as individuals, through a geographically distributed resilient network connecting general purpose data centres and community-specific data repositories. EUDAT wants to enable European researchers from any discipline to preserve, find, access, and process data in a trusted environment, as part of a Collaborative Data Infrastructure.
  4. The services offered by EUDAT are community driven as they are designed, built and implemented based on user community requirements. This means that the communities have direct influence on these services and contribute to the development of them. Services are defined not just by researchers in the EUDAT collaboration, but we also elicit requirements from other communities to ensure our services are as generic and applicable to as wide an audience as possible.
  5. The EUDAT service suite represents an integrated set of services to support researchers manage their data through the data lifecycle. As your data moves through the data lifecycle, EUDAT services will help you manage your data using best practices followed by some of the world’s largest communities. The services available cover a wide range of functionalities. B2SAFE enables communities to replicate and safely store their large-scale data on robust, reliable datacentres operated by the EUDAT partners. B2HANDLE registers all data on EUDAT with a unique identifier which can be globally resolved on the standard handle system. B2DROP allows EUDAT users to easily exchange working data, while B2SHARE allows to deposit and disseminate final research data at a smaller scale, but easier than with B2SAFE. B2FIND allows searches on the EUDAT metadata and is one of the key enablers of multi-disciplinary research on EUDAT. B2ACCESS is the simple and secure authorisation and authentication platform of EUDAT, which allows single sign-on on EUDAT’s public and internal service. B2STAGE, the subject of this talk, offers communities an entry-point to ingest and replicate into EUDAT large volumes of data. Data ingested through B2STAGE are registered with a Persistent Identifier using the mechanism adopted by B2SAFE.
  6. EUDAT offers the B2STAGE service, which allows big, research data to move efficiently between storage and computation. The service also takes care of depositing the computation output from the HPC facilities to EUDAT. B2STAGE can also be used to deposit the community data into the EUDAT facilities. B2STAGE uses the established gridFTP protocol to ensure high-speed transfer between the sites. Data transfer is reliable and requires very little user interaction. B2STAGE also assigns PIDs to computational output that the user elects to inject back into the EUDAT datacentres.
  7. B2STAGE was conceived to deal with modern day research challenges. As hardware and research software improve, scope for research is broadening. Communities now pursue large-scale simulations, for example developing models for climate simulation encompassing the whole of the Earth, as opposed to isolated regions. Scientists simulate not only organs in the human body, but also their interactions. Similarly, earthquake data are now collected and processed for areas as large as entire continents. The common requirement of such research challenges is that they generate and process increasing volumes of data, with typical workflows requiring data to be processed in a distributed fashion, so as to cope with the pace of data generation. In order for this to be possible, data need to be transferred in an efficient way to the high-performance or high-throughput computing resources, and this is where B2STAGE comes in.
  8. B2STAGE was developed to address specific user requirements. The fundamental use-case is to allow data already ingested into EUDAT to move to HPC facilities for processing. This is important not only in the case where the community that deposited the data process them as per their original intention, but also in more advanced scenarios of inter-disciplinary research on open data. In this case B2STAGE moves heterogeneous data for processing allowing data combinations that were not previously thought of. B2STAGE also allows users to push the results of the computation safely back into EUDAT, where they may be preserved and/or further replicated according to the community policies. B2STAGE is developed over the gridFTP protocol, which make data transfers reliable and efficient. To ease use, EUDAT has developed the companion Data Staging Script, a client-side tool that facilitates the data transfer commands and handles PIDs for the data resources involved.
  9. The main end-users are EUDAT researchers, who can transfer their data between storage and computation as part of their day-to-day workflows. Using B2STAGE to inject community data into EUDAT is generally a function of Community Managers.
  10. As per slide.
  11. The B2STAGE service is deployed on EUDAT datacentres and many HPC nodes. Access to EUDAT nodes is automatic for all EUDAT registered users, though users would need to arrange access to HPC nodes separately. The user can use clients running on their desktop or on other log-in servers that they have access to. Globus Online provides a GUI and a command-line interface, or the user may prefer to use the native GridFTP command-line interface, or other GridFTP clients like UberFTP. We spoke about the EUDAT Data Staging Script earlier. EUDAT is also working on an HTTP interface. In all cases, the user uses their client of choice to initiate transfers between B2STAGE instances on EUDAT and HPC centres, or between their desktop and nodes that are enabled with B2STAGE.
  12. (This is a continuation from the last sentence of the previous slide). This is better depicted in this figure. The user employs the GridFTP client of their choice, which interacts with B2STAGE instances on the sites involved in the transfer. Underneath the B2STAGE hood is a GridFTP server, enriched with the EUDAT Data Storage Interface component. When data arrive at an EUDAT node to be deposited, the B2SAFE service ensures that a PID is generated by B2HANDLE for each artefact, and this is recorded in the EPIC PID Register. The iRODS Server also handles any replication required for these artefacts, according to the community policies that apply to the user who initiated the transfer. If the user utilises the EUDAT DSS script, then any PIDs generated, and this again depends on the iRODS server configuration and the community agreement, are returned to them.
  13. The situation is similar when the user transfers data into an EUDAT centre.
  14. B2STAGE is used by EUDAT research communities. For example, the Virtual Physiological Human community is ingesting data onto EUDAT using B2STAGE. The community is to deposit approximately 12TB of data into RZG, which will be replicated by B2SAFE into PSNC. B2STAGE will also be instrumental to establishing the forthcoming collaboration with the EGI and PRACE research infrastructures. The aim of these collaborations is to create the framework and foster cross-infrastructure usage. B2STAGE will be the software to bridge between these infrastructures. And the new projects starting from early 2016 as part of the two EUDAT Calls of Collaboration will use B2STAGE to transfer their data into EUDAT.
  15. As per bullets
  16. As per bullet
  17. For more info please visit: http://eudat.eu/services/b2stage. The User documentation can be found at: http://eudat.eu/services/userdoc/b2stage