SlideShare una empresa de Scribd logo
1 de 51
OGCE Workflow Toolkit for Multi-Scale Science Applications Suresh Marru Pervasive Technology Institute Indiana University ODI Gateway/Pipeline Evaluation  NOAO Jan 21st 2010
TeraGrid
Data Capacitor ,[object Object]
IU Bloomington data center
NSF funded project to create a high throughput 535 TB Lustre storage solution
Short term storage for applications with high bandwidth needs
IU added a wide area version in 2008
Enhanced wide area security,[object Object]
Massive Data Storage System MDSS Architecture
[object Object]
Wide
Open,[object Object]
Gateways Advantages Increase access To instruments Increase capabilities To analyze data Improve workforce development For underserved populations Increase outreach Increase public awareness Public sees value in investments in large facilities
There are approximately 30 gateways using the TeraGrid
Example Gateway Accomplishments LEAD - access to radar data NVO – access to sky surveys OOI – access to sensor data PolarGrid – access to polar ice sheet data SIDGrid – analysis tools GridChem – developing multiscale coupling How would this have been done before gateways? How many details do we want each individual scientist to need to know?
TeraGrid Advantages and Challenges What’s different when the resource doesn’t belong just to me? Resource discovery Accounting Security Proposal-based requests for resources (peer-reviewed access) Code scaling and performance numbers Justification of resources Gateway citations Tremendous benefits at the high end, but even more work for the developers Potential impact on science is huge Small number of developers can impact thousands of scientists But need a way to train and fund those developers and provide them with appropriate tools
Example Gateway: LEAD
Weather is Local, High-Impact, Heterogeneous and Rapidly Evolving…Yet Our Technologies and Thinking are Static Rain and Snow Fog Rain and Snow Snow and Freezing Rain Intense Turbulence Severe Thunderstorms
LEAD Dynamic Adaptive Infrastructure  Storms Forming Forecast Model Streaming Observations Data Mining Instrument Steering Refine forecast grid
NSF Engineering Research Center for Collaborative Adaptive Sensing of the Atmosphere (CASA) UMass/Amherst, OU, CSU, UPRM Concept:  inexpensive, phased array Doppler radars on cell towers and buildings Dynamically adaptive sensing of multiple targets while simultaneously meeting multiple end-user needs
LEAD Workflow Requirements Run jobs on-demand on TeraGrid. Deadline driven workflows (severe weather tracking) Users ranging from 8th grade students to seasoned researchers. Run jobs on Multiple TeraGrid resources to decrease turn-around time. Must be able to integrate to Portal with very user friendly web interface.
Workflow Survey in 2003(http://www.extreme.indiana.edu/swf-survey/)
High Level LEAD Architecture Workflow graph Application  services Compute Engine User Portal Workflow Engine Fault Tolerance & scheduler Event Notification Bus Portal server MyLEAD  Agent service Data Management Service Data Catalog service Providence Collection service MyLEAD User Metadata catalog Data Storage
LEAD Pioneering Technology
LEAD Scientists and Educational Interactions Lowering the barrier for using complex end-to-end weather technologies Democratize Empower Facilitate End Users Developers Researcherss
Layered Architecture
Open Grid Computing Environments Generalize, Harden, Build Test
Flexible Layered Service Oriented Architecture User Interactions Other Clients XBaya GUI Web Portal XBaya Core Event Bus Middleware Services GFac Services Workflow Engine (ODE) XRegistry XMCCat Metadata Catalog Compute & Data Resources Computational Cloud  Local Lab Resources Computational Grids
OGCE Workflow Suite Generic Service Toolkit Tool to wrap command-line applications as web services Handles file staging & job submission and monitoring Extensible runtime for security, resource brokering & urgent computing Generic Factory service for on-demand creation of application services XRegistry Information repository for the OGCE workflow suite Register, search, retrieve & share XML documents User & hierarchical group based authorization XBaya GUI based tool to compose & monitor workflows Extensible support for compiler plug-ins like BPEL, Jython, SCUFL Dynamic Workflow Execution support to start, pause, resume, rewind of workflow executions Apache ODE Scientific Workflow Extensions XBaya GUI integration for BPEL Generation Asynchronous support for long running workflows Instrumented with fine grained monitoring Eventing System Supports both WS-Eventing and WS-Notification Standards Very scalable Persistent Message Box for clients behind firewalls and with intermittent network glitches.
Generic Application Service Factory
Application Wrapper Framework c Application Factory ,[object Object]
The Application Factory generates a web service for each application with I/O interfaces.
Registers WSDL for the service with a registry
Each service generates a stream of notifications that log the service actions back to the XMCCat Metadata Catalog, user monitoring, and provenance tracking toolsApp  Service Run program & publish events
Service Monitoring via Events 1 2 3 4 5 6 ,[object Object]
I am running your request
I have started to move your input files
I have all the files
I am running your application
The application is finished
I am moving the output to you file space
I am done
These are automatically generated by the service using a distributed event system(WS-Eventing / WS-Notification)
Topic based pub-sub system witha well known “channel”Application Service Instance Notification Channel x x publisher listener
Workflow Composer
Interoperable XBaya Workflow Architecture BPEL  1.1 BPEL  2.0 SCUFL Abstract DAG Model Composition and Monitoring Python Dynamic Enactor/Interpreter Jython Based Enactor GPEL  Engine Apache  ODE Engine Taverna Python Runtime Message Bus
WS-BPEL Business Process Execution Language for Web Services (WS-BPEL) De-facto standard for specifying web service based business processes and service compositions Basic activities Invoke, Receive, Assign.. Structured activities Sequence, Flow, ForEach,..
Workflow Composition, Execution & Monitoring     XBaya enables users to construct, share, execute and monitor sequence of tasks executing on their local workstations to high-end compute resources.
GPEL Grid Process Execution Language BPEL4WS based home grown research workflow engine Supports a subset of BPEL4WS 1.1  One of the very early adaptations of BPEL efforts Specifically designed for eScience Usage Long running workflow support Decoupled client
Benefits of Porting to Apache ODE

Más contenido relacionado

La actualidad más candente

Moving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive ModelMoving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive Model
DataWorks Summit
 
Real-Time Robot Predictive Maintenance in Action
Real-Time Robot Predictive Maintenance in ActionReal-Time Robot Predictive Maintenance in Action
Real-Time Robot Predictive Maintenance in Action
DataWorks Summit
 

La actualidad más candente (20)

r1501e
r1501er1501e
r1501e
 
Project DRAC: Creating an applications-aware network
Project DRAC: Creating an applications-aware networkProject DRAC: Creating an applications-aware network
Project DRAC: Creating an applications-aware network
 
Soa12c launch 5 event processing shmakov eng cr
Soa12c launch 5 event processing shmakov eng crSoa12c launch 5 event processing shmakov eng cr
Soa12c launch 5 event processing shmakov eng cr
 
Workload Automation for Cloud Migration and Machine Learning Platform
Workload Automation for Cloud Migration and Machine Learning PlatformWorkload Automation for Cloud Migration and Machine Learning Platform
Workload Automation for Cloud Migration and Machine Learning Platform
 
Cs6703 grid and cloud computing unit 1
Cs6703 grid and cloud computing unit 1Cs6703 grid and cloud computing unit 1
Cs6703 grid and cloud computing unit 1
 
Unit 4
Unit 4Unit 4
Unit 4
 
Comparison of Open-Source Data Stream Processing Engines: Spark Streaming, Fl...
Comparison of Open-Source Data Stream Processing Engines: Spark Streaming, Fl...Comparison of Open-Source Data Stream Processing Engines: Spark Streaming, Fl...
Comparison of Open-Source Data Stream Processing Engines: Spark Streaming, Fl...
 
Cloud Computing Course
Cloud Computing Course Cloud Computing Course
Cloud Computing Course
 
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and FutureHadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
 
Hybrid Cloud Monitoring - Datatdog
Hybrid Cloud Monitoring - DatatdogHybrid Cloud Monitoring - Datatdog
Hybrid Cloud Monitoring - Datatdog
 
An overview of grid monitoring
An overview of grid monitoringAn overview of grid monitoring
An overview of grid monitoring
 
Enterprise Data and Analytics Architecture Overview for Electric Utility
Enterprise Data and Analytics Architecture Overview for Electric UtilityEnterprise Data and Analytics Architecture Overview for Electric Utility
Enterprise Data and Analytics Architecture Overview for Electric Utility
 
Moving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive ModelMoving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive Model
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data AnalyticsHow to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
 
DRAFT - Enterprise Data and Analytics Architecture Overview for Electric Utility
DRAFT - Enterprise Data and Analytics Architecture Overview for Electric UtilityDRAFT - Enterprise Data and Analytics Architecture Overview for Electric Utility
DRAFT - Enterprise Data and Analytics Architecture Overview for Electric Utility
 
IRJET- A Survey on Remote Data Possession Verification Protocol in Cloud Storage
IRJET- A Survey on Remote Data Possession Verification Protocol in Cloud StorageIRJET- A Survey on Remote Data Possession Verification Protocol in Cloud Storage
IRJET- A Survey on Remote Data Possession Verification Protocol in Cloud Storage
 
Real-Time Robot Predictive Maintenance in Action
Real-Time Robot Predictive Maintenance in ActionReal-Time Robot Predictive Maintenance in Action
Real-Time Robot Predictive Maintenance in Action
 
V04405122126
V04405122126V04405122126
V04405122126
 
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachEvolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
 

Similar a Ogce Workflow Suite

FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble
 
Educause Annual 2007
Educause Annual 2007Educause Annual 2007
Educause Annual 2007
Neil Matatall
 

Similar a Ogce Workflow Suite (20)

Scientific
Scientific Scientific
Scientific
 
grid mining
grid mininggrid mining
grid mining
 
XSEDE14 SciGaP-Apache Airavata Tutorial
XSEDE14 SciGaP-Apache Airavata TutorialXSEDE14 SciGaP-Apache Airavata Tutorial
XSEDE14 SciGaP-Apache Airavata Tutorial
 
Grid computing
Grid computingGrid computing
Grid computing
 
SomeSlides
SomeSlidesSomeSlides
SomeSlides
 
Opportunities and Challenges for Running Scientific Workflows on the Cloud
Opportunities and Challenges for Running Scientific Workflows on the Cloud Opportunities and Challenges for Running Scientific Workflows on the Cloud
Opportunities and Challenges for Running Scientific Workflows on the Cloud
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
As34269277
As34269277As34269277
As34269277
 
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
 
Indiana University's Advanced Science Gateway Support
Indiana University's Advanced Science Gateway SupportIndiana University's Advanced Science Gateway Support
Indiana University's Advanced Science Gateway Support
 
Privacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storagePrivacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storage
 
Web Services in the Real World
Web Services in the Real WorldWeb Services in the Real World
Web Services in the Real World
 
Educause Annual 2007
Educause Annual 2007Educause Annual 2007
Educause Annual 2007
 
Atmosphere 2014: Switching from monolithic approach to modular cloud computin...
Atmosphere 2014: Switching from monolithic approach to modular cloud computin...Atmosphere 2014: Switching from monolithic approach to modular cloud computin...
Atmosphere 2014: Switching from monolithic approach to modular cloud computin...
 
Harbour IT & VMware - vForum 2010 Wrap
Harbour IT & VMware - vForum 2010 WrapHarbour IT & VMware - vForum 2010 Wrap
Harbour IT & VMware - vForum 2010 Wrap
 
070416 Egu Vienna Husar
070416 Egu Vienna Husar070416 Egu Vienna Husar
070416 Egu Vienna Husar
 
Stream analytics
Stream analyticsStream analytics
Stream analytics
 
Conceptualizing And Prototyping A Scalable Genomic Data Analysis Pipeline: Us...
Conceptualizing And Prototyping A Scalable Genomic Data Analysis Pipeline: Us...Conceptualizing And Prototyping A Scalable Genomic Data Analysis Pipeline: Us...
Conceptualizing And Prototyping A Scalable Genomic Data Analysis Pipeline: Us...
 
A cloud environment for backup and data storage
A cloud environment for backup and data storageA cloud environment for backup and data storage
A cloud environment for backup and data storage
 
Data Science in the cloud with Microsoft Azure
Data Science in the cloud with Microsoft Azure Data Science in the cloud with Microsoft Azure
Data Science in the cloud with Microsoft Azure
 

Más de smarru

Gsoc airavata
Gsoc airavataGsoc airavata
Gsoc airavata
smarru
 
Apache Student Induction ApacheCon 2013
Apache Student Induction ApacheCon 2013Apache Student Induction ApacheCon 2013
Apache Student Induction ApacheCon 2013
smarru
 

Más de smarru (9)

Cyberinfrastructure Experiences with Apache Airavata
Cyberinfrastructure Experiences with Apache AiravataCyberinfrastructure Experiences with Apache Airavata
Cyberinfrastructure Experiences with Apache Airavata
 
Apache Airavata Credential Store
Apache Airavata Credential StoreApache Airavata Credential Store
Apache Airavata Credential Store
 
RESTLess Design with Apache Thrift: Experiences from Apache Airavata
RESTLess Design with Apache Thrift: Experiences from Apache AiravataRESTLess Design with Apache Thrift: Experiences from Apache Airavata
RESTLess Design with Apache Thrift: Experiences from Apache Airavata
 
Google Summer of Code at Apache Software Foundation
Google Summer of Code at Apache Software FoundationGoogle Summer of Code at Apache Software Foundation
Google Summer of Code at Apache Software Foundation
 
Gsoc airavata
Gsoc airavataGsoc airavata
Gsoc airavata
 
Learning Open Source through GSOC
Learning Open Source through GSOC Learning Open Source through GSOC
Learning Open Source through GSOC
 
Apache Student Induction ApacheCon 2013
Apache Student Induction ApacheCon 2013Apache Student Induction ApacheCon 2013
Apache Student Induction ApacheCon 2013
 
Apache Airavata ApacheCon2013
Apache Airavata ApacheCon2013Apache Airavata ApacheCon2013
Apache Airavata ApacheCon2013
 
Ogce Workflow Suite Tg09
Ogce Workflow Suite Tg09Ogce Workflow Suite Tg09
Ogce Workflow Suite Tg09
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 

Ogce Workflow Suite

  • 1. OGCE Workflow Toolkit for Multi-Scale Science Applications Suresh Marru Pervasive Technology Institute Indiana University ODI Gateway/Pipeline Evaluation NOAO Jan 21st 2010
  • 3.
  • 5. NSF funded project to create a high throughput 535 TB Lustre storage solution
  • 6. Short term storage for applications with high bandwidth needs
  • 7. IU added a wide area version in 2008
  • 8.
  • 9. Massive Data Storage System MDSS Architecture
  • 10.
  • 11. Wide
  • 12.
  • 13. Gateways Advantages Increase access To instruments Increase capabilities To analyze data Improve workforce development For underserved populations Increase outreach Increase public awareness Public sees value in investments in large facilities
  • 14. There are approximately 30 gateways using the TeraGrid
  • 15. Example Gateway Accomplishments LEAD - access to radar data NVO – access to sky surveys OOI – access to sensor data PolarGrid – access to polar ice sheet data SIDGrid – analysis tools GridChem – developing multiscale coupling How would this have been done before gateways? How many details do we want each individual scientist to need to know?
  • 16. TeraGrid Advantages and Challenges What’s different when the resource doesn’t belong just to me? Resource discovery Accounting Security Proposal-based requests for resources (peer-reviewed access) Code scaling and performance numbers Justification of resources Gateway citations Tremendous benefits at the high end, but even more work for the developers Potential impact on science is huge Small number of developers can impact thousands of scientists But need a way to train and fund those developers and provide them with appropriate tools
  • 18. Weather is Local, High-Impact, Heterogeneous and Rapidly Evolving…Yet Our Technologies and Thinking are Static Rain and Snow Fog Rain and Snow Snow and Freezing Rain Intense Turbulence Severe Thunderstorms
  • 19. LEAD Dynamic Adaptive Infrastructure Storms Forming Forecast Model Streaming Observations Data Mining Instrument Steering Refine forecast grid
  • 20. NSF Engineering Research Center for Collaborative Adaptive Sensing of the Atmosphere (CASA) UMass/Amherst, OU, CSU, UPRM Concept: inexpensive, phased array Doppler radars on cell towers and buildings Dynamically adaptive sensing of multiple targets while simultaneously meeting multiple end-user needs
  • 21. LEAD Workflow Requirements Run jobs on-demand on TeraGrid. Deadline driven workflows (severe weather tracking) Users ranging from 8th grade students to seasoned researchers. Run jobs on Multiple TeraGrid resources to decrease turn-around time. Must be able to integrate to Portal with very user friendly web interface.
  • 22. Workflow Survey in 2003(http://www.extreme.indiana.edu/swf-survey/)
  • 23. High Level LEAD Architecture Workflow graph Application services Compute Engine User Portal Workflow Engine Fault Tolerance & scheduler Event Notification Bus Portal server MyLEAD Agent service Data Management Service Data Catalog service Providence Collection service MyLEAD User Metadata catalog Data Storage
  • 25. LEAD Scientists and Educational Interactions Lowering the barrier for using complex end-to-end weather technologies Democratize Empower Facilitate End Users Developers Researcherss
  • 26.
  • 28. Open Grid Computing Environments Generalize, Harden, Build Test
  • 29. Flexible Layered Service Oriented Architecture User Interactions Other Clients XBaya GUI Web Portal XBaya Core Event Bus Middleware Services GFac Services Workflow Engine (ODE) XRegistry XMCCat Metadata Catalog Compute & Data Resources Computational Cloud Local Lab Resources Computational Grids
  • 30. OGCE Workflow Suite Generic Service Toolkit Tool to wrap command-line applications as web services Handles file staging & job submission and monitoring Extensible runtime for security, resource brokering & urgent computing Generic Factory service for on-demand creation of application services XRegistry Information repository for the OGCE workflow suite Register, search, retrieve & share XML documents User & hierarchical group based authorization XBaya GUI based tool to compose & monitor workflows Extensible support for compiler plug-ins like BPEL, Jython, SCUFL Dynamic Workflow Execution support to start, pause, resume, rewind of workflow executions Apache ODE Scientific Workflow Extensions XBaya GUI integration for BPEL Generation Asynchronous support for long running workflows Instrumented with fine grained monitoring Eventing System Supports both WS-Eventing and WS-Notification Standards Very scalable Persistent Message Box for clients behind firewalls and with intermittent network glitches.
  • 32.
  • 33. The Application Factory generates a web service for each application with I/O interfaces.
  • 34. Registers WSDL for the service with a registry
  • 35. Each service generates a stream of notifications that log the service actions back to the XMCCat Metadata Catalog, user monitoring, and provenance tracking toolsApp Service Run program & publish events
  • 36.
  • 37. I am running your request
  • 38. I have started to move your input files
  • 39. I have all the files
  • 40. I am running your application
  • 42. I am moving the output to you file space
  • 44. These are automatically generated by the service using a distributed event system(WS-Eventing / WS-Notification)
  • 45. Topic based pub-sub system witha well known “channel”Application Service Instance Notification Channel x x publisher listener
  • 47. Interoperable XBaya Workflow Architecture BPEL 1.1 BPEL 2.0 SCUFL Abstract DAG Model Composition and Monitoring Python Dynamic Enactor/Interpreter Jython Based Enactor GPEL Engine Apache ODE Engine Taverna Python Runtime Message Bus
  • 48. WS-BPEL Business Process Execution Language for Web Services (WS-BPEL) De-facto standard for specifying web service based business processes and service compositions Basic activities Invoke, Receive, Assign.. Structured activities Sequence, Flow, ForEach,..
  • 49. Workflow Composition, Execution & Monitoring XBaya enables users to construct, share, execute and monitor sequence of tasks executing on their local workstations to high-end compute resources.
  • 50. GPEL Grid Process Execution Language BPEL4WS based home grown research workflow engine Supports a subset of BPEL4WS 1.1 One of the very early adaptations of BPEL efforts Specifically designed for eScience Usage Long running workflow support Decoupled client
  • 51. Benefits of Porting to Apache ODE
  • 52.
  • 53. Simple Recovery Architecture Portal BPEL Workflow Engine Application Performance Models Fault Tolerance/ Recovery Service Resource Reliability Models Application Service OVP/ RST/ MIG NWS, MDS BQP Notification Service Deadline & Success Probability 36 36
  • 54. OGCE Workflow Usage Flow Scientist/Application provider registers application description with Registry Service. Workflow Author constructs the workflow with multiple wrapped application services. Workflow is compiled and deployed to the ODE workflow Engine. Workflow inputs are captured by XBaya and workflow is launched to ODE. Workflow system and possibly some services publish notifications to the Message bus reporting the progress of the workflow. XBaya monitoring system listens to notifications and color the workflow components to present workflow progress.
  • 55. Packaged, Downloadable Software http://www.collab-ogce.org/ogce/index.php/Main_Page
  • 56. ODI Overview – Image Acquisition WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 57. Overview – Image Copy WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 58. Overview – Image Transfer to Data Capacitor WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 59. Overview – Image Ingestions into the Archive WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 60. Overview – Clean Up WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 61. Overview – Automated Tier 1 Processing WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 62. Overview – User Driven Tier 2 Processing WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 63. Overview – User Driven Tier 2 Processing WIYN Buffer ODI Data Capacitor Integration Server Data Capacitor Science Gateway End Users MDSS Archive TeraGrid Compute Resources
  • 64.
  • 65. Some apps have rich Client Gui’s, a challenge with asynchronous long running workflows
  • 67. Parametric sweep scheduling, monitoring iteration steps, graphical composition
  • 68.
  • 69.
  • 70. Input list of amino acid sequences
  • 72.
  • 73. Live Demo & Questions?

Notas del editor

  1. Scientific workflow requirements challenge WS-BPEL