SlideShare una empresa de Scribd logo
1 de 36
Descargar para leer sin conexión
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
• Click	
  to	
  edit	
  
Master	
  0tle	
  style	
  
WS-­‐VLAM	
  Workflow	
  Management	
  
System	
  and	
  its	
  Applica0ons	
  
Adam	
  Belloum	
  
Ins0tute	
  of	
  Informa0cs	
  	
  
University	
  of	
  Amsterdam	
  
a.s.z.belloum@uva.nl	
  
Lunch meeting, Netherlands eScience Center, Amsterdam 2013
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Outline	
  
•  Introduc0on	
  
–  	
  Life	
  cycle	
  of	
  e-­‐Science	
  Workflow	
  	
  
•  Different	
  approaches	
  to	
  workflow	
  scheduling	
  
–  Workflow	
  Process	
  Modeling	
  &	
  Management	
  In	
  Grid/
Cloud	
  
–  Workflow	
  and	
  Web	
  services	
  (intrusive/non	
  intrusive)	
  
•  Provenance	
  
•  Compu0ng	
  in	
  the	
  browser	
  
•  Conclusions	
  
	
  
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Workflow	
  Management	
  Systems	
  
Workflow	
  management	
  system	
  coordinates	
  the	
  
execu/on	
  of	
  a	
  scien0fic	
  applica0ons	
  on	
  a	
  set	
  of	
  
compu/ng	
  distributed	
  resources	
  
	
  
Compu0ng	
  task	
  management	
  	
  
Grid/Cloud	
  middleware	
  	
  
Data	
  Security	
  
Data	
  provenance	
   Workflow	
  	
  
Engine	
  
	
  
	
  
	
  
Repositories	
  
	
  
Data Management
http://www.youtube.com/watch v=R6bTFrzaR_w&feature=player_embedded
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
	
  Life	
  Cycle	
  of	
  a	
  Scien0fic	
  Experiment	
  
(1) Problem
investigation:
•  Look for relevant problems
•  Browse available tools
•  Define the goal
•  Decompose into steps
(2) Experiment
Prototyping:
•  Design experiment workflows
•  Develop necessary components
(3) Experiment
Execution:
•  Execute experiment processes
•  Control the execution
•  Collect and analysis data
(4) Results
Publication:
•  Annotate data
•  Publish data
Shared
repositories
Collaborative e-Science experiments: from scientific workflow to knowledge sharing A.S.Z. Belloum, Vladimir
Korkhov, Spiros Koulouzis, Marcia A Inda, and Marian Bubak JULY/AUGUSTInternet Computing, IEEE, vol.15, no.4, pp.39-47,
July-Aug. 2011 doi: 10.1109/MIC.2011.87
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Model	
  of	
  computa0on	
  
•  Target:	
  stream-­‐based	
  Applica0ons	
  
–  Engine	
  co-­‐allocates	
  all	
  workflow	
  components	
  
•  Communica0on:	
  0me	
  coupled	
  
–  Assumes	
  components	
  are	
  running	
  
–  Simultaneously	
  	
  
–  Synchronized	
  p2p	
  
WS-­‐VLAM	
  Engine:	
  Architecture	
  (1st	
  genera0on)	
  
Service host(s) compute element(s)
GRAM
services
GT4 Java Container
RTSM Factory
Delegation service
Worker nodes
Pre/ws-GRAM/GT5
Client
Delegate
RTSM
Instance
Workflow
components
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
WS-­‐VLAM	
  Features	
  
•  Provide	
  streaming	
  facili0es	
  between	
  applica0ons	
  executed	
  on	
  resource	
  
geographically	
  distributed.	
  	
  
•  Composi/on	
  and	
  the	
  execu0on	
  of	
  hierarchical	
  workflows.	
  	
  
•  Remote	
  graphical	
  output.	
  	
  
•  Detach/a:ach	
  capability	
  for	
  long	
  running	
  workflows.	
  	
  
•  Provides	
  a	
  monitoring	
  facili0es	
  based	
  on	
  the	
  WS-­‐no0fica0on.	
  
•  Provides	
  workflow	
  farming	
  possibili0es.	
  	
  
More features: http://staff.science.uva.nl/~gvlam/wsvlam/demos/wsvlam-about.html
SigWin-Detector workflow has been developed in the VL-e project to detect ridges in
for instance a Gene Expression sequence or Human transcriptome map, BMC Research
Notes 2008, 1:63 doi:10.1186/1756-0500-1-63.
DNA curvature of the Escherichia Coli chromosome
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Easy	
  Deployment	
  	
  	
  	
  	
  	
  	
  czxc`	
  
Workflow composer (java Web Start)
•  WS-VLAM composer
•  VBrowser
•  Semantic tools
SAW: Semantic Annotation for Workflow
CLAMP: Connecting LAnguage for Modules & Programs
HAMMER: Hybrid-bAsed MatchMaker for e-Science
Resources
Sara: National super
computing center
Server host
Production Grid
Experimental
Environment
Storage
WSRF Services (2 services)
- WS-VLAM engine
- workflow component repository
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
WS-VLAM composer
•  Workflow engine may be
invoked form other
systems like
§  Taverna, Kepler, Pgrade
•  Workflow may be made
available to entire
community
§  using Web 2.0 approach
Workflow	
  Sharing	
  and	
  re-­‐usability	
  	
  
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Comparing	
  to	
  other	
  Workflow	
  	
  
Management	
  Systems	
  	
  
•  WS-­‐VLAM	
  was	
  studied	
  by	
  Elts	
  and	
  Bungartz	
  in	
  
2010	
  from	
  the	
  Ins0tute	
  of	
  informa0cs	
  of	
  the	
  
Technical	
  university	
  of	
  Munich	
  as	
  a	
  poten0al	
  
pla]orm	
  for	
  a	
  PhD	
  work	
  
Grid-Workflow-Management-Systeme fur die
Ausfuhrung wissenschaftlicher Prozessablaufe
Ekaterina Elts, Hans-Joachim Bungartz
http://staff.science.uva.nl/~gvlam/wsvlam/
Publications/workflow_review_Elts.pdf
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
WS-­‐VLAM	
  Comparing	
  to	
  other	
  Workflow	
  	
  
Management	
  Systems	
  	
  
by Ekaterina Elts TUMunchen
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
WS-­‐VLAM	
  Comparing	
  to	
  other	
  Workflow	
  	
  
Management	
  Systems	
  	
  
by Ekaterina Elts TUMunchen
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
WS-­‐VLAM	
  Engine:	
  architecture	
  (2nd	
  genera0on)	
  
•  Target:	
  	
  loosely	
  couple	
  applica0ons	
  
– components	
  scheduled	
  depending	
  on	
  data	
  
– components	
  only	
  ac/vated	
  when	
  data	
  is	
  available	
  
– no	
  need	
  for	
  co-­‐alloca/on	
  
	
  
•  Communica0on:	
  0me	
  decouples	
  
– messaging	
  communica0on	
  system.	
  
– components	
  not	
  synchronized	
  
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
WS-­‐VLAM	
  Engine:	
  Architecture	
  (2nd	
  genera0on)	
  
Data	
  driven	
  Workflow	
  coordina0on	
  
Reginald Cushing, Spiros Koulouzis, Adam S. Z. Belloum, Marian Bubak, Prediction-based Auto-scaling
of Scientific Workflows, MGC’2011, December 12th, 2011, Lisbon, Portugal
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Farming	
  with	
  WS-­‐VLAM	
  
•  Task	
  farming:	
  task	
  replica0on	
  
–  parameter	
  sweep	
  applica0on,	
  	
  
–  DNA	
  Sequencing,	
  	
  
–  Monte	
  Carlo,	
  	
  
–  …	
  	
  
•  	
  Implements	
  3	
  types	
  of	
  farming:	
  
–  Auto	
  Farming:	
  the	
  number	
  of	
  tasks/services	
  to	
  run	
  is	
  propor0onal	
  to	
  the	
  
load	
  	
  
–  One-­‐to-­‐One	
  Farming:	
  A	
  task	
  replicated	
  for	
  every	
  message	
  received.	
  
–  Fixed	
  Farming:	
  user	
  defined	
  farming.	
  
	
  
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Farming	
  and	
  Auto-­‐scaling	
  of	
  Workflows	
  
Reginald Cushing, Spiros Koulouzis, Adam S. Z. Belloum, Marian Bubak, Prediction-based Auto-scaling
of Scientific Workflows, Proceedings of the 9th International Workshop on Middleware for Grids, Clouds
and e-Science, ACM/IFIP/USENIX December 12th, 2011, Lisbon, Portugal
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Workflow	
  as	
  a	
  Service	
  (WFaaS)	
  
Reduce	
  Scheduling	
  Overhead	
  
Workflow as a Service: An Approach to Workflow Farming, Reginald Cushing, Adam S. Z.
Belloum, V. Korkhov, D. Vasyunin, M.T. Bubak, C. Leguy ECMLS’12, June 18, 2012, Delft,
The Netherlands
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Resource	
  on-­‐demand	
  for	
  Applica0ons	
  with	
  Hard	
  
Deadline	
  (Urgent	
  Compu0ng)	
  
Resource on-demand using multiple cloud providers, Super-computing 2010, and SCALE 2012
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Outline	
  
(1)
Problem
investigation:
(2)
Experiment
Prototyping
(3)
Experiment
Execution:
(4)
Results
Publication:
Shared
repositories
•  Introduc0on	
  
– 	
  Life	
  cycle	
  of	
  e-­‐Science	
  Workflow	
  	
  
•  Different	
  approaches	
  to	
  workflow	
  scheduling	
  
– Workflow	
  Process	
  Modeling	
  &	
  Management	
  In	
  
Grid/Cloud	
  
– Workflow	
  and	
  Web	
  services	
  (intrusive/non	
  
intrusive)	
  
•  Provenance	
  
•  Compu0ng	
  in	
  the	
  browser	
  
•  Conclusions	
  
	
  
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Web	
  Services	
  in	
  eScience	
  with	
  WS-­‐VLAM	
  
•  WS	
  offer	
  interoperability	
  and	
  flexibility	
  in	
  a	
  large	
  
scale	
  distributed	
  environment.	
  	
  
•  WS	
  can	
  be	
  combined	
  in	
  a	
  workflow	
  so	
  that	
  more	
  
complex	
  opera0ons	
  may	
  be	
  achieved,	
  but	
  any	
  
workflow	
  implementa0on	
  is	
  poten0ally	
  faced	
  with	
  a	
  
data	
  transport	
  problem	
  or	
  being	
  overload	
  	
  
Scale	
  up	
  the	
  number	
  of	
  Web	
  service	
  to	
  keep	
  up	
  with	
  the	
  
incoming	
  load	
  
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Scaling	
  up	
  the	
  number	
  of	
  Web	
  services	
  	
  
•  Tasks/Jobs	
  can	
  be	
  queued	
  on	
  the	
  
runqueue.	
  	
  
•  The	
  service	
  submission	
  listens	
  on	
  the	
  
runqueue	
  and	
  picks	
  up	
  new	
  tasks	
  to	
  
submit	
  
•  Resources	
  such	
  as	
  Grid	
  or	
  Cloud	
  are	
  
abstracted	
  using	
  submieers	
  plugins	
  
–  Enabling	
  a	
  new	
  resource	
  is	
  a	
  maeer	
  of	
  
wri0ng	
  its	
  submieer	
  (Condor,	
  ibis,	
  …)	
  
	
  
Reginald Cushing, Spiros Koulouzis, Adam S. Z. Belloum, Marian Bubak, Dynamic Handling for
Cooperating Scientific Web Services, 7th IEEE International Conference on e-Science, December 2011,
Stockholm, Sweden
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Sequence	
  Alignment	
  Use	
  Case	
  
•  Workflow	
  with	
  2	
  pipelines.	
  The	
  
pipelines	
  perform	
  sequence	
  
alignments	
  using	
  data	
  from	
  
UniProtKB	
  
•  Each	
  pipeline	
  performs	
  22500	
  
alignments	
  i.e.	
  45100	
  total	
  
alignments	
  in	
  all	
  
•  All	
  modules	
  are	
  standard	
  web	
  services	
  which	
  are	
  hosted	
  in	
  the	
  modified	
  Axis2	
  container	
  	
  
•  The	
  alignments	
  where	
  performed	
  using	
  BioJava	
  api	
  
•  Source	
  and	
  sink	
  are	
  part	
  of	
  the	
  bootstrapping	
  sequence.	
  	
  
–  Source	
  submits	
  the	
  getSequenceId	
  service	
  	
  
–  while	
  sink	
  waits	
  for	
  output	
  from	
  the	
  htmlRenderer	
  
Reginald Cushing, Spiros Koulouzis, Adam S. Z. Belloum, Marian Bubak, Dynamic Handling for
Cooperating Scientific Web Services, 7th IEEE International Conference on e-Science, December 2011,
Stockholm, Sweden
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Scale	
  up	
  web	
  services	
  
Peaks	
  in	
  load(lej)	
  will	
  result	
  in	
  peaks	
  in	
  
instances(right).	
  	
  
	
  
The	
  fuzzy	
  controllers	
  scale	
  up	
  the	
  web	
  services	
  to	
  
meet	
  the	
  demands	
  
	
  
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Enabling	
  web	
  services	
  to	
  consume	
  and	
  produce	
  
large	
  distributed	
  	
  
•  In	
  service	
  orchestra0on,	
  all	
  data	
  is	
  passed	
  to	
  the	
  workflow	
  engine	
  
before	
  delivered	
  to	
  a	
  consuming	
  WS	
  
•  Data	
  transfers	
  are	
  made	
  through	
  SOAP,	
  which	
  is	
  unfit	
  for	
  large	
  data	
  
transfers	
  
Enabling web services to consume and produce large distributed datasets Spiros
Koulouzis, Reginald Cushing, Konstantinos Karasavvas, Adam Belloum, Marian Bubak to be
published JAN/FEB, IEEE Internet Computing, 2012
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Enabling	
  web	
  services	
  to	
  consume	
  and	
  produce	
  
large	
  distributed	
  	
  
Enabling web services to consume and produce large distributed datasets Spiros
Koulouzis, Reginald Cushing, Konstantinos Karasavvas, Adam Belloum, Marian Bubak to be
published JAN/FEB, IEEE Internet Computing, 2012
Indexing Web Services for Information
Retrieval (NER) are tools that help
biologists to identify and retrieve information
•  Index 8.4GB of medline documents
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Outline	
  
(1)
Problem
investigation:
(2)
Experiment
Prototyping
(3)
Experiment
Execution:
(4)
Results
Publication:
Shared
repositories
•  Introduc0on	
  
– 	
  Life	
  cycle	
  of	
  e-­‐Science	
  Workflow	
  	
  
•  Different	
  approaches	
  to	
  workflow	
  scheduling	
  
– Workflow	
  Process	
  Modeling	
  &	
  Management	
  In	
  
Grid/Cloud	
  
– Workflow	
  and	
  Web	
  services	
  (intrusive/non	
  
intrusive)	
  
•  Provenance	
  
•  Compu0ng	
  in	
  the	
  browser	
  
•  Conclusions	
  
	
  
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Provenance/	
  Reproducibility	
  	
  
•  “A	
  complete	
  provenance	
  record	
  for	
  a	
  data	
  object	
  
allows	
  the	
  possibility	
  to	
  reproduce	
  the	
  result	
  and	
  
reproducibility	
  is	
  a	
  cri0cal	
  component	
  of	
  the	
  
scien0fic	
  method”	
  
•  Provenance:	
  The	
  recording	
  of	
  metadata	
  and	
  
provenance	
  informa0on	
  during	
  the	
  various	
  stages	
  
of	
  the	
  workflow	
  lifecycle	
  
Workflows and e-Science: An overview of workflow system features
and capabilities Ewa Deelmana, Dennis Gannonb, Matthew Shields c, Ian Taylor, Future
Generation Computer Systems 25 (2009) 528540
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
History-­‐tracing	
  XML	
  (FH	
  Aachen)	
  
provides	
  data/process	
  provenance	
  
following	
  an	
  approach	
  that	
  	
  
•  maps	
  the	
  workflow	
  graph	
  to	
  a	
  
layered	
  structure	
  of	
  an	
  XML	
  
document.	
  	
  
•  This	
  allows	
  an	
  intui0ve	
  and	
  easy	
  
processable	
  representa0on	
  of	
  
the	
  workflow	
  execu0on	
  path	
  
	
  	
  
•  Workflow	
  components	
  can	
  be	
  
eventually,	
  electronically	
  signed.	
  
	
  
	
  
<patternMatch>
<events>
<PortResolved>provenance data</PortResolved>
<ConDone> provenance data </ConDone>
...
</events>
<fileReader2>
<events> ... </events>
<sign-fileReader2> ... </sign-fileReader2>
</fileReader2>
<sffToFasta>
Reference
</sffToFasta>
<sign-patternMatch> ... </sign-patternMatch>
</patternMatch>
M. Gerards, Adam S. Z. Belloum, F. Berritz, V. Snder, S. Skorupa, A History-tracing XML-base
Provenance Framework for workflows, WORKS 2010, New Orleans, USA, November 2010
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
[Biomedical engineering Cardiovascular
biomechanics group TUE])
wave propagation model of blood flow in large
vessels using an approximate velocity profile
function:
a biomedical study for which 3000 runs were
required to perform a global sensitivity analysis
of a blood pressure wave propagation in arteries
User Interface to compose workflow (top
right), monitor the execution of the farmed
workflows (top left), and monitor each run
separately (bottom left) data
Query interface for the provenance data
collected from 3000 simulations of the
“wave propagation model of blood flow in
large vessels using an approximate velocity
profile function”
BigGrid project 2009, presented EGI/
BigGrid technical forum 2010
Wave	
  Propaga0on	
  in	
  Blood	
  Flow	
  
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Alignment	
  of	
  DNA	
  Sequence	
  (Blast)	
  
For Each workflow run
•  The provenance data is collected an stored
following the XML-tracing system
•  User interface allows to reproduce events that
occurred at runtime (replay mode)
•  User Interface can be customized (User can
select the events to track)
•  User Interface show resource usage
The aim of the application is the alignment
of DNA sequence data with a given
reference database.
on-going work UvA-AMC-fh-aachen
[Department of Clinical
Epidemiology, Biostatistics and
Bioinformatics (KEBB), AMC ]
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Sensi0vity	
  Analysis	
  of	
  Cardiovascular	
  Models	
  
Study covers: 164 patient-specific input parameters
(i.e. vessel diameter and length),
•  1494000 Monte Carlo runs are needed for this
first analysis.
•  Expected execution time on a PC 14.525 hours
(20 Months)
FP7 Project VPH-Share " Virtual Physiological
Human: Sharing for Healthcare”
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Protein	
  Folding	
  
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Outline	
  
(1)
Problem
investigation:
(2)
Experiment
Prototyping
(3)
Experiment
Execution:
(4)
Results
Publication:
Shared
repositories
•  Introduc0on	
  
– 	
  Life	
  cycle	
  of	
  e-­‐Science	
  Workflow	
  	
  
•  Different	
  approaches	
  to	
  workflow	
  scheduling	
  
– Workflow	
  Process	
  Modeling	
  &	
  Management	
  In	
  
Grid/Cloud	
  
– Workflow	
  and	
  Web	
  services	
  (intrusive/non	
  
intrusive)	
  
•  Provenance	
  
•  Compu0ng	
  in	
  the	
  browser	
  
•  Conclusions	
  
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Compu0ng	
  in	
  the	
  Browser	
  
(different	
  approach	
  to	
  Grid-­‐desktop)	
  
•  Objec/ves	
  
–  Distributed	
  compu0ng	
  using	
  web	
  browsers	
  
•  Features:	
  	
  
–  volunteer	
  compu0ng	
  instantly	
  (no	
  third	
  party	
  sojware	
  
installa0on)	
  
•  How	
  does	
  it	
  work:	
  	
  
–  Social	
  media	
  	
  mediates	
  the	
  trust	
  between	
  the	
  user	
  and	
  the	
  
volunteers	
  asked	
  to	
  join	
  the	
  network.	
  
•  A	
  user	
  with	
  a	
  distributed	
  applica0on	
  uses	
  social	
  media	
  to	
  get	
  	
  colleagues	
  and	
  friends	
  to	
  
donate	
  CPU	
  
•  Colleagues	
  and	
  friends	
  	
  join	
  the	
  network	
  by	
  	
  simply	
  opening	
  	
  the	
  shared	
  URL.	
  	
  
•  	
  Compu/ng	
  starts	
  almost	
  instantly.	
  
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Compu0ng	
  in	
  the	
  Browser	
  
Distributed computing on an Ensemble of Browsers, R. Cushing, G.a Putra, S. Koulouzis, A.S.Z Belloum,
M.T. Bubak, C. de Laat IEEE Internet Computing, PrePress 10.1109/MIC.2013.3, January 2013
Application
•  Computing 33,000 bio-informatics tasks on
the global cluster of browsers
•  Announcing the experiment using social
media: via social media tools: twitter,
FaceBook, Linkedin, and project mailing lists.
•  Volunteers were asked to open the Weevil web
page http://elab.lab.uvalight.net/~weevil/
and agree to donate their CPU for 3 hours on
Friday December 2011 from 12:00-14:00
0.4
0.6
0.8
1
1.2
1.4
00:00 01:00 02:00 03:00 04:00 05:00
0.0e+00
5.0e+03
1.0e+04
1.5e+04
2.0e+04
2.5e+04
3.0e+04
3.5e+04
4.0e+04
EstimatedGFLOPS
CompletedTasks
Time HH:MM
Tasks
Aggregated Power (GFLOPS)
New Web Technologies make JavaScript engines
more powerful:
•  Web workers
•  web socket
•  WebGL
•  WebCL
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
Conclusions	
  
•  WS-­‐VLAM	
  has	
  interes0ng	
  features	
  (farming,	
  automa0c	
  
scaling,	
  hierarchical	
  workflow,	
  provenance,	
  …)	
  which	
  proved	
  
to	
  be	
  interes0ng	
  for	
  a	
  number	
  scien0fic	
  applica0ons	
  	
  
•  WS-­‐VLAM	
  harnesses	
  various	
  type	
  of	
  compu0ng	
  resources	
  
(desktops,	
  Grid,	
  and	
  Cloud	
  resources)	
  	
  
•  WS-­‐VLAM	
  has	
  modular	
  design	
  which	
  make	
  easy	
  to	
  adapt/
extend	
  	
  
• Clck	
  to	
  edit	
  Master	
  0tle	
  style	
  
http://www.science.n/~gvlam/wsvlam/
http://www.commit-nl.nl/

Más contenido relacionado

Similar a Adam e science_2013_1

Simulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud InfrastructuresSimulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud InfrastructuresCloudLightning
 
Opal: Simple Web Services Wrappers for Scientific Applications
Opal: Simple Web Services Wrappers for Scientific ApplicationsOpal: Simple Web Services Wrappers for Scientific Applications
Opal: Simple Web Services Wrappers for Scientific ApplicationsSriram Krishnan
 
Opportunities and Challenges for Running Scientific Workflows on the Cloud
Opportunities and Challenges for Running Scientific Workflows on the Cloud Opportunities and Challenges for Running Scientific Workflows on the Cloud
Opportunities and Challenges for Running Scientific Workflows on the Cloud lyingcom
 
Build Java Web Application Using Apache Struts
Build Java Web Application Using Apache Struts Build Java Web Application Using Apache Struts
Build Java Web Application Using Apache Struts weili_at_slideshare
 
Introduction To Apache Mesos
Introduction To Apache MesosIntroduction To Apache Mesos
Introduction To Apache MesosTimothy St. Clair
 
Lightening the burden of cloud resources administration: from VMs to Functions
Lightening the burden of cloud resources administration: from VMs to FunctionsLightening the burden of cloud resources administration: from VMs to Functions
Lightening the burden of cloud resources administration: from VMs to FunctionsEUBrasilCloudFORUM .
 
Cloudsim & greencloud
Cloudsim & greencloud Cloudsim & greencloud
Cloudsim & greencloud nedamaleki87
 
CloudLightning Simulator
CloudLightning SimulatorCloudLightning Simulator
CloudLightning SimulatorCloudLightning
 
Creating a Centralized Consumer Profile Management Service with WebSphere Dat...
Creating a Centralized Consumer Profile Management Service with WebSphere Dat...Creating a Centralized Consumer Profile Management Service with WebSphere Dat...
Creating a Centralized Consumer Profile Management Service with WebSphere Dat...Prolifics
 
DOTNET 2013 IEEE CLOUDCOMPUTING PROJECT Dynamic resource allocation using vir...
DOTNET 2013 IEEE CLOUDCOMPUTING PROJECT Dynamic resource allocation using vir...DOTNET 2013 IEEE CLOUDCOMPUTING PROJECT Dynamic resource allocation using vir...
DOTNET 2013 IEEE CLOUDCOMPUTING PROJECT Dynamic resource allocation using vir...IEEEGLOBALSOFTTECHNOLOGIES
 
Software Architecture for Cloud Infrastructure
Software Architecture for Cloud InfrastructureSoftware Architecture for Cloud Infrastructure
Software Architecture for Cloud InfrastructureTapio Rautonen
 
AWS Canberra WWPS Summit 2013 - AWS for Web Applications
AWS Canberra WWPS Summit 2013 - AWS for Web ApplicationsAWS Canberra WWPS Summit 2013 - AWS for Web Applications
AWS Canberra WWPS Summit 2013 - AWS for Web ApplicationsAmazon Web Services
 
How jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxHow jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxjKool
 
How jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxHow jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxDataStax
 
Cloudsim & Green Cloud
Cloudsim & Green CloudCloudsim & Green Cloud
Cloudsim & Green CloudNeda Maleki
 
Hia 1693-effective application-development_in_iib
Hia 1693-effective application-development_in_iibHia 1693-effective application-development_in_iib
Hia 1693-effective application-development_in_iibAndrew Coleman
 
Performance architecture for cloud connect
Performance architecture for cloud connectPerformance architecture for cloud connect
Performance architecture for cloud connectAdrian Cockcroft
 
OS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLOS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLNordic APIs
 
Chap 2 classification of parralel architecture and introduction to parllel p...
Chap 2  classification of parralel architecture and introduction to parllel p...Chap 2  classification of parralel architecture and introduction to parllel p...
Chap 2 classification of parralel architecture and introduction to parllel p...Malobe Lottin Cyrille Marcel
 
Tordatasci meetup-precima-retail-analytics-201901
Tordatasci meetup-precima-retail-analytics-201901Tordatasci meetup-precima-retail-analytics-201901
Tordatasci meetup-precima-retail-analytics-201901WeCloudData
 

Similar a Adam e science_2013_1 (20)

Simulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud InfrastructuresSimulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud Infrastructures
 
Opal: Simple Web Services Wrappers for Scientific Applications
Opal: Simple Web Services Wrappers for Scientific ApplicationsOpal: Simple Web Services Wrappers for Scientific Applications
Opal: Simple Web Services Wrappers for Scientific Applications
 
Opportunities and Challenges for Running Scientific Workflows on the Cloud
Opportunities and Challenges for Running Scientific Workflows on the Cloud Opportunities and Challenges for Running Scientific Workflows on the Cloud
Opportunities and Challenges for Running Scientific Workflows on the Cloud
 
Build Java Web Application Using Apache Struts
Build Java Web Application Using Apache Struts Build Java Web Application Using Apache Struts
Build Java Web Application Using Apache Struts
 
Introduction To Apache Mesos
Introduction To Apache MesosIntroduction To Apache Mesos
Introduction To Apache Mesos
 
Lightening the burden of cloud resources administration: from VMs to Functions
Lightening the burden of cloud resources administration: from VMs to FunctionsLightening the burden of cloud resources administration: from VMs to Functions
Lightening the burden of cloud resources administration: from VMs to Functions
 
Cloudsim & greencloud
Cloudsim & greencloud Cloudsim & greencloud
Cloudsim & greencloud
 
CloudLightning Simulator
CloudLightning SimulatorCloudLightning Simulator
CloudLightning Simulator
 
Creating a Centralized Consumer Profile Management Service with WebSphere Dat...
Creating a Centralized Consumer Profile Management Service with WebSphere Dat...Creating a Centralized Consumer Profile Management Service with WebSphere Dat...
Creating a Centralized Consumer Profile Management Service with WebSphere Dat...
 
DOTNET 2013 IEEE CLOUDCOMPUTING PROJECT Dynamic resource allocation using vir...
DOTNET 2013 IEEE CLOUDCOMPUTING PROJECT Dynamic resource allocation using vir...DOTNET 2013 IEEE CLOUDCOMPUTING PROJECT Dynamic resource allocation using vir...
DOTNET 2013 IEEE CLOUDCOMPUTING PROJECT Dynamic resource allocation using vir...
 
Software Architecture for Cloud Infrastructure
Software Architecture for Cloud InfrastructureSoftware Architecture for Cloud Infrastructure
Software Architecture for Cloud Infrastructure
 
AWS Canberra WWPS Summit 2013 - AWS for Web Applications
AWS Canberra WWPS Summit 2013 - AWS for Web ApplicationsAWS Canberra WWPS Summit 2013 - AWS for Web Applications
AWS Canberra WWPS Summit 2013 - AWS for Web Applications
 
How jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxHow jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStax
 
How jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxHow jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStax
 
Cloudsim & Green Cloud
Cloudsim & Green CloudCloudsim & Green Cloud
Cloudsim & Green Cloud
 
Hia 1693-effective application-development_in_iib
Hia 1693-effective application-development_in_iibHia 1693-effective application-development_in_iib
Hia 1693-effective application-development_in_iib
 
Performance architecture for cloud connect
Performance architecture for cloud connectPerformance architecture for cloud connect
Performance architecture for cloud connect
 
OS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLOS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of ML
 
Chap 2 classification of parralel architecture and introduction to parllel p...
Chap 2  classification of parralel architecture and introduction to parllel p...Chap 2  classification of parralel architecture and introduction to parllel p...
Chap 2 classification of parralel architecture and introduction to parllel p...
 
Tordatasci meetup-precima-retail-analytics-201901
Tordatasci meetup-precima-retail-analytics-201901Tordatasci meetup-precima-retail-analytics-201901
Tordatasci meetup-precima-retail-analytics-201901
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 

Último (20)

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

Adam e science_2013_1

  • 1. • Clck  to  edit  Master  0tle  style   • Click  to  edit   Master  0tle  style   WS-­‐VLAM  Workflow  Management   System  and  its  Applica0ons   Adam  Belloum   Ins0tute  of  Informa0cs     University  of  Amsterdam   a.s.z.belloum@uva.nl   Lunch meeting, Netherlands eScience Center, Amsterdam 2013
  • 2. • Clck  to  edit  Master  0tle  style   Outline   •  Introduc0on   –   Life  cycle  of  e-­‐Science  Workflow     •  Different  approaches  to  workflow  scheduling   –  Workflow  Process  Modeling  &  Management  In  Grid/ Cloud   –  Workflow  and  Web  services  (intrusive/non  intrusive)   •  Provenance   •  Compu0ng  in  the  browser   •  Conclusions    
  • 3. • Clck  to  edit  Master  0tle  style   Workflow  Management  Systems   Workflow  management  system  coordinates  the   execu/on  of  a  scien0fic  applica0ons  on  a  set  of   compu/ng  distributed  resources     Compu0ng  task  management     Grid/Cloud  middleware     Data  Security   Data  provenance   Workflow     Engine         Repositories     Data Management http://www.youtube.com/watch v=R6bTFrzaR_w&feature=player_embedded
  • 4. • Clck  to  edit  Master  0tle  style    Life  Cycle  of  a  Scien0fic  Experiment   (1) Problem investigation: •  Look for relevant problems •  Browse available tools •  Define the goal •  Decompose into steps (2) Experiment Prototyping: •  Design experiment workflows •  Develop necessary components (3) Experiment Execution: •  Execute experiment processes •  Control the execution •  Collect and analysis data (4) Results Publication: •  Annotate data •  Publish data Shared repositories Collaborative e-Science experiments: from scientific workflow to knowledge sharing A.S.Z. Belloum, Vladimir Korkhov, Spiros Koulouzis, Marcia A Inda, and Marian Bubak JULY/AUGUSTInternet Computing, IEEE, vol.15, no.4, pp.39-47, July-Aug. 2011 doi: 10.1109/MIC.2011.87
  • 5. • Clck  to  edit  Master  0tle  style   Model  of  computa0on   •  Target:  stream-­‐based  Applica0ons   –  Engine  co-­‐allocates  all  workflow  components   •  Communica0on:  0me  coupled   –  Assumes  components  are  running   –  Simultaneously     –  Synchronized  p2p   WS-­‐VLAM  Engine:  Architecture  (1st  genera0on)   Service host(s) compute element(s) GRAM services GT4 Java Container RTSM Factory Delegation service Worker nodes Pre/ws-GRAM/GT5 Client Delegate RTSM Instance Workflow components
  • 6. • Clck  to  edit  Master  0tle  style   WS-­‐VLAM  Features   •  Provide  streaming  facili0es  between  applica0ons  executed  on  resource   geographically  distributed.     •  Composi/on  and  the  execu0on  of  hierarchical  workflows.     •  Remote  graphical  output.     •  Detach/a:ach  capability  for  long  running  workflows.     •  Provides  a  monitoring  facili0es  based  on  the  WS-­‐no0fica0on.   •  Provides  workflow  farming  possibili0es.     More features: http://staff.science.uva.nl/~gvlam/wsvlam/demos/wsvlam-about.html SigWin-Detector workflow has been developed in the VL-e project to detect ridges in for instance a Gene Expression sequence or Human transcriptome map, BMC Research Notes 2008, 1:63 doi:10.1186/1756-0500-1-63. DNA curvature of the Escherichia Coli chromosome
  • 7. • Clck  to  edit  Master  0tle  style   Easy  Deployment              czxc`   Workflow composer (java Web Start) •  WS-VLAM composer •  VBrowser •  Semantic tools SAW: Semantic Annotation for Workflow CLAMP: Connecting LAnguage for Modules & Programs HAMMER: Hybrid-bAsed MatchMaker for e-Science Resources Sara: National super computing center Server host Production Grid Experimental Environment Storage WSRF Services (2 services) - WS-VLAM engine - workflow component repository
  • 8. • Clck  to  edit  Master  0tle  style   WS-VLAM composer •  Workflow engine may be invoked form other systems like §  Taverna, Kepler, Pgrade •  Workflow may be made available to entire community §  using Web 2.0 approach Workflow  Sharing  and  re-­‐usability    
  • 9. • Clck  to  edit  Master  0tle  style   Comparing  to  other  Workflow     Management  Systems     •  WS-­‐VLAM  was  studied  by  Elts  and  Bungartz  in   2010  from  the  Ins0tute  of  informa0cs  of  the   Technical  university  of  Munich  as  a  poten0al   pla]orm  for  a  PhD  work   Grid-Workflow-Management-Systeme fur die Ausfuhrung wissenschaftlicher Prozessablaufe Ekaterina Elts, Hans-Joachim Bungartz http://staff.science.uva.nl/~gvlam/wsvlam/ Publications/workflow_review_Elts.pdf
  • 10. • Clck  to  edit  Master  0tle  style   WS-­‐VLAM  Comparing  to  other  Workflow     Management  Systems     by Ekaterina Elts TUMunchen
  • 11. • Clck  to  edit  Master  0tle  style   WS-­‐VLAM  Comparing  to  other  Workflow     Management  Systems     by Ekaterina Elts TUMunchen
  • 12. • Clck  to  edit  Master  0tle  style   WS-­‐VLAM  Engine:  architecture  (2nd  genera0on)   •  Target:    loosely  couple  applica0ons   – components  scheduled  depending  on  data   – components  only  ac/vated  when  data  is  available   – no  need  for  co-­‐alloca/on     •  Communica0on:  0me  decouples   – messaging  communica0on  system.   – components  not  synchronized  
  • 13. • Clck  to  edit  Master  0tle  style   WS-­‐VLAM  Engine:  Architecture  (2nd  genera0on)   Data  driven  Workflow  coordina0on   Reginald Cushing, Spiros Koulouzis, Adam S. Z. Belloum, Marian Bubak, Prediction-based Auto-scaling of Scientific Workflows, MGC’2011, December 12th, 2011, Lisbon, Portugal
  • 14. • Clck  to  edit  Master  0tle  style   Farming  with  WS-­‐VLAM   •  Task  farming:  task  replica0on   –  parameter  sweep  applica0on,     –  DNA  Sequencing,     –  Monte  Carlo,     –  …     •   Implements  3  types  of  farming:   –  Auto  Farming:  the  number  of  tasks/services  to  run  is  propor0onal  to  the   load     –  One-­‐to-­‐One  Farming:  A  task  replicated  for  every  message  received.   –  Fixed  Farming:  user  defined  farming.    
  • 15. • Clck  to  edit  Master  0tle  style   Farming  and  Auto-­‐scaling  of  Workflows   Reginald Cushing, Spiros Koulouzis, Adam S. Z. Belloum, Marian Bubak, Prediction-based Auto-scaling of Scientific Workflows, Proceedings of the 9th International Workshop on Middleware for Grids, Clouds and e-Science, ACM/IFIP/USENIX December 12th, 2011, Lisbon, Portugal
  • 16. • Clck  to  edit  Master  0tle  style   Workflow  as  a  Service  (WFaaS)   Reduce  Scheduling  Overhead   Workflow as a Service: An Approach to Workflow Farming, Reginald Cushing, Adam S. Z. Belloum, V. Korkhov, D. Vasyunin, M.T. Bubak, C. Leguy ECMLS’12, June 18, 2012, Delft, The Netherlands
  • 17. • Clck  to  edit  Master  0tle  style   Resource  on-­‐demand  for  Applica0ons  with  Hard   Deadline  (Urgent  Compu0ng)   Resource on-demand using multiple cloud providers, Super-computing 2010, and SCALE 2012
  • 18. • Clck  to  edit  Master  0tle  style   Outline   (1) Problem investigation: (2) Experiment Prototyping (3) Experiment Execution: (4) Results Publication: Shared repositories •  Introduc0on   –   Life  cycle  of  e-­‐Science  Workflow     •  Different  approaches  to  workflow  scheduling   – Workflow  Process  Modeling  &  Management  In   Grid/Cloud   – Workflow  and  Web  services  (intrusive/non   intrusive)   •  Provenance   •  Compu0ng  in  the  browser   •  Conclusions    
  • 19. • Clck  to  edit  Master  0tle  style   Web  Services  in  eScience  with  WS-­‐VLAM   •  WS  offer  interoperability  and  flexibility  in  a  large   scale  distributed  environment.     •  WS  can  be  combined  in  a  workflow  so  that  more   complex  opera0ons  may  be  achieved,  but  any   workflow  implementa0on  is  poten0ally  faced  with  a   data  transport  problem  or  being  overload     Scale  up  the  number  of  Web  service  to  keep  up  with  the   incoming  load  
  • 20. • Clck  to  edit  Master  0tle  style   Scaling  up  the  number  of  Web  services     •  Tasks/Jobs  can  be  queued  on  the   runqueue.     •  The  service  submission  listens  on  the   runqueue  and  picks  up  new  tasks  to   submit   •  Resources  such  as  Grid  or  Cloud  are   abstracted  using  submieers  plugins   –  Enabling  a  new  resource  is  a  maeer  of   wri0ng  its  submieer  (Condor,  ibis,  …)     Reginald Cushing, Spiros Koulouzis, Adam S. Z. Belloum, Marian Bubak, Dynamic Handling for Cooperating Scientific Web Services, 7th IEEE International Conference on e-Science, December 2011, Stockholm, Sweden
  • 21. • Clck  to  edit  Master  0tle  style   Sequence  Alignment  Use  Case   •  Workflow  with  2  pipelines.  The   pipelines  perform  sequence   alignments  using  data  from   UniProtKB   •  Each  pipeline  performs  22500   alignments  i.e.  45100  total   alignments  in  all   •  All  modules  are  standard  web  services  which  are  hosted  in  the  modified  Axis2  container     •  The  alignments  where  performed  using  BioJava  api   •  Source  and  sink  are  part  of  the  bootstrapping  sequence.     –  Source  submits  the  getSequenceId  service     –  while  sink  waits  for  output  from  the  htmlRenderer   Reginald Cushing, Spiros Koulouzis, Adam S. Z. Belloum, Marian Bubak, Dynamic Handling for Cooperating Scientific Web Services, 7th IEEE International Conference on e-Science, December 2011, Stockholm, Sweden
  • 22. • Clck  to  edit  Master  0tle  style   Scale  up  web  services   Peaks  in  load(lej)  will  result  in  peaks  in   instances(right).       The  fuzzy  controllers  scale  up  the  web  services  to   meet  the  demands    
  • 23. • Clck  to  edit  Master  0tle  style   Enabling  web  services  to  consume  and  produce   large  distributed     •  In  service  orchestra0on,  all  data  is  passed  to  the  workflow  engine   before  delivered  to  a  consuming  WS   •  Data  transfers  are  made  through  SOAP,  which  is  unfit  for  large  data   transfers   Enabling web services to consume and produce large distributed datasets Spiros Koulouzis, Reginald Cushing, Konstantinos Karasavvas, Adam Belloum, Marian Bubak to be published JAN/FEB, IEEE Internet Computing, 2012
  • 24. • Clck  to  edit  Master  0tle  style   Enabling  web  services  to  consume  and  produce   large  distributed     Enabling web services to consume and produce large distributed datasets Spiros Koulouzis, Reginald Cushing, Konstantinos Karasavvas, Adam Belloum, Marian Bubak to be published JAN/FEB, IEEE Internet Computing, 2012 Indexing Web Services for Information Retrieval (NER) are tools that help biologists to identify and retrieve information •  Index 8.4GB of medline documents
  • 25. • Clck  to  edit  Master  0tle  style   Outline   (1) Problem investigation: (2) Experiment Prototyping (3) Experiment Execution: (4) Results Publication: Shared repositories •  Introduc0on   –   Life  cycle  of  e-­‐Science  Workflow     •  Different  approaches  to  workflow  scheduling   – Workflow  Process  Modeling  &  Management  In   Grid/Cloud   – Workflow  and  Web  services  (intrusive/non   intrusive)   •  Provenance   •  Compu0ng  in  the  browser   •  Conclusions    
  • 26. • Clck  to  edit  Master  0tle  style   Provenance/  Reproducibility     •  “A  complete  provenance  record  for  a  data  object   allows  the  possibility  to  reproduce  the  result  and   reproducibility  is  a  cri0cal  component  of  the   scien0fic  method”   •  Provenance:  The  recording  of  metadata  and   provenance  informa0on  during  the  various  stages   of  the  workflow  lifecycle   Workflows and e-Science: An overview of workflow system features and capabilities Ewa Deelmana, Dennis Gannonb, Matthew Shields c, Ian Taylor, Future Generation Computer Systems 25 (2009) 528540
  • 27. • Clck  to  edit  Master  0tle  style   History-­‐tracing  XML  (FH  Aachen)   provides  data/process  provenance   following  an  approach  that     •  maps  the  workflow  graph  to  a   layered  structure  of  an  XML   document.     •  This  allows  an  intui0ve  and  easy   processable  representa0on  of   the  workflow  execu0on  path       •  Workflow  components  can  be   eventually,  electronically  signed.       <patternMatch> <events> <PortResolved>provenance data</PortResolved> <ConDone> provenance data </ConDone> ... </events> <fileReader2> <events> ... </events> <sign-fileReader2> ... </sign-fileReader2> </fileReader2> <sffToFasta> Reference </sffToFasta> <sign-patternMatch> ... </sign-patternMatch> </patternMatch> M. Gerards, Adam S. Z. Belloum, F. Berritz, V. Snder, S. Skorupa, A History-tracing XML-base Provenance Framework for workflows, WORKS 2010, New Orleans, USA, November 2010
  • 28. • Clck  to  edit  Master  0tle  style   [Biomedical engineering Cardiovascular biomechanics group TUE]) wave propagation model of blood flow in large vessels using an approximate velocity profile function: a biomedical study for which 3000 runs were required to perform a global sensitivity analysis of a blood pressure wave propagation in arteries User Interface to compose workflow (top right), monitor the execution of the farmed workflows (top left), and monitor each run separately (bottom left) data Query interface for the provenance data collected from 3000 simulations of the “wave propagation model of blood flow in large vessels using an approximate velocity profile function” BigGrid project 2009, presented EGI/ BigGrid technical forum 2010 Wave  Propaga0on  in  Blood  Flow  
  • 29. • Clck  to  edit  Master  0tle  style   Alignment  of  DNA  Sequence  (Blast)   For Each workflow run •  The provenance data is collected an stored following the XML-tracing system •  User interface allows to reproduce events that occurred at runtime (replay mode) •  User Interface can be customized (User can select the events to track) •  User Interface show resource usage The aim of the application is the alignment of DNA sequence data with a given reference database. on-going work UvA-AMC-fh-aachen [Department of Clinical Epidemiology, Biostatistics and Bioinformatics (KEBB), AMC ]
  • 30. • Clck  to  edit  Master  0tle  style   Sensi0vity  Analysis  of  Cardiovascular  Models   Study covers: 164 patient-specific input parameters (i.e. vessel diameter and length), •  1494000 Monte Carlo runs are needed for this first analysis. •  Expected execution time on a PC 14.525 hours (20 Months) FP7 Project VPH-Share " Virtual Physiological Human: Sharing for Healthcare”
  • 31. • Clck  to  edit  Master  0tle  style   Protein  Folding  
  • 32. • Clck  to  edit  Master  0tle  style   Outline   (1) Problem investigation: (2) Experiment Prototyping (3) Experiment Execution: (4) Results Publication: Shared repositories •  Introduc0on   –   Life  cycle  of  e-­‐Science  Workflow     •  Different  approaches  to  workflow  scheduling   – Workflow  Process  Modeling  &  Management  In   Grid/Cloud   – Workflow  and  Web  services  (intrusive/non   intrusive)   •  Provenance   •  Compu0ng  in  the  browser   •  Conclusions  
  • 33. • Clck  to  edit  Master  0tle  style   Compu0ng  in  the  Browser   (different  approach  to  Grid-­‐desktop)   •  Objec/ves   –  Distributed  compu0ng  using  web  browsers   •  Features:     –  volunteer  compu0ng  instantly  (no  third  party  sojware   installa0on)   •  How  does  it  work:     –  Social  media    mediates  the  trust  between  the  user  and  the   volunteers  asked  to  join  the  network.   •  A  user  with  a  distributed  applica0on  uses  social  media  to  get    colleagues  and  friends  to   donate  CPU   •  Colleagues  and  friends    join  the  network  by    simply  opening    the  shared  URL.     •   Compu/ng  starts  almost  instantly.  
  • 34. • Clck  to  edit  Master  0tle  style   Compu0ng  in  the  Browser   Distributed computing on an Ensemble of Browsers, R. Cushing, G.a Putra, S. Koulouzis, A.S.Z Belloum, M.T. Bubak, C. de Laat IEEE Internet Computing, PrePress 10.1109/MIC.2013.3, January 2013 Application •  Computing 33,000 bio-informatics tasks on the global cluster of browsers •  Announcing the experiment using social media: via social media tools: twitter, FaceBook, Linkedin, and project mailing lists. •  Volunteers were asked to open the Weevil web page http://elab.lab.uvalight.net/~weevil/ and agree to donate their CPU for 3 hours on Friday December 2011 from 12:00-14:00 0.4 0.6 0.8 1 1.2 1.4 00:00 01:00 02:00 03:00 04:00 05:00 0.0e+00 5.0e+03 1.0e+04 1.5e+04 2.0e+04 2.5e+04 3.0e+04 3.5e+04 4.0e+04 EstimatedGFLOPS CompletedTasks Time HH:MM Tasks Aggregated Power (GFLOPS) New Web Technologies make JavaScript engines more powerful: •  Web workers •  web socket •  WebGL •  WebCL
  • 35. • Clck  to  edit  Master  0tle  style   Conclusions   •  WS-­‐VLAM  has  interes0ng  features  (farming,  automa0c   scaling,  hierarchical  workflow,  provenance,  …)  which  proved   to  be  interes0ng  for  a  number  scien0fic  applica0ons     •  WS-­‐VLAM  harnesses  various  type  of  compu0ng  resources   (desktops,  Grid,  and  Cloud  resources)     •  WS-­‐VLAM  has  modular  design  which  make  easy  to  adapt/ extend    
  • 36. • Clck  to  edit  Master  0tle  style   http://www.science.n/~gvlam/wsvlam/ http://www.commit-nl.nl/