SlideShare una empresa de Scribd logo
1 de 38
Science Gateway Group
         Overview
Marlon Pierce, Suresh Marru, Raminder
                 Singh
    Presentation to PTI CREST Lab
            May 2nd, 2012
IU Science Gateway Group Overview
• IU Science Gateway Group members
  – Marlon Pierce: Group Lead
  – Suresh Marru: Principal Software Architect
  – Raminder Singh, Yu Ma, Jun Wang, Lahiru Gunathilake,
    (Chathura Herath): Senior team members
  – Five interns and research assistants
• NSF SDCI funding of Open Gateway Computing
  Environments project
  – TACC (M. Dahan), SDSC (N. Wilkins-Diehr), SDSU (M.
    Thomas), NCSA (S. Pamidighantam), UIUC (S. Wang),
    Purdue (C. Song), UTHSCSA (E. Brookes)
– XSEDE Extended Collaboration Support Services
Science Gateway Group Focus Areas
• Open source, open community software for
  cyberinfrastructure
  – Apache Rave: http://rave.apache.org
  – (portal software)
  – Apache Airavata: http://www.airavata.org
    (workflow software)
• Extended collaborations with application
  scientists
  – Astronomy, astrophysics, biophysics, chemistry,
    nuclear physics, bio informatics…
Science Gateways outreach &
    democratize Research and Education


                                                       Research &
                      Lowering the barrier for using   Education
                          complex end-to-end           Community
    Developers         computational technologies
   Researchers                Democratize
Cyberinfrastructure             Empower
                                Facilitate
Possibilities for Collaboration
• Scientific workflows exhibit a number of
  distributed execution patterns.
  – Not just DAGs.
  – Workflow start as an abstraction, but need system,
    application, library level interactions.
  – We are trying to generalize our execution framework
    over a number of applications.
  – This is parallel, complementary to work in CREST.
• Collaborations can be mediated through Apache,
  students’ independent study, targeted papers.
Apache Airavata

Open Community Software for
    Scientific Workflows
        airavara.org
Apache Airavata
• Science Gateway software framework to
  – Compose, manage, execute, and monitor
    computational workflows on Grids and Clouds
  – Web service abstractions to legacy command line-
    driven scientific applications
  – Modular software framework to use as individual
    components or as an integrated solution.
• More Information
  – Airavata Web Site: airavata.org
  – Developer Mailing Lists: airavata-
    dev@incubator.apache.org
Apache Airavata High Level Overview
A Classic Scientific Workflow
• Workflows are composite applications built out of independent
  parts.
   – Executables wrapped as network accessible services
• Example: codes A, B, and C need to be executed in a specific
  sequence.
   – A, B, C: codes compiled and executable on a cluster, supercomputer,
     etc. through schedulers.
        • A, B, and C do not need to be co-located
        • A, B, and C may be sequential or parallel
        • A, B and C may have data or control dependencies
   – Data may need to be staged in and out
• Some variations on ABC:
   –   Conditional execution branches, interactivity
   –   Dynamic execution resource binding
   –   Iterations (Do-while, For-Each) over all or parts of the sequence
   –   Triggers, events, data streams
Linked Environment for Atmospheric Discovery



                                                   Storms Forming



                                                                        Forecast Model
  Streaming
  Observations                    Data Mining



            Instrument Steering

                                         Refine forecast grid


Envisioned by a multi-disciplinary team from
OU, IU, NCSA, Unidata, UAH, Howard,                                 On-Demand
Millersville, Colorado State, RENCI                                 Grid Computing
Open Grid/Gateway Computing
                   Environments                                     Atmospheric      LEAD, OLAM
                                                                      Science

  LEAD                                                               Molecular
                                                                                      GridChem,
                                                                                     ParamChem,
                                                                     Chemistry
                                                                                      OREChem

GridChem
                                   OGCE                              Bio Physics
                                                                                     Ultrascan
                                 Re-engineer,               OVP/
 TeraGrid                         Generalize,               RST/
                                                            MIG
User Portal                     Build, Test and                                      BioVLAB, mCpG
                                                                   Bio Informatics
                                   Release

OGCE NMI &                                                           Astronomy       ODI, DES-SimWG
SDCI Funding


                                                                   Nuclear Physics      LCCI


           XSEDE ECSS Gateway
                Projects             Projects in the pipe
                                     QuakeSim, VLAB,
                                     Einstein Gateway
                                                                                                      11
Realizing the Universe for the Dark Energy Survey (DES) Using XSEDE Support
   Principal Investigators: Prof. August Evrard (University of Michigan), Prof. Andrey Kravtsov (University of Chicago)


                                                                        Background & Explanation: The Dark
                                                                        Energy Survey (DES) is an upcoming
                                                                        international experiment that aims to
                                                                        constrain the properties of dark energy
                                                                        and dark matter in the universe using a
                                                                        deep, 5000-square degree survey of
                                                                        cosmic structure traced by galaxies.

                                                                        To support this science, the DES
Fig. 1 The density of dark matter in a thin radial slice as seen by a   Simulation Working Group is generating
synthetic observer located in the 8 billion light-year computational
volume. Image courtesy Matthew Becker, University of Chicago.           expectations for galaxy yields in various
                                                                        cosmologies. Analysis of these simulated
                                                                        catalogs offers a quality assurance
                                                                        capability    for    cosmological     and
                                                                        astrophysical analysis of upcoming DES
                                                                        telescope data.

                                                                        Our large, multi-staged computations are
                                                                        a natural fit for workflow control atop
                                                                        XSEDE resources.
Fig. 2: A synthetic 2x3 arcmin DES sky image showing galaxies, stars,
and observational artifacts. Courtesy Huan Lin, FNAL.
DES           Component Description
Application
CAMB          Code for Anisotropies in the Microwave Background is a serial
              FORTRAN code that computes the power spectrum of dark matter
              (simulation initial conditions). Output is a small ASCII file describing
              the power spectrum.
2LPTic        Second-order Lagrangian Perturbation Theory initial conditions
              code is an MPI-based C code that computes the initial conditions for
              the simulation from parameters and an input power spectrum
              generated by CAMB. Output is a set of binary files that vary in size
              from ~80-250 GB depending on the simulation resolution.
LGadget       MPI based C code that uses a TreePM algorithm to evolve a
              gravitational N-body system. The outputs of this step are system
              state snapshot files, as well as light-cone files, and some properties
              of the matter distribution, including the power spectrum at various
              timesteps. The total output from LGadget depends on resolution
              and the number of system snapshots stored, and approaches 10~TB
              for large DES simulation boxes.
ADDGALS       Creates a synthetic galaxy catalog for science analysis
Case Study: Dark Energy Survey
•   Long running code: Based on simulation box
    size L-gadget can run for 3 to 5 days using
    more than 1024 cores on TACC Ranger.
•   Do-While Construct: Restart service support is
    needed to work around queue time
    restrictions. Do-while construct was
    developed to address the need.
•   Data size and file transfer challenges: L-
    gadget produces 10~TB for large DES
    simulation boxes in system scratch so data
    need to moved to persistent storage ASAP
•   File system issues: More than 10,000
    lightcone files are doing continues file I/O.
    Ranger has one Luster metadata server to
    serve 300 I/O nodes. Sometime metadata
    server can’t fine these lightcone files, which
    make simulations to stop. We have wasted
                                                     Figure: Processing steps to build a synthetic galaxy catalog.
    ~50k SU this month struggling with I/O issues    Xbaya workflow currently controls the top-most element (N-
    and to get recommendation to use MPI I/O         body simulations) which consists of methods to sample a
                                                     cosmological power spectrum (ps), generating an initial set
                                                     of particles (ic) and evolving the particles forward in time
                                                     with Gadget (N-body). The remaining methods are run
                                                     manually on distributed resources.
Case Study: ParamChem
• ParamChem researchers try to optimize the geometry of
  new molecules which may or may not converge with in a
  given time or number of steps.
• Factors that include the mathematical convergence
  issues in solutions for partial integro-differential
  equations to potential shallowness of an energy
  landscape.
• The intermediate outputs from model iterations can be
  used to determine convergence.

                            Complex graph executions with support
                            for long running and interactive
                            executions to address non-deterministic
                            convergence problems.
Case Study: LCCI
• The Leadership Class Configuration Interaction (LCCI) project is targeted to
  accurately predict properties of nuclei for astrophysical and fusion energy
  processes.
    – James Vary’s group at Iowa State
• One of the PetaScale Apps
    – Use DOE INCITE and NSF Blue Waters awarded resources
    – Currently using 55 million processor hours on ORNL Cray XK6 machine and Argonne
      Blue Gene/P.
• Workflow Goals
    – Democratizing science: Reduce the learning curve associated with running simulations
    – Controlled access: avoid inappropriate use of super computing resources
    – Reproducibility of results
    – Avoiding waste: needless duplication; minimize erroneous use of codes and erroneous
      exchanging of intermediate files
    – Sustainability: Ensure long-term preservation of applications, configurations and results
    – Provenance: Provide the ability to track down the provenance of results as well as reuse
      previously completed results where applicable without recalculating
    – Parametric sweeps: Allow components to run over a range of dataset such that
      applications may produce richer simulations
Example Workflow: Nuclear Physics




  Courtesy of collaboration with Prof. James Vary and team, Iowa State
Next Generation Workflow Systems
• Scientific workflow systems and compiled
  workflow languages have focused on modeling,
  scheduling, data movement, dynamic service
  creation and monitoring of workflows.
• Building on these foundations we extend to a
  interactive and flexible workflow systems.
  – interactive ways of steering the workflow execution
  – interpreted workflow execution model
  – high level instruction set supporting diverse execution
    patterns
  – flexibility to execute individual workflow activity and
    wait for further analysis.
Various State Changes can tap into
           lower layers
                       Running Workflow

                                          Ready
                                          Node


             Wai ng                                           Load from
              Node                                           provenance
                                 Running
                                  Node


                      Failed                      Finished
                      Node                          Node

                                                                          Finished
                                                                          Workflow

   Ready
  Workflow

                                                                          Stopped
                                                                          Workflow
                       Paused
                      Workflow
                                                   Failed
                                                  Workflow
Uncertainties in Workflows
• Mathematical uncertainty:
   – PDE’s may not converge for certain conditions
   – Statistical techniques lead to nondeterministic results, propagation of
     uncertainties.
   – CLoser observation at computational output ensure acceptability of results.
• Domain uncertainty:
   – Optimization execution patterns: Scenarios of running against range of
     parameter values in an attempt to find the most appropriate input set.
   – Initial execution providing estimate of the accuracy of the inputs and facilitating
     further refinement.
   – Outputs are diverse and nondeterministic
• Resource uncertainty:
   – Failures in distributed systems are norm than an exception
   – Transient failures can be retried if computation is side-effect free/Idempotent.
   – Persistent failures require migration
Next Steps
• Workflow start as an abstraction, but need
  system, application, library level interactions.
  – We are trying to generalize our execution
    framework over a number of applications.
  – This is parallel, complementary to work in CREST.
• Collaborations can be mediated through
  Apache, students’ independent study,
  targeted papers.
Backup Slides
Apache Rave
Apache Rave
• Open Community Software for Enterprise Social
  Networking, Shareable Web Components, and
  Science Gateways
• Founding members:
  •   Mitre Software
  •   SURFnet
  •   Hippo Software
  •   Indiana University
• More information
  • Project Website: http://rave.apache.org/
  • Mailing List: dev@rave.apache.org


                                                  1
Gadget Dashboard
                     View




Gadget Store
   View
Extending Rave for Science Gateways
• Rave is designed to be extended.
  – Good design (interfaces, easily pluggable
    implementations) and code organization are required.
  – It helps to have a diverse, distributed developer
    community
     • How can you work on it if we can’t work on it?
• Rave is also packaged so that you can extend it
  without touching the source tree.
• GCE11 paper presented 3 case studies for Science
  Gateways
Apache Software Foundation and
      Cyberinfrastructure
Why Apache for Gateway Software?
• Apache Software Foundation is a neutral playing field
   – 501(c)(3) non-profit organization.
   – Designed to encourage competitors to collaborate on
     foundational software.
   – Includes a legal cell for legal issues.
• Foundation itself is sustainable
   – Incorporated in 1999
   – Multiple sponsors (Yahoo, Microsoft, Google, AMD, Facebook,
     IBM, …)
• Proven governance models
   – Projects are run by Program Management Committees.
   – New projects must go through incubation.
• Provides the social infrastructure for building communities.
• Opportunities to collaborate with other Apache projects
  outside the usual CI world.
The Apache Way
• Projects start as incubators with 1 champion and several mentors.
   – Making good choices is very important
• Graduation ultimately is judged by the Apache community.
   – +1/-1 votes on the incubator list
• Good, open engineering practices required
   – DEV mailing list design discussions, issue tracking
   – Jira contributions
   – Important decisions are voted on
• Properly packaged code
   –   Build out of the box
   –   Releases are signed
   –   Licenses, disclaimers, notices, change logs, etc.
   –   Releases are voted
• Developer diversity
   – Three or more unconnected developers
   – Price is giving up sole ownership, replace with meritocracy
Apache and Science Gateways
• Apache rewards projects for cross-pollination.
  – Connecting with complementary Apache projects
    strengthens both sides.
  – New requirements, new development methods
• Apache methods foster sustainability
  – Building communities of developers, not just users
  – Key merit criterion
• Apache methods provide governance
  – Incubators learn best practices from mentors
  – Releases are peer-reviewed
Apache Contributions Aren’t Just
             Software
• Apache committers and PMC members aren’t just
  code writers.
• Successful communities also include
  –   Important users
  –   Project evangelists
  –   Content providers: documentation, tutorials
  –   Testers, requirements providers, and constructive
      complainers
       • Using Jira and mailing lists
  – Anything else that needs doing.
Case Study: LEAD
• To create an Integrated, Scalable Geosciences
  Framework, LEAD among things resulted in a
  developing a flexible Scientific workflow system.
• The initial goal was to realize WOORDS: Workflow
  Orchestration for On-Demand Real-Time
  Dynamically-Adaptive System.
• The system enables execution of legacy scientific
  codes and facilities sophisticated coupling while
  interacting with data and provenance sub-systems.
Case Study: One Degree Imager
• A single investigation requires multiple night observations
• Each night takes hours of observations with multiple exposures
• An exposure is divided into 64 Orthogonal Transfer Array (OTAs)
• Each OTA is an 8x8 collection of 512x512 pixel CCD images.
• Reducing these data sets require workflow planning taking advantage of
  system architectures.
• Currently we take advantage of threaded parallelism at node level
  branching out to multiple node executions.
Pipeline Parallelization
Campaign                         TOP
                                             ...
Night/Filter               FTR         FTR
                                             ...

Exposures            EXP         EXP
                                             ...


   OTAs        OTA         OTA
Illustrating Interactivity
                       Asynchronous       Applica on
                        refinements        Steering


Orchestra on level Interac ons                 Job Level Interac ons

 Parametric   Provenance   Workflow            Job launch,    Checkpoint/
  Sweeps                   Steering              gliding        Restart




                                                          Model
               Mathema cal      Domain   Resource
                                                        Refinement



                                Uncertain es
Execution Patterns
Parametric Sweeps
                                                                                 Start                          Start
                                                Level 0
                                                  4 instances X 4 à 16 outputs




                                                Level 1
                                                2instances X (4x4)à 32 outputs




                                    Level 2
                                    1 instance X (32x32)à 1024outputs


                    A                                                             B                         C    Pruned Computation




   Dot vs Cartisian
                                                                                         Start!                             Start
                                                Level%
                                                     0!




                                                                                                       !!
                                                                                                  !!
                                                                                  !!
                                                !!!!4x4!instances!!16!outputs!
               !            !               !

   !       !            !               !



                                                Level 1
       !                                          2x16 instances! 32 outputs
                                !




                    !
                                                Level 2
                                                      !
                                                  1x256 instances! 256 outputs


                   A!                                                                     B!                              C!
Interactivity Contd.
• Deviations during workflow execution that do not
  affect the structure of the workflow
  – dynamic change workflow inputs, workflow rerun.
    interpreted workflow execution model.
  – dynamic change in point of execution, workflow smart
    rerun.
  – Fault handling and exception models.
• Deviations that change the workflow DAG during
  runtime
  – Reconfiguration of activity.
  – Dynamic addition of activities to the workflow.
  – Dynamic remove or replace of activity to the workflow

Más contenido relacionado

La actualidad más candente

OREChem Services and Workflows
OREChem Services and WorkflowsOREChem Services and Workflows
OREChem Services and Workflowsmarpierc
 
OGCE Project Overview
OGCE Project OverviewOGCE Project Overview
OGCE Project Overviewmarpierc
 
Taming Big Data!
Taming Big Data!Taming Big Data!
Taming Big Data!Ian Foster
 
Data Automation at Light Sources
Data Automation at Light SourcesData Automation at Light Sources
Data Automation at Light SourcesIan Foster
 
Big data at experimental facilities
Big data at experimental facilitiesBig data at experimental facilities
Big data at experimental facilitiesIan Foster
 
Accelerating Data-driven Discovery in Energy Science
Accelerating Data-driven Discovery in Energy ScienceAccelerating Data-driven Discovery in Energy Science
Accelerating Data-driven Discovery in Energy ScienceIan Foster
 
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...Larry Smarr
 
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy SciencesDiscovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy SciencesIan Foster
 
Virtual Science in the Cloud
Virtual Science in the CloudVirtual Science in the Cloud
Virtual Science in the Cloudthetfoot
 
grid mining
grid mininggrid mining
grid miningARNOLD
 
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and AutomationThe Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and AutomationIan Foster
 
Coding the Continuum
Coding the ContinuumCoding the Continuum
Coding the ContinuumIan Foster
 
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...Ian Foster
 
Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...
Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...
Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...Eran Chinthaka Withana
 
C2MON - A highly scalable monitoring platform for Big Data scenarios @CERN by...
C2MON - A highly scalable monitoring platform for Big Data scenarios @CERN by...C2MON - A highly scalable monitoring platform for Big Data scenarios @CERN by...
C2MON - A highly scalable monitoring platform for Big Data scenarios @CERN by...J On The Beach
 
Distributed Cyberinfrastructure to Support Big Data Machine Learning
Distributed Cyberinfrastructure to Support Big Data Machine LearningDistributed Cyberinfrastructure to Support Big Data Machine Learning
Distributed Cyberinfrastructure to Support Big Data Machine LearningLarry Smarr
 
The Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of ScienceThe Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of ScienceRobert Grossman
 
Application of Lotka-Volterra model to analyse Cloud behavior and optimise re...
Application of Lotka-Volterra model to analyse Cloud behavior and optimise re...Application of Lotka-Volterra model to analyse Cloud behavior and optimise re...
Application of Lotka-Volterra model to analyse Cloud behavior and optimise re...IJSRP Journal
 
CHASE-CI: A Distributed Big Data Machine Learning Platform
CHASE-CI: A Distributed Big Data Machine Learning PlatformCHASE-CI: A Distributed Big Data Machine Learning Platform
CHASE-CI: A Distributed Big Data Machine Learning PlatformLarry Smarr
 

La actualidad más candente (20)

OREChem Services and Workflows
OREChem Services and WorkflowsOREChem Services and Workflows
OREChem Services and Workflows
 
OGCE Project Overview
OGCE Project OverviewOGCE Project Overview
OGCE Project Overview
 
Taming Big Data!
Taming Big Data!Taming Big Data!
Taming Big Data!
 
Data Automation at Light Sources
Data Automation at Light SourcesData Automation at Light Sources
Data Automation at Light Sources
 
Big data at experimental facilities
Big data at experimental facilitiesBig data at experimental facilities
Big data at experimental facilities
 
Accelerating Data-driven Discovery in Energy Science
Accelerating Data-driven Discovery in Energy ScienceAccelerating Data-driven Discovery in Energy Science
Accelerating Data-driven Discovery in Energy Science
 
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
 
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy SciencesDiscovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
 
Virtual Science in the Cloud
Virtual Science in the CloudVirtual Science in the Cloud
Virtual Science in the Cloud
 
grid mining
grid mininggrid mining
grid mining
 
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and AutomationThe Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
 
Coding the Continuum
Coding the ContinuumCoding the Continuum
Coding the Continuum
 
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
 
Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...
Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...
Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...
 
prj exam
prj examprj exam
prj exam
 
C2MON - A highly scalable monitoring platform for Big Data scenarios @CERN by...
C2MON - A highly scalable monitoring platform for Big Data scenarios @CERN by...C2MON - A highly scalable monitoring platform for Big Data scenarios @CERN by...
C2MON - A highly scalable monitoring platform for Big Data scenarios @CERN by...
 
Distributed Cyberinfrastructure to Support Big Data Machine Learning
Distributed Cyberinfrastructure to Support Big Data Machine LearningDistributed Cyberinfrastructure to Support Big Data Machine Learning
Distributed Cyberinfrastructure to Support Big Data Machine Learning
 
The Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of ScienceThe Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of Science
 
Application of Lotka-Volterra model to analyse Cloud behavior and optimise re...
Application of Lotka-Volterra model to analyse Cloud behavior and optimise re...Application of Lotka-Volterra model to analyse Cloud behavior and optimise re...
Application of Lotka-Volterra model to analyse Cloud behavior and optimise re...
 
CHASE-CI: A Distributed Big Data Machine Learning Platform
CHASE-CI: A Distributed Big Data Machine Learning PlatformCHASE-CI: A Distributed Big Data Machine Learning Platform
CHASE-CI: A Distributed Big Data Machine Learning Platform
 

Destacado

XSEDE14 SciGaP-Apache Airavata Tutorial
XSEDE14 SciGaP-Apache Airavata TutorialXSEDE14 SciGaP-Apache Airavata Tutorial
XSEDE14 SciGaP-Apache Airavata Tutorialmarpierc
 
OGCE TeraGrid 2010 Science Gateway Tutorial Intro
OGCE TeraGrid 2010 Science Gateway Tutorial IntroOGCE TeraGrid 2010 Science Gateway Tutorial Intro
OGCE TeraGrid 2010 Science Gateway Tutorial Intromarpierc
 
GCE11 Apache Rave Presentation
GCE11 Apache Rave PresentationGCE11 Apache Rave Presentation
GCE11 Apache Rave Presentationmarpierc
 
IWSG2014: Developing Science Gateways Using Apache Airavata
IWSG2014: Developing Science Gateways Using Apache AiravataIWSG2014: Developing Science Gateways Using Apache Airavata
IWSG2014: Developing Science Gateways Using Apache Airavatamarpierc
 
OGCE SciDAC2010 Tutorial
OGCE SciDAC2010 TutorialOGCE SciDAC2010 Tutorial
OGCE SciDAC2010 Tutorialmarpierc
 
TG11 ORPS Poster
TG11 ORPS PosterTG11 ORPS Poster
TG11 ORPS Postermarpierc
 
Experiences with the Apache Software Foundation
Experiences with the Apache Software Foundation Experiences with the Apache Software Foundation
Experiences with the Apache Software Foundation marpierc
 
ACES QuakeSim 2011
ACES QuakeSim 2011ACES QuakeSim 2011
ACES QuakeSim 2011marpierc
 
OGCE Overview for SciDAC 2009
OGCE Overview for SciDAC 2009OGCE Overview for SciDAC 2009
OGCE Overview for SciDAC 2009marpierc
 
Knowledge Creator 090528 Social Media
Knowledge Creator 090528 Social MediaKnowledge Creator 090528 Social Media
Knowledge Creator 090528 Social MediaSungKwon Chun
 
OGCE TG09 Tech Track Presentation
OGCE TG09 Tech Track PresentationOGCE TG09 Tech Track Presentation
OGCE TG09 Tech Track Presentationmarpierc
 
GTLAB Installation Tutorial for SciDAC 2009
GTLAB Installation Tutorial for SciDAC 2009GTLAB Installation Tutorial for SciDAC 2009
GTLAB Installation Tutorial for SciDAC 2009marpierc
 

Destacado (13)

XSEDE14 SciGaP-Apache Airavata Tutorial
XSEDE14 SciGaP-Apache Airavata TutorialXSEDE14 SciGaP-Apache Airavata Tutorial
XSEDE14 SciGaP-Apache Airavata Tutorial
 
OGCE TeraGrid 2010 Science Gateway Tutorial Intro
OGCE TeraGrid 2010 Science Gateway Tutorial IntroOGCE TeraGrid 2010 Science Gateway Tutorial Intro
OGCE TeraGrid 2010 Science Gateway Tutorial Intro
 
GCE11 Apache Rave Presentation
GCE11 Apache Rave PresentationGCE11 Apache Rave Presentation
GCE11 Apache Rave Presentation
 
IWSG2014: Developing Science Gateways Using Apache Airavata
IWSG2014: Developing Science Gateways Using Apache AiravataIWSG2014: Developing Science Gateways Using Apache Airavata
IWSG2014: Developing Science Gateways Using Apache Airavata
 
OGCE SciDAC2010 Tutorial
OGCE SciDAC2010 TutorialOGCE SciDAC2010 Tutorial
OGCE SciDAC2010 Tutorial
 
TG11 ORPS Poster
TG11 ORPS PosterTG11 ORPS Poster
TG11 ORPS Poster
 
Experiences with the Apache Software Foundation
Experiences with the Apache Software Foundation Experiences with the Apache Software Foundation
Experiences with the Apache Software Foundation
 
ACES QuakeSim 2011
ACES QuakeSim 2011ACES QuakeSim 2011
ACES QuakeSim 2011
 
OGCE Overview for SciDAC 2009
OGCE Overview for SciDAC 2009OGCE Overview for SciDAC 2009
OGCE Overview for SciDAC 2009
 
Knowledge Creator 090528 Social Media
Knowledge Creator 090528 Social MediaKnowledge Creator 090528 Social Media
Knowledge Creator 090528 Social Media
 
OGCE TG09 Tech Track Presentation
OGCE TG09 Tech Track PresentationOGCE TG09 Tech Track Presentation
OGCE TG09 Tech Track Presentation
 
GTLAB Installation Tutorial for SciDAC 2009
GTLAB Installation Tutorial for SciDAC 2009GTLAB Installation Tutorial for SciDAC 2009
GTLAB Installation Tutorial for SciDAC 2009
 
El precio»
El precio»El precio»
El precio»
 

Similar a Sgg crest-presentation-final

High Performance Cyberinfrastructure Required for Data Intensive Scientific R...
High Performance Cyberinfrastructure Required for Data Intensive Scientific R...High Performance Cyberinfrastructure Required for Data Intensive Scientific R...
High Performance Cyberinfrastructure Required for Data Intensive Scientific R...Larry Smarr
 
Apache Airavata ApacheCon2013
Apache Airavata ApacheCon2013Apache Airavata ApacheCon2013
Apache Airavata ApacheCon2013smarru
 
Data-intensive profile for the VAMDC
Data-intensive profile for the VAMDCData-intensive profile for the VAMDC
Data-intensive profile for the VAMDCAstroAtom
 
Creating a Planetary Scale OptIPuter
Creating a Planetary Scale OptIPuterCreating a Planetary Scale OptIPuter
Creating a Planetary Scale OptIPuterLarry Smarr
 
Emc 2013 Big Data in Astronomy
Emc 2013 Big Data in AstronomyEmc 2013 Big Data in Astronomy
Emc 2013 Big Data in AstronomyFabio Porto
 
Comaskey_William_Poster_SULI_FALL_2014
Comaskey_William_Poster_SULI_FALL_2014Comaskey_William_Poster_SULI_FALL_2014
Comaskey_William_Poster_SULI_FALL_2014William Comaskey
 
Applying Photonics to User Needs: The Application Challenge
Applying Photonics to User Needs: The Application ChallengeApplying Photonics to User Needs: The Application Challenge
Applying Photonics to User Needs: The Application ChallengeLarry Smarr
 
21st Century e-Knowledge Requires a High Performance e-Infrastructure
21st Century e-Knowledge Requires a High Performance e-Infrastructure21st Century e-Knowledge Requires a High Performance e-Infrastructure
21st Century e-Knowledge Requires a High Performance e-InfrastructureLarry Smarr
 
UC Capabilities Supporting High-Performance Collaboration and Data-Intensive ...
UC Capabilities Supporting High-Performance Collaboration and Data-Intensive ...UC Capabilities Supporting High-Performance Collaboration and Data-Intensive ...
UC Capabilities Supporting High-Performance Collaboration and Data-Intensive ...Larry Smarr
 
Cyberinfrastructure to Support Ocean Observatories
Cyberinfrastructure to Support Ocean ObservatoriesCyberinfrastructure to Support Ocean Observatories
Cyberinfrastructure to Support Ocean ObservatoriesLarry Smarr
 
The Pacific Research Platform
 Two Years In
The Pacific Research Platform
 Two Years InThe Pacific Research Platform
 Two Years In
The Pacific Research Platform
 Two Years InLarry Smarr
 
The OptiPuter, Quartzite, and Starlight Projects: A Campus to Global-Scale Te...
The OptiPuter, Quartzite, and Starlight Projects: A Campus to Global-Scale Te...The OptiPuter, Quartzite, and Starlight Projects: A Campus to Global-Scale Te...
The OptiPuter, Quartzite, and Starlight Projects: A Campus to Global-Scale Te...Larry Smarr
 
e-Science for the Science Kilometre Array
e-Science for the Science Kilometre Arraye-Science for the Science Kilometre Array
e-Science for the Science Kilometre ArrayJoint ALMA Observatory
 
Shrinking the Planet—How Dedicated Optical Networks are Transforming Computat...
Shrinking the Planet—How Dedicated Optical Networks are Transforming Computat...Shrinking the Planet—How Dedicated Optical Networks are Transforming Computat...
Shrinking the Planet—How Dedicated Optical Networks are Transforming Computat...Larry Smarr
 
High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...
High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...
High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...Larry Smarr
 
A weak lensing_mass_reconstruction_of _the_large_scale_filament_massive_galax...
A weak lensing_mass_reconstruction_of _the_large_scale_filament_massive_galax...A weak lensing_mass_reconstruction_of _the_large_scale_filament_massive_galax...
A weak lensing_mass_reconstruction_of _the_large_scale_filament_massive_galax...Sérgio Sacani
 
Uses of the OptIPortal
Uses of the OptIPortalUses of the OptIPortal
Uses of the OptIPortalLarry Smarr
 
IRJET- Deep Convolution Neural Networks for Galaxy Morphology Classification
IRJET- Deep Convolution Neural Networks for Galaxy Morphology ClassificationIRJET- Deep Convolution Neural Networks for Galaxy Morphology Classification
IRJET- Deep Convolution Neural Networks for Galaxy Morphology ClassificationIRJET Journal
 
Report to the NAC
Report to the NACReport to the NAC
Report to the NACLarry Smarr
 
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Larry Smarr
 

Similar a Sgg crest-presentation-final (20)

High Performance Cyberinfrastructure Required for Data Intensive Scientific R...
High Performance Cyberinfrastructure Required for Data Intensive Scientific R...High Performance Cyberinfrastructure Required for Data Intensive Scientific R...
High Performance Cyberinfrastructure Required for Data Intensive Scientific R...
 
Apache Airavata ApacheCon2013
Apache Airavata ApacheCon2013Apache Airavata ApacheCon2013
Apache Airavata ApacheCon2013
 
Data-intensive profile for the VAMDC
Data-intensive profile for the VAMDCData-intensive profile for the VAMDC
Data-intensive profile for the VAMDC
 
Creating a Planetary Scale OptIPuter
Creating a Planetary Scale OptIPuterCreating a Planetary Scale OptIPuter
Creating a Planetary Scale OptIPuter
 
Emc 2013 Big Data in Astronomy
Emc 2013 Big Data in AstronomyEmc 2013 Big Data in Astronomy
Emc 2013 Big Data in Astronomy
 
Comaskey_William_Poster_SULI_FALL_2014
Comaskey_William_Poster_SULI_FALL_2014Comaskey_William_Poster_SULI_FALL_2014
Comaskey_William_Poster_SULI_FALL_2014
 
Applying Photonics to User Needs: The Application Challenge
Applying Photonics to User Needs: The Application ChallengeApplying Photonics to User Needs: The Application Challenge
Applying Photonics to User Needs: The Application Challenge
 
21st Century e-Knowledge Requires a High Performance e-Infrastructure
21st Century e-Knowledge Requires a High Performance e-Infrastructure21st Century e-Knowledge Requires a High Performance e-Infrastructure
21st Century e-Knowledge Requires a High Performance e-Infrastructure
 
UC Capabilities Supporting High-Performance Collaboration and Data-Intensive ...
UC Capabilities Supporting High-Performance Collaboration and Data-Intensive ...UC Capabilities Supporting High-Performance Collaboration and Data-Intensive ...
UC Capabilities Supporting High-Performance Collaboration and Data-Intensive ...
 
Cyberinfrastructure to Support Ocean Observatories
Cyberinfrastructure to Support Ocean ObservatoriesCyberinfrastructure to Support Ocean Observatories
Cyberinfrastructure to Support Ocean Observatories
 
The Pacific Research Platform
 Two Years In
The Pacific Research Platform
 Two Years InThe Pacific Research Platform
 Two Years In
The Pacific Research Platform
 Two Years In
 
The OptiPuter, Quartzite, and Starlight Projects: A Campus to Global-Scale Te...
The OptiPuter, Quartzite, and Starlight Projects: A Campus to Global-Scale Te...The OptiPuter, Quartzite, and Starlight Projects: A Campus to Global-Scale Te...
The OptiPuter, Quartzite, and Starlight Projects: A Campus to Global-Scale Te...
 
e-Science for the Science Kilometre Array
e-Science for the Science Kilometre Arraye-Science for the Science Kilometre Array
e-Science for the Science Kilometre Array
 
Shrinking the Planet—How Dedicated Optical Networks are Transforming Computat...
Shrinking the Planet—How Dedicated Optical Networks are Transforming Computat...Shrinking the Planet—How Dedicated Optical Networks are Transforming Computat...
Shrinking the Planet—How Dedicated Optical Networks are Transforming Computat...
 
High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...
High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...
High Performance Cyberinfrastructure Enables Data-Driven Science in the Glob...
 
A weak lensing_mass_reconstruction_of _the_large_scale_filament_massive_galax...
A weak lensing_mass_reconstruction_of _the_large_scale_filament_massive_galax...A weak lensing_mass_reconstruction_of _the_large_scale_filament_massive_galax...
A weak lensing_mass_reconstruction_of _the_large_scale_filament_massive_galax...
 
Uses of the OptIPortal
Uses of the OptIPortalUses of the OptIPortal
Uses of the OptIPortal
 
IRJET- Deep Convolution Neural Networks for Galaxy Morphology Classification
IRJET- Deep Convolution Neural Networks for Galaxy Morphology ClassificationIRJET- Deep Convolution Neural Networks for Galaxy Morphology Classification
IRJET- Deep Convolution Neural Networks for Galaxy Morphology Classification
 
Report to the NAC
Report to the NACReport to the NAC
Report to the NAC
 
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
 

Más de marpierc

SC11 Science Gateway Group Overview
SC11 Science Gateway Group OverviewSC11 Science Gateway Group Overview
SC11 Science Gateway Group Overviewmarpierc
 
PNNL April 2011 ogce
PNNL April 2011 ogcePNNL April 2011 ogce
PNNL April 2011 ogcemarpierc
 
OGCE RT Rroject Review
OGCE RT Rroject ReviewOGCE RT Rroject Review
OGCE RT Rroject Reviewmarpierc
 
Building Science Gateways with Gadgets and OpenSocial
Building Science Gateways with Gadgets and OpenSocialBuilding Science Gateways with Gadgets and OpenSocial
Building Science Gateways with Gadgets and OpenSocialmarpierc
 
OGCE Review for Indiana University Research Technologies
OGCE Review for Indiana University Research TechnologiesOGCE Review for Indiana University Research Technologies
OGCE Review for Indiana University Research Technologiesmarpierc
 
Ogce about-sc10
Ogce about-sc10Ogce about-sc10
Ogce about-sc10marpierc
 
GTLAB Overview
GTLAB OverviewGTLAB Overview
GTLAB Overviewmarpierc
 

Más de marpierc (8)

SC11 Science Gateway Group Overview
SC11 Science Gateway Group OverviewSC11 Science Gateway Group Overview
SC11 Science Gateway Group Overview
 
PNNL April 2011 ogce
PNNL April 2011 ogcePNNL April 2011 ogce
PNNL April 2011 ogce
 
OGCE RT Rroject Review
OGCE RT Rroject ReviewOGCE RT Rroject Review
OGCE RT Rroject Review
 
OGCE SC10
OGCE SC10OGCE SC10
OGCE SC10
 
Building Science Gateways with Gadgets and OpenSocial
Building Science Gateways with Gadgets and OpenSocialBuilding Science Gateways with Gadgets and OpenSocial
Building Science Gateways with Gadgets and OpenSocial
 
OGCE Review for Indiana University Research Technologies
OGCE Review for Indiana University Research TechnologiesOGCE Review for Indiana University Research Technologies
OGCE Review for Indiana University Research Technologies
 
Ogce about-sc10
Ogce about-sc10Ogce about-sc10
Ogce about-sc10
 
GTLAB Overview
GTLAB OverviewGTLAB Overview
GTLAB Overview
 

Último

Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 

Último (20)

Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 

Sgg crest-presentation-final

  • 1. Science Gateway Group Overview Marlon Pierce, Suresh Marru, Raminder Singh Presentation to PTI CREST Lab May 2nd, 2012
  • 2. IU Science Gateway Group Overview • IU Science Gateway Group members – Marlon Pierce: Group Lead – Suresh Marru: Principal Software Architect – Raminder Singh, Yu Ma, Jun Wang, Lahiru Gunathilake, (Chathura Herath): Senior team members – Five interns and research assistants • NSF SDCI funding of Open Gateway Computing Environments project – TACC (M. Dahan), SDSC (N. Wilkins-Diehr), SDSU (M. Thomas), NCSA (S. Pamidighantam), UIUC (S. Wang), Purdue (C. Song), UTHSCSA (E. Brookes) – XSEDE Extended Collaboration Support Services
  • 3. Science Gateway Group Focus Areas • Open source, open community software for cyberinfrastructure – Apache Rave: http://rave.apache.org – (portal software) – Apache Airavata: http://www.airavata.org (workflow software) • Extended collaborations with application scientists – Astronomy, astrophysics, biophysics, chemistry, nuclear physics, bio informatics…
  • 4. Science Gateways outreach & democratize Research and Education Research & Lowering the barrier for using Education complex end-to-end Community Developers computational technologies Researchers Democratize Cyberinfrastructure Empower Facilitate
  • 5. Possibilities for Collaboration • Scientific workflows exhibit a number of distributed execution patterns. – Not just DAGs. – Workflow start as an abstraction, but need system, application, library level interactions. – We are trying to generalize our execution framework over a number of applications. – This is parallel, complementary to work in CREST. • Collaborations can be mediated through Apache, students’ independent study, targeted papers.
  • 6. Apache Airavata Open Community Software for Scientific Workflows airavara.org
  • 7. Apache Airavata • Science Gateway software framework to – Compose, manage, execute, and monitor computational workflows on Grids and Clouds – Web service abstractions to legacy command line- driven scientific applications – Modular software framework to use as individual components or as an integrated solution. • More Information – Airavata Web Site: airavata.org – Developer Mailing Lists: airavata- dev@incubator.apache.org
  • 8. Apache Airavata High Level Overview
  • 9. A Classic Scientific Workflow • Workflows are composite applications built out of independent parts. – Executables wrapped as network accessible services • Example: codes A, B, and C need to be executed in a specific sequence. – A, B, C: codes compiled and executable on a cluster, supercomputer, etc. through schedulers. • A, B, and C do not need to be co-located • A, B, and C may be sequential or parallel • A, B and C may have data or control dependencies – Data may need to be staged in and out • Some variations on ABC: – Conditional execution branches, interactivity – Dynamic execution resource binding – Iterations (Do-while, For-Each) over all or parts of the sequence – Triggers, events, data streams
  • 10. Linked Environment for Atmospheric Discovery Storms Forming Forecast Model Streaming Observations Data Mining Instrument Steering Refine forecast grid Envisioned by a multi-disciplinary team from OU, IU, NCSA, Unidata, UAH, Howard, On-Demand Millersville, Colorado State, RENCI Grid Computing
  • 11. Open Grid/Gateway Computing Environments Atmospheric LEAD, OLAM Science LEAD Molecular GridChem, ParamChem, Chemistry OREChem GridChem OGCE Bio Physics Ultrascan Re-engineer, OVP/ TeraGrid Generalize, RST/ MIG User Portal Build, Test and BioVLAB, mCpG Bio Informatics Release OGCE NMI & Astronomy ODI, DES-SimWG SDCI Funding Nuclear Physics LCCI XSEDE ECSS Gateway Projects Projects in the pipe QuakeSim, VLAB, Einstein Gateway 11
  • 12. Realizing the Universe for the Dark Energy Survey (DES) Using XSEDE Support Principal Investigators: Prof. August Evrard (University of Michigan), Prof. Andrey Kravtsov (University of Chicago) Background & Explanation: The Dark Energy Survey (DES) is an upcoming international experiment that aims to constrain the properties of dark energy and dark matter in the universe using a deep, 5000-square degree survey of cosmic structure traced by galaxies. To support this science, the DES Fig. 1 The density of dark matter in a thin radial slice as seen by a Simulation Working Group is generating synthetic observer located in the 8 billion light-year computational volume. Image courtesy Matthew Becker, University of Chicago. expectations for galaxy yields in various cosmologies. Analysis of these simulated catalogs offers a quality assurance capability for cosmological and astrophysical analysis of upcoming DES telescope data. Our large, multi-staged computations are a natural fit for workflow control atop XSEDE resources. Fig. 2: A synthetic 2x3 arcmin DES sky image showing galaxies, stars, and observational artifacts. Courtesy Huan Lin, FNAL.
  • 13. DES Component Description Application CAMB Code for Anisotropies in the Microwave Background is a serial FORTRAN code that computes the power spectrum of dark matter (simulation initial conditions). Output is a small ASCII file describing the power spectrum. 2LPTic Second-order Lagrangian Perturbation Theory initial conditions code is an MPI-based C code that computes the initial conditions for the simulation from parameters and an input power spectrum generated by CAMB. Output is a set of binary files that vary in size from ~80-250 GB depending on the simulation resolution. LGadget MPI based C code that uses a TreePM algorithm to evolve a gravitational N-body system. The outputs of this step are system state snapshot files, as well as light-cone files, and some properties of the matter distribution, including the power spectrum at various timesteps. The total output from LGadget depends on resolution and the number of system snapshots stored, and approaches 10~TB for large DES simulation boxes. ADDGALS Creates a synthetic galaxy catalog for science analysis
  • 14. Case Study: Dark Energy Survey • Long running code: Based on simulation box size L-gadget can run for 3 to 5 days using more than 1024 cores on TACC Ranger. • Do-While Construct: Restart service support is needed to work around queue time restrictions. Do-while construct was developed to address the need. • Data size and file transfer challenges: L- gadget produces 10~TB for large DES simulation boxes in system scratch so data need to moved to persistent storage ASAP • File system issues: More than 10,000 lightcone files are doing continues file I/O. Ranger has one Luster metadata server to serve 300 I/O nodes. Sometime metadata server can’t fine these lightcone files, which make simulations to stop. We have wasted Figure: Processing steps to build a synthetic galaxy catalog. ~50k SU this month struggling with I/O issues Xbaya workflow currently controls the top-most element (N- and to get recommendation to use MPI I/O body simulations) which consists of methods to sample a cosmological power spectrum (ps), generating an initial set of particles (ic) and evolving the particles forward in time with Gadget (N-body). The remaining methods are run manually on distributed resources.
  • 15.
  • 16. Case Study: ParamChem • ParamChem researchers try to optimize the geometry of new molecules which may or may not converge with in a given time or number of steps. • Factors that include the mathematical convergence issues in solutions for partial integro-differential equations to potential shallowness of an energy landscape. • The intermediate outputs from model iterations can be used to determine convergence. Complex graph executions with support for long running and interactive executions to address non-deterministic convergence problems.
  • 17. Case Study: LCCI • The Leadership Class Configuration Interaction (LCCI) project is targeted to accurately predict properties of nuclei for astrophysical and fusion energy processes. – James Vary’s group at Iowa State • One of the PetaScale Apps – Use DOE INCITE and NSF Blue Waters awarded resources – Currently using 55 million processor hours on ORNL Cray XK6 machine and Argonne Blue Gene/P. • Workflow Goals – Democratizing science: Reduce the learning curve associated with running simulations – Controlled access: avoid inappropriate use of super computing resources – Reproducibility of results – Avoiding waste: needless duplication; minimize erroneous use of codes and erroneous exchanging of intermediate files – Sustainability: Ensure long-term preservation of applications, configurations and results – Provenance: Provide the ability to track down the provenance of results as well as reuse previously completed results where applicable without recalculating – Parametric sweeps: Allow components to run over a range of dataset such that applications may produce richer simulations
  • 18. Example Workflow: Nuclear Physics Courtesy of collaboration with Prof. James Vary and team, Iowa State
  • 19. Next Generation Workflow Systems • Scientific workflow systems and compiled workflow languages have focused on modeling, scheduling, data movement, dynamic service creation and monitoring of workflows. • Building on these foundations we extend to a interactive and flexible workflow systems. – interactive ways of steering the workflow execution – interpreted workflow execution model – high level instruction set supporting diverse execution patterns – flexibility to execute individual workflow activity and wait for further analysis.
  • 20. Various State Changes can tap into lower layers Running Workflow Ready Node Wai ng Load from Node provenance Running Node Failed Finished Node Node Finished Workflow Ready Workflow Stopped Workflow Paused Workflow Failed Workflow
  • 21. Uncertainties in Workflows • Mathematical uncertainty: – PDE’s may not converge for certain conditions – Statistical techniques lead to nondeterministic results, propagation of uncertainties. – CLoser observation at computational output ensure acceptability of results. • Domain uncertainty: – Optimization execution patterns: Scenarios of running against range of parameter values in an attempt to find the most appropriate input set. – Initial execution providing estimate of the accuracy of the inputs and facilitating further refinement. – Outputs are diverse and nondeterministic • Resource uncertainty: – Failures in distributed systems are norm than an exception – Transient failures can be retried if computation is side-effect free/Idempotent. – Persistent failures require migration
  • 22. Next Steps • Workflow start as an abstraction, but need system, application, library level interactions. – We are trying to generalize our execution framework over a number of applications. – This is parallel, complementary to work in CREST. • Collaborations can be mediated through Apache, students’ independent study, targeted papers.
  • 25. Apache Rave • Open Community Software for Enterprise Social Networking, Shareable Web Components, and Science Gateways • Founding members: • Mitre Software • SURFnet • Hippo Software • Indiana University • More information • Project Website: http://rave.apache.org/ • Mailing List: dev@rave.apache.org 1
  • 26. Gadget Dashboard View Gadget Store View
  • 27. Extending Rave for Science Gateways • Rave is designed to be extended. – Good design (interfaces, easily pluggable implementations) and code organization are required. – It helps to have a diverse, distributed developer community • How can you work on it if we can’t work on it? • Rave is also packaged so that you can extend it without touching the source tree. • GCE11 paper presented 3 case studies for Science Gateways
  • 28. Apache Software Foundation and Cyberinfrastructure
  • 29. Why Apache for Gateway Software? • Apache Software Foundation is a neutral playing field – 501(c)(3) non-profit organization. – Designed to encourage competitors to collaborate on foundational software. – Includes a legal cell for legal issues. • Foundation itself is sustainable – Incorporated in 1999 – Multiple sponsors (Yahoo, Microsoft, Google, AMD, Facebook, IBM, …) • Proven governance models – Projects are run by Program Management Committees. – New projects must go through incubation. • Provides the social infrastructure for building communities. • Opportunities to collaborate with other Apache projects outside the usual CI world.
  • 30. The Apache Way • Projects start as incubators with 1 champion and several mentors. – Making good choices is very important • Graduation ultimately is judged by the Apache community. – +1/-1 votes on the incubator list • Good, open engineering practices required – DEV mailing list design discussions, issue tracking – Jira contributions – Important decisions are voted on • Properly packaged code – Build out of the box – Releases are signed – Licenses, disclaimers, notices, change logs, etc. – Releases are voted • Developer diversity – Three or more unconnected developers – Price is giving up sole ownership, replace with meritocracy
  • 31. Apache and Science Gateways • Apache rewards projects for cross-pollination. – Connecting with complementary Apache projects strengthens both sides. – New requirements, new development methods • Apache methods foster sustainability – Building communities of developers, not just users – Key merit criterion • Apache methods provide governance – Incubators learn best practices from mentors – Releases are peer-reviewed
  • 32. Apache Contributions Aren’t Just Software • Apache committers and PMC members aren’t just code writers. • Successful communities also include – Important users – Project evangelists – Content providers: documentation, tutorials – Testers, requirements providers, and constructive complainers • Using Jira and mailing lists – Anything else that needs doing.
  • 33. Case Study: LEAD • To create an Integrated, Scalable Geosciences Framework, LEAD among things resulted in a developing a flexible Scientific workflow system. • The initial goal was to realize WOORDS: Workflow Orchestration for On-Demand Real-Time Dynamically-Adaptive System. • The system enables execution of legacy scientific codes and facilities sophisticated coupling while interacting with data and provenance sub-systems.
  • 34. Case Study: One Degree Imager • A single investigation requires multiple night observations • Each night takes hours of observations with multiple exposures • An exposure is divided into 64 Orthogonal Transfer Array (OTAs) • Each OTA is an 8x8 collection of 512x512 pixel CCD images. • Reducing these data sets require workflow planning taking advantage of system architectures. • Currently we take advantage of threaded parallelism at node level branching out to multiple node executions.
  • 35. Pipeline Parallelization Campaign TOP ... Night/Filter FTR FTR ... Exposures EXP EXP ... OTAs OTA OTA
  • 36. Illustrating Interactivity Asynchronous Applica on refinements Steering Orchestra on level Interac ons Job Level Interac ons Parametric Provenance Workflow Job launch, Checkpoint/ Sweeps Steering gliding Restart Model Mathema cal Domain Resource Refinement Uncertain es
  • 37. Execution Patterns Parametric Sweeps Start Start Level 0 4 instances X 4 à 16 outputs Level 1 2instances X (4x4)à 32 outputs Level 2 1 instance X (32x32)à 1024outputs A B C Pruned Computation Dot vs Cartisian Start! Start Level% 0! !! !! !! !!!!4x4!instances!!16!outputs! ! ! ! ! ! ! ! Level 1 ! 2x16 instances! 32 outputs ! ! Level 2 ! 1x256 instances! 256 outputs A! B! C!
  • 38. Interactivity Contd. • Deviations during workflow execution that do not affect the structure of the workflow – dynamic change workflow inputs, workflow rerun. interpreted workflow execution model. – dynamic change in point of execution, workflow smart rerun. – Fault handling and exception models. • Deviations that change the workflow DAG during runtime – Reconfiguration of activity. – Dynamic addition of activities to the workflow. – Dynamic remove or replace of activity to the workflow