SlideShare una empresa de Scribd logo
1 de 49
Overview of Cloud Computing and Workflow
Research in NGSP Group

Dr. Dong YUAN
Research Fellow
Swinburne University of Technology
Melbourne, Australia
Outline


> SUCCESS Centre and NGSP Group
> Background: Big Data, Cloud Computing and Workflow
> Research Topics
   – Data Management in Cloud Computing
   – Performance Management in Scientific Workflows
   – Security and Privacy Protection in the Cloud
   – SwinDeW-C Cloud Workflow System
The Centre of SUCCESS


> SUCCESS: Swinburne University Centre for Computing
  and Engineering Software Systems
    – SUCCESS is the “NO.1” Software Engineering Centre in
      Australia
    – SUCCESS is one of the 7 Tire 1 Centres at Swinburne
      University of Technology (Times World Ranking: 351- 400,
      Academic Ranking of World Universities: 301- 400)
> The ambition of the Centre is to become the top centre
  for software research in the Southern Hemisphere
  within the next five years.


                                                                 3
SUCCESS


> Research Focus Areas
    – Knowledge and Data Intensive Systems
    – Nature of Software
    – Next Generation Software Platforms
    – SE Education and IBL/RBL
    – Software Analysis and Testing
    – Software R&D Group

> http://www.swinburne.edu.au/ict/success/research-
  expertise/

                                                      4
NGSP (Small) Group Overview
     > We conduct research into cloud computing and workflow
       technologies for complex software systems and services.
     > Members:
                                                     Others:
                    Researchers:                     Prof John Grundy
Leader:             Dr Xiao Liu (Postdoc, China)     Prof Chengfei Liu
Prof Yun Yang       Dr Dong Yuan (Postdoc)
                                                      Visitors:
(PC Member for      Gaofeng Zhang
                                                      Prof Lee Osterweil
ICSE 07/08, FSE09   Wenhao Li
                                                      Prof Lori Clarke
ICSE 10/11/12)      Dahai Cao
                                                      Prof Ivan Stojmenovic
                    Jofry Hadi SUTANTO
                                                      Prof Paola Inverardi
                    Antonio Giardina
                                                      Prof Amit Sheth
                                                      Prof Wil van der Aalst
                                                      Prof Hai Jin      5
                                                      Prof Hai Zhuge
R&D Projects – Grants

> Primary projects:
    – (Cloud) workflow technology: Scheduling and temporal analysis in cloud
      workflows
         • ARC LP0990393 (Y Yang, R Kotagiri, J Chen, C Liu)
    – Cloud computing: Intermediate data management in cloud computing
         • ARC DP110101340 (Y Yang, J Chen, J Grundy)

> Secondary project:
    – Management control systems for effective information sharing and
      security in government organisations
         • ARC LP110100228 (S Cugenasen, Y Yang)

                                                                           6
R&D Projects – Overview

> SwinDeW workflow family including SwinDeW-C
   – Architectures / Models (D Cao)
   – Scheduling / Data and service management (D Yuan, X Liu)
   – Verification / Exception handling (X Liu)

> Cloud computing:
   – Data management (D Yuan, X Liu, W Li)
   – Privacy and Security (G Zhang, X Zhang, C Liu)




                                                                7
Some Recent ERA A* Ranked Publications

> J. Chen and Y. Yang, Temporal Dependency based Checkpoint Selection for Dynamic
  Verification of Temporal Constraints in Scientific Workflow Systems. ACM Transactions on
  Software Engineering and Methodology, 20(3), 2011
> X. Liu, Y. Yang, Y. Jiang and J. Chen, Preventing Temporal Violations in Scientific
  Workflows: Where and How. IEEE Transactions on Software Engineering, 37(6):805-
  825, Nov./Dec. 2011.
> D. Yuan, Y. Yang, X. Liu and J. Chen, On‑demand Minimum Cost Benchmarking for
  Intermediate Datasets Storage in Scientific Cloud Workflow Systems. Journal of Parallel
  and Distributed Computing, 71:(316-332), 2011
> J. Chen and Y. Yang, Localising Temporal Constraints in Scientific Workflows. Journal of
  Computer and System Sciences, Elsevier, 76(6):464-474, Sept. 2010
> G. Zhang, Y. Yang and J. Chen, A Historical Probability based Noise Generation Strategy for
  Privacy Protection in Cloud Computing. Journal of Computer and System Sciences,
  Elsevier, published online, Dec. 2011.
> Another 8 A* papers are currently under review…


                                                                                             8
Part 1: Outline


> SUCCESS Centre and NGSP Group
> Background: Big Data, Cloud Computing and Workflow
> Research Topics
   – Data Management in Cloud Computing
   – Performance Management in Scientific Workflows
   – Security and Privacy Protection in the Cloud
   – SwinDeW-C Cloud Workflow System
Big Data
> Data explosion
    – TB (1012), PB(1015), exabyte (EB, 1018), zettabyte (ZB, 1021), yottabyte (YB,1024)
    – The total amount of global data in 2010:
                                                     1.2 ZB
    – Google processes ? data everyday in 2009:
    – Every day, Facebook 10T, Twitter 7T, Youtube 4.5TPB
                                                   24

> Moore's law vs. data explosion speed
    – Application data double every year over the next decade and further -
      [Szalay et al. Nature, 2006]
> Buzzwords: data storage, data processing, parallel, distributed,
  virtualisation, commodity machines, energy consumption, data
  centres, utility computing, software (everything) as a service
                                                                                  10
Example: Pulsar Searching


    > Astrophysics: pulsar searching
    > Pulsars: the collapsed cores of stars that were once more massive than 6-10 times
      the mass of the Sun
    > http://astronomy.swin.edu.au/cosmos/P/Pulsar
    > Parkes Radio Telescope (http://www.parkes.atnf.csiro.au/)
    >    Swinburne Astrophysics group (http://astronomy.swinburne.edu.au/) has been
        conducting pulsar searching surveys (http://astronomy.swin.edu.au/pulsar/) based
        on the observation data from Parkes Radio Telescope.
    > Typical scientific workflow which involves a large number of data and computation
      intensive activities. For a single searching process, the average data volume (not
               left: Image of the Crab Nebula taken with
      including the raw stream data from the telescope) is over 4 terabytes and the
               the Palomar telescope
      average execution time is about 23 hours on Swinburne high performance
      supercomputing facility (http://astronomy.swinburne.edu.au/supercomputing/).
                right: A close up of the Crab Pulsar from
                the Hubble Space Telescope
                Credit: Jeff Hester and Paul Scowen                                        11
                (Arizona State University) and NASA
Pulsar Searching Workflow




                            Dr. Willem
                            van Straten




                                    12
Benefits of Clouds
> No upfront infrastructure investment
     – No procuring hardware, setup, hosting, power, etc..
> On demand access
     – Lease what you need and when you need..
> Efficient Resource Allocation
     – Globally shared infrastructure …
> Nice Pricing
     – Based on Usage, QoS, Supply and Demand, Loyalty, …
> Application Acceleration
     – Parallelism for large-scale data analysis…
> Highly Availability, Scalable, and Energy Efficient
> Supports Creation of 3rd Party Services & Seamless offering
     – Builds on infrastructure and follows similar Business model as Cloud
                                                                              13
SwinDeW Workflow Series

SwinDeW – Swinburne Decentralised Workflow
  - foundation prototype based on p2p
   – SwinDeW – past
   – SwinDeW-S (for Services) – past
   – SwinDeW-B (for BPEL4WS) – past
   – SwinDeW-G (for Grid) – past
   – SwinDeW-A (for Agents) – past
   – SwinDeW-V (for Verification) – current
   – SwinDeW-C (for Cloud) – current
Part 1: Outline


> SUCCESS Centre and NGSP Group
> Background: Big Data, Cloud Computing and Workflow
> Research Topics
   – Data Management in Cloud Computing
   – Performance Management in Scientific Workflows
   – Security and Privacy Protection in the Cloud
   – SwinDeW-C Cloud Workflow System
Research Topics



     Data Management in Cloud
     Computing

     Dr. Dong Yuan

     http://www.ict.swin.edu.au/personal/dyuan/


                                                  16
Data Management in Cloud Computing

> Scientific applications in cloud computing
    – Computation and data intensive applications
    – Excessive computation and storage resources
    – Pay-as-you-go model

> Three aspects of data management in the cloud
    – Data storage
    – Data placement
    – Data replication
Data Storage


> Developing smart data storage strategies for reducing
  the cost of storing big data in the cloud
    – Data regeneration (computation and storage
      trade-off)
    – Data de-duplication
    – Data compression
> Researcher: Dong Yuan
Publications
>   D. Yuan, Y. Yang, X. Liu, J. Chen, On‑demand Minimum Cost Benchmarking for
    Intermediate Datasets Storage in Scientific Cloud Workflow Systems, Journal of
    Parallel and Distributed Computing, Elsevier, vol. 71(2), pp. 316-332, 2011.
>   D. Yuan, Y. Yang, X. Liu, G. Zhang, J. Chen, A Data Dependency Based Strategy
    for Intermediate Data Storage in Scientific Cloud Workflow Systems, Concurrency
    and Computation: Practice and Experience, Wiley, 24(9), pp. 956-976, Jun. 2012.
>   D. Yuan, Y. Yang, X. Liu, J. Chen, A Cost-Effective Strategy for Intermediate Data
    Storage in Scientific Cloud Workflow Systems, Proc. of 24th IEEE International
    Parallel & Distributed Processing Symposium (IPDPS10), Atlanta, USA, Apr. 2010.
>   D. Yuan, Y. Yang, X. Liu and J. Chen, A Local-Optimisation based Strategy for
    Cost-Effective Datasets Storage of Scientific Applications in the Cloud, Proc. of 4th
    IEEE International Conference on Cloud Computing (Cloud2011), Washington DC,
    USA, July 4-9, 2011.
Data Placement


> Smart data placement strategies to reduce
  application cost
   – Data correlation based strategy to reduce
     bandwidth cost
   – Data usage based strategy to reduce storage cost
> Researchers: Dong Yuan, Jofry Hadi SUTANTO,
  Antonio Giardina
Publications

>   D. Yuan, Y. Yang, X. Liu, J. Chen, A Data Placement Strategy in
    Scientific Cloud Workflows, Future Generation Computer Systems,
    Elsevier, vol. 26(8), pp. 1200-1214, 2010.
Data Replication


> To cost-effectively assure data reliability in the cloud
    – Dynamic replication strategy
    – Proactively checking based replication strategy
> Researchers: Wenhao Li, Dong Yuan
Publications

>   W. Li, Y. Yang and D. Yuan, A Novel Cost-effective Dynamic Data
    Replication Strategy for Reliability in Cloud Data Centres. Proc. of
    International Conference on Cloud and Green Computing (CGC2011),
    pages 496-502, Sydney, Australia, Dec. 2011.
>   W. Li, Y. Yang, J. Chen and D. Yuan, A Cost-Effective Mechanism for
    Cloud Data Reliability Management based on Proactive Replica
    Checking. Proc. of 12th IEEE/ACM International Symposium on Cluster,
    Cloud and Grid Computing (CCGrid2012), pages 564-571, Ottawa,
    Canada, May 2012.
Research Topics



     Performance Management in
     Scientific Workflows

     Dr. Xiao Liu
     http://www.ict.swin.edu.au/personal/xliu/
Workflow QoS


> QoS dimensions
   – time, cost, fidelity, reliability, security …
> QoS of Cloud Services
> Workflow QoS
   – the overall QoS for a collection of cloud services
   – but not simply add up!




                                                          25
Temporal QoS

> System performance
    – Response time
    – Throughput
> Temporal constraints
    – Global constraints: deadlines
    – Local constraints: milestones, individual activity durations
> Satisfactory temporal QoS
    – High performance: fast response, high throughput
    – On-time completion: low temporal violation rate


                                                                     26
Problem Analysis

> Setting temporal constraints
    – Prerequisite: effective forecasting of activity durations
> Monitoring temporal consistency state
    – Monitor workflow execution state
    – Detect potential temporal violations
> Temporal violation handling
    – Where to conduct violation handling
    – What strategies to be used




                                                                  27
Temporal Framework




                     28
Forecasting Activity Durations

> Statistical time-series pattern based forecasting strategies
> Selected Publications:
     – X. Liu, Z. Ni, D. Yuan, Y. Jiang, Z. Wu, J. Chen, Y. Yang, A Novel
       Statistical Time-Series Pattern based Interval Forecasting Strategy
       for Activity Durations in Workflow Systems, Journal of Systems and
       Software (JSS), vol. 84, no. 3, Pages 354-376, March 2011.
     – X. Liu, J. Chen, K. Liu and Y. Yang, Forecasting Duration Intervals of
       Scientific Workflow Activities based on Time-Series Patterns, Proc.
       of 4th IEEE International Conference on e-Science (e-Science08),
       pages 23-30, Indianapolis, USA, Dec. 2008.




                                                                                29
Setting Temporal Constraints

> Probability based temporal consistency model
> Time analysis based on Stochastic Petri Nets
> Selected Publications:
     – X. Liu, Z. Ni, J. Chen, Y. Yang, A Probabilistic Strategy for Temporal
       Constraint Management in Scientific Workflow Systems,
       Concurrency and Computation: Practice and Experience (CCPE),
       Wiley, 23(16):1893-1919, Nov. 2011 .
     – X. Liu, J. Chen and Y. Yang, A Probabilistic Strategy for Setting
       Temporal Constraints in Scientific Workflows, Proc. 6th International
       Conference on Business Process Management (BPM2008), Lecture
       Notes in Computer Science, Vol. 5240, pages 180-195, Milan, Italy,
       Sept. 2008.



                                                                                30
Temporal Consistency Monitoring

>   Minimum (Probability) Time Redundancy based Checkpoint Selection
    Strategy
>   Temporal Dependency based Checkpoint Selection Strategy
>   Selected Publications:
     – X. Liu, Y. Yang, Y. Jiang and J. Chen, Preventing Temporal
       Violations in Scientific Workflows: Where and How. IEEE
       Transactions on Software Engineering, 37(6):805-825, Nov./Dec.
       2011.
     – J. Chen and Y. Yang, Temporal Dependency based Checkpoint
       Selection for Dynamic Verification of Temporal Constraints in
       Scientific Workflow Systems. ACM Transactions on Software
       Engineering and Methodology, 20(3), 2011
Violation Handling
> Violation Handling Point Selection
> (Probability) Time deficit allocation
> Workflow local rescheduling strategy – ACO, GA, PSO
> Selected Publications:
     – X. Liu, Z. Ni, Z. Wu, D. Yuan, J. Chen and Y. Yang, A Novel General Framework
       for Automatic and Cost-Effective Handling of Recoverable Temporal Violations in
       Scientific Workflow Systems, Journal of Systems and Software, vol. 84, no. 3, pp.
       492-509, 2011




                                                                                       32
Research Topics



     Security and Privacy Protection
     in the Cloud

     Gaofeng Zhang

     gzhang@swin.edu.au


                                       33
Background
> Data Security vs. Data Privacy
> Privacy in cloud computing
    – Massive data store and compute in open cloud environment
    – Customers cannot control inside cloud
    The severity of privacy risk in cloud computing
       One specific privacy risk in cloud computing
    – Indirectly private information (collectively information)
    – Normal service processes and functions (not disruption)
    The approach: noise obfuscation for privacy protection
Privacy Protection in Cloud



   > Roles in the view of privacy in regular IT system
        – Privacy owner, Privacy user and Privacy theft


                                                           Keep safe
                     Privacy user                          between Privacy
                                                           owner and
                                                           Privacy
                                                           user!

                                           Privacy theft
     Privacy owner
Privacy Protection in Cloud

> Roles in the view of privacy in Cloud
     – Privacy owner, privacy user and privacy theft



                                                          Virtualisation
                                                          disable the
                        Privacy user
                                                          “keeping safe
                                                          between Privacy
                                                          owner and Privacy
                                                          user!”
                                          Privacy theft
       Privacy owner
Noise Obfuscation(1)
> Background
     – Massive data stores and computes in open cloud environments.
     – Customers cannot control inside cloud.
> Main idea: “Dilute” real private information with noise information
     – Not noise signal!
Noise Obfuscation(2)
> A Motivating example:
    – One customer, who often travels to one city in Australia, like ‘Sydney’, checks the
      weather report regularly from a weather service in cloud environments before
      departure. The frequent appearance of service requests about the weather report for
      ‘Sydney’ can reveal the privacy that the customer usually goes to ‘Sydney’. But if a
      system aids the customer to inject other requests like ‘Perth’ or ‘Darwin’ into the
      ‘Sydney’ queue, the service provider cannot distinguish which ones are real and
      which ones are ‘noise’ as it just sees a similar style of service request. These
      requests should be responded and cannot reveal the location privacy of the
      customer. In such cases, the privacy can be protected by noise obfuscation in
      general.




                               From ‘data’ privacy to ‘process’ privacy!
Research Topics
> Noise Generation
     – Historical probability based noise generation strategy
     – Time-series pattern based noise generation strategy
     – Association probability based noise generation strategy
     – ……
> Noise Utilisation
     – Trust model and injection strategy for noise obfuscation
     – ……
> Noise Cooperation Mechanism
     – Privacy protection framework under noise obfuscation
Publications

>   G. Zhang, Y. Yang and J. Chen, A Historical Probability based Noise Generation
    Strategy for Privacy Protection in Cloud Computing. Journal of Computer and
    System Sciences, Elsevier, 78(5):1374-1381, Sept. 2012.
>   G. Zhang, Y. Yang, D. Yuan and J. Chen, A Trust-based Noise Injection
    Strategy for Privacy Protection in Cloud Computing. Software: Practice and
    Experience , Wiley, 42(4):431-445, Apr. 2012.
>   G. Zhang, Y. Yang, X. Liu and J. Chen, A Time-series Pattern based Noise
    Generation Strategy for Privacy Protection in Cloud Computing. Proc. of 12th
    IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing
    (CCGrid2012), pages 458-465, Ottawa, Canada, May 2012.
>   G. Zhang, X. Zhang, Y. Yang, C. Liu and J. Chen, An Association Probability
    based Noise Generation Strategy for Privacy Protection in Cloud Computing.
    Proc. 10th International Conference on Service Oriented Computing
    (ICSoC2012), pages 639-647, Shanghai, China, Nov. 2012. (accepted on
    13/7/2012)
Research Topics


     Cloud Workflow System
     Design and Development


     Dahai Cao
     dcao@swin.edu.au



                              41
SwinCloud – Cloud Computing Testbed

> SwinCloud




                                      42
General cloud workflow reference model
Prototype Ⅰ: SwinDeW-C (Peer-to-Peer)

> SwinDeW-C




                                        44
PrototypeⅡ: SwinFlow-Cloud (Centralised)
Cloud workflow implementation
> Client system
    – Process definition tools
    – Rule editor
    – Organisation modelling tools
    – Office calendar management tools
    – Authority group tools
    – User management tools
    – Form designing tools
    – Tool agent definition tools
    – Simulation tools
New Progress

> Successfully deploy on the Amazon Cloud




> Eucalyptus: the cloud infrastructure platform
A Book
End
> Questions?

Más contenido relacionado

La actualidad más candente

A review on data mining
A  review on data miningA  review on data mining
A review on data miningEr. Nancy
 
Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data Robert Oostenveld
 
Performance analysis of data mining algorithms with neural network
Performance analysis of data mining algorithms with neural networkPerformance analysis of data mining algorithms with neural network
Performance analysis of data mining algorithms with neural networkIAEME Publication
 
XLDB South America Keynote: eScience Institute and Myria
XLDB South America Keynote: eScience Institute and MyriaXLDB South America Keynote: eScience Institute and Myria
XLDB South America Keynote: eScience Institute and MyriaUniversity of Washington
 
The Brain Imaging Data Structure and its use for fNIRS
The Brain Imaging Data Structure and its use for fNIRSThe Brain Imaging Data Structure and its use for fNIRS
The Brain Imaging Data Structure and its use for fNIRSRobert Oostenveld
 
Why Data Science Matters - 2014 WDS Data Stewardship Award Lecture
Why Data Science Matters - 2014 WDS Data Stewardship Award LectureWhy Data Science Matters - 2014 WDS Data Stewardship Award Lecture
Why Data Science Matters - 2014 WDS Data Stewardship Award LectureXiaogang (Marshall) Ma
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8Scott Edmunds
 
Virtual Appliances, Cloud Computing, and Reproducible Research
Virtual Appliances, Cloud Computing, and Reproducible ResearchVirtual Appliances, Cloud Computing, and Reproducible Research
Virtual Appliances, Cloud Computing, and Reproducible ResearchUniversity of Washington
 
cloudComputing_ProjectJunior
cloudComputing_ProjectJuniorcloudComputing_ProjectJunior
cloudComputing_ProjectJuniorDominic Searson
 
Big Data and Advanced Data Intensive Computing
Big Data and Advanced Data Intensive ComputingBig Data and Advanced Data Intensive Computing
Big Data and Advanced Data Intensive ComputingJongwook Woo
 
HPC and Precision Medicine: A New Framework for Alzheimer's and Parkinson's
HPC and Precision Medicine: A New Framework for Alzheimer's and Parkinson'sHPC and Precision Medicine: A New Framework for Alzheimer's and Parkinson's
HPC and Precision Medicine: A New Framework for Alzheimer's and Parkinson'sinside-BigData.com
 
2011 11 pre_cs50_accelerating_sciencegrid_ianstokesrees
2011 11 pre_cs50_accelerating_sciencegrid_ianstokesrees2011 11 pre_cs50_accelerating_sciencegrid_ianstokesrees
2011 11 pre_cs50_accelerating_sciencegrid_ianstokesreesBoston Consulting Group
 
Preservation, Publishing, and People: A SEAD View
Preservation, Publishing, and  People: A SEAD ViewPreservation, Publishing, and  People: A SEAD View
Preservation, Publishing, and People: A SEAD ViewInna Kouper
 
CODATA International Training Workshop in Big Data for Science for Researcher...
CODATA International Training Workshop in Big Data for Science for Researcher...CODATA International Training Workshop in Big Data for Science for Researcher...
CODATA International Training Workshop in Big Data for Science for Researcher...Johann van Wyk
 
Donders neuroimage toolkit - open science and good practices
Donders neuroimage toolkit -  open science and good practicesDonders neuroimage toolkit -  open science and good practices
Donders neuroimage toolkit - open science and good practicesRobert Oostenveld
 
Big Data: tools and techniques for working with large data sets
Big Data: tools and techniques for working with large data setsBig Data: tools and techniques for working with large data sets
Big Data: tools and techniques for working with large data setsBoston Consulting Group
 
Towards Supporting Data-Intensive Research
Towards Supporting Data-Intensive ResearchTowards Supporting Data-Intensive Research
Towards Supporting Data-Intensive ResearchJano van Hemert
 
Web analytics webinar
Web analytics webinarWeb analytics webinar
Web analytics webinarJim Jansen
 

La actualidad más candente (20)

A review on data mining
A  review on data miningA  review on data mining
A review on data mining
 
2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
 
Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data
 
Performance analysis of data mining algorithms with neural network
Performance analysis of data mining algorithms with neural networkPerformance analysis of data mining algorithms with neural network
Performance analysis of data mining algorithms with neural network
 
XLDB South America Keynote: eScience Institute and Myria
XLDB South America Keynote: eScience Institute and MyriaXLDB South America Keynote: eScience Institute and Myria
XLDB South America Keynote: eScience Institute and Myria
 
The Brain Imaging Data Structure and its use for fNIRS
The Brain Imaging Data Structure and its use for fNIRSThe Brain Imaging Data Structure and its use for fNIRS
The Brain Imaging Data Structure and its use for fNIRS
 
Why Data Science Matters - 2014 WDS Data Stewardship Award Lecture
Why Data Science Matters - 2014 WDS Data Stewardship Award LectureWhy Data Science Matters - 2014 WDS Data Stewardship Award Lecture
Why Data Science Matters - 2014 WDS Data Stewardship Award Lecture
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
Virtual Appliances, Cloud Computing, and Reproducible Research
Virtual Appliances, Cloud Computing, and Reproducible ResearchVirtual Appliances, Cloud Computing, and Reproducible Research
Virtual Appliances, Cloud Computing, and Reproducible Research
 
cloudComputing_ProjectJunior
cloudComputing_ProjectJuniorcloudComputing_ProjectJunior
cloudComputing_ProjectJunior
 
Big Data and Advanced Data Intensive Computing
Big Data and Advanced Data Intensive ComputingBig Data and Advanced Data Intensive Computing
Big Data and Advanced Data Intensive Computing
 
Andreas Hense: Climate data for our future – acquired, analysed, archived
Andreas Hense: Climate data for our future – acquired, analysed, archivedAndreas Hense: Climate data for our future – acquired, analysed, archived
Andreas Hense: Climate data for our future – acquired, analysed, archived
 
HPC and Precision Medicine: A New Framework for Alzheimer's and Parkinson's
HPC and Precision Medicine: A New Framework for Alzheimer's and Parkinson'sHPC and Precision Medicine: A New Framework for Alzheimer's and Parkinson's
HPC and Precision Medicine: A New Framework for Alzheimer's and Parkinson's
 
2011 11 pre_cs50_accelerating_sciencegrid_ianstokesrees
2011 11 pre_cs50_accelerating_sciencegrid_ianstokesrees2011 11 pre_cs50_accelerating_sciencegrid_ianstokesrees
2011 11 pre_cs50_accelerating_sciencegrid_ianstokesrees
 
Preservation, Publishing, and People: A SEAD View
Preservation, Publishing, and  People: A SEAD ViewPreservation, Publishing, and  People: A SEAD View
Preservation, Publishing, and People: A SEAD View
 
CODATA International Training Workshop in Big Data for Science for Researcher...
CODATA International Training Workshop in Big Data for Science for Researcher...CODATA International Training Workshop in Big Data for Science for Researcher...
CODATA International Training Workshop in Big Data for Science for Researcher...
 
Donders neuroimage toolkit - open science and good practices
Donders neuroimage toolkit -  open science and good practicesDonders neuroimage toolkit -  open science and good practices
Donders neuroimage toolkit - open science and good practices
 
Big Data: tools and techniques for working with large data sets
Big Data: tools and techniques for working with large data setsBig Data: tools and techniques for working with large data sets
Big Data: tools and techniques for working with large data sets
 
Towards Supporting Data-Intensive Research
Towards Supporting Data-Intensive ResearchTowards Supporting Data-Intensive Research
Towards Supporting Data-Intensive Research
 
Web analytics webinar
Web analytics webinarWeb analytics webinar
Web analytics webinar
 

Destacado

IDs书友会 - 主题4 - K-Evolution · 活动策划及推广
IDs书友会 - 主题4 - K-Evolution · 活动策划及推广IDs书友会 - 主题4 - K-Evolution · 活动策划及推广
IDs书友会 - 主题4 - K-Evolution · 活动策划及推广IDs Club 澳洲互联网俱乐部
 
IDs书友会 - 主题4 - 开场白 · 线下活动品牌策划及推广
IDs书友会 - 主题4 -  开场白 · 线下活动品牌策划及推广IDs书友会 - 主题4 -  开场白 · 线下活动品牌策划及推广
IDs书友会 - 主题4 - 开场白 · 线下活动品牌策划及推广IDs Club 澳洲互联网俱乐部
 
IDs书友会 - 主题4 - 特别嘉宾 - 彭韬 · 面包旅行
IDs书友会 - 主题4 - 特别嘉宾 - 彭韬 · 面包旅行IDs书友会 - 主题4 - 特别嘉宾 - 彭韬 · 面包旅行
IDs书友会 - 主题4 - 特别嘉宾 - 彭韬 · 面包旅行IDs Club 澳洲互联网俱乐部
 
bài giảng lập trình hướng đối tượng
bài giảng lập trình hướng đối tượngbài giảng lập trình hướng đối tượng
bài giảng lập trình hướng đối tượngMountain Nguyen
 
IDs澳洲互联网俱乐部 - 主题6 - 微信周边产品和创业项目
IDs澳洲互联网俱乐部 - 主题6 - 微信周边产品和创业项目 IDs澳洲互联网俱乐部 - 主题6 - 微信周边产品和创业项目
IDs澳洲互联网俱乐部 - 主题6 - 微信周边产品和创业项目 IDs Club 澳洲互联网俱乐部
 

Destacado (6)

IDs书友会 - 主题4 - K-Evolution · 活动策划及推广
IDs书友会 - 主题4 - K-Evolution · 活动策划及推广IDs书友会 - 主题4 - K-Evolution · 活动策划及推广
IDs书友会 - 主题4 - K-Evolution · 活动策划及推广
 
IDs书友会 - 主题4 - 开场白 · 线下活动品牌策划及推广
IDs书友会 - 主题4 -  开场白 · 线下活动品牌策划及推广IDs书友会 - 主题4 -  开场白 · 线下活动品牌策划及推广
IDs书友会 - 主题4 - 开场白 · 线下活动品牌策划及推广
 
IDs书友会 - 主题3 - 开场白 · 商业应用云实战
IDs书友会 - 主题3 - 开场白 · 商业应用云实战IDs书友会 - 主题3 - 开场白 · 商业应用云实战
IDs书友会 - 主题3 - 开场白 · 商业应用云实战
 
IDs书友会 - 主题4 - 特别嘉宾 - 彭韬 · 面包旅行
IDs书友会 - 主题4 - 特别嘉宾 - 彭韬 · 面包旅行IDs书友会 - 主题4 - 特别嘉宾 - 彭韬 · 面包旅行
IDs书友会 - 主题4 - 特别嘉宾 - 彭韬 · 面包旅行
 
bài giảng lập trình hướng đối tượng
bài giảng lập trình hướng đối tượngbài giảng lập trình hướng đối tượng
bài giảng lập trình hướng đối tượng
 
IDs澳洲互联网俱乐部 - 主题6 - 微信周边产品和创业项目
IDs澳洲互联网俱乐部 - 主题6 - 微信周边产品和创业项目 IDs澳洲互联网俱乐部 - 主题6 - 微信周边产品和创业项目
IDs澳洲互联网俱乐部 - 主题6 - 微信周边产品和创业项目
 

Similar a IDs书友会 - 主题1 - Swinburne Next Generation Research

[1.9] Data Archiving and Publishing - Annemiek van der Kuil [3TU.Datacentrum...
[1.9] Data Archiving and Publishing - Annemiek van der Kuil  [3TU.Datacentrum...[1.9] Data Archiving and Publishing - Annemiek van der Kuil  [3TU.Datacentrum...
[1.9] Data Archiving and Publishing - Annemiek van der Kuil [3TU.Datacentrum...3TU.Datacentrum
 
Your Research Data Management with the support of 3TU.Datacentrum
Your Research Data Management with the support of 3TU.DatacentrumYour Research Data Management with the support of 3TU.Datacentrum
Your Research Data Management with the support of 3TU.DatacentrumAnnemiekvdKuil
 
Working towards Sustainable Software for Science (an NSF and community view)
Working towards Sustainable Software for Science (an NSF and community view)Working towards Sustainable Software for Science (an NSF and community view)
Working towards Sustainable Software for Science (an NSF and community view)Daniel S. Katz
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceCarole Goble
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012IUPUI
 
Big Data, Beyond the Data Center
Big Data, Beyond the Data CenterBig Data, Beyond the Data Center
Big Data, Beyond the Data CenterGilles Fedak
 
Open Science Data Cloud (IEEE Cloud 2011)
Open Science Data Cloud (IEEE Cloud 2011)Open Science Data Cloud (IEEE Cloud 2011)
Open Science Data Cloud (IEEE Cloud 2011)Robert Grossman
 
UK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceUK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceLizLyon
 
Opening ndm2012 sc12
Opening ndm2012 sc12Opening ndm2012 sc12
Opening ndm2012 sc12balmanme
 
Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Ola Spjuth
 
Accelerating data-intensive science by outsourcing the mundane
Accelerating data-intensive science by outsourcing the mundaneAccelerating data-intensive science by outsourcing the mundane
Accelerating data-intensive science by outsourcing the mundaneIan Foster
 
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryData Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryAnita de Waard
 
Curation of Research Data
Curation of Research DataCuration of Research Data
Curation of Research DataMichael Day
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data ManagementJamie Bisset
 
NIST Big Data Public Working Group NBD-PWG
NIST Big Data Public Working Group NBD-PWGNIST Big Data Public Working Group NBD-PWG
NIST Big Data Public Working Group NBD-PWGGeoffrey Fox
 
Science Engagement: A Non-Technical Approach to the Technical Divide
Science Engagement: A Non-Technical Approach to the Technical DivideScience Engagement: A Non-Technical Approach to the Technical Divide
Science Engagement: A Non-Technical Approach to the Technical DivideCybera Inc.
 

Similar a IDs书友会 - 主题1 - Swinburne Next Generation Research (20)

[1.9] Data Archiving and Publishing - Annemiek van der Kuil [3TU.Datacentrum...
[1.9] Data Archiving and Publishing - Annemiek van der Kuil  [3TU.Datacentrum...[1.9] Data Archiving and Publishing - Annemiek van der Kuil  [3TU.Datacentrum...
[1.9] Data Archiving and Publishing - Annemiek van der Kuil [3TU.Datacentrum...
 
Your Research Data Management with the support of 3TU.Datacentrum
Your Research Data Management with the support of 3TU.DatacentrumYour Research Data Management with the support of 3TU.Datacentrum
Your Research Data Management with the support of 3TU.Datacentrum
 
Working towards Sustainable Software for Science (an NSF and community view)
Working towards Sustainable Software for Science (an NSF and community view)Working towards Sustainable Software for Science (an NSF and community view)
Working towards Sustainable Software for Science (an NSF and community view)
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better Science
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012
 
Big Data, Beyond the Data Center
Big Data, Beyond the Data CenterBig Data, Beyond the Data Center
Big Data, Beyond the Data Center
 
Open Science Data Cloud (IEEE Cloud 2011)
Open Science Data Cloud (IEEE Cloud 2011)Open Science Data Cloud (IEEE Cloud 2011)
Open Science Data Cloud (IEEE Cloud 2011)
 
UK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceUK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalface
 
Opening ndm2012 sc12
Opening ndm2012 sc12Opening ndm2012 sc12
Opening ndm2012 sc12
 
Summary of 3DPAS
Summary of 3DPASSummary of 3DPAS
Summary of 3DPAS
 
Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...
 
Accelerating data-intensive science by outsourcing the mundane
Accelerating data-intensive science by outsourcing the mundaneAccelerating data-intensive science by outsourcing the mundane
Accelerating data-intensive science by outsourcing the mundane
 
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryData Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost Recovery
 
SomeSlides
SomeSlidesSomeSlides
SomeSlides
 
Curation of Research Data
Curation of Research DataCuration of Research Data
Curation of Research Data
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
NIST Big Data Public Working Group NBD-PWG
NIST Big Data Public Working Group NBD-PWGNIST Big Data Public Working Group NBD-PWG
NIST Big Data Public Working Group NBD-PWG
 
Science Engagement: A Non-Technical Approach to the Technical Divide
Science Engagement: A Non-Technical Approach to the Technical DivideScience Engagement: A Non-Technical Approach to the Technical Divide
Science Engagement: A Non-Technical Approach to the Technical Divide
 
User engagement in research data curation
User engagement in research data curationUser engagement in research data curation
User engagement in research data curation
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 

Último

Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 

Último (20)

Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

IDs书友会 - 主题1 - Swinburne Next Generation Research

  • 1. Overview of Cloud Computing and Workflow Research in NGSP Group Dr. Dong YUAN Research Fellow Swinburne University of Technology Melbourne, Australia
  • 2. Outline > SUCCESS Centre and NGSP Group > Background: Big Data, Cloud Computing and Workflow > Research Topics – Data Management in Cloud Computing – Performance Management in Scientific Workflows – Security and Privacy Protection in the Cloud – SwinDeW-C Cloud Workflow System
  • 3. The Centre of SUCCESS > SUCCESS: Swinburne University Centre for Computing and Engineering Software Systems – SUCCESS is the “NO.1” Software Engineering Centre in Australia – SUCCESS is one of the 7 Tire 1 Centres at Swinburne University of Technology (Times World Ranking: 351- 400, Academic Ranking of World Universities: 301- 400) > The ambition of the Centre is to become the top centre for software research in the Southern Hemisphere within the next five years. 3
  • 4. SUCCESS > Research Focus Areas – Knowledge and Data Intensive Systems – Nature of Software – Next Generation Software Platforms – SE Education and IBL/RBL – Software Analysis and Testing – Software R&D Group > http://www.swinburne.edu.au/ict/success/research- expertise/ 4
  • 5. NGSP (Small) Group Overview > We conduct research into cloud computing and workflow technologies for complex software systems and services. > Members: Others: Researchers: Prof John Grundy Leader: Dr Xiao Liu (Postdoc, China) Prof Chengfei Liu Prof Yun Yang Dr Dong Yuan (Postdoc) Visitors: (PC Member for Gaofeng Zhang Prof Lee Osterweil ICSE 07/08, FSE09 Wenhao Li Prof Lori Clarke ICSE 10/11/12) Dahai Cao Prof Ivan Stojmenovic Jofry Hadi SUTANTO Prof Paola Inverardi Antonio Giardina Prof Amit Sheth Prof Wil van der Aalst Prof Hai Jin 5 Prof Hai Zhuge
  • 6. R&D Projects – Grants > Primary projects: – (Cloud) workflow technology: Scheduling and temporal analysis in cloud workflows • ARC LP0990393 (Y Yang, R Kotagiri, J Chen, C Liu) – Cloud computing: Intermediate data management in cloud computing • ARC DP110101340 (Y Yang, J Chen, J Grundy) > Secondary project: – Management control systems for effective information sharing and security in government organisations • ARC LP110100228 (S Cugenasen, Y Yang) 6
  • 7. R&D Projects – Overview > SwinDeW workflow family including SwinDeW-C – Architectures / Models (D Cao) – Scheduling / Data and service management (D Yuan, X Liu) – Verification / Exception handling (X Liu) > Cloud computing: – Data management (D Yuan, X Liu, W Li) – Privacy and Security (G Zhang, X Zhang, C Liu) 7
  • 8. Some Recent ERA A* Ranked Publications > J. Chen and Y. Yang, Temporal Dependency based Checkpoint Selection for Dynamic Verification of Temporal Constraints in Scientific Workflow Systems. ACM Transactions on Software Engineering and Methodology, 20(3), 2011 > X. Liu, Y. Yang, Y. Jiang and J. Chen, Preventing Temporal Violations in Scientific Workflows: Where and How. IEEE Transactions on Software Engineering, 37(6):805- 825, Nov./Dec. 2011. > D. Yuan, Y. Yang, X. Liu and J. Chen, On‑demand Minimum Cost Benchmarking for Intermediate Datasets Storage in Scientific Cloud Workflow Systems. Journal of Parallel and Distributed Computing, 71:(316-332), 2011 > J. Chen and Y. Yang, Localising Temporal Constraints in Scientific Workflows. Journal of Computer and System Sciences, Elsevier, 76(6):464-474, Sept. 2010 > G. Zhang, Y. Yang and J. Chen, A Historical Probability based Noise Generation Strategy for Privacy Protection in Cloud Computing. Journal of Computer and System Sciences, Elsevier, published online, Dec. 2011. > Another 8 A* papers are currently under review… 8
  • 9. Part 1: Outline > SUCCESS Centre and NGSP Group > Background: Big Data, Cloud Computing and Workflow > Research Topics – Data Management in Cloud Computing – Performance Management in Scientific Workflows – Security and Privacy Protection in the Cloud – SwinDeW-C Cloud Workflow System
  • 10. Big Data > Data explosion – TB (1012), PB(1015), exabyte (EB, 1018), zettabyte (ZB, 1021), yottabyte (YB,1024) – The total amount of global data in 2010: 1.2 ZB – Google processes ? data everyday in 2009: – Every day, Facebook 10T, Twitter 7T, Youtube 4.5TPB 24 > Moore's law vs. data explosion speed – Application data double every year over the next decade and further - [Szalay et al. Nature, 2006] > Buzzwords: data storage, data processing, parallel, distributed, virtualisation, commodity machines, energy consumption, data centres, utility computing, software (everything) as a service 10
  • 11. Example: Pulsar Searching > Astrophysics: pulsar searching > Pulsars: the collapsed cores of stars that were once more massive than 6-10 times the mass of the Sun > http://astronomy.swin.edu.au/cosmos/P/Pulsar > Parkes Radio Telescope (http://www.parkes.atnf.csiro.au/) > Swinburne Astrophysics group (http://astronomy.swinburne.edu.au/) has been conducting pulsar searching surveys (http://astronomy.swin.edu.au/pulsar/) based on the observation data from Parkes Radio Telescope. > Typical scientific workflow which involves a large number of data and computation intensive activities. For a single searching process, the average data volume (not left: Image of the Crab Nebula taken with including the raw stream data from the telescope) is over 4 terabytes and the the Palomar telescope average execution time is about 23 hours on Swinburne high performance supercomputing facility (http://astronomy.swinburne.edu.au/supercomputing/). right: A close up of the Crab Pulsar from the Hubble Space Telescope Credit: Jeff Hester and Paul Scowen 11 (Arizona State University) and NASA
  • 12. Pulsar Searching Workflow Dr. Willem van Straten 12
  • 13. Benefits of Clouds > No upfront infrastructure investment – No procuring hardware, setup, hosting, power, etc.. > On demand access – Lease what you need and when you need.. > Efficient Resource Allocation – Globally shared infrastructure … > Nice Pricing – Based on Usage, QoS, Supply and Demand, Loyalty, … > Application Acceleration – Parallelism for large-scale data analysis… > Highly Availability, Scalable, and Energy Efficient > Supports Creation of 3rd Party Services & Seamless offering – Builds on infrastructure and follows similar Business model as Cloud 13
  • 14. SwinDeW Workflow Series SwinDeW – Swinburne Decentralised Workflow - foundation prototype based on p2p – SwinDeW – past – SwinDeW-S (for Services) – past – SwinDeW-B (for BPEL4WS) – past – SwinDeW-G (for Grid) – past – SwinDeW-A (for Agents) – past – SwinDeW-V (for Verification) – current – SwinDeW-C (for Cloud) – current
  • 15. Part 1: Outline > SUCCESS Centre and NGSP Group > Background: Big Data, Cloud Computing and Workflow > Research Topics – Data Management in Cloud Computing – Performance Management in Scientific Workflows – Security and Privacy Protection in the Cloud – SwinDeW-C Cloud Workflow System
  • 16. Research Topics Data Management in Cloud Computing Dr. Dong Yuan http://www.ict.swin.edu.au/personal/dyuan/ 16
  • 17. Data Management in Cloud Computing > Scientific applications in cloud computing – Computation and data intensive applications – Excessive computation and storage resources – Pay-as-you-go model > Three aspects of data management in the cloud – Data storage – Data placement – Data replication
  • 18. Data Storage > Developing smart data storage strategies for reducing the cost of storing big data in the cloud – Data regeneration (computation and storage trade-off) – Data de-duplication – Data compression > Researcher: Dong Yuan
  • 19. Publications > D. Yuan, Y. Yang, X. Liu, J. Chen, On‑demand Minimum Cost Benchmarking for Intermediate Datasets Storage in Scientific Cloud Workflow Systems, Journal of Parallel and Distributed Computing, Elsevier, vol. 71(2), pp. 316-332, 2011. > D. Yuan, Y. Yang, X. Liu, G. Zhang, J. Chen, A Data Dependency Based Strategy for Intermediate Data Storage in Scientific Cloud Workflow Systems, Concurrency and Computation: Practice and Experience, Wiley, 24(9), pp. 956-976, Jun. 2012. > D. Yuan, Y. Yang, X. Liu, J. Chen, A Cost-Effective Strategy for Intermediate Data Storage in Scientific Cloud Workflow Systems, Proc. of 24th IEEE International Parallel & Distributed Processing Symposium (IPDPS10), Atlanta, USA, Apr. 2010. > D. Yuan, Y. Yang, X. Liu and J. Chen, A Local-Optimisation based Strategy for Cost-Effective Datasets Storage of Scientific Applications in the Cloud, Proc. of 4th IEEE International Conference on Cloud Computing (Cloud2011), Washington DC, USA, July 4-9, 2011.
  • 20. Data Placement > Smart data placement strategies to reduce application cost – Data correlation based strategy to reduce bandwidth cost – Data usage based strategy to reduce storage cost > Researchers: Dong Yuan, Jofry Hadi SUTANTO, Antonio Giardina
  • 21. Publications > D. Yuan, Y. Yang, X. Liu, J. Chen, A Data Placement Strategy in Scientific Cloud Workflows, Future Generation Computer Systems, Elsevier, vol. 26(8), pp. 1200-1214, 2010.
  • 22. Data Replication > To cost-effectively assure data reliability in the cloud – Dynamic replication strategy – Proactively checking based replication strategy > Researchers: Wenhao Li, Dong Yuan
  • 23. Publications > W. Li, Y. Yang and D. Yuan, A Novel Cost-effective Dynamic Data Replication Strategy for Reliability in Cloud Data Centres. Proc. of International Conference on Cloud and Green Computing (CGC2011), pages 496-502, Sydney, Australia, Dec. 2011. > W. Li, Y. Yang, J. Chen and D. Yuan, A Cost-Effective Mechanism for Cloud Data Reliability Management based on Proactive Replica Checking. Proc. of 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid2012), pages 564-571, Ottawa, Canada, May 2012.
  • 24. Research Topics Performance Management in Scientific Workflows Dr. Xiao Liu http://www.ict.swin.edu.au/personal/xliu/
  • 25. Workflow QoS > QoS dimensions – time, cost, fidelity, reliability, security … > QoS of Cloud Services > Workflow QoS – the overall QoS for a collection of cloud services – but not simply add up! 25
  • 26. Temporal QoS > System performance – Response time – Throughput > Temporal constraints – Global constraints: deadlines – Local constraints: milestones, individual activity durations > Satisfactory temporal QoS – High performance: fast response, high throughput – On-time completion: low temporal violation rate 26
  • 27. Problem Analysis > Setting temporal constraints – Prerequisite: effective forecasting of activity durations > Monitoring temporal consistency state – Monitor workflow execution state – Detect potential temporal violations > Temporal violation handling – Where to conduct violation handling – What strategies to be used 27
  • 29. Forecasting Activity Durations > Statistical time-series pattern based forecasting strategies > Selected Publications: – X. Liu, Z. Ni, D. Yuan, Y. Jiang, Z. Wu, J. Chen, Y. Yang, A Novel Statistical Time-Series Pattern based Interval Forecasting Strategy for Activity Durations in Workflow Systems, Journal of Systems and Software (JSS), vol. 84, no. 3, Pages 354-376, March 2011. – X. Liu, J. Chen, K. Liu and Y. Yang, Forecasting Duration Intervals of Scientific Workflow Activities based on Time-Series Patterns, Proc. of 4th IEEE International Conference on e-Science (e-Science08), pages 23-30, Indianapolis, USA, Dec. 2008. 29
  • 30. Setting Temporal Constraints > Probability based temporal consistency model > Time analysis based on Stochastic Petri Nets > Selected Publications: – X. Liu, Z. Ni, J. Chen, Y. Yang, A Probabilistic Strategy for Temporal Constraint Management in Scientific Workflow Systems, Concurrency and Computation: Practice and Experience (CCPE), Wiley, 23(16):1893-1919, Nov. 2011 . – X. Liu, J. Chen and Y. Yang, A Probabilistic Strategy for Setting Temporal Constraints in Scientific Workflows, Proc. 6th International Conference on Business Process Management (BPM2008), Lecture Notes in Computer Science, Vol. 5240, pages 180-195, Milan, Italy, Sept. 2008. 30
  • 31. Temporal Consistency Monitoring > Minimum (Probability) Time Redundancy based Checkpoint Selection Strategy > Temporal Dependency based Checkpoint Selection Strategy > Selected Publications: – X. Liu, Y. Yang, Y. Jiang and J. Chen, Preventing Temporal Violations in Scientific Workflows: Where and How. IEEE Transactions on Software Engineering, 37(6):805-825, Nov./Dec. 2011. – J. Chen and Y. Yang, Temporal Dependency based Checkpoint Selection for Dynamic Verification of Temporal Constraints in Scientific Workflow Systems. ACM Transactions on Software Engineering and Methodology, 20(3), 2011
  • 32. Violation Handling > Violation Handling Point Selection > (Probability) Time deficit allocation > Workflow local rescheduling strategy – ACO, GA, PSO > Selected Publications: – X. Liu, Z. Ni, Z. Wu, D. Yuan, J. Chen and Y. Yang, A Novel General Framework for Automatic and Cost-Effective Handling of Recoverable Temporal Violations in Scientific Workflow Systems, Journal of Systems and Software, vol. 84, no. 3, pp. 492-509, 2011 32
  • 33. Research Topics Security and Privacy Protection in the Cloud Gaofeng Zhang gzhang@swin.edu.au 33
  • 34. Background > Data Security vs. Data Privacy > Privacy in cloud computing – Massive data store and compute in open cloud environment – Customers cannot control inside cloud The severity of privacy risk in cloud computing  One specific privacy risk in cloud computing – Indirectly private information (collectively information) – Normal service processes and functions (not disruption) The approach: noise obfuscation for privacy protection
  • 35. Privacy Protection in Cloud > Roles in the view of privacy in regular IT system – Privacy owner, Privacy user and Privacy theft Keep safe Privacy user between Privacy owner and Privacy user! Privacy theft Privacy owner
  • 36. Privacy Protection in Cloud > Roles in the view of privacy in Cloud – Privacy owner, privacy user and privacy theft Virtualisation disable the Privacy user “keeping safe between Privacy owner and Privacy user!” Privacy theft Privacy owner
  • 37. Noise Obfuscation(1) > Background – Massive data stores and computes in open cloud environments. – Customers cannot control inside cloud. > Main idea: “Dilute” real private information with noise information – Not noise signal!
  • 38. Noise Obfuscation(2) > A Motivating example: – One customer, who often travels to one city in Australia, like ‘Sydney’, checks the weather report regularly from a weather service in cloud environments before departure. The frequent appearance of service requests about the weather report for ‘Sydney’ can reveal the privacy that the customer usually goes to ‘Sydney’. But if a system aids the customer to inject other requests like ‘Perth’ or ‘Darwin’ into the ‘Sydney’ queue, the service provider cannot distinguish which ones are real and which ones are ‘noise’ as it just sees a similar style of service request. These requests should be responded and cannot reveal the location privacy of the customer. In such cases, the privacy can be protected by noise obfuscation in general. From ‘data’ privacy to ‘process’ privacy!
  • 39. Research Topics > Noise Generation – Historical probability based noise generation strategy – Time-series pattern based noise generation strategy – Association probability based noise generation strategy – …… > Noise Utilisation – Trust model and injection strategy for noise obfuscation – …… > Noise Cooperation Mechanism – Privacy protection framework under noise obfuscation
  • 40. Publications > G. Zhang, Y. Yang and J. Chen, A Historical Probability based Noise Generation Strategy for Privacy Protection in Cloud Computing. Journal of Computer and System Sciences, Elsevier, 78(5):1374-1381, Sept. 2012. > G. Zhang, Y. Yang, D. Yuan and J. Chen, A Trust-based Noise Injection Strategy for Privacy Protection in Cloud Computing. Software: Practice and Experience , Wiley, 42(4):431-445, Apr. 2012. > G. Zhang, Y. Yang, X. Liu and J. Chen, A Time-series Pattern based Noise Generation Strategy for Privacy Protection in Cloud Computing. Proc. of 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid2012), pages 458-465, Ottawa, Canada, May 2012. > G. Zhang, X. Zhang, Y. Yang, C. Liu and J. Chen, An Association Probability based Noise Generation Strategy for Privacy Protection in Cloud Computing. Proc. 10th International Conference on Service Oriented Computing (ICSoC2012), pages 639-647, Shanghai, China, Nov. 2012. (accepted on 13/7/2012)
  • 41. Research Topics Cloud Workflow System Design and Development Dahai Cao dcao@swin.edu.au 41
  • 42. SwinCloud – Cloud Computing Testbed > SwinCloud 42
  • 43. General cloud workflow reference model
  • 44. Prototype Ⅰ: SwinDeW-C (Peer-to-Peer) > SwinDeW-C 44
  • 46. Cloud workflow implementation > Client system – Process definition tools – Rule editor – Organisation modelling tools – Office calendar management tools – Authority group tools – User management tools – Form designing tools – Tool agent definition tools – Simulation tools
  • 47. New Progress > Successfully deploy on the Amazon Cloud > Eucalyptus: the cloud infrastructure platform

Notas del editor

  1. Swinburne University of Technology
  2. Location example
  3. Location example
  4. Location example
  5. Road map of research