SlideShare una empresa de Scribd logo
1 de 19
Seattle Trip Report
Data Integration – Company Engagement – BigData
Denis C. Bauer | Research Scientist
19 November 2012

CMIS
About me
•   BSc (Germany) Bioinformatics + Hons (ITEE, UQ) “In Silico Protein Design” Machine Learning
•   PhD (IMB, UQ) “Quantitative models of Transcriptional regulation” Optimization
•   PostDoc (IMB, UQ) “Sorting the intranuclear proteom” Bayesian Networks
•   PostDoc (QBI, UQ) Bioinformatics for the Sequencing Facility Operation



                                          • Research Scientist (CSIRO)
                                                 “Data integration of ‘Omics data in CRC”
                                             •     Develop protocols for data generation
                                             •     Develop pipelines for analysis
                                             •     Research ways for data integration

                                                           pHealth (Garry Hannan)
Seattle: Future hub for life sciences?




Seattle Trip Report | Denis C. Bauer | Page 3
Primary Goal: Collaboration with
William Noble

    Bayesian Network                                 for   automatic
    grouping                         of genomic functional elements

    (TSS, gene) by learning                     simultaneously from
    measured                               genomic   features   (histone         Bill Noble

    modifications)




                                                                           Michael Hoffman


Seattle Trip Report | Denis C. Bauer | Page 4
Segway: predictions
 Histone Modifications
             H2M3     x0x00x0xx00x00x000x000x00x000000x00x00xxxxx00xx0x00xxxx0x00x0xx00x00x000x000x00x000000x00x00xxxxx00xx0x00xxxx0x00x0xx00x00x000x000x00x0

             H3M4     x0x00x0xx00x00x000x000x00x000000x00x00xxxxx00xx0x00xxxx0x00x0xx00x00x000x000x00x000000x00x00xxxxx00xx0x00xxxx0x00x0xx00x00x000x000x00x0

             H3M4     0x0xx00x00x000x000x00x000000x00x00xxxxx00xx0x00xxxx0x00x0xx00x00x000x000x00x000000x00x00xxxxx00xx0x00xxxx0x00x0xx00x00x000x000x00x00000


 Bayesian Network



                                                                                                                                                         Train



 Segmentation & Classification




 Annotation




Presentation title | Presenter name | Page 5
Institute for Systems Biology: case study
for BigData
                                                TCGA has 20 different cancer
                                                types with up to 900 samples
                                                each.
                                                • Faster computers
                                                • Better approaches

Amazon: machine learning method for uncovering                                 Ilya Shmulevich
multivariate associations from large and diverse data sets.


Google: Use 10.000 – 600.000 cores and benefit from
Google expertise in compute and storage.




Seattle Trip Report | Denis C. Bauer | Page 6
ISB App Engine Presentation at Google IO 2012




http://popcorn.webmadecontent.org/4d3
  Seattle Trip Report | Denis C. Bauer | Page 7
Focusing on large scale and tactile interactive experiences that engross and
 envelope the visitor, Philip Worthington (1977-) created Shadow Monsters, a
 digital version of the traditional shadow puppet.



Seattle Trip Report | Denis C. Bauer | Page 8
Can CSIRO use outline-detection to do cool stuff ?




Seattle Trip Report | Denis C. Bauer | Page 9
Road Trip to Pacific Northwestern National Laboratory




Presentation title | Presenter name | Page 10
Road Trip to PNNL




Presentation title | Presenter name | Page 11
Road Trip to PNNL




Presentation title | Presenter name | Page 12
Road Trip to PNNL




Presentation title | Presenter name | Page 13
Road Trip to PNNL




Presentation title | Presenter name | Page 14
Road Trip to PNNL




Presentation title | Presenter name | Page 15
Enterprise-wide multidisciplinary
collaborations
PNNL predicts from sensor data if and when
radioactive material hits ground water.
Mathematical and visual prediction methods of
compute-intensive expert systems
Ian’s team develops a framework that allows
enterprise wide collaboration
     • Data sharing/annotation/provenance
     • Computational expert pipelines -> graphical
       programming -> domain experts
     • Developed for computer-grid infrastructure
                                                     Ian Gorton




Seattle Trip Report | Denis C. Bauer | Page 16
Commoditize parallelization
                                                                Computer Science & Engineering
                                                                University of Washington
Currently: Expert-system if !(embarrassingly parallel)
     • Deciding how to most efficiently bundle for parallel
       execution and how to resolve
     • The appropriate method can change with the actual load
       at runtime
Parallelization needs to become something the
compiler at run time works out for us
(just like we don’t write assembly code anymore)
     • SciDB
     • SKEWTUNE (better load for Hadoop)
     • HaLoop (Iterative parallele Data Processing)
                                                                 Magdalena Balazinska




Presentation title | Presenter name | Page 17
Commoditize parallelization (and
visualization)

HDInsight
             Hadoop on windows Server
             and Azure
             Integration with excel




PowerView
             Interactive graphics




Seattle Trip Report | Denis C. Bauer | Page 18
Collaboration options
                            • GS (Bill): Bayesian Network
                           • ISB (Ilya): Variant association
                         • CS (Magda): Iterative parallelization
                         • PNNL (Ian): Graphical programming
                                       Framework




Thank you
CMIS
Denis C. Bauer
Research Scientist
t +61 2 9325 3174
E Denis.Bauer@csiro.au
w www.csiro.au/cmis


CMIS

Más contenido relacionado

Similar a Trip Report Seattle

Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Databricks
 
Advanced Probabilistic Modeling Algorithms for Clustering ...
Advanced Probabilistic Modeling Algorithms for Clustering ...Advanced Probabilistic Modeling Algorithms for Clustering ...
Advanced Probabilistic Modeling Algorithms for Clustering ...
butest
 

Similar a Trip Report Seattle (20)

Rethinking how we provide science IT in an era of massive data but modest bud...
Rethinking how we provide science IT in an era of massive data but modest bud...Rethinking how we provide science IT in an era of massive data but modest bud...
Rethinking how we provide science IT in an era of massive data but modest bud...
 
HPC lab projects
HPC lab projectsHPC lab projects
HPC lab projects
 
Docker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce HoffDocker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce Hoff
 
Foundations for the Future of Science
Foundations for the Future of ScienceFoundations for the Future of Science
Foundations for the Future of Science
 
H2O with Erin LeDell at Portland R User Group
H2O with Erin LeDell at Portland R User GroupH2O with Erin LeDell at Portland R User Group
H2O with Erin LeDell at Portland R User Group
 
Describing Scholarly Contributions semantically with the Open Research Knowle...
Describing Scholarly Contributions semantically with the Open Research Knowle...Describing Scholarly Contributions semantically with the Open Research Knowle...
Describing Scholarly Contributions semantically with the Open Research Knowle...
 
2015 illinois-talk
2015 illinois-talk2015 illinois-talk
2015 illinois-talk
 
Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)
 
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data HandlingScott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
 
Sharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsSharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reports
 
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, RomeWorkflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
 
2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
 
Triplifier talk
Triplifier talkTriplifier talk
Triplifier talk
 
Opportunities for HPC in pharma R&D - main deck
Opportunities for HPC in pharma R&D - main deckOpportunities for HPC in pharma R&D - main deck
Opportunities for HPC in pharma R&D - main deck
 
Cifar
CifarCifar
Cifar
 
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
 
Advanced Probabilistic Modeling Algorithms for Clustering ...
Advanced Probabilistic Modeling Algorithms for Clustering ...Advanced Probabilistic Modeling Algorithms for Clustering ...
Advanced Probabilistic Modeling Algorithms for Clustering ...
 
Analytics of analytics pipelines: from optimising re-execution to general Dat...
Analytics of analytics pipelines:from optimising re-execution to general Dat...Analytics of analytics pipelines:from optimising re-execution to general Dat...
Analytics of analytics pipelines: from optimising re-execution to general Dat...
 
Seth A. Faith - Building a PaaS for Forensic DNA analysis using AWS
 Seth A. Faith - Building a PaaS for Forensic DNA analysis using AWS Seth A. Faith - Building a PaaS for Forensic DNA analysis using AWS
Seth A. Faith - Building a PaaS for Forensic DNA analysis using AWS
 
Time to Science/Time to Results: Transforming Research in the Cloud
Time to Science/Time to Results: Transforming Research in the CloudTime to Science/Time to Results: Transforming Research in the Cloud
Time to Science/Time to Results: Transforming Research in the Cloud
 

Más de Denis C. Bauer

Population-scale high-throughput sequencing data analysis
Population-scale high-throughput sequencing data analysisPopulation-scale high-throughput sequencing data analysis
Population-scale high-throughput sequencing data analysis
Denis C. Bauer
 

Más de Denis C. Bauer (20)

Cloud-native machine learning - Transforming bioinformatics research
Cloud-native machine learning - Transforming bioinformatics research Cloud-native machine learning - Transforming bioinformatics research
Cloud-native machine learning - Transforming bioinformatics research
 
Translating genomics into clinical practice - 2018 AWS summit keynote
Translating genomics into clinical practice - 2018 AWS summit keynoteTranslating genomics into clinical practice - 2018 AWS summit keynote
Translating genomics into clinical practice - 2018 AWS summit keynote
 
Going Server-less for Web-Services that need to Crunch Large Volumes of Data
Going Server-less for Web-Services that need to Crunch Large Volumes of DataGoing Server-less for Web-Services that need to Crunch Large Volumes of Data
Going Server-less for Web-Services that need to Crunch Large Volumes of Data
 
How novel compute technology transforms life science research
How novel compute technology transforms life science researchHow novel compute technology transforms life science research
How novel compute technology transforms life science research
 
How novel compute technology transforms life science research
How novel compute technology transforms life science researchHow novel compute technology transforms life science research
How novel compute technology transforms life science research
 
VariantSpark: applying Spark-based machine learning methods to genomic inform...
VariantSpark: applying Spark-based machine learning methods to genomic inform...VariantSpark: applying Spark-based machine learning methods to genomic inform...
VariantSpark: applying Spark-based machine learning methods to genomic inform...
 
Population-scale high-throughput sequencing data analysis
Population-scale high-throughput sequencing data analysisPopulation-scale high-throughput sequencing data analysis
Population-scale high-throughput sequencing data analysis
 
Allelic Imbalance for Pre-capture Whole Exome Sequencing
Allelic Imbalance for Pre-capture Whole Exome SequencingAllelic Imbalance for Pre-capture Whole Exome Sequencing
Allelic Imbalance for Pre-capture Whole Exome Sequencing
 
Centralizing sequence analysis
Centralizing sequence analysisCentralizing sequence analysis
Centralizing sequence analysis
 
Qbi Centre for Brain genomics (Informatics side)
Qbi Centre for Brain genomics (Informatics side)Qbi Centre for Brain genomics (Informatics side)
Qbi Centre for Brain genomics (Informatics side)
 
Differential gene expression
Differential gene expressionDifferential gene expression
Differential gene expression
 
Transcript detection in RNAseq
Transcript detection in RNAseqTranscript detection in RNAseq
Transcript detection in RNAseq
 
Functionally annotate genomic variants
Functionally annotate genomic variantsFunctionally annotate genomic variants
Functionally annotate genomic variants
 
Variant (SNPs/Indels) calling in DNA sequences, Part 2
Variant (SNPs/Indels) calling in DNA sequences, Part 2Variant (SNPs/Indels) calling in DNA sequences, Part 2
Variant (SNPs/Indels) calling in DNA sequences, Part 2
 
Variant (SNPs/Indels) calling in DNA sequences, Part 1
Variant (SNPs/Indels) calling in DNA sequences, Part 1 Variant (SNPs/Indels) calling in DNA sequences, Part 1
Variant (SNPs/Indels) calling in DNA sequences, Part 1
 
Introduction to second generation sequencing
Introduction to second generation sequencingIntroduction to second generation sequencing
Introduction to second generation sequencing
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
The missing data issue for HiSeq runs
The missing data issue for HiSeq runsThe missing data issue for HiSeq runs
The missing data issue for HiSeq runs
 
Deciphering the regulatory code in the genome
Deciphering the regulatory code in the genomeDeciphering the regulatory code in the genome
Deciphering the regulatory code in the genome
 
ReliF
ReliFReliF
ReliF
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 

Trip Report Seattle

  • 1. Seattle Trip Report Data Integration – Company Engagement – BigData Denis C. Bauer | Research Scientist 19 November 2012 CMIS
  • 2. About me • BSc (Germany) Bioinformatics + Hons (ITEE, UQ) “In Silico Protein Design” Machine Learning • PhD (IMB, UQ) “Quantitative models of Transcriptional regulation” Optimization • PostDoc (IMB, UQ) “Sorting the intranuclear proteom” Bayesian Networks • PostDoc (QBI, UQ) Bioinformatics for the Sequencing Facility Operation • Research Scientist (CSIRO) “Data integration of ‘Omics data in CRC” • Develop protocols for data generation • Develop pipelines for analysis • Research ways for data integration pHealth (Garry Hannan)
  • 3. Seattle: Future hub for life sciences? Seattle Trip Report | Denis C. Bauer | Page 3
  • 4. Primary Goal: Collaboration with William Noble Bayesian Network for automatic grouping of genomic functional elements (TSS, gene) by learning simultaneously from measured genomic features (histone Bill Noble modifications) Michael Hoffman Seattle Trip Report | Denis C. Bauer | Page 4
  • 5. Segway: predictions Histone Modifications H2M3 x0x00x0xx00x00x000x000x00x000000x00x00xxxxx00xx0x00xxxx0x00x0xx00x00x000x000x00x000000x00x00xxxxx00xx0x00xxxx0x00x0xx00x00x000x000x00x0 H3M4 x0x00x0xx00x00x000x000x00x000000x00x00xxxxx00xx0x00xxxx0x00x0xx00x00x000x000x00x000000x00x00xxxxx00xx0x00xxxx0x00x0xx00x00x000x000x00x0 H3M4 0x0xx00x00x000x000x00x000000x00x00xxxxx00xx0x00xxxx0x00x0xx00x00x000x000x00x000000x00x00xxxxx00xx0x00xxxx0x00x0xx00x00x000x000x00x00000 Bayesian Network Train Segmentation & Classification Annotation Presentation title | Presenter name | Page 5
  • 6. Institute for Systems Biology: case study for BigData TCGA has 20 different cancer types with up to 900 samples each. • Faster computers • Better approaches Amazon: machine learning method for uncovering Ilya Shmulevich multivariate associations from large and diverse data sets. Google: Use 10.000 – 600.000 cores and benefit from Google expertise in compute and storage. Seattle Trip Report | Denis C. Bauer | Page 6
  • 7. ISB App Engine Presentation at Google IO 2012 http://popcorn.webmadecontent.org/4d3 Seattle Trip Report | Denis C. Bauer | Page 7
  • 8. Focusing on large scale and tactile interactive experiences that engross and envelope the visitor, Philip Worthington (1977-) created Shadow Monsters, a digital version of the traditional shadow puppet. Seattle Trip Report | Denis C. Bauer | Page 8
  • 9. Can CSIRO use outline-detection to do cool stuff ? Seattle Trip Report | Denis C. Bauer | Page 9
  • 10. Road Trip to Pacific Northwestern National Laboratory Presentation title | Presenter name | Page 10
  • 11. Road Trip to PNNL Presentation title | Presenter name | Page 11
  • 12. Road Trip to PNNL Presentation title | Presenter name | Page 12
  • 13. Road Trip to PNNL Presentation title | Presenter name | Page 13
  • 14. Road Trip to PNNL Presentation title | Presenter name | Page 14
  • 15. Road Trip to PNNL Presentation title | Presenter name | Page 15
  • 16. Enterprise-wide multidisciplinary collaborations PNNL predicts from sensor data if and when radioactive material hits ground water. Mathematical and visual prediction methods of compute-intensive expert systems Ian’s team develops a framework that allows enterprise wide collaboration • Data sharing/annotation/provenance • Computational expert pipelines -> graphical programming -> domain experts • Developed for computer-grid infrastructure Ian Gorton Seattle Trip Report | Denis C. Bauer | Page 16
  • 17. Commoditize parallelization Computer Science & Engineering University of Washington Currently: Expert-system if !(embarrassingly parallel) • Deciding how to most efficiently bundle for parallel execution and how to resolve • The appropriate method can change with the actual load at runtime Parallelization needs to become something the compiler at run time works out for us (just like we don’t write assembly code anymore) • SciDB • SKEWTUNE (better load for Hadoop) • HaLoop (Iterative parallele Data Processing) Magdalena Balazinska Presentation title | Presenter name | Page 17
  • 18. Commoditize parallelization (and visualization) HDInsight Hadoop on windows Server and Azure Integration with excel PowerView Interactive graphics Seattle Trip Report | Denis C. Bauer | Page 18
  • 19. Collaboration options • GS (Bill): Bayesian Network • ISB (Ilya): Variant association • CS (Magda): Iterative parallelization • PNNL (Ian): Graphical programming Framework Thank you CMIS Denis C. Bauer Research Scientist t +61 2 9325 3174 E Denis.Bauer@csiro.au w www.csiro.au/cmis CMIS

Notas del editor

  1. http://www.snap2objects.com/2009/05/70-designers-that-shaped-the-world/http://www.snap2objects.com/2009/05/70-designers-that-shaped-the-world/