SlideShare a Scribd company logo
1 of 9
SAP	
  HANA	
  For	
  Genome	
  Data	
  
          Processing:	
  A	
  Deep	
  Dive	
  
Dr.	
  Ma'hieu-­‐P.	
  Schapranow	
          Emanuel	
  Ziegler	
  
 PI	
  In-­‐Memory	
  Technology	
       HANA	
  In-­‐Memory	
  Pla:orm	
  
           for	
  Life	
  Sciences	
     Genomics	
  and	
  Proteomics	
  
   Hasso	
  Pla9ner	
  Ins;tute	
                     SAP	
  AG	
  
Comparison	
  of	
  Costs	
  
                                                                                                            Comparison	
  of	
  Costs	
  for	
  Main	
  Memory	
  and	
  Genome	
  Analysis	
  
                                                                                                                                                 Costs	
  per	
  Megabyte	
  RAM	
                                                                           Costs	
  per	
  Megabase	
  Sequencing	
  

                         10000	
  



                          1000	
  



                            100	
  
Costs	
  in	
  USD	
  




                              10	
  



                                1	
  



                             0.1	
  



                           0.01	
  



                         0.001	
  
                                        1/1/01	
  
                                                     5/1/01	
  
                                                                  9/1/01	
  
                                                                               1/1/02	
  
                                                                                            5/1/02	
  
                                                                                                         9/1/02	
  
                                                                                                                      1/1/03	
  
                                                                                                                                   5/1/03	
  
                                                                                                                                                9/1/03	
  
                                                                                                                                                             1/1/04	
  
                                                                                                                                                                          5/1/04	
  
                                                                                                                                                                                       9/1/04	
  
                                                                                                                                                                                                    1/1/05	
  
                                                                                                                                                                                                                 5/1/05	
  
                                                                                                                                                                                                                              9/1/05	
  
                                                                                                                                                                                                                                           1/1/06	
  
                                                                                                                                                                                                                                                        5/1/06	
  
                                                                                                                                                                                                                                                                     9/1/06	
  
                                                                                                                                                                                                                                                                                  1/1/07	
  
                                                                                                                                                                                                                                                                                               5/1/07	
  
                                                                                                                                                                                                                                                                                                            9/1/07	
  
                                                                                                                                                                                                                                                                                                                         1/1/08	
  
                                                                                                                                                                                                                                                                                                                                      5/1/08	
  
                                                                                                                                                                                                                                                                                                                                                   9/1/08	
  
                                                                                                                                                                                                                                                                                                                                                                1/1/09	
  
                                                                                                                                                                                                                                                                                                                                                                             5/1/09	
  
                                                                                                                                                                                                                                                                                                                                                                                          9/1/09	
  
                                                                                                                                                                                                                                                                                                                                                                                                       1/1/10	
  
                                                                                                                                                                                                                                                                                                                                                                                                                    5/1/10	
  
                                                                                                                                                                                                                                                                                                                                                                                                                                 9/1/10	
  
                                                                                                                                                                                                                                                                                                                                                                                                                                              1/1/11	
  
                                                                                                                                                                                                                                                                                                                                                                                                                                                           5/1/11	
  
                                                                                                                                                                                                                                                                                                                                                                                                                                                                        9/1/11	
  
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     1/1/12	
  
                                                                       SAP	
  HANA	
  For	
  Genome	
  Data	
  Processing:	
  A	
  Deep	
  Dive,	
  E.	
  Ziegler	
  and	
  Dr.	
  M.-­‐P.	
  Schapranow	
                                                                                                                                                                                                                                                                                        2	
  
HANA	
  technology	
  for	
  alignment	
  
                                           Efficient	
  streaming	
  of	
  large	
  amounts	
  of	
  data	
  
                                           using	
  experience	
  with	
  high	
  throughput	
  of	
  big	
  data	
  


                                           Cache	
  efficient	
  index	
  structures	
  for	
  seed	
  lookups	
  
                                           using	
  knowledge	
  from	
  text	
  search	
  


                                           RaFng	
  of	
  seed	
  matches	
  
                                           based	
  on	
  search	
  engine	
  prac;ces	
  


                                           Hardware	
  accelerated	
  gapped	
  alignment	
  
                                           using	
  vectoriza;on	
  and	
  bit	
  parallelism	
  

    SAP	
  HANA	
  For	
  Genome	
  Data	
  Processing:	
  A	
  Deep	
  Dive,	
  E.	
  Ziegler	
  and	
  Dr.	
  M.-­‐P.	
  Schapranow	
     3	
  
Alignment	
  on	
  SAP	
  HANA	
  
                  Simulated	
  full	
  genome	
                                                                  Illumina	
  HiSeq	
  sequenced	
  exome	
  
                  100	
  bases	
  per	
  read,	
  single	
  ended	
                                              100	
  bases	
  per	
  read,	
  single	
  ended	
  
BWA-­‐SW	
  
SAP	
  HANA	
  




                                                                   Misaligned	
                                                                                       Misaligned	
  
                                                                   Unaligned	
                                                                                        Unaligned	
  

             0	
            0.2	
           0.4	
   0.6	
                       0.8	
             1.0	
   0	
                  0.2	
            0.4	
   0.6	
                 0.8	
     1.0	
  
                                            Percentage	
                                                                                        Percentage	
  
                     Misalignment	
  w.	
  r.	
  t.	
  Smith-­‐Waterman	
  score	
                                    Misalignment	
  w.	
  r.	
  t.	
  Smith-­‐Waterman	
  score	
  
                       of	
  reference	
  alignment	
  from	
  simula;on	
                                                of	
  other	
  alignment	
  algorithm	
  result	
  

                                      SAP	
  HANA	
  For	
  Genome	
  Data	
  Processing:	
  A	
  Deep	
  Dive,	
  E.	
  Ziegler	
  and	
  Dr.	
  M.-­‐P.	
  Schapranow	
                  4	
  
Genome	
  Data	
  Processing	
  
        Integrated	
  in	
  SAP	
  HANA	
  

                                                                                1,000	
  core	
  cluster	
  
                                                                                    ■  25	
  iden;cal	
  nodes	
  
                                                                                    ■  80	
  cores	
  
                                                                                    ■  1	
  TB	
  main	
  memory	
  
                                                                                    ■  2.40	
  GHz,	
  30	
  MB	
  Cache	
  




SAP	
  HANA	
  For	
  Genome	
  Data	
  Processing:	
  A	
  Deep	
  Dive,	
  E.	
  Ziegler	
  and	
  Dr.	
  M.-­‐P.	
  Schapranow	
     5	
  
Real-­‐;me	
  Combina;on	
  of	
  
                   Latest	
  Research	
  Results	
  
Genome	
  Browser	
  
■  Comparison	
  of	
  mul;ple	
  mapped	
  genomes	
  with	
  reference	
  
■  Explora;on	
  of	
  individual	
  genome	
  loca;ons	
  combined	
  with	
  latest	
  
     relevant	
  annota;ons	
  and	
  literature	
  e.g.	
  NCBI,	
  dbSNP,	
  UCSC,	
  Sanger	
  
	
  
	
  
	
  
	
  
	
  
InterpretaFon	
  of	
  Variants	
  
■  Variants	
  are	
  sorted,	
  e.g.	
  accordingly	
  to	
  known	
  associated	
  diseases	
  
■  All	
  variants	
  are	
  linked	
  to	
  genome	
  browser	
  
■  Mul;ple	
  pa;ents	
  can	
  be	
  compared	
  to	
  iden;fy	
  individual	
  disposi;ons	
  


            SAP	
  HANA	
  For	
  Genome	
  Data	
  Processing:	
  A	
  Deep	
  Dive,	
  E.	
  Ziegler	
  and	
  Dr.	
  M.-­‐P.	
  Schapranow	
     6	
  
Hardware	
  Advances	
  Support	
  
          Analysis	
  of	
  Genome	
  Data	
  

                                            Alignment	
  and	
                                     CombinaFon	
  with	
  Latest	
  
                                            Variant	
  Calling	
                                    Research	
  AnnotaFons	
  
 Bound	
  To	
                          CPU	
  Performance	
                                                Memory	
  Capacity	
  
 DuraFon	
                                            Hours	
                                                           Weeks	
  
SAP	
  &	
  HPI	
                                  Minutes	
                                                         Real-­‐;me	
  
                                                Mul;-­‐Core	
                                    Par;;oning	
  &	
  Compression	
  
In-­‐Memory	
  
                                                    	
                                                          	
  
Technology	
  
                                                    	
                                                          	
  




         SAP	
  HANA	
  For	
  Genome	
  Data	
  Processing:	
  A	
  Deep	
  Dive,	
  E.	
  Ziegler	
  and	
  Dr.	
  M.-­‐P.	
  Schapranow	
     7	
  
What	
  to	
  take	
  home?	
  
Sequencing	
  machines	
  become	
  faster,	
  
smaller,	
  cheaper,	
  and	
  generate	
  immense	
  
data	
  sets	
  in	
  heterogeneous	
  formats	
  
 	
  



 ■  In-­‐memory	
  technology	
  is	
  the	
  key	
  to	
  
    explore	
  and	
  analyze	
  these	
  big	
  data	
  sets	
  
 ■  Efficient	
  paralleliza;on	
  reduces	
  processing	
  ;me	
  
 ■  In-­‐memory	
  technology	
  enables	
  real-­‐;me	
  analysis	
  and	
  
    interac;ve	
  explora;on	
  of	
  genome	
  data	
  
                                          	
  
        “Let’s	
  idenFfy	
  genomic	
  roots	
  and	
  opFmal	
  treatments	
  
          before	
  the	
  paFent	
  wakes	
  up	
  from	
  anaesthesia!”	
  

                SAP	
  HANA	
  For	
  Genome	
  Data	
  Processing:	
  A	
  Deep	
  Dive,	
  E.	
  Ziegler	
  and	
  Dr.	
  M.-­‐P.	
  Schapranow	
     8	
  
Thank	
  you	
  for	
  your	
  interest!	
  
                Keep	
  in	
  contact	
  with	
  us.	
  



                                                                                                                        Dr. Matthieu-P. Schapranow
Emanuel Ziegler
                                                                                                                   schapranow@hpi.uni-potsdam.de
emanuel.ziegler@SAP.com                                                                                                                          http://j.mp/schapranow




SAP AG
                                                                                                        Hasso Plattner Institute
Emanuel Ziegler, TREX
                                                                                    Enterprise Platform & Integration Concepts
Dietmar-Hopp-Allee 16                                                                                   Matthieu-P. Schapranow
69190 Walldorf, Germany                                                                                   August-Bebel-Str. 88
                                                                                                      14482 Potsdam, Germany


             SAP	
  HANA	
  For	
  Genome	
  Data	
  Processing:	
  A	
  Deep	
  Dive,	
  E.	
  Ziegler	
  and	
  Dr.	
  M.-­‐P.	
  Schapranow	
                    9	
  

More Related Content

More from Matthieu Schapranow

More from Matthieu Schapranow (20)

Patient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in PracticePatient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in Practice
 
How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?
 
AI in Oncology
AI in OncologyAI in Oncology
AI in Oncology
 
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital HealthAnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
 
Algorithmen statt Ärzte: Algorithmen statt Ärzte: Ersetzt Big Data künftig ...
Algorithmen statt Ärzte: Algorithmen statt Ärzte: Ersetzt Big Data künftig ...Algorithmen statt Ärzte: Algorithmen statt Ärzte: Ersetzt Big Data künftig ...
Algorithmen statt Ärzte: Algorithmen statt Ärzte: Ersetzt Big Data künftig ...
 
A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...
A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...
A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...
 
In-Memory Apps for Precision Medicine
In-Memory Apps for Precision MedicineIn-Memory Apps for Precision Medicine
In-Memory Apps for Precision Medicine
 
"When time matters..."
"When time matters...""When time matters..."
"When time matters..."
 
ICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
ICT Platform to Enable Consortium Work for Systems Medicine of Heart FailureICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
ICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
 
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
 
Analyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision MedicineAnalyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision Medicine
 
In-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems MedicineIn-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems Medicine
 
Analyze Genomes: In-memory Apps supporting Precision Medicine
Analyze Genomes: In-memory Apps supporting Precision MedicineAnalyze Genomes: In-memory Apps supporting Precision Medicine
Analyze Genomes: In-memory Apps supporting Precision Medicine
 
Analyze Genomes: In-memory Apps for Next-generation Life Sciences Research
Analyze Genomes: In-memory Apps for Next-generation Life Sciences ResearchAnalyze Genomes: In-memory Apps for Next-generation Life Sciences Research
Analyze Genomes: In-memory Apps for Next-generation Life Sciences Research
 
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
 
Analyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision MedicineAnalyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision Medicine
 
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
 
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
 
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
 
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
 

Recently uploaded

Recently uploaded (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 

SAP HANA For Genome Data Processing: A Deep Dive

  • 1. SAP  HANA  For  Genome  Data   Processing:  A  Deep  Dive   Dr.  Ma'hieu-­‐P.  Schapranow   Emanuel  Ziegler   PI  In-­‐Memory  Technology   HANA  In-­‐Memory  Pla:orm   for  Life  Sciences   Genomics  and  Proteomics   Hasso  Pla9ner  Ins;tute   SAP  AG  
  • 2. Comparison  of  Costs   Comparison  of  Costs  for  Main  Memory  and  Genome  Analysis   Costs  per  Megabyte  RAM   Costs  per  Megabase  Sequencing   10000   1000   100   Costs  in  USD   10   1   0.1   0.01   0.001   1/1/01   5/1/01   9/1/01   1/1/02   5/1/02   9/1/02   1/1/03   5/1/03   9/1/03   1/1/04   5/1/04   9/1/04   1/1/05   5/1/05   9/1/05   1/1/06   5/1/06   9/1/06   1/1/07   5/1/07   9/1/07   1/1/08   5/1/08   9/1/08   1/1/09   5/1/09   9/1/09   1/1/10   5/1/10   9/1/10   1/1/11   5/1/11   9/1/11   1/1/12   SAP  HANA  For  Genome  Data  Processing:  A  Deep  Dive,  E.  Ziegler  and  Dr.  M.-­‐P.  Schapranow   2  
  • 3. HANA  technology  for  alignment   Efficient  streaming  of  large  amounts  of  data   using  experience  with  high  throughput  of  big  data   Cache  efficient  index  structures  for  seed  lookups   using  knowledge  from  text  search   RaFng  of  seed  matches   based  on  search  engine  prac;ces   Hardware  accelerated  gapped  alignment   using  vectoriza;on  and  bit  parallelism   SAP  HANA  For  Genome  Data  Processing:  A  Deep  Dive,  E.  Ziegler  and  Dr.  M.-­‐P.  Schapranow   3  
  • 4. Alignment  on  SAP  HANA   Simulated  full  genome   Illumina  HiSeq  sequenced  exome   100  bases  per  read,  single  ended   100  bases  per  read,  single  ended   BWA-­‐SW   SAP  HANA   Misaligned   Misaligned   Unaligned   Unaligned   0   0.2   0.4   0.6   0.8   1.0   0   0.2   0.4   0.6   0.8   1.0   Percentage   Percentage   Misalignment  w.  r.  t.  Smith-­‐Waterman  score   Misalignment  w.  r.  t.  Smith-­‐Waterman  score   of  reference  alignment  from  simula;on   of  other  alignment  algorithm  result   SAP  HANA  For  Genome  Data  Processing:  A  Deep  Dive,  E.  Ziegler  and  Dr.  M.-­‐P.  Schapranow   4  
  • 5. Genome  Data  Processing   Integrated  in  SAP  HANA   1,000  core  cluster   ■  25  iden;cal  nodes   ■  80  cores   ■  1  TB  main  memory   ■  2.40  GHz,  30  MB  Cache   SAP  HANA  For  Genome  Data  Processing:  A  Deep  Dive,  E.  Ziegler  and  Dr.  M.-­‐P.  Schapranow   5  
  • 6. Real-­‐;me  Combina;on  of   Latest  Research  Results   Genome  Browser   ■  Comparison  of  mul;ple  mapped  genomes  with  reference   ■  Explora;on  of  individual  genome  loca;ons  combined  with  latest   relevant  annota;ons  and  literature  e.g.  NCBI,  dbSNP,  UCSC,  Sanger             InterpretaFon  of  Variants   ■  Variants  are  sorted,  e.g.  accordingly  to  known  associated  diseases   ■  All  variants  are  linked  to  genome  browser   ■  Mul;ple  pa;ents  can  be  compared  to  iden;fy  individual  disposi;ons   SAP  HANA  For  Genome  Data  Processing:  A  Deep  Dive,  E.  Ziegler  and  Dr.  M.-­‐P.  Schapranow   6  
  • 7. Hardware  Advances  Support   Analysis  of  Genome  Data   Alignment  and   CombinaFon  with  Latest   Variant  Calling   Research  AnnotaFons   Bound  To   CPU  Performance   Memory  Capacity   DuraFon   Hours   Weeks   SAP  &  HPI   Minutes   Real-­‐;me   Mul;-­‐Core   Par;;oning  &  Compression   In-­‐Memory       Technology       SAP  HANA  For  Genome  Data  Processing:  A  Deep  Dive,  E.  Ziegler  and  Dr.  M.-­‐P.  Schapranow   7  
  • 8. What  to  take  home?   Sequencing  machines  become  faster,   smaller,  cheaper,  and  generate  immense   data  sets  in  heterogeneous  formats     ■  In-­‐memory  technology  is  the  key  to   explore  and  analyze  these  big  data  sets   ■  Efficient  paralleliza;on  reduces  processing  ;me   ■  In-­‐memory  technology  enables  real-­‐;me  analysis  and   interac;ve  explora;on  of  genome  data     “Let’s  idenFfy  genomic  roots  and  opFmal  treatments   before  the  paFent  wakes  up  from  anaesthesia!”   SAP  HANA  For  Genome  Data  Processing:  A  Deep  Dive,  E.  Ziegler  and  Dr.  M.-­‐P.  Schapranow   8  
  • 9. Thank  you  for  your  interest!   Keep  in  contact  with  us.   Dr. Matthieu-P. Schapranow Emanuel Ziegler schapranow@hpi.uni-potsdam.de emanuel.ziegler@SAP.com http://j.mp/schapranow SAP AG Hasso Plattner Institute Emanuel Ziegler, TREX Enterprise Platform & Integration Concepts Dietmar-Hopp-Allee 16 Matthieu-P. Schapranow 69190 Walldorf, Germany August-Bebel-Str. 88 14482 Potsdam, Germany SAP  HANA  For  Genome  Data  Processing:  A  Deep  Dive,  E.  Ziegler  and  Dr.  M.-­‐P.  Schapranow   9