SlideShare una empresa de Scribd logo
1 de 23
Sequencing Genomics: The New Big Data Driver IntermezzoTalk SURFnet7, Part of GigaPort3 Utrecht, Netherlands December 7, 2011 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor,  Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD http://lsmarr.calit2.net
Cost Per Megabase in Sequencing DNA  is Falling Much Faster Than Moore’s Law www.genome.gov/sequencingcosts/
Genomic Sequencing  is Driving Big Data November 30, 2011
BGI—The Beijing Genome Institute  is the World’s Largest Genomic Institute ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Next Generation Genome Sequencers Produce Large Data Sets Source: Chris Misleh, SOM/Calit2 UCSD
Needed: Interdisciplinary Teams Made From  Computer Science, Data Analytics, and Genomics We believe  the field of bioinformatics  for genetic analysis  will be one of the biggest areas of disruptive innovation  in life science tools  over the next few years,”  --Isaac Ro, an analyst at Goldman Sachs
Calit2 Brings Together  Computer Science and Bioinformatics  National Biomedical Computation  Resource  an NIH supported resource center
Single Nucleotide Polymophisms (SNPs): Human DNA Base Pairs May Differ At Some Points Person A Person B http://en.wikipedia.org/wiki/File:Dna-SNP.svg
Why We Study SNPs 99.9% of One’s Individual DNA Sequence will be Identical  to that of Another Person.  Of the 0.1% Difference,  Over 80% will be  Single Nucleotide Polymorphisms (SNPs). http://shop.perkinelmer.com/content/snps/genotyping.asp
Consumer Companies Provide Your SNPs www.23andme.com
Cost of Sequencing Human Genome  is Rapidly Becoming Affordable
The Rise of Individual and Societal Genomic Testing-Promise and Concerns www.technologyreview.com/biomedicine/25218/
Publically Sharing Your Genome and Medical Records: Is it Crazy or the Future?
From 10,000 Human Genomes Sequenced in 2011 to 1 Million by 2015 Out of Less Than 5,000 sq. ft.! 4 Million Newborns / Year in U.S.
But the Human Genome Contains  Less Than 1% of the Bodies Genes http://commonfund.nih.gov/hmp/ The Total Number of These Bacterial Cells is 10 Times the Number  of Human Cells in Your Body
The Human Microbiome is the Next Large NIH Drive  to Understand Human Health and Disease ,[object Object],[object Object],[object Object],“ Diversity of the Human Intestinal Microbial Flora”  Paul B. Eckburg, et al  Science  (10 June 2005) 395 Phylotypes
The New Science of Metagenomics “ The emerging field  of metagenomics,  where the DNA of entire communities of microbes  is studied simultaneously, presents the greatest opportunity --  perhaps since the invention of the microscope  –  to revolutionize understanding of the microbial world.” – National Research Council March 27, 2007 NRC Report: Metagenomic data should be made publicly available in international archives as rapidly as possible.
Community Cyberinfrastructure for Advanced  Microbial Ecology Research and Analysis http://camera.calit2.net/
Calit2 CAMERA:  0ver 4000 Registered Users  From Over 80 Countries
Calit2 Microbial Metagenomics Cluster- Next Generation Optically Linked Science Data Server 4000 Users From 90 Countries 512 Processors  ~5 Teraflops  ~ 200 Terabytes Storage  1GbE and 10GbE Switched/ Routed Core ~200TB Sun X4500 Storage 10GbE Source: Phil Papadopoulos, SDSC, Calit2
UCSD Planned Optical Networked Biomedical Researchers and Instruments ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Cellular & Molecular Medicine West  National Center for Microscopy & Imaging Biomedical Research  Center for  Molecular Genetics  Pharmaceutical Sciences Building Cellular & Molecular Medicine East CryoElectron Microscopy Facility  Radiology Imaging Lab  Bioengineering [email_address] San Diego Supercomputer Center
UCSD Campus Investment in Fiber  Enables Big Data Science Source:  Philip Papadopoulos, SDSC, UCSD OptIPortal Tiled Display Wall Campus Lab Cluster Digital Data Collections N x 10Gb/s Triton – Petascale  Data Analysis Gordon – HPD System Cluster Condo WAN 10Gb:  CENIC, NLR, I2 GLIF Scientific  Instruments DataOasis  (Central) Storage GreenLight Data Center
SURFnet – a Global SuperNetwork Connecting to the Global Lambda Integrated Facility Visualization courtesy of  Donna Cox, Bob Patterson, NCSA. www.glif.is

Más contenido relacionado

La actualidad más candente

Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Intel IT Center
 
Human genome project 2007
Human genome project 2007Human genome project 2007
Human genome project 2007
Hesham Gaber
 
Dna sequencing pp
Dna sequencing ppDna sequencing pp
Dna sequencing pp
libs6359
 

La actualidad más candente (18)

Building an Information Infrastructure to Support Genetic Sciences
Building an Information Infrastructure to Support Genetic SciencesBuilding an Information Infrastructure to Support Genetic Sciences
Building an Information Infrastructure to Support Genetic Sciences
 
Human genome project
Human genome projectHuman genome project
Human genome project
 
Using Supercomputers and Supernetworks to Explore the Ocean of Life
Using Supercomputers and Supernetworks to Explore the Ocean of LifeUsing Supercomputers and Supernetworks to Explore the Ocean of Life
Using Supercomputers and Supernetworks to Explore the Ocean of Life
 
PAPER 3.1 ~ HUMAN GENOME PROJECT
PAPER 3.1 ~  HUMAN GENOME PROJECTPAPER 3.1 ~  HUMAN GENOME PROJECT
PAPER 3.1 ~ HUMAN GENOME PROJECT
 
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
 
Human genome project
Human genome projectHuman genome project
Human genome project
 
Human genome project
Human genome projectHuman genome project
Human genome project
 
Genetic engineering
Genetic engineering Genetic engineering
Genetic engineering
 
Human genome project by kk sahu
Human genome project by kk sahuHuman genome project by kk sahu
Human genome project by kk sahu
 
Human genome project 2007
Human genome project 2007Human genome project 2007
Human genome project 2007
 
Human genome project
Human genome projectHuman genome project
Human genome project
 
Briefing to External Relations Staff
Briefing to External Relations StaffBriefing to External Relations Staff
Briefing to External Relations Staff
 
Using Supercomputers and Gene Sequencers to Discover Your Inner Microbiome
Using Supercomputers and Gene Sequencers to Discover Your Inner MicrobiomeUsing Supercomputers and Gene Sequencers to Discover Your Inner Microbiome
Using Supercomputers and Gene Sequencers to Discover Your Inner Microbiome
 
Dna sequencing pp
Dna sequencing ppDna sequencing pp
Dna sequencing pp
 
Microbial Metagenomics and Human Health
Microbial Metagenomics and Human HealthMicrobial Metagenomics and Human Health
Microbial Metagenomics and Human Health
 
Scott Schweikart, "Human Genome Editing: An Ethical Analysis and Arguments fo...
Scott Schweikart, "Human Genome Editing: An Ethical Analysis and Arguments fo...Scott Schweikart, "Human Genome Editing: An Ethical Analysis and Arguments fo...
Scott Schweikart, "Human Genome Editing: An Ethical Analysis and Arguments fo...
 
Microbial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New CyberinfrastructureMicrobial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New Cyberinfrastructure
 
Human genome project slides
Human genome project slidesHuman genome project slides
Human genome project slides
 

Destacado

Destacado (10)

A Gigabit in Every Home—The Emergence of True Broadband
A Gigabit in Every Home—The Emergence of True BroadbandA Gigabit in Every Home—The Emergence of True Broadband
A Gigabit in Every Home—The Emergence of True Broadband
 
Best pratices at BGI for the Challenges in the Era of Big Genomics Data
Best pratices at BGI for the Challenges in the Era of Big Genomics DataBest pratices at BGI for the Challenges in the Era of Big Genomics Data
Best pratices at BGI for the Challenges in the Era of Big Genomics Data
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical information
 
Managing & Processing Big Data for Cancer Genomics, an insight of Bioinformatics
Managing & Processing Big Data for Cancer Genomics, an insight of BioinformaticsManaging & Processing Big Data for Cancer Genomics, an insight of Bioinformatics
Managing & Processing Big Data for Cancer Genomics, an insight of Bioinformatics
 
Spark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scaleSpark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scale
 
Big Data and Genomics
Big Data and GenomicsBig Data and Genomics
Big Data and Genomics
 
Genome Big Data
Genome Big DataGenome Big Data
Genome Big Data
 
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud PilotsC-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
 
Genomics: Big Data Leading to Big Opportunities
Genomics: Big Data Leading to Big OpportunitiesGenomics: Big Data Leading to Big Opportunities
Genomics: Big Data Leading to Big Opportunities
 
Lightning fast genomics with Spark, Adam and Scala
Lightning fast genomics with Spark, Adam and ScalaLightning fast genomics with Spark, Adam and Scala
Lightning fast genomics with Spark, Adam and Scala
 

Similar a Sequencing Genomics: The New Big Data Driver

Cancer genome repository_berkeley
Cancer genome repository_berkeleyCancer genome repository_berkeley
Cancer genome repository_berkeley
Shyam Sarkar
 

Similar a Sequencing Genomics: The New Big Data Driver (20)

High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
 
Global Telepresence in Support of Global Public Health
Global Telepresence in Support of Global Public HealthGlobal Telepresence in Support of Global Public Health
Global Telepresence in Support of Global Public Health
 
High Performance Collaboration
High Performance CollaborationHigh Performance Collaboration
High Performance Collaboration
 
Collaborations Between Calit2, SIO, and the Venter Institute-a Beginning
Collaborations Between Calit2, SIO, and the Venter Institute-a BeginningCollaborations Between Calit2, SIO, and the Venter Institute-a Beginning
Collaborations Between Calit2, SIO, and the Venter Institute-a Beginning
 
The Singularity: Toward a Post-Human Reality
The Singularity: Toward a Post-Human RealityThe Singularity: Toward a Post-Human Reality
The Singularity: Toward a Post-Human Reality
 
Genomics at the Speed of Light: Understanding the Living Ocean
Genomics at the Speed of Light: Understanding the Living OceanGenomics at the Speed of Light: Understanding the Living Ocean
Genomics at the Speed of Light: Understanding the Living Ocean
 
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
 
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
 
Discovering Yourself with Computational Bioinformatics
Discovering Yourself with Computational BioinformaticsDiscovering Yourself with Computational Bioinformatics
Discovering Yourself with Computational Bioinformatics
 
Bioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of NatureBioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of Nature
 
Advancing the Metagenomics Revolution
Advancing the Metagenomics RevolutionAdvancing the Metagenomics Revolution
Advancing the Metagenomics Revolution
 
Living in a World of Nanobioinfotechnology
Living in a World of NanobioinfotechnologyLiving in a World of Nanobioinfotechnology
Living in a World of Nanobioinfotechnology
 
Driving Applications on the UCSD Big Data Freeway System
Driving Applications on the UCSD Big Data Freeway SystemDriving Applications on the UCSD Big Data Freeway System
Driving Applications on the UCSD Big Data Freeway System
 
Cancer genome repository_berkeley
Cancer genome repository_berkeleyCancer genome repository_berkeley
Cancer genome repository_berkeley
 
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
 
Calit2 - CSE's Living Laboratory for Applications
Calit2 - CSE's Living Laboratory for ApplicationsCalit2 - CSE's Living Laboratory for Applications
Calit2 - CSE's Living Laboratory for Applications
 
The Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersThe Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics Researchers
 
Emerging Trends
Emerging TrendsEmerging Trends
Emerging Trends
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomics
 
Machine Learning in Healthcare Diagnostics
Machine Learning in Healthcare DiagnosticsMachine Learning in Healthcare Diagnostics
Machine Learning in Healthcare Diagnostics
 

Más de Larry Smarr

Más de Larry Smarr (20)

My Remembrances of Mike Norman Over The Last 45 Years
My Remembrances of Mike Norman Over The Last 45 YearsMy Remembrances of Mike Norman Over The Last 45 Years
My Remembrances of Mike Norman Over The Last 45 Years
 
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
 
Panel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving Institutions
 
Global Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated SystemsGlobal Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated Systems
 
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
 Wireless FasterData and Distributed Open Compute Opportunities and (some) Us... Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
 
Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...
 
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon MoonThe Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
 
Panel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving Institutions
 
Panel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An OverviewPanel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An Overview
 
Panel: Future Wireless Extensions of Regional Optical Networks
Panel: Future Wireless Extensions of Regional Optical NetworksPanel: Future Wireless Extensions of Regional Optical Networks
Panel: Future Wireless Extensions of Regional Optical Networks
 
Global Research Platform Workshops - Maxine Brown
Global Research Platform Workshops - Maxine BrownGlobal Research Platform Workshops - Maxine Brown
Global Research Platform Workshops - Maxine Brown
 
Built around answering questions
Built around answering questionsBuilt around answering questions
Built around answering questions
 
Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Panel: NRP Science Impacts​
Panel: NRP Science Impacts​
 
Democratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish ParasharDemocratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish Parashar
 
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
 
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Frank Würthwein - NRP and the Path forward
Frank Würthwein - NRP and the Path forwardFrank Würthwein - NRP and the Path forward
Frank Würthwein - NRP and the Path forward
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Último (20)

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 

Sequencing Genomics: The New Big Data Driver

  • 1. Sequencing Genomics: The New Big Data Driver IntermezzoTalk SURFnet7, Part of GigaPort3 Utrecht, Netherlands December 7, 2011 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor, Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD http://lsmarr.calit2.net
  • 2. Cost Per Megabase in Sequencing DNA is Falling Much Faster Than Moore’s Law www.genome.gov/sequencingcosts/
  • 3. Genomic Sequencing is Driving Big Data November 30, 2011
  • 4.
  • 5. Next Generation Genome Sequencers Produce Large Data Sets Source: Chris Misleh, SOM/Calit2 UCSD
  • 6. Needed: Interdisciplinary Teams Made From Computer Science, Data Analytics, and Genomics We believe the field of bioinformatics for genetic analysis will be one of the biggest areas of disruptive innovation in life science tools over the next few years,” --Isaac Ro, an analyst at Goldman Sachs
  • 7. Calit2 Brings Together Computer Science and Bioinformatics National Biomedical Computation Resource an NIH supported resource center
  • 8. Single Nucleotide Polymophisms (SNPs): Human DNA Base Pairs May Differ At Some Points Person A Person B http://en.wikipedia.org/wiki/File:Dna-SNP.svg
  • 9. Why We Study SNPs 99.9% of One’s Individual DNA Sequence will be Identical to that of Another Person. Of the 0.1% Difference, Over 80% will be Single Nucleotide Polymorphisms (SNPs). http://shop.perkinelmer.com/content/snps/genotyping.asp
  • 10. Consumer Companies Provide Your SNPs www.23andme.com
  • 11. Cost of Sequencing Human Genome is Rapidly Becoming Affordable
  • 12. The Rise of Individual and Societal Genomic Testing-Promise and Concerns www.technologyreview.com/biomedicine/25218/
  • 13. Publically Sharing Your Genome and Medical Records: Is it Crazy or the Future?
  • 14. From 10,000 Human Genomes Sequenced in 2011 to 1 Million by 2015 Out of Less Than 5,000 sq. ft.! 4 Million Newborns / Year in U.S.
  • 15. But the Human Genome Contains Less Than 1% of the Bodies Genes http://commonfund.nih.gov/hmp/ The Total Number of These Bacterial Cells is 10 Times the Number of Human Cells in Your Body
  • 16.
  • 17. The New Science of Metagenomics “ The emerging field of metagenomics, where the DNA of entire communities of microbes is studied simultaneously, presents the greatest opportunity -- perhaps since the invention of the microscope – to revolutionize understanding of the microbial world.” – National Research Council March 27, 2007 NRC Report: Metagenomic data should be made publicly available in international archives as rapidly as possible.
  • 18. Community Cyberinfrastructure for Advanced Microbial Ecology Research and Analysis http://camera.calit2.net/
  • 19. Calit2 CAMERA: 0ver 4000 Registered Users From Over 80 Countries
  • 20. Calit2 Microbial Metagenomics Cluster- Next Generation Optically Linked Science Data Server 4000 Users From 90 Countries 512 Processors ~5 Teraflops ~ 200 Terabytes Storage 1GbE and 10GbE Switched/ Routed Core ~200TB Sun X4500 Storage 10GbE Source: Phil Papadopoulos, SDSC, Calit2
  • 21.
  • 22. UCSD Campus Investment in Fiber Enables Big Data Science Source: Philip Papadopoulos, SDSC, UCSD OptIPortal Tiled Display Wall Campus Lab Cluster Digital Data Collections N x 10Gb/s Triton – Petascale Data Analysis Gordon – HPD System Cluster Condo WAN 10Gb: CENIC, NLR, I2 GLIF Scientific Instruments DataOasis (Central) Storage GreenLight Data Center
  • 23. SURFnet – a Global SuperNetwork Connecting to the Global Lambda Integrated Facility Visualization courtesy of Donna Cox, Bob Patterson, NCSA. www.glif.is

Notas del editor

  1. This is a production cluster with it’s own Force10 e1200 switch. It is connected to quartzite and is labeled as the “CAMERA Force10 E1200”. We built CAMERA this way because of technology deployed successfully in Quartzite