SlideShare una empresa de Scribd logo
1 de 25
Yaoyu E. Wang, Ph.D
Center for Cancer Computational Biology, DFCI
SPECSII webinar
June 05, 2013
- Transcriptome profiling represents a static gene expression
state of a biological sample across the genome
- Allows for direct genomic comparisons with multiple samples
to determine genes that exhibit differential expression in
different state (i.e. normal vs. tumor)
- Allows for hypothesis generation on molecular abnormalities
and mechanisms that may contribute to the tumor phenotype
- Provides information on molecular subtypes, the development
of prognostic and predictive molecular signatures
- Two main technologies:
a. Microarray
b. RNA-Sequencing (RNASeq) using next generation
sequencing
Affymetrix
GeneChip
scanner
Blencowe B J et al. Genes Dev. 2009;23:1379-1386
Illumina HiSeq
.bcl files
CASAVA processing
•Demultiplexing
•Fastq file generation
•Sequencing filtering
Raw files containing base calls
and quality scores
Illumina defined
quality filters
Split into Project and Sample Folders
Jones_Lab
ChIP_A ChIP-B
Marcus_Lab
RNA-SeqA RNA-SeqB RNA-SeqC
Williams_Lab
Exome1 Exome2
Fastq Files Fastq Files Fastq Files
Haas & Zody. Nature Biotechnology 28, 421–423 (2010)
Using known
annotations
And compare to
known annotations
•Differential Expression
•Differential Isoform Abundance
•RNA editing
•SNP, indel detection
Technology RNASeq Microarray
High run-to run reproducibility Yes Yes
Dynamic Range Comparable to
actual transcript abundance
>8000-fold
Hundred
fold
Able to detect alternative splice site
and novel isoforms
Yes No
De novo analysis of samples without
reference genome
Yes No
Multiplexing Samples in one run Yes No
Required amount of total RNA >100 ng ~1 ug
Re-analyzable data Yes No
Technology RNASeq Microarray
Heterogeneity of read coverage
across an expressed region
Yes No
Well understood sources of
experimental bias
No Yes
Data portable on a flush drive (~4G) No Yes
Data is analyzable by any PC No Yes
Cheaper cost per sample No(?) Yes(?)
RNA-Seq Experiment
GEO Database
White paper, Illumina
White paper, Illumina
Comparing Expression Profiles from Microarrays to RNASeq
n=7532 n=4537
Mooney M, PloSOne (2013)
10 Lymphoma (3T-cell, 7 B-cell)
4 Normal lymph node
Total RNA
PE100 run
50-100 million
mapped reads
Compare 15,092 annotated genes on chip
Mooney M, PloSOne (2013)
T
NB
r=0.6; p<10-15
c. elegans
Biological Replicates for
L2 andYA stages
AffyTilingArrays* Illumina RNASeq
Agarwal, BMC Genomics (2010)
* Covers whole c.elegans genome
Differential Expression genes between the L2 andYA stage
Agarwal, BMC Genomics (2010)
RNA-Seq and tiling arrays
Tiling Array
Microarray
Maximum
Sensitivity
RNASeq 11-plex
RNASeq 6-plex
Agarwal, BMC Genomics (2010)
Per Sample Microarray Illumina HiSeq
1 per Chip/Lane $670 $4,010.00
2 plex NA $2,097.50
4-plex NA $1,141.25
6-plex NA $822.50
8-plex NA $663.13
6-plex
11-plex
Per Sample Microarray Illumina HiSeq
1 per Chip/Lane $670 $4,010.00
2 plex NA $2,097.50
4-plex NA $1,141.25
6-plex NA $822.50
8-plex NA $663.13
Data Per Sample
Time to
download 1
Sample
Time to download
100 samples
Cost to Store on the
Cloud per Month
RNASeq 30-65GB 1 Hr 6 days $270
Microarray 30MB 5 second 8 minutes $0.30
http://www.ncbi.nlm.nih.gov/genbank/statistics
-Application withUser Interface RNA-Seq analysis (i.e. Galaxy) can only
handle very few samples
-Knowledge of Linux server, scripting language, programming language is
absolutely REQUIRED
-Lack of detailed understanding in NGS technology and data leads to
diverse bioinformatics tools with different characteristics
LawWC ,Voom!, Bionconductor (2013)
The answer isYes
- Transcriptome profiles generated by microarray and RNASeq
are in strongly concordance
- Microarray data generated in the last decades is durable
- RNASeq is it offers more a lot more biological information
than microarray that is re-analyzable
- NGS is getting cheaper
However, the devil is in the data
- NGS data is a lot more expensive to store and analyze
- Specialized computing infrastructure and personnel are
required to take advantage of the information from NGS data

Más contenido relacionado

La actualidad más candente

MASSIVELY PARELLEL SIGNATURE SEQUENCING
MASSIVELY PARELLEL SIGNATURE SEQUENCINGMASSIVELY PARELLEL SIGNATURE SEQUENCING
MASSIVELY PARELLEL SIGNATURE SEQUENCINGAashish Patel
 
Small Molecule Real Time Sequencing
Small Molecule Real Time SequencingSmall Molecule Real Time Sequencing
Small Molecule Real Time SequencingUSD Bioinformatics
 
High-Throughput Sequencing
High-Throughput SequencingHigh-Throughput Sequencing
High-Throughput SequencingMark Pallen
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAGRF_Ltd
 
Microarray (DNA and SNP microarray)
Microarray (DNA and SNP microarray)Microarray (DNA and SNP microarray)
Microarray (DNA and SNP microarray)Hamza Khan
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicshemantbreeder
 
Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods Mrinal Vashisth
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencingUzma Jabeen
 
PacBio SMRT - THIRD GENERATION SEQUENCING TECHNIQUE
PacBio SMRT - THIRD GENERATION SEQUENCING TECHNIQUEPacBio SMRT - THIRD GENERATION SEQUENCING TECHNIQUE
PacBio SMRT - THIRD GENERATION SEQUENCING TECHNIQUEMuunda Mudenda
 
Functional genomics, and tools
Functional genomics, and toolsFunctional genomics, and tools
Functional genomics, and toolsKAUSHAL SAHU
 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomicsNikhil Aggarwal
 
New generation sequencing equipments
New generation sequencing equipmentsNew generation sequencing equipments
New generation sequencing equipmentsKalaivani P
 
Bioinformatics tools for NGS data analysis
Bioinformatics tools for NGS data analysisBioinformatics tools for NGS data analysis
Bioinformatics tools for NGS data analysisDespoina Kalfakakou
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicsprateek kumar
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicsAthira RG
 
Single-cell RNA-seq tutorial
Single-cell RNA-seq tutorialSingle-cell RNA-seq tutorial
Single-cell RNA-seq tutorialAaron Diaz
 
Techniques in proteomics
Techniques in proteomicsTechniques in proteomics
Techniques in proteomicsN Poorin
 

La actualidad más candente (20)

MASSIVELY PARELLEL SIGNATURE SEQUENCING
MASSIVELY PARELLEL SIGNATURE SEQUENCINGMASSIVELY PARELLEL SIGNATURE SEQUENCING
MASSIVELY PARELLEL SIGNATURE SEQUENCING
 
Small Molecule Real Time Sequencing
Small Molecule Real Time SequencingSmall Molecule Real Time Sequencing
Small Molecule Real Time Sequencing
 
High-Throughput Sequencing
High-Throughput SequencingHigh-Throughput Sequencing
High-Throughput Sequencing
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysis
 
Microarray (DNA and SNP microarray)
Microarray (DNA and SNP microarray)Microarray (DNA and SNP microarray)
Microarray (DNA and SNP microarray)
 
Gene expression profiling
Gene expression profilingGene expression profiling
Gene expression profiling
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
PacBio SMRT - THIRD GENERATION SEQUENCING TECHNIQUE
PacBio SMRT - THIRD GENERATION SEQUENCING TECHNIQUEPacBio SMRT - THIRD GENERATION SEQUENCING TECHNIQUE
PacBio SMRT - THIRD GENERATION SEQUENCING TECHNIQUE
 
ChIP-seq
ChIP-seqChIP-seq
ChIP-seq
 
Functional genomics, and tools
Functional genomics, and toolsFunctional genomics, and tools
Functional genomics, and tools
 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomics
 
New generation sequencing equipments
New generation sequencing equipmentsNew generation sequencing equipments
New generation sequencing equipments
 
Bioinformatics tools for NGS data analysis
Bioinformatics tools for NGS data analysisBioinformatics tools for NGS data analysis
Bioinformatics tools for NGS data analysis
 
Illumina Sequencing
Illumina SequencingIllumina Sequencing
Illumina Sequencing
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Single-cell RNA-seq tutorial
Single-cell RNA-seq tutorialSingle-cell RNA-seq tutorial
Single-cell RNA-seq tutorial
 
Techniques in proteomics
Techniques in proteomicsTechniques in proteomics
Techniques in proteomics
 

Similar a Comparison between RNASeq and Microarray for Gene Expression Analysis

FFPE Applications Solutions brochure
FFPE Applications Solutions brochureFFPE Applications Solutions brochure
FFPE Applications Solutions brochureAffymetrix
 
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Reid Robison
 
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...eventi-ITBbari
 
Whole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysisWhole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysisdrelamuruganvet
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGLong Pei
 
Microarrays;application
Microarrays;applicationMicroarrays;application
Microarrays;applicationFyzah Bashir
 
New Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overviewNew Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overviewPaolo Dametto
 
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell TumorsWhole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell TumorsThermo Fisher Scientific
 
Processing Amplicon Sequence Data for the Analysis of Microbial Communities
Processing Amplicon Sequence Data for the Analysis of Microbial CommunitiesProcessing Amplicon Sequence Data for the Analysis of Microbial Communities
Processing Amplicon Sequence Data for the Analysis of Microbial CommunitiesMartin Hartmann
 
Analytical performance of a novel next generation sequencing assay for Myeloi...
Analytical performance of a novel next generation sequencing assay for Myeloi...Analytical performance of a novel next generation sequencing assay for Myeloi...
Analytical performance of a novel next generation sequencing assay for Myeloi...Thermo Fisher Scientific
 
Unilag workshop complex genome analysis
Unilag workshop   complex genome analysisUnilag workshop   complex genome analysis
Unilag workshop complex genome analysisDr. Olusoji Adewumi
 
GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016ExternalEvents
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNAUlises Urzua
 
140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposalGenomeInABottle
 
Wellstein poster embl meeting nov 2018
Wellstein poster embl meeting nov 2018Wellstein poster embl meeting nov 2018
Wellstein poster embl meeting nov 2018Anne Deslattes Mays
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...VHIR Vall d’Hebron Institut de Recerca
 
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICSPROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICSLubna MRL
 
Evolinc: Evolutionary analysis of LincRNA
Evolinc: Evolutionary analysis of LincRNAEvolinc: Evolutionary analysis of LincRNA
Evolinc: Evolutionary analysis of LincRNAUpendra Kumar Devisetty
 
Alvis Brazma, Array Express Gene Expression Atlas, fged_seattle_2013
Alvis Brazma, Array Express Gene Expression Atlas, fged_seattle_2013Alvis Brazma, Array Express Gene Expression Atlas, fged_seattle_2013
Alvis Brazma, Array Express Gene Expression Atlas, fged_seattle_2013Functional Genomics Data Society
 

Similar a Comparison between RNASeq and Microarray for Gene Expression Analysis (20)

FFPE Applications Solutions brochure
FFPE Applications Solutions brochureFFPE Applications Solutions brochure
FFPE Applications Solutions brochure
 
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
 
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
 
Whole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysisWhole genome sequencing of bacteria & analysis
Whole genome sequencing of bacteria & analysis
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEG
 
Microarrays;application
Microarrays;applicationMicroarrays;application
Microarrays;application
 
New Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overviewNew Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overview
 
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell TumorsWhole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
 
Dna microarray mehran- u of toronto
Dna microarray  mehran- u of torontoDna microarray  mehran- u of toronto
Dna microarray mehran- u of toronto
 
Processing Amplicon Sequence Data for the Analysis of Microbial Communities
Processing Amplicon Sequence Data for the Analysis of Microbial CommunitiesProcessing Amplicon Sequence Data for the Analysis of Microbial Communities
Processing Amplicon Sequence Data for the Analysis of Microbial Communities
 
Analytical performance of a novel next generation sequencing assay for Myeloi...
Analytical performance of a novel next generation sequencing assay for Myeloi...Analytical performance of a novel next generation sequencing assay for Myeloi...
Analytical performance of a novel next generation sequencing assay for Myeloi...
 
Unilag workshop complex genome analysis
Unilag workshop   complex genome analysisUnilag workshop   complex genome analysis
Unilag workshop complex genome analysis
 
GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016GMI proficiency testing- Progress report 2016
GMI proficiency testing- Progress report 2016
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNA
 
140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal
 
Wellstein poster embl meeting nov 2018
Wellstein poster embl meeting nov 2018Wellstein poster embl meeting nov 2018
Wellstein poster embl meeting nov 2018
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
 
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICSPROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
 
Evolinc: Evolutionary analysis of LincRNA
Evolinc: Evolutionary analysis of LincRNAEvolinc: Evolutionary analysis of LincRNA
Evolinc: Evolutionary analysis of LincRNA
 
Alvis Brazma, Array Express Gene Expression Atlas, fged_seattle_2013
Alvis Brazma, Array Express Gene Expression Atlas, fged_seattle_2013Alvis Brazma, Array Express Gene Expression Atlas, fged_seattle_2013
Alvis Brazma, Array Express Gene Expression Atlas, fged_seattle_2013
 

Más de Yaoyu Wang

Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisYaoyu Wang
 
Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisYaoyu Wang
 
Request CCCB Services
Request CCCB ServicesRequest CCCB Services
Request CCCB ServicesYaoyu Wang
 
CCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud PlatformCCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud PlatformYaoyu Wang
 
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing InformaticsBio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing InformaticsYaoyu Wang
 
Rnaseq basics ngs_application1
Rnaseq basics ngs_application1Rnaseq basics ngs_application1
Rnaseq basics ngs_application1Yaoyu Wang
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment DesignYaoyu Wang
 

Más de Yaoyu Wang (7)

Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysis
 
Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysis
 
Request CCCB Services
Request CCCB ServicesRequest CCCB Services
Request CCCB Services
 
CCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud PlatformCCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud Platform
 
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing InformaticsBio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
 
Rnaseq basics ngs_application1
Rnaseq basics ngs_application1Rnaseq basics ngs_application1
Rnaseq basics ngs_application1
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment Design
 

Último

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Último (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Comparison between RNASeq and Microarray for Gene Expression Analysis

  • 1. Yaoyu E. Wang, Ph.D Center for Cancer Computational Biology, DFCI SPECSII webinar June 05, 2013
  • 2. - Transcriptome profiling represents a static gene expression state of a biological sample across the genome - Allows for direct genomic comparisons with multiple samples to determine genes that exhibit differential expression in different state (i.e. normal vs. tumor) - Allows for hypothesis generation on molecular abnormalities and mechanisms that may contribute to the tumor phenotype - Provides information on molecular subtypes, the development of prognostic and predictive molecular signatures - Two main technologies: a. Microarray b. RNA-Sequencing (RNASeq) using next generation sequencing
  • 4. Blencowe B J et al. Genes Dev. 2009;23:1379-1386 Illumina HiSeq
  • 5. .bcl files CASAVA processing •Demultiplexing •Fastq file generation •Sequencing filtering Raw files containing base calls and quality scores Illumina defined quality filters Split into Project and Sample Folders Jones_Lab ChIP_A ChIP-B Marcus_Lab RNA-SeqA RNA-SeqB RNA-SeqC Williams_Lab Exome1 Exome2 Fastq Files Fastq Files Fastq Files
  • 6. Haas & Zody. Nature Biotechnology 28, 421–423 (2010) Using known annotations And compare to known annotations •Differential Expression •Differential Isoform Abundance •RNA editing •SNP, indel detection
  • 7. Technology RNASeq Microarray High run-to run reproducibility Yes Yes Dynamic Range Comparable to actual transcript abundance >8000-fold Hundred fold Able to detect alternative splice site and novel isoforms Yes No De novo analysis of samples without reference genome Yes No Multiplexing Samples in one run Yes No Required amount of total RNA >100 ng ~1 ug Re-analyzable data Yes No
  • 8. Technology RNASeq Microarray Heterogeneity of read coverage across an expressed region Yes No Well understood sources of experimental bias No Yes Data portable on a flush drive (~4G) No Yes Data is analyzable by any PC No Yes Cheaper cost per sample No(?) Yes(?)
  • 12. Comparing Expression Profiles from Microarrays to RNASeq n=7532 n=4537
  • 13. Mooney M, PloSOne (2013) 10 Lymphoma (3T-cell, 7 B-cell) 4 Normal lymph node Total RNA PE100 run 50-100 million mapped reads Compare 15,092 annotated genes on chip
  • 14. Mooney M, PloSOne (2013) T NB r=0.6; p<10-15
  • 15. c. elegans Biological Replicates for L2 andYA stages AffyTilingArrays* Illumina RNASeq Agarwal, BMC Genomics (2010) * Covers whole c.elegans genome
  • 16. Differential Expression genes between the L2 andYA stage Agarwal, BMC Genomics (2010)
  • 17. RNA-Seq and tiling arrays Tiling Array Microarray Maximum Sensitivity RNASeq 11-plex RNASeq 6-plex Agarwal, BMC Genomics (2010)
  • 18.
  • 19. Per Sample Microarray Illumina HiSeq 1 per Chip/Lane $670 $4,010.00 2 plex NA $2,097.50 4-plex NA $1,141.25 6-plex NA $822.50 8-plex NA $663.13 6-plex 11-plex
  • 20. Per Sample Microarray Illumina HiSeq 1 per Chip/Lane $670 $4,010.00 2 plex NA $2,097.50 4-plex NA $1,141.25 6-plex NA $822.50 8-plex NA $663.13
  • 21.
  • 22. Data Per Sample Time to download 1 Sample Time to download 100 samples Cost to Store on the Cloud per Month RNASeq 30-65GB 1 Hr 6 days $270 Microarray 30MB 5 second 8 minutes $0.30 http://www.ncbi.nlm.nih.gov/genbank/statistics
  • 23. -Application withUser Interface RNA-Seq analysis (i.e. Galaxy) can only handle very few samples -Knowledge of Linux server, scripting language, programming language is absolutely REQUIRED -Lack of detailed understanding in NGS technology and data leads to diverse bioinformatics tools with different characteristics LawWC ,Voom!, Bionconductor (2013)
  • 24.
  • 25. The answer isYes - Transcriptome profiles generated by microarray and RNASeq are in strongly concordance - Microarray data generated in the last decades is durable - RNASeq is it offers more a lot more biological information than microarray that is re-analyzable - NGS is getting cheaper However, the devil is in the data - NGS data is a lot more expensive to store and analyze - Specialized computing infrastructure and personnel are required to take advantage of the information from NGS data

Notas del editor

  1. The basic concept behind the use of GeneChip arrays for gene expression is simple: labeled cDNA or cRNA targets derived from the mRNA of an experimental sample are hybridized to nucleic acid probes attached to the solid support. By monitoring the amount of dye label associated with each DNA location, it is possible to infer the abundance of each mRNA species represented. For transcriptome profiling, the input is usually about 1ug total RNA that are poly-A selected to ensure only mature mRNA is being assayed.
  2. Poly(A)+ mRNA is purified, fragmented, and then converted to a cDNA library with 5′ and 3′ adapter sequences. Short sequence reads are generated from the cDNA library. Normally, reads are mapped to previously annotate known transcripts and a pile un-mapped reads are kept. Reads that map to novel expressed sequences, including alternative exons and corresponding splice junction sequences
  3. Two RNA sample types MAQC brain and universal human Reference RNA were processed using 5 technical replicates on both microarray and RNA-Seq. Once teh data is generated, the microarray data was processed using MAQC. For RNA-Seq, the sample cDNA libraries were prepared with Illumina protocol and sequenced to a depth of ~30 million mapped reads.
  4. This is the scatter plot of technical replicates of the samples analyzed by RNA-Seq and microarray. The false positive rates are comparable between the two methods, and both methods have extremely high correlation between replicates (R&gt;0.99). The plots demonstrate that RNA-Seq identifies more genes and spans a wider dynamic range compared to the microarray.
  5. Scatterplot of fold change per gene as measured by RNASeq and microarray. Genes identified as differentially expressed by both platform are plotted in red, genes identified by RNASeq in blue, microarray in yellow and neither ins green. While the correlation between the two platforms in identifying differentially expressed genes is really high, this figure clearly indicates that a discrepancy between the platforms in the ability to identify genes as differentially expressed. The gene subset segmentation reveal that RNA-Seq counts identified significantly more differentially expressed genes. However, microarray does detect gene expression differences. Further valudation from a subset of 1000 genes for which PCR data is available, RNASeq data shows higher concordance with PCR results than microarray.
  6. A study by Mooney et al, use a paired RNA sequencing (RNA-Seq)/microarray analysis of a set of 4 normal canine lymph nodes and 10 canine lymphoma fine needle aspirates to identify technical biases and variation between the technologies. We use a paired RNA sequencing (RNA-Seq)/microarray analysis of a set of four normal canine lymph nodes and ten canine lymphoma fine needle aspirates to identify technical biases and variation between the technologies and compare the 15,092 annotated genes on chip.
  7. Both RNA-Seq and microarray observations provide present detection calls for 15,092 genes in each of the 14 samples. Thepercent present detection calls provided by the two technologies agreed with high frequency (73%) and were statistically associated(Table 3; p,10215, odds ratio .40). Among genes probed by both methods, percent present detection frequencies of 69% and 44%were obtained by RNA-Seq and microarray, respectively. Among genes called present using microarray over 97% were detectedusing RNA-Seq.Variation among expression profiles obtained using RNASeq is similar to that obtained using microarray after removing contributions of the first surrogate variable [42]. Each letter denotes a sample from a dog having a normal (N), B-cell (B), or T-cell (T) diagnosis as in the legend, with subscript ‘m’ run on the microarray platform and subscript ‘r’ run onthe NGS platform. a) Principal component scores b) Hierarchical clustering
  8. Here, we compare these two platforms using a matched sample of poly(A)-enriched RNA isolated from thesecond larval stage of C. elegansto Young adult (YA)for all genes Each point represents a gene from the composite model. RNA-Seq expression levels per gene were measured using RPKM, and tiling array levels were measured using the mean intensity of probes falling within composite exons. The Spearman&apos;s coefficient is 0.90, indicating that the platforms correlate well on identical samples. The disproportionate number of genes in the upper left likely represents cross-hybridization.
  9. Differential expression of genes between the L2 and YA stages. (a) Correlation of log2(YA/L2) ratios between RNA-Seq and tiling arrays.. Black: not significantly differentially expressed between samples.Blue: significantly differentially expressed (q ≤ 0.01). The ratio of expression levels is well-correlated, but RNA-Seq has a larger dynamic range. (b) Venn diagram of genes called differentially expressed by each platform. There is significant overlap (8,976) between the two platforms, but more genes were called differentially expressed by RNA-Seq (14,201) than by tiling arrays (10,283), likely reflecting its greater dynamic range. A total of 4,326 genes were not called differentially expressed by either technology. 
  10. ROC curve analysis. Black: tiling array. Red: RNA-Seq with all 32 million reads. It is evident that the RNA-Seq substantially outperforms the tiling array with consistently higher sensitivity at lower FPR. Remaining curves are for RNA-Seq with only a subset of reads utilized. At an FPR = 0.05, just 4 million reads (blue) are required to attain the same sensitivity as two tiling array replicates.