SlideShare una empresa de Scribd logo
1 de 10
Biology

Chemistry
Informatics

Evaluation of metabolomic
sample processing methods using
hierarchical cluster analysis

Cluster Analysis

Goal:
Use hierarchical cluster analysis (HCA) to evaluate
data variance structure
Topics:
1. Evaluate sample and variable similarities
2. Identify the effect of data transformation,
distance and linkage methods on data
similarities
Clustering data
Biology

Chemistry
Informatics

Cluster Analysis

Goal:

Use HCA to cluster samples (Use DATA: Pumpkin data 1.csv)

Visualize:
1. Sample (row) raw similarities as a heat map
2. Annotate heatmap with extraction and treatment type
3. Select cluster distance and linkage method to cluster the samples
4. Determine the effect of data transformations on the cluster structure
(view as a dendrogram)
Exercises:
1. What factor, extraction or treatment, has the greatest contribution to
the data variance structure?
2. Describe the effect of clustering raw data or sample correlations
Biology

Chemistry

Raw data matrix visualized as
a heatmap
samples

variables

Cluster Analysis

Informatics
Raw data matrix organized by HCA
Biology

Chemistry
Informatics

Cluster Analysis

•ACN:/IPA/water|fresh and
MeOH/CH3Cl/water|dried
display distinct patterns in
metabolites which are most
similar to each other
•Sample similarities are linked
to metabolite magnitudes
Biology

Chemistry

Clustering based on sample
correlations (spearman)

Informatics

Cluster Analysis

•100% MeOH/fresh is the
most dissimilar protocol from
all others
•ACN:/IPA/water and
MeOH/CH3Cl/water are most
similar to each other
•Sample similarities are
decoupled from metabolite
magnitudes
Clustering metabolites
Biology

Chemistry
Informatics

Goal 2: Use HCA to evaluate metabolite similarities

Cluster Analysis

Visualize:
1.Z-scaled and correlation based variable clustering
2.Use a dendrogram to extract variable clusters
3.Select two variables from the same cluster and visualize their
correlation
Exercise:
1.Do the clustered variables share biological functions?
2.Which type of correlation is most robust to outliers?
3.Are the correlations for the visualized variable independent of
extraction/treatment?
Z-scaled variable clusters
Biology

Chemistry

Cluster Analysis

Informatics
Correlation based variable clusters
Biology

Chemistry

Cluster Analysis

Informatics
Biology

Chemistry

Extraction of clusters of
correlated variables

Informatics

less similar

most similar cluster

Cluster Analysis

lowest common
branch height
more similar
Biology

Chemistry

Cluster Analysis

Informatics

Correlation among cluster
members (4)

Más contenido relacionado

La actualidad más candente

Normalization of Large-Scale Metabolomic Studies 2014
Normalization of Large-Scale Metabolomic Studies 2014Normalization of Large-Scale Metabolomic Studies 2014
Normalization of Large-Scale Metabolomic Studies 2014Dmitry Grapov
 
Multivarite and network tools for biological data analysis
Multivarite and network tools for biological data analysisMultivarite and network tools for biological data analysis
Multivarite and network tools for biological data analysisDmitry Grapov
 
Metabolomic Data Analysis Case Studies
Metabolomic Data Analysis Case StudiesMetabolomic Data Analysis Case Studies
Metabolomic Data Analysis Case StudiesDmitry Grapov
 
General Concepts in QSAR for Using the QSAR Application Toolbox Part 3
General Concepts in QSAR for Using the QSAR Application Toolbox Part 3General Concepts in QSAR for Using the QSAR Application Toolbox Part 3
General Concepts in QSAR for Using the QSAR Application Toolbox Part 3International QSAR Foundation
 
Prote-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and VisualizationProte-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and VisualizationDmitry Grapov
 
Data Normalization Approaches for Large-scale Biological Studies
Data Normalization Approaches for Large-scale Biological StudiesData Normalization Approaches for Large-scale Biological Studies
Data Normalization Approaches for Large-scale Biological StudiesDmitry Grapov
 
Data analysis workflows part 2 2015
Data analysis workflows part 2 2015Data analysis workflows part 2 2015
Data analysis workflows part 2 2015Dmitry Grapov
 
3 data normalization (2014 lab tutorial)
3  data normalization (2014 lab tutorial)3  data normalization (2014 lab tutorial)
3 data normalization (2014 lab tutorial)Dmitry Grapov
 
Mapping to the Metabolomic Manifold
Mapping to the Metabolomic ManifoldMapping to the Metabolomic Manifold
Mapping to the Metabolomic ManifoldDmitry Grapov
 
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses Dmitry Grapov
 
Advanced strategies for Metabolomics Data Analysis
Advanced strategies for Metabolomics Data AnalysisAdvanced strategies for Metabolomics Data Analysis
Advanced strategies for Metabolomics Data AnalysisDmitry Grapov
 
Strategies for Metabolomics Data Analysis
Strategies for Metabolomics Data AnalysisStrategies for Metabolomics Data Analysis
Strategies for Metabolomics Data AnalysisDmitry Grapov
 
Case Study: Overview of Metabolomic Data Normalization Strategies
Case Study: Overview of Metabolomic Data Normalization StrategiesCase Study: Overview of Metabolomic Data Normalization Strategies
Case Study: Overview of Metabolomic Data Normalization StrategiesDmitry Grapov
 
Metabolomic data analysis and visualization tools
Metabolomic data analysis and visualization toolsMetabolomic data analysis and visualization tools
Metabolomic data analysis and visualization toolsDmitry Grapov
 
Data analysis and Visualisation Techniques for Compound Combination Modelling
Data analysis and Visualisation Techniques for Compound Combination ModellingData analysis and Visualisation Techniques for Compound Combination Modelling
Data analysis and Visualisation Techniques for Compound Combination ModellingRichard Lewis
 
Automation of (Biological) Data Analysis and Report Generation
Automation of (Biological) Data Analysis and Report GenerationAutomation of (Biological) Data Analysis and Report Generation
Automation of (Biological) Data Analysis and Report GenerationDmitry Grapov
 
Omic Data Integration Strategies
Omic Data Integration StrategiesOmic Data Integration Strategies
Omic Data Integration StrategiesDmitry Grapov
 
Embase technology showcase mla v9
Embase technology showcase mla v9Embase technology showcase mla v9
Embase technology showcase mla v9Ann-Marie Roche
 

La actualidad más candente (20)

Normalization of Large-Scale Metabolomic Studies 2014
Normalization of Large-Scale Metabolomic Studies 2014Normalization of Large-Scale Metabolomic Studies 2014
Normalization of Large-Scale Metabolomic Studies 2014
 
Multivarite and network tools for biological data analysis
Multivarite and network tools for biological data analysisMultivarite and network tools for biological data analysis
Multivarite and network tools for biological data analysis
 
Metabolomic Data Analysis Case Studies
Metabolomic Data Analysis Case StudiesMetabolomic Data Analysis Case Studies
Metabolomic Data Analysis Case Studies
 
General Concepts in QSAR for Using the QSAR Application Toolbox Part 3
General Concepts in QSAR for Using the QSAR Application Toolbox Part 3General Concepts in QSAR for Using the QSAR Application Toolbox Part 3
General Concepts in QSAR for Using the QSAR Application Toolbox Part 3
 
7 network mapping i
7  network mapping i7  network mapping i
7 network mapping i
 
Prote-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and VisualizationProte-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and Visualization
 
Data Normalization Approaches for Large-scale Biological Studies
Data Normalization Approaches for Large-scale Biological StudiesData Normalization Approaches for Large-scale Biological Studies
Data Normalization Approaches for Large-scale Biological Studies
 
Data analysis workflows part 2 2015
Data analysis workflows part 2 2015Data analysis workflows part 2 2015
Data analysis workflows part 2 2015
 
3 data normalization (2014 lab tutorial)
3  data normalization (2014 lab tutorial)3  data normalization (2014 lab tutorial)
3 data normalization (2014 lab tutorial)
 
Mapping to the Metabolomic Manifold
Mapping to the Metabolomic ManifoldMapping to the Metabolomic Manifold
Mapping to the Metabolomic Manifold
 
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
 
Advanced strategies for Metabolomics Data Analysis
Advanced strategies for Metabolomics Data AnalysisAdvanced strategies for Metabolomics Data Analysis
Advanced strategies for Metabolomics Data Analysis
 
Strategies for Metabolomics Data Analysis
Strategies for Metabolomics Data AnalysisStrategies for Metabolomics Data Analysis
Strategies for Metabolomics Data Analysis
 
Case Study: Overview of Metabolomic Data Normalization Strategies
Case Study: Overview of Metabolomic Data Normalization StrategiesCase Study: Overview of Metabolomic Data Normalization Strategies
Case Study: Overview of Metabolomic Data Normalization Strategies
 
Metabolomic data analysis and visualization tools
Metabolomic data analysis and visualization toolsMetabolomic data analysis and visualization tools
Metabolomic data analysis and visualization tools
 
Data analysis and Visualisation Techniques for Compound Combination Modelling
Data analysis and Visualisation Techniques for Compound Combination ModellingData analysis and Visualisation Techniques for Compound Combination Modelling
Data analysis and Visualisation Techniques for Compound Combination Modelling
 
Automation of (Biological) Data Analysis and Report Generation
Automation of (Biological) Data Analysis and Report GenerationAutomation of (Biological) Data Analysis and Report Generation
Automation of (Biological) Data Analysis and Report Generation
 
Omic Data Integration Strategies
Omic Data Integration StrategiesOmic Data Integration Strategies
Omic Data Integration Strategies
 
Embase technology showcase mla v9
Embase technology showcase mla v9Embase technology showcase mla v9
Embase technology showcase mla v9
 
Data handling metabolomics
Data handling metabolomicsData handling metabolomics
Data handling metabolomics
 

Similar a 2 cluster analysis

ATTRIBUTE REDUCTION-BASED ENSEMBLE RULE CLASSIFIERS METHOD FOR DATASET CLASSI...
ATTRIBUTE REDUCTION-BASED ENSEMBLE RULE CLASSIFIERS METHOD FOR DATASET CLASSI...ATTRIBUTE REDUCTION-BASED ENSEMBLE RULE CLASSIFIERS METHOD FOR DATASET CLASSI...
ATTRIBUTE REDUCTION-BASED ENSEMBLE RULE CLASSIFIERS METHOD FOR DATASET CLASSI...csandit
 
Multivariate Analysis and Visualization of Proteomic Data
Multivariate Analysis and Visualization of Proteomic DataMultivariate Analysis and Visualization of Proteomic Data
Multivariate Analysis and Visualization of Proteomic DataUC Davis
 
New Feature Selection Model Based Ensemble Rule Classifiers Method for Datase...
New Feature Selection Model Based Ensemble Rule Classifiers Method for Datase...New Feature Selection Model Based Ensemble Rule Classifiers Method for Datase...
New Feature Selection Model Based Ensemble Rule Classifiers Method for Datase...ijaia
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)theijes
 
Metabolite Set Enrichment Analysis (ChemRICH)
Metabolite Set Enrichment Analysis (ChemRICH)Metabolite Set Enrichment Analysis (ChemRICH)
Metabolite Set Enrichment Analysis (ChemRICH)Dinesh Barupal
 
Analytical chemistry_Instrumentation_Introduction
Analytical chemistry_Instrumentation_IntroductionAnalytical chemistry_Instrumentation_Introduction
Analytical chemistry_Instrumentation_IntroductionBivek Timalsina
 
A Threshold Fuzzy Entropy Based Feature Selection: Comparative Study
A Threshold Fuzzy Entropy Based Feature Selection:  Comparative StudyA Threshold Fuzzy Entropy Based Feature Selection:  Comparative Study
A Threshold Fuzzy Entropy Based Feature Selection: Comparative StudyIJMER
 
An experimental study on hypothyroid using rotation forest
An experimental study on hypothyroid using rotation forestAn experimental study on hypothyroid using rotation forest
An experimental study on hypothyroid using rotation forestIJDKP
 
Advanced Strategies for Analysis of Metabolomic Data
Advanced Strategies for Analysis of Metabolomic DataAdvanced Strategies for Analysis of Metabolomic Data
Advanced Strategies for Analysis of Metabolomic DataDmitry Grapov
 
Metabolic Set Enrichment Analysis - chemrich - 2019
Metabolic Set Enrichment Analysis - chemrich - 2019Metabolic Set Enrichment Analysis - chemrich - 2019
Metabolic Set Enrichment Analysis - chemrich - 2019Dinesh Barupal
 
Cluster spss week7
Cluster spss week7Cluster spss week7
Cluster spss week7Birat Sharma
 
Prediction Of Bioactivity From Chemical Structure
Prediction Of Bioactivity From Chemical StructurePrediction Of Bioactivity From Chemical Structure
Prediction Of Bioactivity From Chemical StructureJeremy Besnard
 
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)r-kor
 
JBEI Research Highlights - November 2018
JBEI Research Highlights - November 2018 JBEI Research Highlights - November 2018
JBEI Research Highlights - November 2018 Irina Silva
 
Chapter 1
Chapter 1Chapter 1
Chapter 1MEI MEI
 
Applying fuzzy ahp to evaluate the carbon foot print on the workplace in educ...
Applying fuzzy ahp to evaluate the carbon foot print on the workplace in educ...Applying fuzzy ahp to evaluate the carbon foot print on the workplace in educ...
Applying fuzzy ahp to evaluate the carbon foot print on the workplace in educ...eSAT Publishing House
 

Similar a 2 cluster analysis (20)

ATTRIBUTE REDUCTION-BASED ENSEMBLE RULE CLASSIFIERS METHOD FOR DATASET CLASSI...
ATTRIBUTE REDUCTION-BASED ENSEMBLE RULE CLASSIFIERS METHOD FOR DATASET CLASSI...ATTRIBUTE REDUCTION-BASED ENSEMBLE RULE CLASSIFIERS METHOD FOR DATASET CLASSI...
ATTRIBUTE REDUCTION-BASED ENSEMBLE RULE CLASSIFIERS METHOD FOR DATASET CLASSI...
 
Multivariate Analysis and Visualization of Proteomic Data
Multivariate Analysis and Visualization of Proteomic DataMultivariate Analysis and Visualization of Proteomic Data
Multivariate Analysis and Visualization of Proteomic Data
 
New Feature Selection Model Based Ensemble Rule Classifiers Method for Datase...
New Feature Selection Model Based Ensemble Rule Classifiers Method for Datase...New Feature Selection Model Based Ensemble Rule Classifiers Method for Datase...
New Feature Selection Model Based Ensemble Rule Classifiers Method for Datase...
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
Metabolite Set Enrichment Analysis (ChemRICH)
Metabolite Set Enrichment Analysis (ChemRICH)Metabolite Set Enrichment Analysis (ChemRICH)
Metabolite Set Enrichment Analysis (ChemRICH)
 
Analytical chemistry_Instrumentation_Introduction
Analytical chemistry_Instrumentation_IntroductionAnalytical chemistry_Instrumentation_Introduction
Analytical chemistry_Instrumentation_Introduction
 
A Threshold Fuzzy Entropy Based Feature Selection: Comparative Study
A Threshold Fuzzy Entropy Based Feature Selection:  Comparative StudyA Threshold Fuzzy Entropy Based Feature Selection:  Comparative Study
A Threshold Fuzzy Entropy Based Feature Selection: Comparative Study
 
An experimental study on hypothyroid using rotation forest
An experimental study on hypothyroid using rotation forestAn experimental study on hypothyroid using rotation forest
An experimental study on hypothyroid using rotation forest
 
Advanced Strategies for Analysis of Metabolomic Data
Advanced Strategies for Analysis of Metabolomic DataAdvanced Strategies for Analysis of Metabolomic Data
Advanced Strategies for Analysis of Metabolomic Data
 
Metabolic Set Enrichment Analysis - chemrich - 2019
Metabolic Set Enrichment Analysis - chemrich - 2019Metabolic Set Enrichment Analysis - chemrich - 2019
Metabolic Set Enrichment Analysis - chemrich - 2019
 
A new, automated retrosynthetic search engine: ARChem
A new, automated retrosynthetic search engine: ARChemA new, automated retrosynthetic search engine: ARChem
A new, automated retrosynthetic search engine: ARChem
 
Cluster spss week7
Cluster spss week7Cluster spss week7
Cluster spss week7
 
Prediction Of Bioactivity From Chemical Structure
Prediction Of Bioactivity From Chemical StructurePrediction Of Bioactivity From Chemical Structure
Prediction Of Bioactivity From Chemical Structure
 
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
 
JBEI Research Highlights - November 2018
JBEI Research Highlights - November 2018 JBEI Research Highlights - November 2018
JBEI Research Highlights - November 2018
 
Chapter 1
Chapter 1Chapter 1
Chapter 1
 
Note 2 writing a research
Note 2    writing a researchNote 2    writing a research
Note 2 writing a research
 
ERVA-NMR
ERVA-NMRERVA-NMR
ERVA-NMR
 
Applying fuzzy ahp to evaluate the carbon foot print on the workplace in educ...
Applying fuzzy ahp to evaluate the carbon foot print on the workplace in educ...Applying fuzzy ahp to evaluate the carbon foot print on the workplace in educ...
Applying fuzzy ahp to evaluate the carbon foot print on the workplace in educ...
 
OTTO-Report
OTTO-ReportOTTO-Report
OTTO-Report
 

Más de Dmitry Grapov

R programming for Data Science - A Beginner’s Guide
R programming for Data Science - A Beginner’s GuideR programming for Data Science - A Beginner’s Guide
R programming for Data Science - A Beginner’s GuideDmitry Grapov
 
Network mapping 101 course
Network mapping 101 courseNetwork mapping 101 course
Network mapping 101 courseDmitry Grapov
 
Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...
Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...
Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...Dmitry Grapov
 
Dmitry Grapov Resume and CV
Dmitry Grapov Resume and CVDmitry Grapov Resume and CV
Dmitry Grapov Resume and CVDmitry Grapov
 
Machine Learning Powered Metabolomic Network Analysis
Machine Learning Powered Metabolomic Network AnalysisMachine Learning Powered Metabolomic Network Analysis
Machine Learning Powered Metabolomic Network AnalysisDmitry Grapov
 
Complex Systems Biology Informed Data Analysis and Machine Learning
Complex Systems Biology Informed Data Analysis and Machine LearningComplex Systems Biology Informed Data Analysis and Machine Learning
Complex Systems Biology Informed Data Analysis and Machine LearningDmitry Grapov
 
Data analysis workflows part 1 2015
Data analysis workflows part 1 2015Data analysis workflows part 1 2015
Data analysis workflows part 1 2015Dmitry Grapov
 
Gene Ontology Enrichment Network Analysis -Tutorial
Gene Ontology Enrichment Network Analysis -TutorialGene Ontology Enrichment Network Analysis -Tutorial
Gene Ontology Enrichment Network Analysis -TutorialDmitry Grapov
 
American Society of Mass Spectrommetry Conference 2014
American Society of Mass Spectrommetry Conference 2014American Society of Mass Spectrommetry Conference 2014
American Society of Mass Spectrommetry Conference 2014Dmitry Grapov
 

Más de Dmitry Grapov (10)

R programming for Data Science - A Beginner’s Guide
R programming for Data Science - A Beginner’s GuideR programming for Data Science - A Beginner’s Guide
R programming for Data Science - A Beginner’s Guide
 
Network mapping 101 course
Network mapping 101 courseNetwork mapping 101 course
Network mapping 101 course
 
Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...
Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...
Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integratio...
 
Dmitry Grapov Resume and CV
Dmitry Grapov Resume and CVDmitry Grapov Resume and CV
Dmitry Grapov Resume and CV
 
Machine Learning Powered Metabolomic Network Analysis
Machine Learning Powered Metabolomic Network AnalysisMachine Learning Powered Metabolomic Network Analysis
Machine Learning Powered Metabolomic Network Analysis
 
Complex Systems Biology Informed Data Analysis and Machine Learning
Complex Systems Biology Informed Data Analysis and Machine LearningComplex Systems Biology Informed Data Analysis and Machine Learning
Complex Systems Biology Informed Data Analysis and Machine Learning
 
Data analysis workflows part 1 2015
Data analysis workflows part 1 2015Data analysis workflows part 1 2015
Data analysis workflows part 1 2015
 
Modeling poster
Modeling posterModeling poster
Modeling poster
 
Gene Ontology Enrichment Network Analysis -Tutorial
Gene Ontology Enrichment Network Analysis -TutorialGene Ontology Enrichment Network Analysis -Tutorial
Gene Ontology Enrichment Network Analysis -Tutorial
 
American Society of Mass Spectrommetry Conference 2014
American Society of Mass Spectrommetry Conference 2014American Society of Mass Spectrommetry Conference 2014
American Society of Mass Spectrommetry Conference 2014
 

2 cluster analysis

  • 1. Biology Chemistry Informatics Evaluation of metabolomic sample processing methods using hierarchical cluster analysis Cluster Analysis Goal: Use hierarchical cluster analysis (HCA) to evaluate data variance structure Topics: 1. Evaluate sample and variable similarities 2. Identify the effect of data transformation, distance and linkage methods on data similarities
  • 2. Clustering data Biology Chemistry Informatics Cluster Analysis Goal: Use HCA to cluster samples (Use DATA: Pumpkin data 1.csv) Visualize: 1. Sample (row) raw similarities as a heat map 2. Annotate heatmap with extraction and treatment type 3. Select cluster distance and linkage method to cluster the samples 4. Determine the effect of data transformations on the cluster structure (view as a dendrogram) Exercises: 1. What factor, extraction or treatment, has the greatest contribution to the data variance structure? 2. Describe the effect of clustering raw data or sample correlations
  • 3. Biology Chemistry Raw data matrix visualized as a heatmap samples variables Cluster Analysis Informatics
  • 4. Raw data matrix organized by HCA Biology Chemistry Informatics Cluster Analysis •ACN:/IPA/water|fresh and MeOH/CH3Cl/water|dried display distinct patterns in metabolites which are most similar to each other •Sample similarities are linked to metabolite magnitudes
  • 5. Biology Chemistry Clustering based on sample correlations (spearman) Informatics Cluster Analysis •100% MeOH/fresh is the most dissimilar protocol from all others •ACN:/IPA/water and MeOH/CH3Cl/water are most similar to each other •Sample similarities are decoupled from metabolite magnitudes
  • 6. Clustering metabolites Biology Chemistry Informatics Goal 2: Use HCA to evaluate metabolite similarities Cluster Analysis Visualize: 1.Z-scaled and correlation based variable clustering 2.Use a dendrogram to extract variable clusters 3.Select two variables from the same cluster and visualize their correlation Exercise: 1.Do the clustered variables share biological functions? 2.Which type of correlation is most robust to outliers? 3.Are the correlations for the visualized variable independent of extraction/treatment?
  • 8. Correlation based variable clusters Biology Chemistry Cluster Analysis Informatics
  • 9. Biology Chemistry Extraction of clusters of correlated variables Informatics less similar most similar cluster Cluster Analysis lowest common branch height more similar