SlideShare una empresa de Scribd logo
1 de 39
Heading for full solution to Now Generation Informatics BGI-Shenzhen Sep 19, 2011
Nothing in biology makes sense except in light of evolutionTheodosius Dobzhansky “Tree” type of  thinking of Genomics They are different, they are also related
What is the scope of bioinformatics? Bioinformatics is to understand the tree of life. Bioinformatics will: Draw trees (basic information) Map information on trees (association/cause-effect) Show the trees (visualizations, databases, clouds)
Mission 1: Tree of Species A set of different genes (sequence) made different forms of life
Mission 1: Tree of Species Draw De novo genome assembly Multiple sequence mapping and alignment Phylogenic tree construction Map In-depth Annotation Comparative genomicss Show Genome browsers
Dinner    “taste good, sequence it!” Peking Duck cucumber Cabbage kung pao chicken Mapotoufu oyster
Factory Silk and silkworm Oil and castor bean “Useful, sequence it!” Cloth and cotton
Zoo “look cute, sequence it!” Panda Polar bear and Penguin Antelope
Misson 2: Tree of Individuals A set of different variations (sequence) made different individuals/cells of Human
 An Evolutionary perspective ,[object Object]
These oldest alleles are common in all populations worldwide.
Approximately 90% of the variability in allele frequencies is of this sort.From Mary-Claire King
International project to construct a next generation baseline data set for human genetics Sequence level HapMap, an order of magnitude deeper  Consortium with multiple centres, platforms, funders Aims Find >95% accessible SNPs at allele frequencies above 1%, down towards 0.1% in coding regions Genotype them and place on haplotype backgrounds Also discover and characterize indels, structural variants
An Evolutionary perspective ,[object Object]
Somatic/LCL substitution rate = 7-12x higher than germline rate
Male mutation rate ~7x higher than female mutation rateFrom 1000G Project From Mary-Claire King Development of agriculture in the past 10,000 years and of urbanization and industrialization in the past 700 years has led to rapid populations growth and therefore to the appearance of vast numbers of new alleles, each individually rare and specific to one population or even to one family.
What’s the whole picture of genetic variants ? Billion Genomes  Project Personal genomics with  phenotype information Allele Frequency 50%  5% 0.5% 0.05% Rarer Alleles Stronger Effects Common Alleles Less Effects Very Rare Alleles Strongest Effects Eg:    CFTR delta 508 PCSK9 C679X Eg: MC4R, ABCA1 1q21.1 in SCZ Common/rare Disease Mendelian Disease
If selection goes another direction…lesson from the domesticated animal/plant The history of silkworm domestication D Domesticated W wild  Silkworm domestication history Silkworm phylogenetic tree ,[object Object]
  domestication event lead to a 90% reduction in effective population size during the initial bottleneckPublished in Science 16 Oct.
from Andersson and Georges, Nature Reviews of Genetic5: 202-212 (2004) selective sweep: inheritance of regions around adaptive alleles extent of selective sweep for domestication in MAIZE: tb1 locus (60 to 90-kb) (Clark et al. 2004), Y1 locus (about 600-kb) (Palaisa et al. 2004)
Domestication Genome variation during silkworm domestication 354 candidate domesticated genes 159 tissue-specific expressed (silk gland, midgut, testis) Published in Science 16 Oct.
50 Tibetan’s and 40 Han’s exomes has been sequenced Function further validated in ,[object Object]
Expression level difference in placentaEPAS1: endothelial Per-Arnt-Sim (PAS) domain protein 1 The signal of selection The gene (EPAS1) showing strongest selection signal (up to 80% frequency change in allele distribution), Han: 9%; Tibetan: 87%
Your Micro-Environment, Your other genome?
PCA analysis for 85 Danish samples (based on gene profiling) BMI data Gene level
Misson 2. Tree of Individuals Draw (Complete spectrum of) variation identification Population frequencies and spectrums Map Selection and evolution Phenotypic traits Intermediate phenotypes
Misson 3: Tree of Cells Cell lineages are characterized by single biological levels and their inter-correlations.
On DNA Differentiate the cancer and normal cells by PCA analysis ET AML + : cancer *:  normal *:cells possibly mixed  (from tumor, but clustered to normal cells) these cancers are really heterogeneous. BTCC
Phylogenetic trees clearly show subpopulations in ET and AML cancers   ET AML Essential Thrombocythemia Acute Myeloid Leukemia
Inferring key genes in AML (a typical heterozygous cancer) Key Gene? Key Gene for sub-pop? Consensus Tree
Key genes for AML MLL ALK G1~G6: different subpopulations from AML cancer  MLL: myeloid/lymphoid or mixed-lineage leukemia, recurrent translocations in acute leukemias that may be characterized as acute myeloid leukemia (AML; MIM 601626), acute lymphoblastic leukemia (ALL), or mixed lineage (biphenotypic) leukemia (MLL).
LILRA1 G1~G6: different subpopulations from AML cancer  LILRA1: leukocyte immunoglobulin-like receptor  Inferring key genes in AML (a typical heterozygous cancer)
CTNNA1 G1~G6: different subpopulations from AML cancer  CTNNA1:Leukocyte transendothelial migration; Pathways in cancer  Inferring key genes in AML (a typical heterozygous cancer)
CTSS G1~G6: different subpopulations from AML cancer  CTSS: cathepsin Inferring key genes in AML (a typical heterozygous cancer)
PPP2R1A G1~G6: different subpopulations from AML cancer  PPP2R1A: TGF-beta signaling pathway  Inferring key genes in AML (a typical heterozygous cancer)
DIAPH1 G1~G6: different subpopulations from AML cancer  DIAPH1: Focal adhesion; Regulation of actin cytoskeleton  Inferring key genes in AML (a typical heterozygous cancer)
LILRA1 G1~G6: different subpopulations from AML cancer  LILRA1: leukocyte immunoglobulin-like receptor  Inferring key genes in AML (a typical heterozygous cancer)
3. Tree of cells Draw Single-cell information acquisition technologies Map Single-cell metrics measurement technologies

Más contenido relacionado

La actualidad más candente

The opportunity of stem cell to treat diabetes and cancer
The opportunity of stem cell to treat diabetes and cancerThe opportunity of stem cell to treat diabetes and cancer
The opportunity of stem cell to treat diabetes and cancerResearchsio
 
2000 weinberg. hallsmarcks of cancer new generation
2000 weinberg. hallsmarcks of cancer new generation2000 weinberg. hallsmarcks of cancer new generation
2000 weinberg. hallsmarcks of cancer new generationNatalia Muñoz Roa
 
Comparative genomics presentation
Comparative genomics presentationComparative genomics presentation
Comparative genomics presentationEmmanuel Aguon
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicsprateek kumar
 
R. Redfield's SMBE talk slides
R. Redfield's SMBE talk slidesR. Redfield's SMBE talk slides
R. Redfield's SMBE talk slidesRosie Redfield
 
Genetic polymorphism
Genetic polymorphismGenetic polymorphism
Genetic polymorphismmanorama12
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicsAthira RG
 
Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1Leighton Pritchard
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicskiran singh
 
The two hit hypothesis, Dr BÙI ĐắC CHÍ
The two hit hypothesis, Dr BÙI ĐắC CHÍThe two hit hypothesis, Dr BÙI ĐắC CHÍ
The two hit hypothesis, Dr BÙI ĐắC CHÍhungnguyenthien
 
Alfred knudson and the two hit hypothesis
Alfred knudson and the two hit hypothesisAlfred knudson and the two hit hypothesis
Alfred knudson and the two hit hypothesisWEI LIN
 
Patologia bilogia del cancer
Patologia bilogia del cancerPatologia bilogia del cancer
Patologia bilogia del cancerzulieth
 
BITS - Introduction to comparative genomics
BITS - Introduction to comparative genomicsBITS - Introduction to comparative genomics
BITS - Introduction to comparative genomicsBITS
 
ABSTRACT (MSc Thesis)
ABSTRACT (MSc Thesis)ABSTRACT (MSc Thesis)
ABSTRACT (MSc Thesis)Faith Galabe
 

La actualidad más candente (20)

The opportunity of stem cell to treat diabetes and cancer
The opportunity of stem cell to treat diabetes and cancerThe opportunity of stem cell to treat diabetes and cancer
The opportunity of stem cell to treat diabetes and cancer
 
2000 weinberg. hallsmarcks of cancer new generation
2000 weinberg. hallsmarcks of cancer new generation2000 weinberg. hallsmarcks of cancer new generation
2000 weinberg. hallsmarcks of cancer new generation
 
Comparative genomics presentation
Comparative genomics presentationComparative genomics presentation
Comparative genomics presentation
 
Genetic polymorphism
Genetic polymorphismGenetic polymorphism
Genetic polymorphism
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
R. Redfield's SMBE talk slides
R. Redfield's SMBE talk slidesR. Redfield's SMBE talk slides
R. Redfield's SMBE talk slides
 
Gene therapy
Gene therapyGene therapy
Gene therapy
 
Genetic polymorphism
Genetic polymorphismGenetic polymorphism
Genetic polymorphism
 
Genetics varaiation.1
Genetics varaiation.1Genetics varaiation.1
Genetics varaiation.1
 
Viral oncogenesis
Viral oncogenesisViral oncogenesis
Viral oncogenesis
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1
 
Scanned abstract from the proceedings
Scanned abstract from the proceedingsScanned abstract from the proceedings
Scanned abstract from the proceedings
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
The two hit hypothesis, Dr BÙI ĐắC CHÍ
The two hit hypothesis, Dr BÙI ĐắC CHÍThe two hit hypothesis, Dr BÙI ĐắC CHÍ
The two hit hypothesis, Dr BÙI ĐắC CHÍ
 
Alfred knudson and the two hit hypothesis
Alfred knudson and the two hit hypothesisAlfred knudson and the two hit hypothesis
Alfred knudson and the two hit hypothesis
 
Patologia bilogia del cancer
Patologia bilogia del cancerPatologia bilogia del cancer
Patologia bilogia del cancer
 
poster sandy final
poster sandy finalposter sandy final
poster sandy final
 
BITS - Introduction to comparative genomics
BITS - Introduction to comparative genomicsBITS - Introduction to comparative genomics
BITS - Introduction to comparative genomics
 
ABSTRACT (MSc Thesis)
ABSTRACT (MSc Thesis)ABSTRACT (MSc Thesis)
ABSTRACT (MSc Thesis)
 

Similar a Yingrui Li: Complete Solutions for Now-Generation Bioinformatics

Genetics In Psychiatry
Genetics In PsychiatryGenetics In Psychiatry
Genetics In PsychiatryFrank Meissner
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationJoaquin Dopazo
 
DNA Methylation & C Value.pdf
DNA Methylation & C Value.pdfDNA Methylation & C Value.pdf
DNA Methylation & C Value.pdfsoniaangeline
 
Genotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary dataGenotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary dataFAO
 
A New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicA New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicJoaquin Dopazo
 
thesis_final dhwani.docx
thesis_final dhwani.docxthesis_final dhwani.docx
thesis_final dhwani.docxssuser1e2788
 
Conferencia SBECH
Conferencia SBECHConferencia SBECH
Conferencia SBECHrodrimedel
 
Una revisión de los conocimientos fundamentales de la biología de la célula. ...
Una revisión de los conocimientos fundamentales de la biología de la célula. ...Una revisión de los conocimientos fundamentales de la biología de la célula. ...
Una revisión de los conocimientos fundamentales de la biología de la célula. ...Universidad Popular Carmen de Michelena
 
Mark Daly - Finding risk genes in psychiatric disorders
Mark Daly - Finding risk genes in psychiatric disordersMark Daly - Finding risk genes in psychiatric disorders
Mark Daly - Finding risk genes in psychiatric disorderswef
 
Learning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxLearning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxwrite4
 
Learning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxLearning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxwrite5
 
Data analytics challenges in genomics
Data analytics challenges in genomicsData analytics challenges in genomics
Data analytics challenges in genomicsmikaelhuss
 
Abraham B. Korol, Lecture presentation
Abraham B. Korol, Lecture presentationAbraham B. Korol, Lecture presentation
Abraham B. Korol, Lecture presentationMoshe Kenigshtein
 
How Can Ngs Forward Research Essay
How Can Ngs Forward Research EssayHow Can Ngs Forward Research Essay
How Can Ngs Forward Research EssayStefanie Yang
 
From reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene findingFrom reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene findingJoaquin Dopazo
 
POSTGENETICS MEDICINE
POSTGENETICS MEDICINEPOSTGENETICS MEDICINE
POSTGENETICS MEDICINEinemet
 
TLSC Biotech 101 Noc 2010 (Moore)
TLSC Biotech 101 Noc 2010 (Moore)TLSC Biotech 101 Noc 2010 (Moore)
TLSC Biotech 101 Noc 2010 (Moore)jmoore89
 

Similar a Yingrui Li: Complete Solutions for Now-Generation Bioinformatics (20)

Genetics In Psychiatry
Genetics In PsychiatryGenetics In Psychiatry
Genetics In Psychiatry
 
Genetics in Psychiatry
Genetics in PsychiatryGenetics in Psychiatry
Genetics in Psychiatry
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical information
 
DNA Methylation & C Value.pdf
DNA Methylation & C Value.pdfDNA Methylation & C Value.pdf
DNA Methylation & C Value.pdf
 
Genotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary dataGenotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary data
 
A New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicA New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The Clinic
 
thesis_final dhwani.docx
thesis_final dhwani.docxthesis_final dhwani.docx
thesis_final dhwani.docx
 
Conferencia SBECH
Conferencia SBECHConferencia SBECH
Conferencia SBECH
 
Family history
Family history Family history
Family history
 
Una revisión de los conocimientos fundamentales de la biología de la célula. ...
Una revisión de los conocimientos fundamentales de la biología de la célula. ...Una revisión de los conocimientos fundamentales de la biología de la célula. ...
Una revisión de los conocimientos fundamentales de la biología de la célula. ...
 
Mark Daly - Finding risk genes in psychiatric disorders
Mark Daly - Finding risk genes in psychiatric disordersMark Daly - Finding risk genes in psychiatric disorders
Mark Daly - Finding risk genes in psychiatric disorders
 
Bio
BioBio
Bio
 
Learning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxLearning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docx
 
Learning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxLearning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docx
 
Data analytics challenges in genomics
Data analytics challenges in genomicsData analytics challenges in genomics
Data analytics challenges in genomics
 
Abraham B. Korol, Lecture presentation
Abraham B. Korol, Lecture presentationAbraham B. Korol, Lecture presentation
Abraham B. Korol, Lecture presentation
 
How Can Ngs Forward Research Essay
How Can Ngs Forward Research EssayHow Can Ngs Forward Research Essay
How Can Ngs Forward Research Essay
 
From reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene findingFrom reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene finding
 
POSTGENETICS MEDICINE
POSTGENETICS MEDICINEPOSTGENETICS MEDICINE
POSTGENETICS MEDICINE
 
TLSC Biotech 101 Noc 2010 (Moore)
TLSC Biotech 101 Noc 2010 (Moore)TLSC Biotech 101 Noc 2010 (Moore)
TLSC Biotech 101 Noc 2010 (Moore)
 

Más de GigaScience, BGI Hong Kong

IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...GigaScience, BGI Hong Kong
 
Scott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByteScott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByteGigaScience, BGI Hong Kong
 
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...GigaScience, BGI Hong Kong
 
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...GigaScience, BGI Hong Kong
 
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...GigaScience, BGI Hong Kong
 
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...GigaScience, BGI Hong Kong
 
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...GigaScience, BGI Hong Kong
 
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...GigaScience, BGI Hong Kong
 
Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...GigaScience, BGI Hong Kong
 
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU GuixRicardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU GuixGigaScience, BGI Hong Kong
 
Anil Thanki at #ICG13: Aequatus: An open-source homology browser
Anil Thanki at #ICG13: Aequatus: An open-source homology browserAnil Thanki at #ICG13: Aequatus: An open-source homology browser
Anil Thanki at #ICG13: Aequatus: An open-source homology browserGigaScience, BGI Hong Kong
 
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...GigaScience, BGI Hong Kong
 
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant scienceVenice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant scienceGigaScience, BGI Hong Kong
 
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...GigaScience, BGI Hong Kong
 
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...GigaScience, BGI Hong Kong
 
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global PerspectiveChris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global PerspectiveGigaScience, BGI Hong Kong
 
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...GigaScience, BGI Hong Kong
 
Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...GigaScience, BGI Hong Kong
 
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...GigaScience, BGI Hong Kong
 

Más de GigaScience, BGI Hong Kong (20)

IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...
 
Scott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByteScott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByte
 
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
 
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
 
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
 
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
 
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
 
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...
 
Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...
 
Hong Kong Open Access & GigaScience: CCHK@10
Hong Kong Open Access & GigaScience: CCHK@10Hong Kong Open Access & GigaScience: CCHK@10
Hong Kong Open Access & GigaScience: CCHK@10
 
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU GuixRicardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
 
Anil Thanki at #ICG13: Aequatus: An open-source homology browser
Anil Thanki at #ICG13: Aequatus: An open-source homology browserAnil Thanki at #ICG13: Aequatus: An open-source homology browser
Anil Thanki at #ICG13: Aequatus: An open-source homology browser
 
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
 
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant scienceVenice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
 
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
 
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
 
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global PerspectiveChris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
 
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
 
Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...
 
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
 

Último

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 

Último (20)

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 

Yingrui Li: Complete Solutions for Now-Generation Bioinformatics

  • 1. Heading for full solution to Now Generation Informatics BGI-Shenzhen Sep 19, 2011
  • 2. Nothing in biology makes sense except in light of evolutionTheodosius Dobzhansky “Tree” type of thinking of Genomics They are different, they are also related
  • 3. What is the scope of bioinformatics? Bioinformatics is to understand the tree of life. Bioinformatics will: Draw trees (basic information) Map information on trees (association/cause-effect) Show the trees (visualizations, databases, clouds)
  • 4. Mission 1: Tree of Species A set of different genes (sequence) made different forms of life
  • 5. Mission 1: Tree of Species Draw De novo genome assembly Multiple sequence mapping and alignment Phylogenic tree construction Map In-depth Annotation Comparative genomicss Show Genome browsers
  • 6. Dinner “taste good, sequence it!” Peking Duck cucumber Cabbage kung pao chicken Mapotoufu oyster
  • 7. Factory Silk and silkworm Oil and castor bean “Useful, sequence it!” Cloth and cotton
  • 8. Zoo “look cute, sequence it!” Panda Polar bear and Penguin Antelope
  • 9. Misson 2: Tree of Individuals A set of different variations (sequence) made different individuals/cells of Human
  • 10.
  • 11. These oldest alleles are common in all populations worldwide.
  • 12. Approximately 90% of the variability in allele frequencies is of this sort.From Mary-Claire King
  • 13. International project to construct a next generation baseline data set for human genetics Sequence level HapMap, an order of magnitude deeper Consortium with multiple centres, platforms, funders Aims Find >95% accessible SNPs at allele frequencies above 1%, down towards 0.1% in coding regions Genotype them and place on haplotype backgrounds Also discover and characterize indels, structural variants
  • 14.
  • 15. Somatic/LCL substitution rate = 7-12x higher than germline rate
  • 16. Male mutation rate ~7x higher than female mutation rateFrom 1000G Project From Mary-Claire King Development of agriculture in the past 10,000 years and of urbanization and industrialization in the past 700 years has led to rapid populations growth and therefore to the appearance of vast numbers of new alleles, each individually rare and specific to one population or even to one family.
  • 17. What’s the whole picture of genetic variants ? Billion Genomes Project Personal genomics with phenotype information Allele Frequency 50% 5% 0.5% 0.05% Rarer Alleles Stronger Effects Common Alleles Less Effects Very Rare Alleles Strongest Effects Eg: CFTR delta 508 PCSK9 C679X Eg: MC4R, ABCA1 1q21.1 in SCZ Common/rare Disease Mendelian Disease
  • 18.
  • 19. domestication event lead to a 90% reduction in effective population size during the initial bottleneckPublished in Science 16 Oct.
  • 20. from Andersson and Georges, Nature Reviews of Genetic5: 202-212 (2004) selective sweep: inheritance of regions around adaptive alleles extent of selective sweep for domestication in MAIZE: tb1 locus (60 to 90-kb) (Clark et al. 2004), Y1 locus (about 600-kb) (Palaisa et al. 2004)
  • 21. Domestication Genome variation during silkworm domestication 354 candidate domesticated genes 159 tissue-specific expressed (silk gland, midgut, testis) Published in Science 16 Oct.
  • 22.
  • 23. Expression level difference in placentaEPAS1: endothelial Per-Arnt-Sim (PAS) domain protein 1 The signal of selection The gene (EPAS1) showing strongest selection signal (up to 80% frequency change in allele distribution), Han: 9%; Tibetan: 87%
  • 25. PCA analysis for 85 Danish samples (based on gene profiling) BMI data Gene level
  • 26. Misson 2. Tree of Individuals Draw (Complete spectrum of) variation identification Population frequencies and spectrums Map Selection and evolution Phenotypic traits Intermediate phenotypes
  • 27. Misson 3: Tree of Cells Cell lineages are characterized by single biological levels and their inter-correlations.
  • 28. On DNA Differentiate the cancer and normal cells by PCA analysis ET AML + : cancer *: normal *:cells possibly mixed (from tumor, but clustered to normal cells) these cancers are really heterogeneous. BTCC
  • 29. Phylogenetic trees clearly show subpopulations in ET and AML cancers ET AML Essential Thrombocythemia Acute Myeloid Leukemia
  • 30. Inferring key genes in AML (a typical heterozygous cancer) Key Gene? Key Gene for sub-pop? Consensus Tree
  • 31. Key genes for AML MLL ALK G1~G6: different subpopulations from AML cancer MLL: myeloid/lymphoid or mixed-lineage leukemia, recurrent translocations in acute leukemias that may be characterized as acute myeloid leukemia (AML; MIM 601626), acute lymphoblastic leukemia (ALL), or mixed lineage (biphenotypic) leukemia (MLL).
  • 32. LILRA1 G1~G6: different subpopulations from AML cancer LILRA1: leukocyte immunoglobulin-like receptor Inferring key genes in AML (a typical heterozygous cancer)
  • 33. CTNNA1 G1~G6: different subpopulations from AML cancer CTNNA1:Leukocyte transendothelial migration; Pathways in cancer Inferring key genes in AML (a typical heterozygous cancer)
  • 34. CTSS G1~G6: different subpopulations from AML cancer CTSS: cathepsin Inferring key genes in AML (a typical heterozygous cancer)
  • 35. PPP2R1A G1~G6: different subpopulations from AML cancer PPP2R1A: TGF-beta signaling pathway Inferring key genes in AML (a typical heterozygous cancer)
  • 36. DIAPH1 G1~G6: different subpopulations from AML cancer DIAPH1: Focal adhesion; Regulation of actin cytoskeleton Inferring key genes in AML (a typical heterozygous cancer)
  • 37. LILRA1 G1~G6: different subpopulations from AML cancer LILRA1: leukocyte immunoglobulin-like receptor Inferring key genes in AML (a typical heterozygous cancer)
  • 38.
  • 39. 3. Tree of cells Draw Single-cell information acquisition technologies Map Single-cell metrics measurement technologies
  • 40. Integrating DNA variation, molecular traits, and phenotypes to construct causal gene networks Gene works in a network!
  • 41.
  • 42. Finally: Where are the papers? On what paper you draw and map and show? It is harder and harder to find a platform efficient enough Sample house High-throughput biology Capable computing system with high I/O performance Interlinked database and standardized formats Bioinformatics workflows to perform in silico analysis on data
  • 43. Making data PUBLIC! Does not mean making data downloadable in theory Does mean the public could make use of data New types of databases with operations to the data are required New academic credit system to motivate high-quality easy-to-access datasets. http://www.gigasciencejournal.com http://climb.genomics.cn
  • 44. Acknowledgements Great International Efforts The Genome 10K Consortium The 1000 Genomes Project Consortium The 1000 Plant Genomes Project Consortium The 5000 insects Project Consortium (pending) BGI Initiatives and collaboration framework The 1000 Plant and Animal Genomes Project The 10K Microbial Genomes Project http://ldl.genomics.org.cn
  • 45. Acknowledgements Prof. Rasmus Nielson’s lab in UC Berkeley and in University of Copenhagen Prof. Richard Durbin’s lab in Wellcome Trust Sanger Insititute Prof. Tak-Wah Lam and Siu-Ming Yiu’s lab in Department of Computer Sciences, Hong Kong University Dr. Heng Li in Broad Insititute …