SlideShare una empresa de Scribd logo
1 de 25
Assembly
Understanding the human reference genome




             Deanna M. Church
             Staff Scientist, NCBI
                26 Mar 2013
Valerie Schneider




http://genomereference.org
The Reference Assembly
     is NOT Static
      NCBI35 (hg17)
      NCBI36 (hg18)
      GRCh37 (hg19)
      GRCh37.p10
An assembly is a   MODEL of the genome
CD1E


chr1:g.158324425A>G
CD1E:c.317A>G
GeT-RM            http://www.ncbi.nlm.nih.gov/projects/variation/get-rm
NC_000012.11:g.22066016delA




                      Missed in ARUP Exome, but not covered by capture probes
Kidd et al, 2007 APOBEC cluster




BLACK: Deletion
White: Insertion
IHGSC, Nature 2004




    Clones




Clones
Build sequence contigs based on contigs
defined in TPF (Tiling Path File).
 Check for orientation consistencies
 Select switch points
 Instantiate sequence for further analysis
                 Switch point



                      Representative chromosome
                               sequence
RP11-34P13     64E8   RP4-669L17   RP5-857K21 RP11-206L10   RP11-54O7




             Gaps
NCBI35 (Assembly described in last HGP paper)       chrX:g.153054447G>A
chrX                                                TKTL1:c.31G>A



                                                                   TKTL1
                   CXorf2
153,019,779                 153,044,285              153,054,417           153,079,546

                                                  chrX:g.153533600G>A
                                                  TKTL1:c.135-74G>A
GRCh37 (current reference assembly)               TKTL1:c.-90G>A
chrX                                              TKTL1:c.135-56G>A




153,498,930             153,523,564 153,524,027                            153,558,713
Data tracking

ABC14-1065514J1
                Date       Gaps      Length

FP565796.1   21-Oct-2009    1

FP565796.2   14-Oct-2010    0

FP565796.3   07-Nov-2010    0
NCBI35 (Assembly described in last HGP paper)
chrX




                      chrX:g.153054447G>A
                      NC_000023.8:g.153054447G>A
GRCh37 (current reference assembly)
chrX




                      chrX:g.153533600G>A
                      NC_000023.10:g.153533600G>A
NM_012253.3
NM_001145933.1         TKTL1
NM_001145934.1


   NM_001145933.1:c.135-74G>A
   NM_00114594.1:c.-90G>A
   NM_012253.3:c.135-56G>A
GRCh37 (current reference assembly)
chrX




Preview of GRCh38 (scheduled Fall 2013)


       TEX28                                       TKTL1


        LOC101060233          LOC101060234
          (opsin related)        (TEX28 related)
http://genomereference.org
The human reference assembly is a COMPOSITE of many individuals

The human reference assembly is NOT static
Accession.versions are KEY to data management
When the reference assembly updates:
  Your favorite region may have the same SEQUENCE but different COORDINATES
  Your favorite region may CHANGE significantly




              We have the TOOLS to help!

                                  http://www.ncbi.nlm.nih.gov/variation

Más contenido relacionado

Más de Deanna Church

Imgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorialImgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorial
Deanna Church
 

Más de Deanna Church (17)

Church SFAF2014 keynote
Church SFAF2014 keynoteChurch SFAF2014 keynote
Church SFAF2014 keynote
 
Church_NCBIvariation2013
Church_NCBIvariation2013Church_NCBIvariation2013
Church_NCBIvariation2013
 
Church_GenomeAccess_2013_genome2013
Church_GenomeAccess_2013_genome2013Church_GenomeAccess_2013_genome2013
Church_GenomeAccess_2013_genome2013
 
Church iowa2013
Church iowa2013Church iowa2013
Church iowa2013
 
Church emory2013
Church emory2013Church emory2013
Church emory2013
 
Church GeT-RM
Church GeT-RMChurch GeT-RM
Church GeT-RM
 
Church sfaf13
Church sfaf13Church sfaf13
Church sfaf13
 
Church gia13
Church gia13Church gia13
Church gia13
 
Church apr2013
Church apr2013Church apr2013
Church apr2013
 
Church agbt13 merge
Church agbt13 mergeChurch agbt13 merge
Church agbt13 merge
 
Church clinical2012
Church clinical2012Church clinical2012
Church clinical2012
 
Church isca2012
Church isca2012Church isca2012
Church isca2012
 
Church nhgri 2012
Church nhgri 2012Church nhgri 2012
Church nhgri 2012
 
Church gmod2012 pt2
Church gmod2012 pt2Church gmod2012 pt2
Church gmod2012 pt2
 
Church gmod2012 pt1
Church gmod2012 pt1Church gmod2012 pt1
Church gmod2012 pt1
 
Imgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorialImgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorial
 
Church Fif2009
Church Fif2009Church Fif2009
Church Fif2009
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 

Church ngs

  • 1. Assembly Understanding the human reference genome Deanna M. Church Staff Scientist, NCBI 26 Mar 2013
  • 2.
  • 3.
  • 5. The Reference Assembly is NOT Static NCBI35 (hg17) NCBI36 (hg18) GRCh37 (hg19) GRCh37.p10
  • 6.
  • 7.
  • 8.
  • 9.
  • 10. An assembly is a MODEL of the genome
  • 12. GeT-RM http://www.ncbi.nlm.nih.gov/projects/variation/get-rm NC_000012.11:g.22066016delA Missed in ARUP Exome, but not covered by capture probes
  • 13.
  • 14.
  • 15. Kidd et al, 2007 APOBEC cluster BLACK: Deletion White: Insertion
  • 16. IHGSC, Nature 2004 Clones Clones
  • 17. Build sequence contigs based on contigs defined in TPF (Tiling Path File). Check for orientation consistencies Select switch points Instantiate sequence for further analysis Switch point Representative chromosome sequence
  • 18. RP11-34P13 64E8 RP4-669L17 RP5-857K21 RP11-206L10 RP11-54O7 Gaps
  • 19. NCBI35 (Assembly described in last HGP paper) chrX:g.153054447G>A chrX TKTL1:c.31G>A TKTL1 CXorf2 153,019,779 153,044,285 153,054,417 153,079,546 chrX:g.153533600G>A TKTL1:c.135-74G>A GRCh37 (current reference assembly) TKTL1:c.-90G>A chrX TKTL1:c.135-56G>A 153,498,930 153,523,564 153,524,027 153,558,713
  • 20. Data tracking ABC14-1065514J1 Date Gaps Length FP565796.1 21-Oct-2009 1 FP565796.2 14-Oct-2010 0 FP565796.3 07-Nov-2010 0
  • 21. NCBI35 (Assembly described in last HGP paper) chrX chrX:g.153054447G>A NC_000023.8:g.153054447G>A GRCh37 (current reference assembly) chrX chrX:g.153533600G>A NC_000023.10:g.153533600G>A
  • 22. NM_012253.3 NM_001145933.1 TKTL1 NM_001145934.1 NM_001145933.1:c.135-74G>A NM_00114594.1:c.-90G>A NM_012253.3:c.135-56G>A
  • 23. GRCh37 (current reference assembly) chrX Preview of GRCh38 (scheduled Fall 2013) TEX28 TKTL1 LOC101060233 LOC101060234 (opsin related) (TEX28 related)
  • 25. The human reference assembly is a COMPOSITE of many individuals The human reference assembly is NOT static Accession.versions are KEY to data management When the reference assembly updates: Your favorite region may have the same SEQUENCE but different COORDINATES Your favorite region may CHANGE significantly We have the TOOLS to help! http://www.ncbi.nlm.nih.gov/variation

Notas del editor

  1. Alignments refer to pairs of sequence. Once you know how a pair of sequences go together, you can look at stringing the pairs along into a contig. The contig is essentially the consensus sequence that is produced from the components.To create a contig, we use the steps shown on this slide.What are switch points? As you create the consensus sequence of the contig, the switch points tell you where to stop using the sequence from one component and begin using the sequence from the next.