Daniela Puiu in the GigaScience Prize Track at ICG12: The first near-complete assembly of the hexaploid bread wheat genome, Triticum aestivum. #ICG12, 26th October 2017
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
Daniela Puiu at #ICG12: The first near-complete assembly of the hexaploid bread wheat genome, Triticum aestivum
1. The first near-complete assembly of
the hexaploid bread wheat genome,
Tritricum aestivum
Daniela Puiu
Aleksey Zimin, Richard Hall, Sarah Kingan, Bernardo Clavijo, Steven Salzberg
ICG-12
Oct 27 2017
2. IGC-12The Wheat Genome 2
Sequencing and Assembly of the
Ancestral and Common Wheat
Aegilops tauschii ssp strangulata accession AL8/78
Chinese spring variety (CS42, accession Dv418)
2013-2017
4. IGC-12The Wheat Genome 4
The Wheat Genome
One of the most complex genomes !
1) Genome size: over 15 billion bases
2) Allohexapoild : six copies of each chromosome
3) >90% repeats
Multiple past attempts to assemble =>
assemblies shorter than the estimated genome size.
12. IGC-12The Wheat Genome 12
Run Time: 100 CPU years
Main
Steps
Run
Time
CPUhrs
Wall
Time
Months
MaSuRCA 100K 1.5
Celera WGS 470K 5
FALCON 150K 0.75
ARROW 160K 0.75
total 880K 9
100K CPU hrs=11.5 years
800K CPU hrs=100 years
15. IGC-12The Wheat Genome 15
Conclusions
The most challenging genome (we) assembled!
Learning experience!
Assembly quality vs computational resources?
Share your data!
The most challenging genome (we) assembled!
Learning experience!
Assembly quality vs computational resources?
Share your data!
16. IGC-12The Wheat Genome 16
Acknowledgements
Steven Salzberg
Aleksey ZImin
Johns Hopkins University UCDavis Plant Sciences
Jan Dvorak
Earlham Institute
Bernardo Clavijo
Mingcheng Luo