Se ha denunciado esta presentación.
Se está descargando tu SlideShare. ×

GPU Computation and the Next Gen Cloud

Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Próximo SlideShare
Team 04 3 d open land use
Team 04 3 d open land use
Cargando en…3
×

Eche un vistazo a continuación

1 de 27 Anuncio

Más Contenido Relacionado

Presentaciones para usted (17)

Similares a GPU Computation and the Next Gen Cloud (20)

Anuncio

Más reciente (20)

GPU Computation and the Next Gen Cloud

  1. 1. GPU Computing and the Next Gen Cloud JAYPRASAD HEGDE jayprasad_hegde AT haasalum.berkeley.edu
  2. 2. Contents • GPU Computing • Applications • Usage and Cloud Offerings June 3, 2017 2
  3. 3. Where are we headed? • Plateau in performance improvements with Single Threaded Architecture Image: Courtesy Nvidia GTC 2017 Keynote June 3, 2017 3
  4. 4. Picking up the slack Image: Courtesy Nvidia GTC 2017 KeynoteJune 3, 2017 4
  5. 5. Key Trigger applications • Bitcoin Mining • AMD’s Radeon cards being used for mining on the cheap • Temporary shortage of graphics cards • Geoffrey Hinton, Univ of Toronto applying Neural Networks on an massive scale using GPUs --- Origin of Deep Learning • Limited adoption of NN until 2004. Invented in 1948 (Hebb, Rosenblatt). • Popular Machine Learning method even prior to 2004. Small scale – CPU based • Also, Kaggle winners in 2012 – NNs to run on GPUs using CUDA • http://blog.kaggle.com/2012/11/01/deep-learning-how-i-did-it-merck-1st-place-interview/ • https://arxiv.org/pdf/1702.07800.pdf (Origin of Deep Learning) June 3, 2017 5
  6. 6. Computation super-heavy applications June 3, 2017 6
  7. 7. Newer Super-duper computers or super-farms June 3, 2017 7
  8. 8. CPU: Designed for Low Latency June 3, 2017 8
  9. 9. GPU Architecture: Two Main Components June 3, 2017 9
  10. 10. GPU: Designed for Throughput June 3, 2017 10
  11. 11. June 3, 2017 11
  12. 12. June 3, 2017 12
  13. 13. June 3, 2017 13
  14. 14. June 3, 2017 14
  15. 15. June 3, 2017 15
  16. 16. Sense of proportions with a Kepler GK110 (2014) June 3, 2017 16 Each SMX has 2048 threads 15 x 2048 threads
  17. 17. Upcoming startups in Deep Learning June 3, 2017 17
  18. 18. Upcoming startups with Deep Learning Higher level platforms June 3, 2017 18
  19. 19. Current State of the Art – Commodity Hardware: Tesla V100 June 3, 2017 19 Image: Courtesy Nvidia Announcement in GTC 2017
  20. 20. Frameworks performance for Volta K80 is Kepler, P100 is Pascal and V100 is Volta June 3, 2017 20
  21. 21. For your home • 4 way SLI bridge • GTX 1080 Ti or Titan Xp • $10,000+ June 3, 2017 21
  22. 22. For a mini setup June 3, 2017 22 $ 69,000
  23. 23. For the Datacentre June 3, 2017 23 Food for thought: GTX 1080 in your Desktop will do 8-10 TFLOPS
  24. 24. June 3, 2017 24
  25. 25. Computation Frameworks, Cloud and GPU systems June 3, 2017 25
  26. 26. June 3, 2017 26
  27. 27. Thank you! • Image credits: Nvidia Keynote GTC 2017 and CSCSCH talk on CUDA by Paul Bessmer June 3, 2017 27

Notas del editor

  • More and more transistors running at a faster performance
  • ResNet – 7 seconds for every single super computer in the world – to process this network.
    One CPU only server will take 2 years to run through the server 1 time – Baidu Deep Speech 2

×