GPU Computing In Higher Education And Research

•

1 recomendación•921 vistas

NVIDIA Tesla GPUs can accelerate computational research by providing greater performance at lower costs and power requirements compared to CPUs alone. GPUs allow for faster simulation times, higher accuracy, and more research to be conducted. UCLA's physics and astronomy department saw a 20% performance increase with the same power budget by upgrading to Tesla M2090 GPUs. Over 150,000 academic papers have been published on GPU computing, showing its widespread adoption for accelerating science applications.

Tecnología

Lift the Barriers of HPC
Faster / Maximum Greater Budget &
More Research Performance Power Efficiencies

Faster, More Discovery, More Performance More Performance
Higher Accuracy per dollar per watt

GPU Impact to Computational Research

More
Research + Maximum
Performance + Efficient
Power

88ns/day, 6x Faster 318% Higher Performance 2.5x Flops / Watt
54% Added Cost Tianhe-1A: CPU + GPU
JAC simulation time
23,558 Atoms DHFR AMBER 11 Jaguar: CPU only
CPU: Dual socket Intel Xeon
Axel Kohlmeyer: Temple University Tianhe-1A: #2 Top500; Jaguar: #3 Top500
X5670, 2.93 GHz (12 cores)

GPU Computing by Numbers

60 583
Universities Universities

150K 1.5M
CUDA Downloads CUDA Downloads

4,000 22,500
Academic Papers Academic Papers

1 52
Supercomputer Supercomputers

2008 2012

UCLA
Department of Physics and Astronomy
Challenge
Accelerate Plasma Research with innovative Particle-in-Cell (PIC) Simulations
Overcome space and power constraints in data centers
Integrate into shared computing strategy across institutes and centers at UCLA

Solution
GPU cluster
96 server nodes
288 NVIDIA Tesla GPUs
Upgraded GPUs to NVIDIA Tesla M2090s (from M2070)
Impact
Upgrades resulted in 20% higher performance with same power cost
GPUs extended to new groups within department for greatly accelerated modeling
Solves faster performance requirements within limited space and power constraints
#235 on prestigious Top500 list with only 6 Racks

Add GPUs: Accelerate Science Applications

CPU GPU

207 GPU-Accelerated Applications
www.nvidia.com/appscatalog

3 Ways to Accelerate Applications

Applications

OpenACC Programming
Libraries
Directives Languages
“Drop-in” Easily Accelerate Maximum
Acceleration Applications Flexibility

THRUST C
BLAS, LAPACK C++
FFT PGI Accelerator Fortran
NPP CAPS HMPP OpenCL
Sparse CRAY DirectCompute
Imaging Java
RNG Python

GPU-Accelerated MATLAB Results

10x speedup in data clustering via K- 14x speedup in template matching routine 3x speedup in estimating 7.6 million
means clustering algorithm (part of cancer cell image analysis) contract prices using Black-Scholes model

17x speedup in simulating the movement 4x speedup in adaptive filtering routine 4x speedup in wave equation solving (part
of 3072 celestial objects (part of acoustic tracking algorithm) of seismic data processing algorithm)

AMBER 12 - Extreme Performance with K20
DHRF JAC 23K Atoms (NVE) Running AMBER 12 GPU Support Revision 12.1
SPFP with CUDA 4.2.9 ECC Off
120

The blue node contains 2x Intel E5-2687W CPUs
95.59 (8 Cores per CPU)
100

Each green node contains 2x Intel E5-2687W
CPUs (8 Cores per CPU) plus 2x NVIDIA K20 GPU
Nanoseconds / Day

80

60

40

20 12.47

0
1 Node 1 Node
DHFR

Gain > 7.5X throughput/performance by adding just 2 K20 GPUs
when compared to dual CPU performance

NAMD 2.9
Outstanding Strong Scaling with Multi-STMV Running NAMD version 2.9
Each blue XE6 CPU node contains 1x AMD
100 STMV on Hundreds of Nodes 1600 Opteron (16 Cores per CPU).
1.2

Fermi XK6 Each green XK6 CPU+GPU node contains
1x AMD 1600 Opteron (16 Cores per CPU)
1 and an additional 1x NVIDIA X2090 GPU.
CPU XK6
2.7x
Nanoseconds / Day

0.8

2.9x
0.6

0.4

0.2
3.6x
3.8x Concatenation of 100
0 Satellite Tobacco Mosaic Virus
32 64 128 256 512 640 768
# of Nodes

Accelerate your science by 2.7-3.8x when compared to CPU-based supercomputers

Try NVIDIA GPUs

Available Applications Applications Catalog
www.nvidia.com/appscatalog

Quick Application Acceleration OpenACC Directives
www.nvidia.com/gpudirectives

Easy & Free GPU Test Drive GPU Test Drive Cluster
www.nvidia.com/gputestdrive

Más contenido relacionado

La actualidad más candente

Série grafických karet Lightning společnosti MSI, předního světového výrobce základních desek a grafických karet, si získala skvělé renomé jak mezi pokročilými uživateli, tak ve světových médiích. Nejnovější člen této rodiny, MSI N480GTX Lightning, je šitý na míru pro extrémní přetaktování. MSI představuje unikátní architekturu Power4, která modelu N480GTX Lightning poskytuje nejsilnější a nejstabilnější výkon a výrazně zvyšuje potenciál pro přetaktování. Svědky jedinečných schopností karty byli účastníci a návštěvníci finále MSI MOA2010 v Taipei, kde švédský mistr v přetaktování “elmor” překonal dosavadní světový rekord v 3DMark Vantage. Grafická karta MSI N480GTX Lightning je doslova napěchována exkluzivními funkcemi, včetně nového systému chlazení Twin Frozr III, který je v porovnání s referenčním chladičem schopen uchladit grafické jádro na teplotu o 18 °C nižší. Samozřejmostí je také funkce trojnásobné změny napětí (Triple Overvoltage) pomocí unikátního nástroje pro přetaktování MSI Afterburner.

MSI N480GTX Lightning Infokit

MSI

Accelerating Scientific Discovery V1

Shanker Trivedi

iMinds The Conference: Jan Lemeire

imec

Top500 List June 2012

top500

Kindratenko hpc day 2011 Kiev

Volodymyr Saviak

Cybertron pc slayer ii gaming pc (blue)

LilianaSuri

How To Train Your Calxeda EnergyCore

Naoto MATSUMOTO

Insist On DrMOS v1.0

Eric van Beurden

Vigor Ex

rwachsman

Maximizing Application Performance on Cray XT6 and XE6 Supercomputers DOD-MOD...

Jeff Larkin

Cuda 6 performance_report

Michael Zhang

VMware - EMC vs NetApp

psi888

R&D work on pre exascale HPC systems

Joshua Mora

Cuda tutorial

Mahesh Khadatare

Vpu technology &gpgpu computing

Arka Ghosh

Parallel Vision by GPGPU/CUDA

IEEE International Conference on Intelligent Information Hiding and Multimedia Signal Processing

La actualidad más candente (16)

MSI N480GTX Lightning Infokit

Accelerating Scientific Discovery V1

iMinds The Conference: Jan Lemeire

Top500 List June 2012

Kindratenko hpc day 2011 Kiev

Cybertron pc slayer ii gaming pc (blue)

How To Train Your Calxeda EnergyCore

Insist On DrMOS v1.0

Vigor Ex

Maximizing Application Performance on Cray XT6 and XE6 Supercomputers DOD-MOD...

Cuda 6 performance_report

VMware - EMC vs NetApp

R&D work on pre exascale HPC systems

Cuda tutorial

Vpu technology &gpgpu computing

Parallel Vision by GPGPU/CUDA

Similar a GPU Computing In Higher Education And Research

Nvidia tesla-k80-overview

Communication Progress

計算力学シミュレーションに GPU は役立つのか？

Shinnosuke Furuya

In this deck from the UK HPC Conference, Gunter Roeth from NVIDIA presents: Hardware & Software Platforms for HPC, AI and ML. "Data is driving the transformation of industries around the world and a new generation of AI applications are effectively becoming programs that write software, powered by data, vs by computer programmers. Today, NVIDIA’s tensor core GPU sits at the core of most AI, ML and HPC applications, and NVIDIA software surrounds every level of such a modern application, from CUDA and libraries like cuDNN and NCCL embedded in every deep learning framework and optimized and delivered via the NVIDIA GPU Cloud to reference architectures designed to streamline the deployment of large scale infrastructures." Watch the video: https://wp.me/p3RLHQ-l2Y Learn more: http://nvidia.com and http://hpcadvisorycouncil.com/events/2019/uk-conference/agenda.php Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter

Hardware & Software Platforms for HPC, AI and ML

inside-BigData.com

NVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdf

MuhammadAbdullah311866

Example Application of GPU

Chakkrit (Kla) Tantithamthavorn

In this deck from the Univa Breakfast Briefing at ISC 2018, Duncan Poole from NVIDIA describes how the company is accelerating HPC in the Cloud. Learn more: https://www.nvidia.com/en-us/data-center/dgx-systems/ and http://univa.com Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter Today’s groundbreaking scientific discoveries are taking place in HPC data centers. Using containers, researchers and scientists gain the flexibility to run HPC application containers on NVIDIA Volta-powered systems including Quadro-powered workstations, NVIDIA DGX Systems, and HPC clusters.

NVIDIA GPUs Power HPC & AI Workloads in Cloud with Univa

inside-BigData.com

Latest HPC News from NVIDIA

inside-BigData.com

Jetson AGX Xavier and the New Era of Autonomous Machines

Dustin Franklin

GPU for DL

Nikolay Karelin

Presentation of the 40th TOP500 List

top500

Introduction to National Supercomputer center in Tianjin TH-1A Supercomputer

Förderverein Technische Fakultät

Tegra 4 outperforms snapdragon

Brian Caulfield

BURA Supercomputer

SIMTEC Software and Services

N A G P A R I S280101

John Holden

Exaflop In 2018 Hardware

Jacob Wu

APSys Presentation Final copy2

Junli Gu

NVIDIA Tesla K40 GPU

Can Ozdoruk

Axel Koehler from Nvidia presented this deck at the 2016 HPC Advisory Council Switzerland Conference. “Accelerated computing is transforming the data center that delivers unprecedented throughput, enabling new discoveries and services for end users. This talk will give an overview about the NVIDIA Tesla accelerated computing platform including the latest developments in hardware and software. In addition it will be shown how deep learning on GPUs is changing how we use computers to understand data.” In related news, the GPU Technology Conference takes place April 4-7 in Silicon Valley. Watch the video presentation: http://insidehpc.com/2016/03/tesla-accelerated-computing/ See more talks in the Swiss Conference Video Gallery: http://insidehpc.com/2016-swiss-hpc-conference/ Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter

Tesla Accelerated Computing Platform

inside-BigData.com

ICDE2010 Nb-GCLOCK

Makoto Yui

Seven years ago at LCA, Van Jacobsen introduced the concept of net channels but since then the concept of user mode networking has not hit the mainstream. There are several different user mode networking environments: Intel DPDK, BSD netmap, and Solarflare OpenOnload. Each of these provides higher performance than standard Linux kernel networking; but also creates new problems. This talk will explore the issues created by user space networking including performance, internal architecture, security and licensing.

Userspace networking

Stephen Hemminger

Similar a GPU Computing In Higher Education And Research (20)

Nvidia tesla-k80-overview

計算力学シミュレーションに GPU は役立つのか？

Hardware & Software Platforms for HPC, AI and ML

NVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdf

Example Application of GPU

NVIDIA GPUs Power HPC & AI Workloads in Cloud with Univa

Latest HPC News from NVIDIA

Jetson AGX Xavier and the New Era of Autonomous Machines

GPU for DL

Presentation of the 40th TOP500 List

Introduction to National Supercomputer center in Tianjin TH-1A Supercomputer

Tegra 4 outperforms snapdragon

BURA Supercomputer

N A G P A R I S280101

Exaflop In 2018 Hardware

APSys Presentation Final copy2

NVIDIA Tesla K40 GPU

Tesla Accelerated Computing Platform

ICDE2010 Nb-GCLOCK

Userspace networking

Último

ICT role in 21st century education and its challenges

rafiqahmad00786416

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Zilliz

The value of a flexible API Management solution for Open Banking Steve Melan, Manager for IT Innovation and Architecture - State's and Saving's Bank of Luxembourg Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The value of a flexible API Management solution for O...

apidays

Dubai, often portrayed as a shimmering oasis in the desert, faces its own set of challenges, including the occasional threat of flooding. Despite its reputation for opulence and modernity, the emirate is not immune to the forces of nature. In recent years, Dubai has experienced sporadic but significant floods, testing the resilience of its infrastructure and communities. Among the critical lifelines in this bustling metropolis is the Dubai International Airport, a bustling hub that connects the city to the world. This article explores the intersection of Dubai flood events and the resilience demonstrated by the Dubai International Airport in the face of such challenges.

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...

Orbitshub

Passkeys: Developing APIs to enable passwordless authentication Cody Salas, Sr Developer Advocate | Solutions Architect - Yubico Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...

apidays

Manulife - Insurer Transformation Award 2024

The Digital Insurer

Keynote 2: APIs in 2030: The Risk of Technological Sleepwalk Paolo Malinverno, Growth Advisor - The Business of Technology Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...

apidays

Webinar Recording: https://www.panagenda.com/webinars/why-teams-call-analytics-is-critical-to-your-entire-business Nothing is as frustrating and noticeable as being in an important call and being unable to see or hear the other person. Not surprising then, that issues with Teams calls are among the most common problems users call their helpdesk for. Having in depth insight into everything relevant going on at the user’s device, local network, ISP and Microsoft itself during the call is crucial for good Microsoft Teams Call quality support. To ensure a quick and adequate solution and to ensure your users get the most out of their Microsoft 365. But did you know that ‘bad calls’ are also an excellent indicator of other problems arising? Precisely because it is so noticeable!? Like the canary in the mine, bad calls can be early indicators of problems. Problems that might otherwise not have been noticed for a while but can have a big impact on productivity and satisfaction. Join this session by Christoph Adler to learn how true Microsoft Teams call quality analytics helped other organizations troubleshoot bad calls and identify and fix problems that impacted Teams calls or the use of Microsoft365 in general. See what it can do to keep your users happy and productive! In this session we will cover - Why CQD data alone is not enough to troubleshoot call problems - The importance of attributing call problems to the right call participant - What call quality analytics can do to help you quickly find, fix-, and prevent problems - Why having retrospective detailed insights matters - Real life examples of how others have used Microsoft Teams call quality monitoring to problem shoot problems with their ISP, network, device health and more.

Why Teams call analytics are critical to your entire business

panagenda

The action of the next cyber saga takes place in the mystical lands of the Asia-Pacific region, where the main characters began their digital activities in the middle of 2021 and qualitatively strengthened it in 2022. Corporate espionage, document theft, audio recordings, and data leaks from messaging platforms were all a matter of one day for Dark Pink. Their geographical focus may have started in the Asia-Pacific region, but their ambitions knew no bounds, targeting a European government ministry in a bold move to expand their portfolio. Their victim profile was as diverse as a UN meeting, targeting military organizations, government agencies, and even a religious organization. Because discrimination is not a fashionable agenda. In the world of cybercrime, they serve as a reminder that sometimes the most serious threats come in the most unassuming packages with a pink bow.

Cyberprint. Dark Pink Apt Group [EN].pdf

Overkill Security

Dubai, known for its towering skyscrapers, luxurious lifestyle, and relentless pursuit of innovation, often finds itself in the global spotlight. However, amidst the glitz and glamour, the emirate faces its own set of challenges, including the occasional threat of flooding. In recent years, Dubai has experienced sporadic but significant floods, disrupting normalcy and posing unique challenges to its infrastructure. Among the critical nodes in this bustling metropolis is the Dubai International Airport, a vital hub connecting the world. This article delves into the intersection of Dubai flood events and the resilience demonstrated by the Dubai International Airport in the face of such challenges.

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf

Orbitshub

Following the popularity of "Cloud Revolution: Exploring the New Wave of Serverless Spatial Data," we're thrilled to announce this much-anticipated encore webinar. In this sequel, we'll dive deeper into the Cloud-Native realm by uncovering practical applications and FME support for these new formats, including COGs, COPC, FlatGeoBuf, GeoParquet, STAC, and ZARR. Building on the foundation laid by industry leaders Michelle Roby of Radiant Earth and Chris Holmes of Planet in the first webinar, this second part offers an in-depth look at the real-world application and behind-the-scenes dynamics of these cutting-edge formats. We will spotlight specific use-cases and workflows, showcasing their efficiency and relevance in practical scenarios. Discover the vast possibilities each format holds, highlighted through detailed discussions and demonstrations. Our expert speakers will dissect the key aspects and provide critical takeaways for effective use, ensuring attendees leave with a thorough understanding of how to apply these formats in their own projects. Elevate your understanding of how FME supports these cutting-edge technologies, enhancing your ability to manage, share, and analyze spatial data. Whether you're building on knowledge from our initial session or are new to the serverless spatial data landscape, this webinar is your gateway to mastering cloud-native formats in your workflows.

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Safe Software

AXA XL - Insurer Innovation Award Americas 2024

The Digital Insurer

Effective data discovery is crucial for maintaining compliance and mitigating risks in today's rapidly evolving privacy landscape. However, traditional manual approaches often struggle to keep pace with the growing volume and complexity of data. Join us for an insightful webinar where industry leaders from TrustArc and Privya will share their expertise on leveraging AI-powered solutions to revolutionize data discovery. You'll learn how to: - Effortlessly maintain a comprehensive, up-to-date data inventory - Harness code scanning insights to gain complete visibility into data flows leveraging the advantages of code scanning over DB scanning - Simplify compliance by leveraging Privya's integration with TrustArc - Implement proven strategies to mitigate third-party risks Our panel of experts will discuss real-world case studies and share practical strategies for overcoming common data discovery challenges. They'll also explore the latest trends and innovations in AI-driven data management, and how these technologies can help organizations stay ahead of the curve in an ever-changing privacy landscape.

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

TrustArc

💥 You’re lucky! We’ve found two different (lead) developers that are willing to share their valuable lessons learned about using UiPath Document Understanding! Based on recent implementations in appealing use cases at Partou and SPIE. Don’t expect fancy videos or slide decks, but real and practical experiences that will help you with your own implementations. 📕 Topics that will be addressed: • Training the ML-model by humans: do or don't? • Rule-based versus AI extractors • Tips for finding use cases • How to start 👨‍🏫👨‍💻 Speakers: o Dion Morskieft, RPA Product Owner @Partou o Jack Klein-Schiphorst, Automation Developer @Tacstone Technology

DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam

UiPathCommunity

presentation ICT roal in 21st century education

jfdjdjcjdnsjd

The Good, the Bad and the Governed - Why is governance a dirty word? David O'Neill, Chief Operating Officer - APIContext Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

apidays

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Zilliz

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

MadyBayot

Artificial Intelligence Chap.5 : Uncertainty

Khushali Kathiriya

MINDCTI Revenue Release Quarter One 2024

MIND CTI

GPU Computing In Higher Education And Research

1. ACCELERATE RESEARCH NVIDIA TESLA

2. Lift the Barriers of HPC Faster / Maximum Greater Budget & More Research Performance Power Efficiencies Faster, More Discovery, More Performance More Performance Higher Accuracy per dollar per watt

3. GPU Impact to Computational Research More Research + Maximum Performance + Efficient Power 88ns/day, 6x Faster 318% Higher Performance 2.5x Flops / Watt 54% Added Cost Tianhe-1A: CPU + GPU JAC simulation time 23,558 Atoms DHFR AMBER 11 Jaguar: CPU only CPU: Dual socket Intel Xeon Axel Kohlmeyer: Temple University Tianhe-1A: #2 Top500; Jaguar: #3 Top500 X5670, 2.93 GHz (12 cores)

4. GPU Computing by Numbers 60 583 Universities Universities 150K 1.5M CUDA Downloads CUDA Downloads 4,000 22,500 Academic Papers Academic Papers 1 52 Supercomputer Supercomputers 2008 2012

5. UCLA Department of Physics and Astronomy Challenge Accelerate Plasma Research with innovative Particle-in-Cell (PIC) Simulations Overcome space and power constraints in data centers Integrate into shared computing strategy across institutes and centers at UCLA Solution GPU cluster 96 server nodes 288 NVIDIA Tesla GPUs Upgraded GPUs to NVIDIA Tesla M2090s (from M2070) Impact Upgrades resulted in 20% higher performance with same power cost GPUs extended to new groups within department for greatly accelerated modeling Solves faster performance requirements within limited space and power constraints #235 on prestigious Top500 list with only 6 Racks

6. Add GPUs: Accelerate Science Applications CPU GPU

7. 207 GPU-Accelerated Applications www.nvidia.com/appscatalog

8. 3 Ways to Accelerate Applications Applications OpenACC Programming Libraries Directives Languages “Drop-in” Easily Accelerate Maximum Acceleration Applications Flexibility THRUST C BLAS, LAPACK C++ FFT PGI Accelerator Fortran NPP CAPS HMPP OpenCL Sparse CRAY DirectCompute Imaging Java RNG Python

9. GPU-Accelerated MATLAB Results 10x speedup in data clustering via K- 14x speedup in template matching routine 3x speedup in estimating 7.6 million means clustering algorithm (part of cancer cell image analysis) contract prices using Black-Scholes model 17x speedup in simulating the movement 4x speedup in adaptive filtering routine 4x speedup in wave equation solving (part of 3072 celestial objects (part of acoustic tracking algorithm) of seismic data processing algorithm)

10. AMBER 12 - Extreme Performance with K20 DHRF JAC 23K Atoms (NVE) Running AMBER 12 GPU Support Revision 12.1 SPFP with CUDA 4.2.9 ECC Off 120 The blue node contains 2x Intel E5-2687W CPUs 95.59 (8 Cores per CPU) 100 Each green node contains 2x Intel E5-2687W CPUs (8 Cores per CPU) plus 2x NVIDIA K20 GPU Nanoseconds / Day 80 60 40 20 12.47 0 1 Node 1 Node DHFR Gain > 7.5X throughput/performance by adding just 2 K20 GPUs when compared to dual CPU performance

11. NAMD 2.9 Outstanding Strong Scaling with Multi-STMV Running NAMD version 2.9 Each blue XE6 CPU node contains 1x AMD 100 STMV on Hundreds of Nodes 1600 Opteron (16 Cores per CPU). 1.2 Fermi XK6 Each green XK6 CPU+GPU node contains 1x AMD 1600 Opteron (16 Cores per CPU) 1 and an additional 1x NVIDIA X2090 GPU. CPU XK6 2.7x Nanoseconds / Day 0.8 2.9x 0.6 0.4 0.2 3.6x 3.8x Concatenation of 100 0 Satellite Tobacco Mosaic Virus 32 64 128 256 512 640 768 # of Nodes Accelerate your science by 2.7-3.8x when compared to CPU-based supercomputers

12. Try NVIDIA GPUs Available Applications Applications Catalog www.nvidia.com/appscatalog Quick Application Acceleration OpenACC Directives www.nvidia.com/gpudirectives Easy & Free GPU Test Drive GPU Test Drive Cluster www.nvidia.com/gputestdrive

13. THANK YOU

Notas del editor

Welcome, today I am excited to show you how NVIDIA Tesla GPU solutions are having a profound impact on science by breaking new barriers in computing performance. Researchers all over the world have embraced computing as the third pillar of science. Now with Tesla GPU Computing, explosive performance gains are allowing academic researchers to discover new theories, build more robust models and publish more papers.I will share highlights of successful academic institutions and researchers achieving their goals of faster, better science while doing so within academic budget constraints.
With the growing need to use computing to achieve new frontiers in science and research, we quickly identified barriers to growing this need. First of all, we need to enable the researchers and scientists to do faster and more discovery with higher amounts of accuracy. We need to also do that with maximum performance per dollar, because we all have budgets. We need to do it in the most efficient manner, whether that be efficiency of power, or even efficiency in space.
It’s exciting to show that GPU computing can address all of the most important barriers of delivering game changing ability in computational research.For example: AMBER – a very popular computational chemistry application can allow researchers to see 6x more simulation data per day, achieving 88 nanoseconds in a day, what would take a week to simulate on CPUs alone.Now let’s see how much does that actually cost, well by adding just 50% cost to a system, you are getting over a 300% performance gain.And finally GPUs are very power efficient. The #2 and #3 most powerful supercomputers in the world are a great example. China’s Tianhe-1A, taking the #2 spot, is 2.5x more power efficient than oak ridge’s Jaguar CPU only system.
We have certainly reached the inflection point of broad adoption of GPU computing.Over 580 universities are teaching GPU computing as part of their regular curriculum. In fact, this year the Chinese Ministry of education will be requiring 200 of their higher education institutions to make NVIDIA’s CUDA parallel programming part of the curriculum.It’s been a growing trend for more and more government funding being awarded to GPU projects by the NIH, NSF or DOE.Not only large projects, like Oak Ridge’s Titan project which incorporates some 18 thousand GPUs, but also university infrastructure grants and department/research grants to develop GPU computing applications are being regularly awarded.
UCLA was faced with many of challenges or barriers of HPC. The challenges they faced were that they needed to accelerate a new innovative Plasma simulation. And they also needed to overcome space and power constraints. So their solution was a cluster with 96 nodes and 288 NVIDIA Tesla GPUs. The impact was considerable. The GPUs resulted in 20% higher performance with the same power cost. Additionally, the GPUs extended to new groups within departments for greater accelerated modeling.So here they were able to offer faster and more performance as well as fitting within a budget they had for both space and power.
NVIDIA’s GPU accelerated application footprint is growing exponentially year over year. Computational scientists and developers have realized that the future is in parallel computing.Native GPU acceleration has now made its way into the most widely used and published against scientific applications. This breadth of applications enables each school and department’s domain scientist population, specifically those who aren’t programmers, to reap the benefits of GPU acceleration.
Equally important to applications, enabling domain scientists, we have been developing easier and easier approaches to develop your own applications for GPUs.For fastest and easiest approach we have our “drop in” libraries.Many scientific applications make wide use of standard templates or math libraries. NVIDIA makes freely available the most commonly used such as Thrust, a templated library and many math libraries such as BLAS, fft and Sparse matrices.Another extremely non-invasive way to get application acceleration is to apply open ACC directives to your existing application. It takes only a few lines of code to get a 2-10 times speedup in just a matter of days or hours.Finally if you are a developer and need the maximum amount of performance, we support you in your native programming language.
Engineers and scientists worldwide rely on MATLAB to accelerate the pace of discovery, innovation, and development in disciplines such as automotive, aerospace, electronics, financial services, biotech, and many other industriesEngineers and scientists are successfully employing GPU technology, to accelerate their discipline-specific calculations. With minimal effort and without extensive knowledge of GPUs, you can now use the promising power of GPUs with MATLAB.
(previous script from AMBER 11 benchmarks. Slide showsK20 results)I briefly spoke about AMBER’s price performance in our opening. Now that you see how easy it is for researchers and scientists to benefit from GPU computing with ready to go applications or easy to implement developer approaches such as directives, we should revisit price performance. See again, on a single node when applying 2 GPUs, this will essentially increase the node cost by 50%, we get much more than a 50% performance improvement. In fact, with this application we achieve greater than 300% higher performance making GPUs a clear winning investment.Additional Information on K20 Slide:1 CPU node (dual CPUs) = 12.47 ns/day1 CPU+ GPU node (dual CPUs and GPUs) = 95.59 ns/day
NAMD, another extremely popular Molecular Dynamics package, here is showing that it gets up to a 2.7x speedup with GPUs. We’ve benchmarked it with a typical STMV benchmark, which is 1 million atoms. So this is a very large system. But these are the systems and simulation times needed for researchers to make breakthroughs in science. 32 64 128 256 512 640 768s/step GPU XK6 1.2414 0.660887 0.342743 0.199465 0.10837 0.089752 0.0774948s/step CPU XK6 4.62633 2.36707 1.19722 0.609124 0.314745 0.255016 0.209511ns/day Fermi XK6 0.069599 0.13073339 0.252084 0.433159 0.797269 0.962655 1.114913517ns/day CPU XK6 0.018676 0.03650082 0.072167 0.141843 0.274508 0.338802 0.412388848
Today more than ever, it’s easier for researchers, scientists and academic institutions to benefit from GPU computing. We have ready-to-go GPU accelerated applications (see the Applications Catalog). We are continuously investing in creating the easiest approaches to quickly accelerating your own applications; OpenACC directives being our latest development.And finally, the GPU Test Drive cluster is the ideal solution to easily test how a particular application accelerates with GPUs. The GPU Test Drive clusteris also pre-configured for easy purchase and installations
Thank you for following along.I hope we have proved to you that GPU computing is making extraordinary contributions to science and research.Now is the time to reach your next scientific computing achievements by investing in NVIDIA Tesla GPUs which have worldwide adoption and world class developer support.

GPU Computing In Higher Education And Research

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (16)

Similar a GPU Computing In Higher Education And Research

Similar a GPU Computing In Higher Education And Research (20)

Último

Último (20)

GPU Computing In Higher Education And Research

Notas del editor