Module 2 Class 1

Download as pdf or txt
Download as pdf or txt
You are on page 1of 9

DATA CENTER

ACCELERATION
Dave Salvator, Senior Manager, Product Management, NVIDIA
EVOLUTION OF COMPUTING

AI & IOT
Deep Learning, GPU
100s of billions of devices

Mobile-Cloud
iPhone, Amazon AWS
2.5 billion mobile users

PC Internet
WinTel, Yahoo!
1 billion PC users

1995 2005 2015

22
NVIDIA
“THE AI COMPUTING COMPANY”

GPU Computing Computer Graphics Artificial Intelligence


3
RISE OF NVIDIA GPU COMPUTING

109

APPLICATIONS 108 1000X


GPU-Computing perf
In 10
107 2X per year
ALGORITHMS years
106
SYSTEMS 1.1X per year
105
CUDA 104

103
1.5X per year
ARCHITECTURE 102
Single-threaded perf
1980 1990 2000 2010 2020

Original data up to the year 2010 collected and plotted by M. Horowitz, F. Labonte, O. Shacham, 4
K. Olukotun, L. Hammond, and C. Batten New plot and data collected for 2010-2015 by K. Rupp
BEYOND MOORE’S LAW
Progress Of Stack In 7 Years

2013 2020
cuBLAS: 5.0 cuBLAS: 11.0
cuFFT: 5.0 cuFFT: 11.0
cuRAND: 5.0 cuRAND: 11.0
cuSOLVER: 11.0

Relative Performance
cuSPARSE: 5.0
NPP: 5.0 GPU-Accelerated cuSPARSE: 11.0
Computing
Thrust: 1.5.3 NPP: 1`1.0
CUDA: 5.0 Thrust: 1.9.0
Resource Mgr: r304 CUDA: 11.0
Moore’s Law
Base OS: CentOS 6.2 Resource Mgr: r384
Base OS: Ubuntu 16.04
CPU

2013 2014 2015 2016 2017 2018 2019 2020

Accelerated Server Accelerated Server


With Fermi with Ampere
5
25 YEARS OF ACCELERATED COMPUTING

DEVELOPERS ++

DEVELOPMENT

GPU
INSTALLED PERFORMANCE ++
ACCELERATION BASE ++

DPU
COMPUTE
CUDA
EVERYWHERE
CPU NETWORKING

X-factor Speed-up Full Stack Data Center Scale One Architecture

6
NVIDIA DATACENTER PLATFORM

BUSINESS Customer Patient Fraud Quality Industrial Precision Molecular


++
APPLICATIONS Engagement Diagnostics Detection Assurance Automation Marketing Simulations

NGC
SMART CITY CONVERSATIONAL AI AUTONOMOUS RECOMMENDATION HEALTHCARE ++ OPERATIONS
APPLICATION VEHICLES SYSTEMS
SOFTWARE HUB FRAMEWORKS Merlin ...
Metropolis Jarvis Drive Clara

VIRTUAL GPU SW
TRITON
INFERENCE
ML & DATA ANALYTICS AI TRAINING & INFERENCE HIGH PERFORMANCE RENDERING & SERVER
Certified FLEET
Con
Containers
DEVELOPER COMPUTING VISUALIZATION COMMAND
TensorRT
TOOLKITS IndeX OptiX
NVIDIA GPU
NVIDIA HPC SDK MDL
CloudXR Operator
Pre-trained Models

COMPUTE MANAGEMENT
SDKs ACCELERATION NETWORKING, STORAGE & SECURITY
LIBRARIES CUDA-X DOCA MAGNUM IO

NVIDIA CERTIFIED MONITORING


SERVERS & DGX
EGX
VALIDATED HGX
CLOUD DCGM
SOLUTIONS
CSP Instances
Purpose Built Mainstream & Edge

HARDWARE
UFM
TECHNOLOGIES
GPU NVSwitch BF DPU SMART NIC NVIDIA Switch

7
AMAZING EXPANSION OF NVIDIA ECOSYSTEM
Apps for Every Industry Reaching Billions of Users

80 2.3M
New SDKs Developers
6M
DLSS 2.1 CUDA Downloads in 2020
RTX DI
OptiX 7.2
AERIAL
HPC SDK 20.9 RTX HPC RAPIDS AI CLARA METRO DRIVE ISAAC 5G

RAPIDS 0.16
Parabricks 3.5
DeepStream 5.0 1,800
GPU-Accelerated Applications
NSIGHT 2020.5
cuDNN 8.03 CUDA-X
TensorRT 7.2

CUDA 11.1 CUDA


6,500
NCCL 2.7.8 AI Startups
MAGNUM IO
GPUDirect Storage

10
11
12
13
14
15
16
17
18
20 19

T
ES
20
20
20
20
20
20
20
20
20
20
20
COMPLETE SOFTWARE STACK GROWING ECOSYSTEM

You might also like