Module 2 Class 1
Module 2 Class 1
Module 2 Class 1
ACCELERATION
Dave Salvator, Senior Manager, Product Management, NVIDIA
EVOLUTION OF COMPUTING
AI & IOT
Deep Learning, GPU
100s of billions of devices
Mobile-Cloud
iPhone, Amazon AWS
2.5 billion mobile users
PC Internet
WinTel, Yahoo!
1 billion PC users
22
NVIDIA
“THE AI COMPUTING COMPANY”
109
103
1.5X per year
ARCHITECTURE 102
Single-threaded perf
1980 1990 2000 2010 2020
Original data up to the year 2010 collected and plotted by M. Horowitz, F. Labonte, O. Shacham, 4
K. Olukotun, L. Hammond, and C. Batten New plot and data collected for 2010-2015 by K. Rupp
BEYOND MOORE’S LAW
Progress Of Stack In 7 Years
2013 2020
cuBLAS: 5.0 cuBLAS: 11.0
cuFFT: 5.0 cuFFT: 11.0
cuRAND: 5.0 cuRAND: 11.0
cuSOLVER: 11.0
Relative Performance
cuSPARSE: 5.0
NPP: 5.0 GPU-Accelerated cuSPARSE: 11.0
Computing
Thrust: 1.5.3 NPP: 1`1.0
CUDA: 5.0 Thrust: 1.9.0
Resource Mgr: r304 CUDA: 11.0
Moore’s Law
Base OS: CentOS 6.2 Resource Mgr: r384
Base OS: Ubuntu 16.04
CPU
DEVELOPERS ++
DEVELOPMENT
GPU
INSTALLED PERFORMANCE ++
ACCELERATION BASE ++
DPU
COMPUTE
CUDA
EVERYWHERE
CPU NETWORKING
6
NVIDIA DATACENTER PLATFORM
NGC
SMART CITY CONVERSATIONAL AI AUTONOMOUS RECOMMENDATION HEALTHCARE ++ OPERATIONS
APPLICATION VEHICLES SYSTEMS
SOFTWARE HUB FRAMEWORKS Merlin ...
Metropolis Jarvis Drive Clara
VIRTUAL GPU SW
TRITON
INFERENCE
ML & DATA ANALYTICS AI TRAINING & INFERENCE HIGH PERFORMANCE RENDERING & SERVER
Certified FLEET
Con
Containers
DEVELOPER COMPUTING VISUALIZATION COMMAND
TensorRT
TOOLKITS IndeX OptiX
NVIDIA GPU
NVIDIA HPC SDK MDL
CloudXR Operator
Pre-trained Models
COMPUTE MANAGEMENT
SDKs ACCELERATION NETWORKING, STORAGE & SECURITY
LIBRARIES CUDA-X DOCA MAGNUM IO
HARDWARE
UFM
TECHNOLOGIES
GPU NVSwitch BF DPU SMART NIC NVIDIA Switch
7
AMAZING EXPANSION OF NVIDIA ECOSYSTEM
Apps for Every Industry Reaching Billions of Users
80 2.3M
New SDKs Developers
6M
DLSS 2.1 CUDA Downloads in 2020
RTX DI
OptiX 7.2
AERIAL
HPC SDK 20.9 RTX HPC RAPIDS AI CLARA METRO DRIVE ISAAC 5G
RAPIDS 0.16
Parabricks 3.5
DeepStream 5.0 1,800
GPU-Accelerated Applications
NSIGHT 2020.5
cuDNN 8.03 CUDA-X
TensorRT 7.2
10
11
12
13
14
15
16
17
18
20 19
T
ES
20
20
20
20
20
20
20
20
20
20
20
COMPLETE SOFTWARE STACK GROWING ECOSYSTEM