NVIDIA TESLA V100 GPU ACCELERATOR - Microway...NVIDIA Volta, the latest GPU architecture, Tesla V100...

NVIDIA TESLA V100 GPU ACCELERATOR

The Most Advanced Data Center GPU Ever Built.NVIDIA® Tesla® V100 is the world’s most advanced data center GPU ever built to accelerate AI, HPC, and graphics. Powered by NVIDIA Volta, the latest GPU architecture, Tesla V100 offers the performance of up to 100 CPUs in a single GPU—enabling data scientists, researchers, and engineers to tackle challenges that were once thought impossible.

SPECIFICATIONS

Tesla V100PCle

Tesla V100SXM2

GPU Architecture NVIDIA Volta

NVIDIA Tensor Cores 640

NVIDIA CUDA® Cores 5,120

Double-Precision Performance 7 TFLOPS 7.8 TFLOPS

Single-Precision Performance 14 TFLOPS 15.7 TFLOPS

Tensor Performance 112 TFLOPS 125 TFLOPS

GPU Memory 16 GB or 32 GB HBM2

Memory Bandwidth 900 GB/sec

ECC Yes

Interconnect Bandwidth 32 GB/sec 300 GB/sec

System Interface PCIe Gen3 NVIDIA NVLink

Form Factor PCIe FullHeight/Length SXM2

Max Power Comsumption 250 W 300 W

Thermal Solution Passive

Compute APIs CUDA, DirectCompute,OpenCL™, OpenACC

TESLA V100 | DATA ShEET | FEB18

System Config Info: 2X Xeon E5-2690 v4, 2.6GHz, w/ 2X Tesla P100 or V100.

1.5X HPC Performance in One Year with NVIDIA Tesla V100

0 1.0X 2.0X0.5X 1.5X

Performance Normalized to P100

Benchmark(MiniFE)

Seismic(RTM)

Seismic(SPECFEM3D)

Life Sci(Amber)

Physics(GTC-P)

Physics(QUDA)

Life Sci(HOOMD-Blue)

Benchmark(STREAM)

Workload: ResNet-50 | CPU: 1X Xeon E5-2690v4 @ 2.6GHz | GPU: add 1X NVIDIA® Tesla® P100 or V100

Tesla V100

Tesla P100

1X CPU

0 20X 50X40X10X 30X

Performance Normalized to CPU

47X Higher Throughput than CPU Server on Deep Learning Inference

8X V100

8X P100

8X K8038 Hours

18 Hours

6 Hours

Deep Learning Trainingin One Workday

Time to Solution in Hours - Lower is Better

4020 30100

Server Config: Dual Xeon E5-2699 v4, 2.6GHz | 8X Tesla K80, Tesla P100 or Tesla V100 | ResNet-50 Training on Caffe2 for 90 Epochs with 1.28M ImageNet dataset

© 2018 NVIDIA Corporation. All rights reserved. NVIDIA, the NVIDIA logo, Tesla, NVIDIA GPU Boost, CUDA, and NVIDIA Volta are trademarks and/or registered trademarks of NVIDIA Corporation in the U.S. and other countries. OpenCL is a trademark of Apple Inc. used under license to the Khronos Group Inc. All other trademarks and copyrights are the property of their respective owners. FEB18

To learn more about the Tesla V100 visit www.microway.com/tesla

GROUNDBREAKING INNOVATIONSVOLTA ARCHITECTUREBy pairing CUDA Cores and Tensor Cores within a unified architecture, a single server with Tesla V100 GPUs can replace hundreds of commodity CPU servers for traditional HPC and Deep Learning.

TENSOR COREEquipped with 640 Tensor Cores, Tesla V100 delivers 125 TeraFLOPS of deep learning performance. That’s 12X Tensor FLOPS for DL Training, and 6X Tensor FLOPS for DL Inference when compared to NVIDIA Pascal™ GPUs.

NEXT GENERATION NVLINKNVIDIA NVLink in Tesla V100 delivers 2X higher throughput compared to the previous generation. Up to eight Tesla V100 accelerators can be interconnected at up to 300 GB/s to unleash the highest application performance possible on a single server.

HBM2With a combination of improved raw bandwidth of 900 GB/s and higher DRAM utilization efficiency at 95%, Tesla V100 delivers 1.5X higher memory bandwidth over Pascal GPUs as measured on STREAM. Tesla V100 is now availablein a 32GB configuration thatdoubles the memory of thestandard 16GB offering.

MAXIMUMEFFICIENCY MODEThe new maximum efficiency mode allows data centers to achieve up to 40% higher compute capacity per rack within the existing power budget. In this mode, Tesla V100 runs at peak processing efficiency, providing up to 80% of the performance at half the power consumption.

PROGRAMMABILITYTesla V100 is architected from the ground up to simplify programmability. Its new independent thread scheduling enables finer-grain synchronization and improves GPU utilization by sharing resources among small jobs.

Tesla V100 is the flagship product of Tesla data center computing platform for deep learning, HPC, and graphics. The Tesla platform accelerates over 550 HPC applications and every major deep learning framework. It is available everywhere from desktops to servers to cloud services, delivering both dramatic performance gains and cost savings opportunities.

NumberSmasher 1U

Tesla GPU Server

NUMBERSMAShER 1U GPU WITh 4 TESLA V100 PCI-E GPUS

> 4 Tesla V100 GPUs > 2 Intel Xeon Processors > Up to 512GB DDR4 memory

MICROWAY

Incorporated in 1982, Microway designs state-of-the-art, high-end Linux clusters, servers, and workstations for NVIDIA GPUs applied for HPC, HPDA, and Deep Learning. Classified as a small business, woman owned and operated, Microway’s GSA Schedule is GS-35F-0431N.

www.microway.com | +1 (508) 746-7341 | wespeakhpc@microway.com

NVIDIA TESLA V100 GPU ACCELERATOR - Microway...NVIDIA Volta, the latest GPU architecture, Tesla V100...

Documents

Transcript of NVIDIA TESLA V100 GPU ACCELERATOR - Microway...NVIDIA Volta, the latest GPU architecture, Tesla V100...

NVIDIA TESLA GPU ACCELERATORS - OCF NVIDIA K80 Data Sheet.pdfNVIDIA ® TESLA ® GPU ACCELERATORS The world’s fastest accelerators 1 For details on GPU Boost, refer to the GPU Boost

NVIDIA Tesla P100 Graphics Processing Unit (GPU) with HBM2 - … · 2017. 10. 19. · NVIDIA Tesla P100 Graphics Processing Unit (GPU) with HBM2 Title: NVIDIA Tesla P100 GPU with

TESLA K40 GPU ACTIVE ACCELERATOR - Nvidia · Tesla K40 GPU Active Accelerator BD-06949-001_v03 | 1 OVERVIEW The NVIDIA® Tesla® K40 graphics processing unit (GPU) active accelerator

DGXUPDATE - NVIDIAon-demand.gputechconf.com/gtc/2017/presentation/s... · DGX Station DGX-1 NVIDIA GPU Cloud with Tesla P100 with Tesla V100. 5 NVIDIA DGX unlocks the full potential

NVIDIA Tesla P100...Pascal GP100 GPU takes GPU computing to the next level. This paper details both the Tesla P100 accelerator and the Pascal GP100 GPU architectures. Also discussed

NVIDIA Tesla K40 GPU

Tesla V100 Performance Guide - IBM

TESLA P40 GPU ACCELERATOR - Nvidiaimages.nvidia.com/content/tesla/pdf/Tesla-P40-Product-Brief.pdf · The Tesla P40 GPU Accelerator is offered as a 250 W passively cooled board that

NVIDIA Tesla GPUs · PERFORMANCE: Highest Parallel peak performance with NVIDIA Tesla V100 GPUs THROUGHPUT: Best in class GPU-to-GPU bandwidth with a maximum speed of 300GB/s

DGX SYSTEMS: DEEP LEARNING FROM DESK TO DATA CENTER - … · NVIDIA DGX STATION SPECIFICATIONS At a Glance GPUs 4x NVIDIA® Tesla® V100 TFLOPS (GPU FP16) 500 GPU Memory 16 GB per

NVIDIA TESLA V100 GPU ARCHITECTUREimages.nvidia.com/content/volta-architecture/pdf/volta... · wp-08608-001_v1.1 | august 2017 nvidia tesla v100 gpu architecture the world’s most

NVIDIA TESLA K80 THE WORLD’S FASTEST GPU ACCELERATOR Library/Support/PNY Products/Resource Center/NVIDIA Tesla...NVIDIA ® TESLA ® K80 THE WORLD’S FASTEST GPU ACCELERATOR . Experience

Tesla: Fastest Processor Adoption in HPC History · Tesla GPU Computing Products Tesla S1070 System Tesla C1060 Processor GPUs 4 Tesla GPUs 1 Tesla GPU 1 Tesla GPU Single Precision

GPU Architecture & Implications - Computer Scienceskadron/cuda_asplos08_tutorial/4-GPU-architecture.pdf · GPU Architecture CUDA provides a parallel programming model The Tesla GPU

NVIDIA DGX-1 with Tesla V100 System Architecture White paperimages.nvidia.com/content/pdf/dgx1-v100-system-architecture-whitepaper.pdf · NVIDIA DGX-1 With Tesla V100 System Architecture

NVIDIA TESLA V100 GPU ARCHITECTURE - TechPowerUpThe NVIDIA Tesla V100 accelerator is the world’s highest performing parallel processor, designed to power the most computationally

NVIDIA TESLA V100 GPU ARCHITECTURE - Компостер · Maximum Efficiency Mode allows data center managers to tune power usage of their Tesla V100 accelerators to operate with

BLEEDING EDGE TESLA GPU PERFORMANCE: NVIDIA® Tesla® …€¦ · GPU Options NVIDIA® Tesla® K10, K20X, K40, K80 and NVIDIA Quadro® Visualization Node Option NumberSmasher 1U,

Dell PowerEdge C4130 & NVIDIA Tesla K80 GPU accelerators

NVIDIA VOLTA ARCHITECTURE€¦ · NVIDIA VOLTA ARCHITECTURE Jeff Larkin, NVIDIA December 03, 2018. 2 TESLA V100 The Fastest and Most Productive GPU for Deep Learning and HPC Volta