NVIDIA A10 Tensor Core GPU
Accelerated graphics and video with AI for mainstream enterprise servers.
Manufacturer Part Number: 900-2G133-0020-100
Features and Benefits:
Accelerated Graphics and Video with AI for Mainstream Enterprise Servers
The NVIDIA A10 Tensor Core GPU combines with NVIDIA RTX Virtual Workstation (vWS) software to bring mainstream graphics and video with AI services to mainstream enterprise servers, delivering the solutions that designers, engineers, artists, and scientists need to meet today’s challenges. Built on the latest NVIDIA Ampere architecture, the A10 combines second-generation RT Cores, third-generation Tensor Cores, and new streaming microprocessors with 24 gigabytes (GB) of GDDR6 memory – all in a 150W power envelope – for versatile graphics, rendering, AI, and compute performance. From virtual workstations, accessible anywhere in the world, to render nodes to the data centers running a variety of workloads, A10 is built to deliver optimal performance in a single-wide, full-height, full-length PCIe form factor.
NVIDIA A10 is supported as part of NVIDIA-Certified Systems, in the on-prem data center, in the cloud, and at the edge. NVIDIA A10 builds on the rich ecosystem of AI frameworks from the NVIDIA NGC catalog, CUDA-X libraries, over 2.3 million developers, and over 1,800 GPU-optimized applications to help enterprises solve the most critical challenges in their business.
Specifications:
FP32 |
31.2 teraFLOPS |
TF32 Tensor Core |
62.5 teraFLOPS | 125 teraFLOPS* |
BFLOAT16 Tensor Core |
125 teraFLOPS | 250 teraFLOPS* |
FP16 Tensor Core |
125 teraFLOPS | 250 teraFLOPS* |
INT8 Tensor Core |
250 TOPS | 500 TOPS* |
INT4 Tensor Core |
500 TOPS | 1,000 TOPS* |
RT Core |
72 RT Cores |
Encode/decode |
1 encoder
2 decoder (+AV1 decode) |
GPU memory |
24GB GDDR6 |
GPU memory bandwidth |
600GB/s |
Interconnect |
PCIe Gen4 64GB/s |
Form factors |
Single-slot, full-height, full-length (FHFL) |
Max thermal design power (TDP) |
150W |
vGPU software support |
NVIDIA Virtual PC, NVIDIA Virtual Applications, NVIDIA RTX Virtual
Workstation, NVIDIA Virtual Compute Server |
point performance 2.91 Tflops (GPU Boost Clocks)
1.87 Tflops (Base Clocks) 1.66 Tflops (GPU Boost Clocks)
1.43 Tflops (Base Clocks)
Peak single precision floating
point performance 8.74 Tflops (GPU Boost Clocks)
5.6 Tflops (Base Clocks) 5 Tflops (GPU Boost Clocks)
4.29 Tflops (Base Clocks)
Memory bandwidth (ECC off)² 480 GB/sec (240 GB/sec per GPU) 288 GB/sec
Memory size (GDDR5) 24 GB (12GB per GPU) 12 GB
CUDA cores 4992 ( 2496 per GPU) 2880 - See more at: http://www.nvidia.com/object/tesla-servers.html#sthash.ZmsPP43F.dpuf