Silicon Index
Updated June 2026

AI accelerators, compared

Comparing AI chips is mostly a fight over definitions. Peak FLOPS aren't delivered FLOPS, "efficiency" splits three ways, and half of these you can't actually buy. Fourteen accelerators from NVIDIA, Google, AWS, AMD, Huawei and Groq — normalized to FP8 and stripped to the numbers that decide a deployment.

WHAT IS FP8

FP8 is an 8-bit floating-point number format — a low-precision way to represent numbers that roughly doubles compute throughput and halves memory use versus 16-bit FP16/BF16. It has become the default precision for modern AI training and inference, which makes it the fairest common axis for comparing chips. Every compute figure here is dense FP8, peak theoretical — real delivered throughput runs ~30–50% of peak. Chips that can't do FP8 (Huawei 910C, Groq) are shown at FP16 or hatched, never faked.

Color = company

The efficiency frontier

FP8 compute-per-watt against memory bandwidth · bubble = memory capacity · top-right is better

Only the six chips with both a published FP8 rate and a usable power figure can be placed here. Trainium3 (power undisclosed), Ascend 910C (no native FP8) and Groq (per-chip metrics don't apply) sit in the table instead.

Full spec table

Sort any column · filter by use case. Per-chip figures; rack-scale systems pool these differently.

What the numbers don't show