イメージでわかる GPU の世代

Visual Guide to GPU Generations

Visual GPU Generations

シリーズ: イメージでわかる

GPU の世代が変わると、何が増え、何が減り、どこで差が出るのかを「持ち物の違い」として説明するページです。

What changes between GPU generations, what gets added, what gets dropped — explained as changes in a traveler's equipment.

1. GPU の「世代」ってなに？

1. What Are GPU "Generations"?

スマートフォンに「iPhone 13 → 14 → 15」のようなモデルがあるように、GPU にも世代があります。世代が進むと、計算ユニットの数・メモリの容量・対応する命令セットなどが変わります。

Just like smartphones go from iPhone 13 → 14 → 15, GPUs have generations too. Each new generation may change the number of compute units, memory capacity, supported instruction sets, and more.

🎒 たとえ話: 🎒 Analogy: GPU の世代は「旅行者の持ち物」のようなもの。世代が新しいほどリュックが大きく（メモリ増）、靴が頑丈で（計算ユニット増）、新しい道具（新命令）が入っている。古い世代は持ち物が少ないが、歩けなくなるわけではない。 GPU generations are like a traveler's gear. Newer generations have a bigger backpack (more memory), sturdier boots (more compute units), and new tools (new instructions). Older generations have less gear, but they can still walk.

2. AMD データセンター GPU の世代を見てみよう

2. AMD Datacenter GPU Generations at a Glance

Vega2017

Vega 56/64 / MI25

gfx900 · GCN 5th gen

HBM2 8–16 GB · FP32/FP16 対応 · AI 専用ユニットなし · ROCm の初期世代

HBM2 8–16 GB · FP32/FP16 · No dedicated AI units · Early ROCm generation

CDNA2020

MI100

gfx908 · CDNA 1

HBM2 32 GB · Matrix Core（行列演算専用ユニット）初搭載 · BF16/FP16 高速化

HBM2 32 GB · First Matrix Cores (dedicated matrix units) · BF16/FP16 acceleration

CDNA22021

MI210 / MI250X

gfx90a · CDNA 2

HBM2e 64–128 GB · Matrix Core 強化 · INT8 高速化 · 大規模 AI 向け

HBM2e 64–128 GB · Enhanced Matrix Cores · INT8 acceleration · Large-scale AI

CDNA32023

MI300X / MI300A

gfx942 · CDNA 3

HBM3 192 GB · FP8 対応 · GPU+CPU 統合チップ(MI300A) · 現行フラッグシップ

HBM3 192 GB · FP8 support · GPU+CPU unified chip (MI300A) · Current flagship

3. 世代で何が変わる？

3. What Changes Between Generations?

大きく分けると3つ変わります。

Three main things change:

① メモリが増える: 8 GB → 32 GB → 128 GB → 192 GB と、載せられるモデルサイズが一気に拡大。

① More memory: 8 GB → 32 GB → 128 GB → 192 GB — the model size you can run explodes.

② 専用ユニットが足される: Vega にはなかった Matrix Core（行列高速演算ユニット）が CDNA 以降で搭載。

② Specialized units are added: Matrix Cores (fast matrix compute units), absent on Vega, appear from CDNA onward.

③ 対応データ型が広がる: FP32 → FP16/BF16 → INT8 → FP8 と、より小さなデータ型を高速に扱えるように。

③ Supported data types expand: FP32 → FP16/BF16 → INT8 → FP8 — smaller types can be processed faster.

💡 gfx900 の位置づけ: 💡 Where gfx900 stands: Vega（gfx900）は「初期世代」で、Matrix Core や INT8 高速ユニットはありません。しかし、HBM2 の帯域幅と基本的な FP32/FP16 演算能力は備えており、小〜中規模モデルの推論は可能です。 Vega (gfx900) is the "earliest generation" without Matrix Cores or INT8 acceleration. But it has HBM2 bandwidth and solid FP32/FP16 compute — enough for inference on small to mid-size models.

4. 古い世代はもう使えない？

4. Are Older Generations Useless?

使えなくなるわけではありません。ただし、2つの面で不利になります。

They don't become useless. But they face two disadvantages:

ソフトウェア面: 新しい GPU に合わせて最適化が進むため、古い GPU 向けのテスト・チューニングが減る傾向がある。
ハードウェア面: 新しい命令やユニットがないので、同じ処理でも速度差が開く。

Software: Optimization focuses on newer GPUs, so testing and tuning for older GPUs tends to decrease.
Hardware: Without newer instructions and units, performance gaps widen for the same workload.

📱 たとえ話: 📱 Analogy: 古いスマホでも電話やメールはできるが、最新アプリが「推奨環境外」になることがある。動くけど、動作保証はされなくなっていく——GPU も同じ構造です。 An old smartphone can still make calls and send emails, but the latest apps may declare it "outside recommended specs." It works, but support gradually fades — GPUs follow the same pattern.

5. ROCm では世代差をどう見る？

5. How Does ROCm See Generation Gaps?

ROCm から見る世代差は、「新しいほど速い」だけでなく、「使える速い道が増える」という差でもあります。

From ROCm's point of view, generation gaps are not only "newer is faster" but also "newer has more fast paths available."

新しい世代になると、rocBLAS が使える高速カーネル、MIOpen が選べる solver、扱いやすいデータ型が増えます。逆に古い世代では、基本計算はできても、最新向けに増えた速い経路は使えないことがあります。

Newer generations gain more high-speed kernels for rocBLAS, more solver choices for MIOpen, and more practical data types. Older generations may still handle the basic computation, but they can miss the newer fast paths.

だから gfx900 を考えるときは、「動くか」だけでなく「どのライブラリのどの経路まで残っているか」を見るのが ROCm らしい見方です。これが support boundary の話につながります。

So when thinking about gfx900, a ROCm-style question is not just "does it run?" but "which library paths are still available?" That is exactly what leads into the support-boundary discussion.

💡 ROCm における世代差: 💡 Generation gaps in ROCm: VRAM や演算器の差だけでなく、どの経路がテストされ、最適化され、残されているか も世代差の一部です。 Generation gaps in ROCm are not only about VRAM and compute units, but also about which paths are still tested, tuned, and kept alive.

📖 このあと読むならこれ: 📖 Read next:
イメージでわかるメモリと帯域幅 — 世代が変わるたびに増えるメモリの意味
イメージでわかる「対応している」の意味 — 古い世代が直面するサポートの段階
ROCm バージョン変遷 — 各世代と ROCm リリースの対応関係 Visual Guide to Memory & Bandwidth — What more memory means each generation
Visual Guide to "Supported" — Support tiers facing older generations
ROCm Version History — How generations map to ROCm releases