GPU ケーススタディ

GPU Case Studies

実機・実ログ・再現手順に寄り添って、各 GPU で何が観測されたかを整理するページ群です。ROCm の一般論ではなく、個別のセットアップを丁寧に読むための入口として設計しています。

A collection of pages that organize what was observed on specific GPUs using real machines, runtime logs, and reproducible setups. These pages focus on concrete setups rather than broad ROCm generalizations.

Per-GPU observation pages Per-GPU observation pages Fact / Interpretation / Open Question Fact / Interpretation / Open Question 実機ログベース Grounded in local runtime logs

ここでは「その GPU が ROCm 全体でどう扱われるべきか」を断定するのではなく、観測できた事実、そこから読めること、まだ未確定な点を分けて記録します。 The goal here is not to declare what a GPU means for ROCm as a whole, but to separate observed facts, interpretations grounded in those facts, and the remaining open questions.

読み方

How to read these pages

各ケーススタディは、観測と解釈を混ぜないために、同じ3つの層で整理します。

Each case study uses the same three-layer structure so that observation and interpretation do not get mixed together.

Fact

確認できた事実

Observed facts

実機ログ、生成結果、`rocm-smi`、journal、設定ファイルなど、実際に確認できたものだけを置きます。

Only things that were actually observed on the machine: runtime logs, generation output, `rocm-smi`, journals, and concrete configuration details.

Interpretation

そこから読めること

What the facts suggest

事実から自然に読める範囲で、どこがボトルネックだったか、何が支配的に見えたかを整理します。

Keeps interpretation within the range that the evidence can support, such as likely bottlenecks or the dominant reliability risk in that setup.

Open Question

まだ言えないこと

What remains open

他環境で再現するか、一般法則と言えるか、設定依存かどうかなど、未確定な点も明示します。

Explicitly marks what is still unknown, such as cross-system reproducibility, generality, and setup-specific behavior.

公開中のケース

Available case studies

まずは MI25 / gfx900 の推論経路ケースから始めます。今後、GPU ごとのページをここに追加していく前提です。

The first published case focuses on MI25 / gfx900 inference behavior. Additional per-GPU pages can be added here later.

Case Study

MI25 / gfx900 で GPU inference はどこまで通るか

How far GPU inference runs on MI25 / gfx900

mi25-inference-case-study.html

ROCm 7.2 と Ollama source build を使ったローカル観測。`tinyllama` と `deepseek-r1:14b` を例に、GPU path がどのように observed されたかを整理します。

A local observation page for ROCm 7.2 plus an Ollama source build, using `tinyllama` and `deepseek-r1:14b` as concrete examples of the GPU path being observed.

関連リポジトリ: AETS-MAGI/ROCm-MI25-build

Related repository: AETS-MAGI/ROCm-MI25-build

環境・失敗状態・修正点・証跡を分離
Separates environment, failure state, fix path, and evidence
MI25 全般への一般化は避ける
Avoids over-generalizing to all MI25 setups

Open GitHub

GPU ケーススタディ

GPU Case Studies

読み方

How to read these pages

確認できた事実

Observed facts

そこから読めること

What the facts suggest

まだ言えないこと

What remains open

公開中のケース

Available case studies

MI25 / gfx900 で GPU inference はどこまで通るか

How far GPU inference runs on MI25 / gfx900

関連ページ

Related pages

実験の流れ

Experiment history

ROCm の構造と貢献レイヤ

ROCm Structure and Contribution Layers