Files
agentic-kvc/v2/figs/exp_d_policy_dispatch.png
Gahow Wang 0b180c191e v2 exp(d): expand figure to 6 panels (TTFT/E2E mean+p90, TPS, per-worker GPU util)
Per request: TTFT mean+p90, E2E mean+p90, decode TPS (output goodput; total/
prefill TPS omitted as cache-miss-inflated), and per-worker GPU-util boxplots
(8 workers/arm, tracets vs thinktime) showing utilization level + balance.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-05-30 21:10:27 +08:00

152 KiB
2170x1204px

/gahow/agentic-kvc/raw/commit/54e1f5266ab076aaaaf6a37e24673eecbe700a04/v2/figs/exp_d_policy_dispatch.png