- plot_pd_crossover.py fig_conc: lead with request-completion % (the honest
collapse signal; latency percentiles count successes only), then mean-E2E /
TPS; note PD-capped/colo-uncapped in the title.
- add microbench/fresh_setup/gpu_monitor.sh (referenced by the committed
mb5_run_gpu.sh:73 for per-GPU util collection).
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>