Gahow Wang fefbd71ca9 GPU imbalance analysis + elastic PS verdict + corrected LMetric results
Key findings:
- Session-sticky imbalance is 8.6x at 200 req (small-sample artifact)
  but only 1.24x at 1000 req (moderate, TPOT unaffected)
- Elastic PS not justified: interference reduction 0% at 1/GPU,
  migration reduces imbalance 1.24x→1.18x at 1.5s/event cost
- Corrected LMetric (no affinity) matches Linear (sticky) on all
  metrics (<2%), proving soft affinity from cache-hit scoring works
- Updated §3.4 errata, added §8 GPU imbalance + elastic PS analysis

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-05-23 12:11:23 +08:00
Description
No description provided
48 MiB
Languages
Python 82.9%
Shell 17.1%