e2f94495a1f876260e7fa2851f196259ca970840
Initial 8-section outline for "Elastic Affinity Router" — agentic LLM scheduler with session-affinity routing + hot-triggered session migration. Centerpiece is §2.3's dispatch coupling argument: agentic workloads close Little's Law on themselves (no human think-time), so per-turn W enters Λ, amplifying small latency differences into throughput differences. This is the intellectual hook the design hangs on. §3 attacks three baselines on three orthogonal failure modes (load-balance loses locality, static PD-disagg hits D-side KV wall, pure sticky creates hot pin). §4 frames EAR as the single scheduler that addresses all three. All figures and several numbers (T_hot, T_cool, EAR wall-clock factor) are TBD — see Open Items at bottom. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Description
No description provided
Languages
Python
82.9%
Shell
17.1%