556f3011c6ac3e1981ac9281bd413ee6abbe480c
B1: _inst_cumulative_tokens was written by pick_instance but never read anywhere; delete the variable, global declaration, and per-call increment. Load is already tracked via inst.ongoing_tokens. D1: _send_prefill_async + the --fire-and-forget branch were unreachable in practice (no launch/bench script enabled the flag) and broken even if exercised: D-decode would fire before P registered the transfer_id, guaranteeing a Mooncake 502. Collapse _handle_pd_sep to its synchronous path and drop the CLI flag.
Description
No description provided
Languages
Python
82.9%
Shell
17.1%