Files
Gahow Wang 0881942cf3 Window 1 results: recompute with fixed metrics + reframe limitations
After the B3 audit bug fixes (joined_analysis hotspot median +
b3_analyze percentile interp), regenerate b3_policy_comparison.json
and the per-policy hotspot_index.json from the same raw run on
dash0 and re-render the three affected figures (apc-vs-hotspot,
latency-bars, per-worker TTFT).

Key number changes in window_1_results.md:
- hotspot_index magnitudes corrected (all five policies; lmetric
  smallest delta at +0.7%, sticky largest at +16.1%)
- "capped reduces hotspot 13%" -> "~10% (2.253 -> 2.020)"
- TTFT/E2E/TPOT percentiles shift by <1% from floor->interp
  (unified TTFT p90 7.24 -> 7.35 s)

Restructured "Caveats" into "Limitations (read this before quoting
B3 numbers)":
1. Agentic dispatch coupling is by design — promoted from caveat
   to top-level methodology framing, tied to
   agentic_dispatch_coupling.md
2. B3 interference_index is binary (not size-graded) — added
3. Hot-sweep cache contamination (<1%) — kept
4. Unified interference unrecoverable — kept with explicit warning
   not to read unified's failure attribution as causal
5. w600 is a sample, not full trace — kept
6. Reuse decomposition is per-token in expectation — added

current_results/characterization_claim_matrix.md updates:
- The "heavy-tail not sole cause" claim now cites the corrected
  ~10% drop with the median bug noted
- New supported claim: "B3 saturated-replay latency gaps include an
  agentic dispatch-coupling feedback term, which is intentional and
  matches production"; cited against agentic_dispatch_coupling.md.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-26 01:08:55 +08:00

24 lines
920 B
JSON

{
"hotspot_index_ttft_p90": 2.0204268015410918,
"per_worker_latency_p90_s": {
"http://127.0.0.1:8000": 23.81083881931848,
"http://127.0.0.1:8001": 18.139674991380897,
"http://127.0.0.1:8002": 29.116712999995805,
"http://127.0.0.1:8003": 19.245074290811324,
"http://127.0.0.1:8004": 17.230851700413044,
"http://127.0.0.1:8005": 15.86663371440958,
"http://127.0.0.1:8006": 16.707309890014592,
"http://127.0.0.1:8007": 23.93718611740042
},
"per_worker_ttft_p90_s": {
"http://127.0.0.1:8000": 19.772570010094213,
"http://127.0.0.1:8001": 15.786850639013576,
"http://127.0.0.1:8002": 20.403525242628533,
"http://127.0.0.1:8003": 10.535247699997853,
"http://127.0.0.1:8004": 9.52290979558602,
"http://127.0.0.1:8005": 9.455131393985376,
"http://127.0.0.1:8006": 7.379608143202497,
"http://127.0.0.1:8007": 9.661995008389932
},
"status": "supported"
}