Files
Gahow Wang 0881942cf3 Window 1 results: recompute with fixed metrics + reframe limitations
After the B3 audit bug fixes (joined_analysis hotspot median +
b3_analyze percentile interp), regenerate b3_policy_comparison.json
and the per-policy hotspot_index.json from the same raw run on
dash0 and re-render the three affected figures (apc-vs-hotspot,
latency-bars, per-worker TTFT).

Key number changes in window_1_results.md:
- hotspot_index magnitudes corrected (all five policies; lmetric
  smallest delta at +0.7%, sticky largest at +16.1%)
- "capped reduces hotspot 13%" -> "~10% (2.253 -> 2.020)"
- TTFT/E2E/TPOT percentiles shift by <1% from floor->interp
  (unified TTFT p90 7.24 -> 7.35 s)

Restructured "Caveats" into "Limitations (read this before quoting
B3 numbers)":
1. Agentic dispatch coupling is by design — promoted from caveat
   to top-level methodology framing, tied to
   agentic_dispatch_coupling.md
2. B3 interference_index is binary (not size-graded) — added
3. Hot-sweep cache contamination (<1%) — kept
4. Unified interference unrecoverable — kept with explicit warning
   not to read unified's failure attribution as causal
5. w600 is a sample, not full trace — kept
6. Reuse decomposition is per-token in expectation — added

current_results/characterization_claim_matrix.md updates:
- The "heavy-tail not sole cause" claim now cites the corrected
  ~10% drop with the median bug noted
- New supported claim: "B3 saturated-replay latency gaps include an
  agentic dispatch-coupling feedback term, which is intentional and
  matches production"; cited against agentic_dispatch_coupling.md.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-26 01:08:55 +08:00

24 lines
922 B
JSON

{
"hotspot_index_ttft_p90": 2.252837147833725,
"per_worker_latency_p90_s": {
"http://127.0.0.1:8000": 34.71445541951107,
"http://127.0.0.1:8001": 21.922988962882666,
"http://127.0.0.1:8002": 23.936190764518685,
"http://127.0.0.1:8003": 26.22220957049285,
"http://127.0.0.1:8004": 40.318757307820505,
"http://127.0.0.1:8005": 12.26559703698149,
"http://127.0.0.1:8006": 27.904838753980588,
"http://127.0.0.1:8007": 18.430557113309625
},
"per_worker_ttft_p90_s": {
"http://127.0.0.1:8000": 28.18261351052206,
"http://127.0.0.1:8001": 13.147308969072796,
"http://127.0.0.1:8002": 13.818959677941162,
"http://127.0.0.1:8003": 14.003642184572524,
"http://127.0.0.1:8004": 31.339895512629305,
"http://127.0.0.1:8005": 7.870992770011071,
"http://127.0.0.1:8006": 14.149156623415186,
"http://127.0.0.1:8007": 11.777357225219024
},
"status": "supported"
}