traces/w600_r0.0015_st30_first600s.jsonl: first-600s cut of the shipped w600 trace (807 reqs, 274 sessions, all turn-1s + early later-turns; theoretical APC ceiling ~70% vs 80% full). Faster iteration (~18 min/arm) but a colder, lower-locality regime; whitelisted alongside the parent anonymized trace. analysis/lpwl_5policy_600s.md: LPWL vs LMetric/sticky/unified/unified+A+B on the 600s trace (dash1 8xH20, cold APC, n=1). LPWL is overall best with zero knobs — TTFT p90 7983ms vs tuned A+B 11562 (-31%), E2E p90 -16%, best request balance; APC 0.648 (emergent affinity, far above LMetric 0.507); only loss is E2E p99 from heavy-class decode concentration. Demonstrates anti-overfit: A+B was tuned on full w600 yet is beaten by the knob-free policy on this regime. Includes the run_5policy_600s.sh repro driver. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
14 lines
301 B
Plaintext
14 lines
301 B
Plaintext
__pycache__/
|
|
*.pyc
|
|
.venv/
|
|
*.egg-info/
|
|
outputs/
|
|
traces/*
|
|
# ship the anonymized sampled trace + its provenance (metadata only, no cleartext)
|
|
!traces/w600_r0.0015_st30.jsonl
|
|
!traces/README.md
|
|
*.log
|
|
.claude/
|
|
# third_party/vllm tracked in git for patch management
|
|
!traces/w600_r0.0015_st30_first600s.jsonl
|