Commit Graph

  • f09562123b docs(experiments): E4-v8 results on real-timestamp SWE-Bench trace h200-cu130 Claude Code Agent 2026-05-13 19:07:59 +08:00
  • 9cca2c60c9 feat(experiments): expose PREFILL_MEM_FRAC + plumb --prefill-mem-fraction-static Claude Code Agent 2026-05-13 15:31:40 +08:00
  • 5c09a3a0cb feat(experiments): per-second GPU util sampler in E4-pressured sweep Claude Code Agent 2026-05-13 14:25:16 +08:00
  • 19612ff3a3 feat(experiments): parameterize TIME_SCALE in E4-pressured sweep Claude Code Agent 2026-05-13 14:22:13 +08:00
  • a953346a0c feat(experiments): E4-pressured points at third_party/traces SWE-Bench trace Claude Code Agent 2026-05-13 14:19:25 +08:00
  • 2dfe22ab20 refactor(snapshot): dedicated GPU snapshot_buf replaces kv_pool alloc Claude Code Agent 2026-05-13 14:18:23 +08:00
  • 6be5f9b57e docs(d2p): SnapshotStore refactor design — dedicated GPU buffer Claude Code Agent 2026-05-13 14:14:00 +08:00
  • f926a7b87d data: include qwen35-swebench-50sess trace under third_party/traces/ kzlin 2026-05-13 14:04:54 +08:00
  • 8fc31be605 data: include qwen35-swebench-50sess trace under third_party/traces/ kvc-debug-journey-v1-to-v4 kzlin 2026-05-13 14:04:54 +08:00
  • 552f3f564e chore(submodule): add third_party/agentic-kvcache submodule Claude Code Agent 2026-05-13 13:59:05 +08:00
  • 051d9220f4 fix(d2p): remove dangling logger.info refs in seeded_router Claude Code Agent 2026-05-13 12:53:28 +08:00
  • 9aac36fd89 docs: branch executive summary h200-cu130 Claude Code Agent 2026-05-13 12:24:56 +08:00
  • e9ad1c4bc7 feat(experiments): E4 vs E1 results + p99 attribution figures Claude Code Agent 2026-05-13 12:23:11 +08:00
  • af966f2371 fix(cli): plumb --enable-d-to-p-sync through benchmark-live → ReplayConfig Claude Code Agent 2026-05-13 12:17:28 +08:00
  • f6d6dc01ea feat(cli): per-role --mem-fraction-static + use in E4-pressured Claude Code Agent 2026-05-13 10:43:26 +08:00
  • 314c4cda0e docs(kvc): redesign gpu_utilization figure to lead with system-total compute kzlin 2026-05-13 10:39:15 +08:00
  • 722032a13b docs(kvc): add TPOT probability density figure (KVC v2 vs 4DP) kzlin 2026-05-13 10:24:44 +08:00
  • fbeb968f2f feat(experiments): E4-pressured sweep — force reseed via reject_threshold=1 Claude Code Agent 2026-05-13 10:22:58 +08:00
  • e729d62ddf fix(d2p): structural log + relax entrance condition for sync Claude Code Agent 2026-05-13 09:34:09 +08:00
  • 1d68ad66a7 docs(experiments): E4 results — initial scaffold + mid-run observation Claude Code Agent 2026-05-13 09:10:02 +08:00
  • 9149b530c0 feat(experiments): E4 cross-comparison analysis helper Claude Code Agent 2026-05-13 08:30:46 +08:00
  • a4f30e6bd3 docs(d2p): implementation status snapshot — Phase 1-3 audit Claude Code Agent 2026-05-13 08:29:26 +08:00
  • 8a2f72f18e feat(experiments): E4 protocol + sweep script — KVC + D→P vs naive PD Claude Code Agent 2026-05-13 08:27:40 +08:00
  • a369722efe fix(sglang): account snapshot-reserved slots in radix mem leak check Claude Code Agent 2026-05-13 08:26:16 +08:00
  • b9b0cf0fac feat(agentic): D→P snapshot orchestration in reseed path + CLI flag Claude Code Agent 2026-05-13 08:16:46 +08:00
  • 86412bb174 feat(sglang): D→P snapshot link integration — controller + RPC handlers Claude Code Agent 2026-05-13 08:12:04 +08:00
  • 7216507773 feat(snapshot): D→P RDMA Phase 1b — GPU pointer path verified Claude Code Agent 2026-05-13 00:59:43 +08:00
  • dc4867c270 feat(snapshot): D→P RDMA link Phase 1 — minimal byte transport Claude Code Agent 2026-05-13 00:55:55 +08:00
  • 9c35eddc79 docs(design): D→P RDMA snapshot push design Claude Code Agent 2026-05-13 00:44:03 +08:00
  • 110bd68000 docs(failures): consolidated 5-mode failure taxonomy improve/audit-and-foundations Gahow Wang 2026-05-13 00:43:58 +08:00
  • d93228e156 docs(sglang): patch surface inventory + retire-after-refactor list Gahow Wang 2026-05-13 00:42:22 +08:00
  • 9a81c993ab docs(onboarding): link new audit / design / eval docs from the root README + AGENTS.md Gahow Wang 2026-05-12 23:58:56 +08:00
  • dbb9eee471 feat(analysis): paired comparison with bootstrap CI Gahow Wang 2026-05-12 23:57:57 +08:00
  • 4021f27ee2 feat(analysis): stratified latency / TTFT reporter Gahow Wang 2026-05-12 23:57:13 +08:00
  • c5f552e122 test(policy): Theorem 1 no-starvation property tests Gahow Wang 2026-05-12 23:55:57 +08:00
  • a785b83023 test(policy): unit tests for Algorithm 1 lex scoring Gahow Wang 2026-05-12 23:54:48 +08:00
  • 76a79dfdda refactor(policy): extract pure score_candidate() from KvAwarePolicy Gahow Wang 2026-05-12 23:53:17 +08:00
  • 591cd6d382 docs(eval): paper-quality evaluation protocol (M1–M6) Gahow Wang 2026-05-12 23:51:46 +08:00
  • fd37eda367 docs(design): D->P sync interface contract + 4-phase rollout Gahow Wang 2026-05-12 23:50:39 +08:00
  • 683c44bd71 docs(design): block-level eviction refactor — concrete API plan Gahow Wang 2026-05-12 23:49:18 +08:00
  • baa843a3f9 docs(index): collaborator-facing doc index Gahow Wang 2026-05-12 23:47:28 +08:00
  • 6cdea52f28 docs(audit): cross-branch audit + 3-milestone roadmap Gahow Wang 2026-05-12 23:46:40 +08:00
  • 6d1c9237fa docs(architecture): KVC eviction granularity is the wrong abstraction tim 2026-05-12 14:21:45 +08:00
  • 7568e041ff docs(kvc): record real Ali KVC experiment results kvc-real-ali-iter-v1 Gahow Wang 2026-05-12 05:28:06 +00:00
  • 4e8f943875 feat(kvc): add real Ali replay workflow Gahow Wang 2026-05-12 05:28:00 +00:00
  • 986f351365 feat(sglang): drop streaming-session reqs with fill_ids < prefix_indices tim 2026-05-12 12:12:14 +08:00
  • d40db1f117 docs(experiments): E3 first run — load-floor bonus works, exposes SGLang bug tim 2026-05-12 12:05:51 +08:00
  • a1abdcd50c feat(experiments): E3 sweep — KVC v2 + RDMA + load-floor bonus tim 2026-05-12 11:45:09 +08:00
  • 93fce42747 feat(policy): load-floor bonus for KvAwarePolicy (Q2.B) tim 2026-05-12 11:45:09 +08:00
  • 905d671135 feat(env): MC_TRANSFER_TIMEOUT=1800s default in setup_env + stack tim 2026-05-12 11:45:09 +08:00
  • 9a166ac43b docs(experiments): design space for Q1 (mooncake stall) + Q2 (cold-D) tim 2026-05-12 11:20:00 +08:00
  • 976115ea5e Revert "feat(policy): cold-D bonus to break overlap-pinning death spiral" tim 2026-05-12 11:17:16 +08:00
  • 786cbb8d91 feat(policy): cold-D bonus to break overlap-pinning death spiral tim 2026-05-12 11:14:00 +08:00
  • bf4da281c0 docs(experiments): mooncake "is not alive" deep-dives to LRU starvation tim 2026-05-12 11:14:00 +08:00
  • 7f2ebf3d87 docs(experiments): forensic on Q1 (mooncake death) and Q2 (no D2 migration) tim 2026-05-12 10:45:18 +08:00
  • ef4dc81ea9 docs(experiments): forensic explanation for E2 80% failure rate tim 2026-05-12 10:38:49 +08:00
  • 3db2d84df8 docs(experiments): E2 complete — qualified H1 with a surprise tim 2026-05-12 03:23:33 +08:00
  • e3e5c45ed4 docs(experiments): E2 mid-run finding — D2 stays cold in KVC v2 too tim 2026-05-12 02:08:00 +08:00
  • 631b2c8847 docs(experiments): E1 results — naive 1P3D + kv-aware confirms H1 baseline tim 2026-05-12 01:49:52 +08:00
  • ad8aaa8c5a feat(experiments): E2 sweep — KVC v2 + RDMA on the matched subset tim 2026-05-12 00:49:53 +08:00
  • bb9cc249cd feat(experiments): E1 sweep on 50-session deterministic subset tim 2026-05-12 00:21:36 +08:00
  • b55371fe69 docs: H200 + driver 570 setup guide + 11 lessons learned tim 2026-05-12 00:10:14 +08:00
  • d11a66d11b feat(scripts): cu12.8 env wrapper + Inferact trace converter tim 2026-05-12 00:10:06 +08:00
  • a418aafeed feat(stack): pin PD workers to --disable-overlap-schedule tim 2026-05-12 00:09:56 +08:00
  • e874b1f055 feat(env): install vendored SGLang via uv path source tim 2026-05-12 00:09:50 +08:00
  • 7590e55189 docs: archive deprecated docs to docs/archive/, drop E1 from onboarding kzlin 2026-05-11 22:40:35 +08:00
  • 5a2fb8799c docs(kvc): onboarding manual for the next SWE agent kzlin 2026-05-11 22:31:08 +08:00
  • 506d360160 fix(figures): GPU utilization figure annotation/headroom polish kzlin 2026-05-11 22:28:39 +08:00
  • c01d6101d6 docs(kvc): freeze reseed slow-path audit + three reviewer challenges feat/d-to-p-sync kzlin 2026-05-11 22:20:34 +08:00
  • 9ccd853066 docs(kvc): correct reseed cost decomposition + flag D->P sync gap kzlin 2026-05-11 22:07:14 +08:00
  • 517677d7f2 docs(kvc): add GPU-utilization and cache-efficiency figures (rebut critic) kzlin 2026-05-11 18:04:49 +08:00
  • c5519066de docs(kvc): add TTFT probability density figure (KVC v2 vs 4DP) kzlin 2026-05-11 17:46:27 +08:00
  • b5af19583b docs(kvc): replace v2 path breakdown tables with generated figures kzlin 2026-05-11 17:38:43 +08:00
  • 37e9caa431 docs(kvc): production-decision reframe + formal router algorithm spec kzlin 2026-05-11 17:29:18 +08:00
  • 5eac9b4f6b fix(metrics): exclude aborted requests from latency/ttft/tpot stats kzlin 2026-05-11 17:29:18 +08:00
  • 0c25168cad docs(kvc): v2 deep analysis vs TEAM_REPORT baseline kzlin 2026-05-11 11:17:00 +08:00
  • 2ec0debef4 feat(kvc): session migration with reset-on-success + direct-append threshold tuning kzlin 2026-05-09 01:18:13 +08:00
  • 1d51704dad docs(kvc): agentic-fit analysis, refactor plan, validation report kzlin 2026-05-06 21:30:11 +08:00
  • 7affb565b2 feat(kvc): add backpressure smoke sweep + analyzer (and v6 p1 profile script) kzlin 2026-05-06 21:29:56 +08:00
  • c47adaf8e3 feat(kvc): honor admission backpressure hints + structural event logging kzlin 2026-05-06 21:29:46 +08:00
  • ca4b64c79a feat(sglang): expose backpressure pause hint in admit_direct_append kzlin 2026-05-06 21:29:30 +08:00
  • 4978c0d0cd profile(kvc): rewrite v5+profile report after critic audit + P0/P1 instrument kzlin 2026-04-29 22:29:21 +08:00
  • 51f5386691 profile(kvc): add D KV pool timeseries poller + analyzer for v6 root-cause kzlin 2026-04-29 20:04:21 +08:00
  • 6572d7f3f4 docs: add v5 chapter (Option D worker-mode admission) and rename to V1_TO_V5 kzlin 2026-04-29 16:13:25 +08:00
  • 6e5ed8da80 feat(kvc): Option D - delegate seed/reseed admission to D worker kzlin 2026-04-28 23:40:03 +08:00
  • 74194e660a docs: v4 final results, error analysis, and updated journey kzlin 2026-04-28 23:34:01 +08:00
  • c9d350b372 docs: KVC v1-v4 debug journey + raise session soft_cap to 16 kzlin 2026-04-28 21:10:41 +08:00
  • e9062b1d6e Document PD baseline comparison main Gahow Wang 2026-04-25 17:29:27 +00:00
  • c928c7db23 Add transfer queue admission knobs Gahow Wang 2026-04-25 17:29:15 +00:00
  • fe583fb413 Document kvcache-centric experiment progress Gahow Wang 2026-04-25 16:01:31 +00:00
  • 13bb31a446 Add kvcache-centric profiling and admission controls Gahow Wang 2026-04-25 16:00:52 +00:00
  • 08b13d22bc docs: rewrite project docs in concise chinese Gahow Wang 2026-04-24 12:41:52 +00:00
  • 5bdc0ed4f0 docs: document sglang maintenance workflow Gahow Wang 2026-04-24 12:31:32 +00:00
  • b8e6f13c20 feat(sglang): support decode session cache admission Gahow Wang 2026-04-24 12:30:41 +00:00
  • bded08301f chore: vendor sglang v0.5.10 snapshot Gahow Wang 2026-04-24 12:29:36 +00:00
  • 78f0d15221 docs: document project design and status Gahow Wang 2026-04-24 12:17:55 +00:00
  • 4bca741f32 feat: add agentic pd hybrid benchmark prototype Gahow Wang 2026-04-24 12:17:46 +00:00
  • d2fe014db7 chore: initialize repo hygiene Gahow Wang 2026-04-24 12:17:40 +00:00