Files
agentic-pd-hybrid/scripts
Claude Code Agent 9cca2c60c9 feat(experiments): expose PREFILL_MEM_FRAC + plumb --prefill-mem-fraction-static
v7 with --decode-mem-fraction-static=0.8 + SGLANG_SNAPSHOT_LINK_BUF_BYTES=16GB
silently fell back to 1 GB snapshot_buf because Prefill (mem-fraction
default 0.88) left only 10.8 GB free on GPU 0. Reducing prefill
mem-fraction lets 16 GB snapshot_buf fit.
2026-05-13 15:31:40 +08:00
..