Turns the architectural manifesto
(KVC_EVICTION_GRANULARITY_DESIGN_ZH.md) into a
function-by-function design the next collaborator can
implement against.
Contents:
- §1 current SessionAwareCache state with exact field
semantics (req_pool_idx / kv_committed_len /
kv_allocated_len / cache_protected_len)
- §3.1–§3.6 post-refactor source sketches for
SessionSlot, cache_finished_req,
cache_unfinished_req, match_prefix,
release_session, get_session_status
- §3.7 the schedule_batch.py:1572-1646 correction
block we can remove (the E3 landmine)
- §4 five invariants the PR must defend
- §5 GPU-free unit + property test plan with a
MockRadixCache shape
- §6 ~1 week engineering estimate and three risks
- §7 dependency relationship to the planned
D->P sync work
- §8 minimal step list for the implementing agent
No code change yet. Future commits on a
feat/block-level-evict branch will execute against
this spec.