Files
obsidian/phd/weekly-report/25/250323.md

513 B

Objectives

  • Serverless KVCache cache
  • DeepSeek deployment study

Key Results

  • Write a KVCache simulator to speed up policy test
  • Refine S3-FIFO to get some improvement

Last Week

  • Write a naive KVCache simulator to align with vLLM's KVCache management. And have very small bias comparing to real vLLM.
  • Refine the S3-FIFO in vLLM and evaluate it. It can have a little improvement in relatively small cache space.
  • Write the middle-stage report for graduation thesis.

Next Week

  • Refine the cache policy.