513 B
513 B
Objectives
- Serverless KVCache cache
- DeepSeek deployment study
Key Results
- Write a KVCache simulator to speed up policy test
- Refine S3-FIFO to get some improvement
Last Week
- Write a naive KVCache simulator to align with vLLM's KVCache management. And have very small bias comparing to real vLLM.
- Refine the S3-FIFO in vLLM and evaluate it. It can have a little improvement in relatively small cache space.
- Write the middle-stage report for graduation thesis.
Next Week
- Refine the cache policy.