15 lines
513 B
Markdown
15 lines
513 B
Markdown
Objectives
|
|
- Serverless KVCache cache
|
|
- DeepSeek deployment study
|
|
|
|
Key Results
|
|
- Write a KVCache simulator to speed up policy test
|
|
- Refine S3-FIFO to get some improvement
|
|
|
|
Last Week
|
|
- Write a *naive* KVCache simulator to align with vLLM's KVCache management. And have very small bias comparing to real vLLM.
|
|
- Refine the S3-FIFO in vLLM and evaluate it. It can have a little improvement in relatively small cache space.
|
|
- Write the middle-stage report for graduation thesis.
|
|
|
|
Next Week
|
|
- Refine the cache policy. |