13 lines
430 B
Markdown
13 lines
430 B
Markdown
Objective
|
|
- Serverless KVCache cache
|
|
|
|
Key Result
|
|
- Analysis the difference between LRU/WA/oracle
|
|
|
|
Last Week
|
|
- Define the difference of cache policies with a reuse rank (for each cache hit, we can get current key's rank in a cache policy). Evaluate different cache policies by reuse rank and draw CDF.
|
|
- Prepare and do middle term graduating thesis offense.
|
|
|
|
Next Week
|
|
- Do rebuttal for ATC.
|
|
- Implement WA policy in vllm and test. |