Files
obsidian/phd/weekly-report/25/250601.md

18 lines
473 B
Markdown

Objectives
- Serverless KVCache cache
- MoE autoscaling
Key Results
- [10/10] Refine a final version of KV$ cache for ATC'25
- [10/10] Graduation thesis defense
- [2/10] Run MoE model in Ali
- [0/10] Analysis the pattern of experts loading in Ali trace
Last Week
- Prepare and finish graduation defense.
- Polish the final version of KV$ cache and send to the shepherd.
- Run Qwen3-32B on latest vLLM.
Next Week
- Modify vLLM to support tracing the expert load pattern.