Initial commit: obsidian to gitea

This commit is contained in:
2026-05-07 15:04:41 +08:00
commit a57afa86b4
323 changed files with 42569 additions and 0 deletions

View File

@@ -0,0 +1,19 @@
Objectives
- Serverless KVCache cache
- MoE pattern feature
- EP design for inference performance
Key Results
- [10/10] Prepare slides for ATC'25 presentation w/ Jinbo
- [6/10] Survey MoE works and their observations
- [9/10] Analysis experts load balance's temporal locality
- [4/10] Analysis correlations between MoE layers
- [0/10] Understand how EP influence performance fully
- [0/10] Verify how dynamic EP influence performance
Last Week
- Survey the architecture of Bailian, read their docs, get some knowledge of their gateway, cluster setup and some serverless service.
- Refine KVCache slides w/ Jinbo and Dingyan.
Next Week
- Skip for one week.