Files
obsidian/phd/weekly-report/25/250727.md

19 lines
600 B
Markdown

Objectives
- MoE pattern feature
- EP design for inference performance
Key Results
- [6/10] Survey MoE works and their observations
- [9/10] Analysis experts load balance's temporal locality
- [4/10] Analysis correlations between MoE layers
- [0/10] Understand how EP influence performance fully
- [0/10] Verify how dynamic EP influence performance
Last Week
- Survey about heterogeneous parallelism config setup for different workloads and SLO.
- Finish the review for all papers as a shadow PC.
Next Week
- Survey the chance and challenges for EP reconfiguration.
- Survey the agentic AI infra.