14 lines
472 B
Markdown
14 lines
472 B
Markdown
Objective
|
|
- Serverless KVCache cache
|
|
|
|
Key Results
|
|
- Test traceA and traceB and fix bugs
|
|
- Survey the hardware for MoE deploying in medium-scale cluster
|
|
|
|
Last Week
|
|
- Do test on traceA and traceB, then fix bugs for the format pass to handle corner cases.
|
|
- Learn the calculation details of MLA and MoE to estimate the memory and calculation requirements, and compare with the different hardware.
|
|
|
|
Next Week
|
|
- Re-plot all the figures about trace.
|
|
- Survey the MoE deployment. |