472 B
472 B
Objective
- Serverless KVCache cache
Key Results
- Test traceA and traceB and fix bugs
- Survey the hardware for MoE deploying in medium-scale cluster
Last Week
- Do test on traceA and traceB, then fix bugs for the format pass to handle corner cases.
- Learn the calculation details of MLA and MoE to estimate the memory and calculation requirements, and compare with the different hardware.
Next Week
- Re-plot all the figures about trace.
- Survey the MoE deployment.