653 B
653 B
Objective
- Serverless KVCache cache
- MoE study
Key Results
- Check the trace from Ali and fix problems
- Define a formatted trace structure for incoming refine
- Study papers about MoE, run int4 DeepSeek v3 671B in 8 * A800
Last Week
- Communicate with a colleague in Ali to get a desired trace and check the problems in trace to give feedback.
- Design a standard trace structure for better refining, then start format the trace in 12h for test.
- Study on MoE and find a int4 quantization version DeepSeek v3 671B to run in 8 * A800.
Next Week
- Format all trace to desired structure.
- Study on DeepSeek v3 to see how the experts do parallelism.