17 lines
653 B
Markdown
17 lines
653 B
Markdown
Objective
|
|
- Serverless KVCache cache
|
|
- MoE study
|
|
|
|
Key Results
|
|
- Check the trace from Ali and fix problems
|
|
- Define a formatted trace structure for incoming refine
|
|
- Study papers about MoE, run int4 DeepSeek v3 671B in 8 * A800
|
|
|
|
Last Week
|
|
- Communicate with a colleague in Ali to get a desired trace and check the problems in trace to give feedback.
|
|
- Design a standard trace structure for better refining, then start format the trace in 12h for test.
|
|
- Study on MoE and find a int4 quantization version DeepSeek v3 671B to run in 8 * A800.
|
|
|
|
Next Week
|
|
- Format all trace to desired structure.
|
|
- Study on DeepSeek v3 to see how the experts do parallelism. |