Files
obsidian/phd/weekly-report/25/250223.md

653 B

Objective

  • Serverless KVCache cache
  • MoE study

Key Results

  • Check the trace from Ali and fix problems
  • Define a formatted trace structure for incoming refine
  • Study papers about MoE, run int4 DeepSeek v3 671B in 8 * A800

Last Week

  • Communicate with a colleague in Ali to get a desired trace and check the problems in trace to give feedback.
  • Design a standard trace structure for better refining, then start format the trace in 12h for test.
  • Study on MoE and find a int4 quantization version DeepSeek v3 671B to run in 8 * A800.

Next Week

  • Format all trace to desired structure.
  • Study on DeepSeek v3 to see how the experts do parallelism.