Objective - Serverless KVCache cache - MoE study Key Results - Check the trace from Ali and fix problems - Define a formatted trace structure for incoming refine - Study papers about MoE, run int4 DeepSeek v3 671B in 8 * A800 Last Week - Communicate with a colleague in Ali to get a desired trace and check the problems in trace to give feedback. - Design a standard trace structure for better refining, then start format the trace in 12h for test. - Study on MoE and find a int4 quantization version DeepSeek v3 671B to run in 8 * A800. Next Week - Format all trace to desired structure. - Study on DeepSeek v3 to see how the experts do parallelism.