Initial commit: obsidian to gitea

This commit is contained in:
2026-05-07 15:04:41 +08:00
commit a57afa86b4
323 changed files with 42569 additions and 0 deletions

View File

@@ -0,0 +1,16 @@
Objectives
- Analysis of QWen trace
- Customize vLLM(Ali ver) with new features
Key Results
- Tokenize Qwen trace with Qwen-agent and some other tools [60%]
- Modify vLLM to support different KV cache block number
- Profile open source dataset with different cache blocks
Last Week
- Use Qwen-agent to handle workloads with file, get a more precise token length for these workloads.
- Modify vLLM's cache manager to support specific KVCache cache blocks, then measure the KV cache hit rate trend by block number in different workloads.
Next Week
- Tokenize all Qwen trace especially multimodal (image) workloads and measure with these trace.
- Profile KVCache cache hit rate in actual trace and compare with other open source trace to find different.