Commit Graph

27 Commits

Author SHA1 Message Date
9919b9a7bd configs: add q235b prefill 1s 2s 0-32k study 2026-04-17 19:25:32 +08:00
34eb495b3e configs: add qwen235b prefill 0-32k study 2026-04-17 19:20:44 +08:00
26f3b46966 compare: add multi-candidate runner 2026-04-13 20:50:39 +08:00
18ff644b32 configs: add qwen235b prefill tight ttft 0323 study 2026-04-13 09:39:32 +08:00
edfd61a696 Add qwen235b prefill docs and tight TTFT spec 2026-04-12 11:24:23 +08:00
3f20ddf87e Add qwen235b prefill-only tuning support 2026-04-11 21:00:02 +08:00
5e54e9c8f5 Add multi-window baseline vs tuned compare flow 2026-04-11 13:51:54 +08:00
31dd44c54b Align qwen27b baseline proposal to TP1 run script 2026-04-11 00:40:05 +08:00
a4d54442db Fix topology-aware incumbents for qwen27b tuning 2026-04-11 00:32:41 +08:00
06d4c380b3 Align qwen27b baseline proposal with topology study 2026-04-10 17:43:02 +08:00
8d0777e5e2 Add topology-aware qwen27b 0-8k tuning 2026-04-10 17:41:54 +08:00
9422d43737 Prioritize topology exploration in decode tuning 2026-04-10 10:25:41 +08:00
d582a8ed1b Validate served model name consistency 2026-04-09 22:50:23 +08:00
ef78fe7eb5 Add topology-aware tuning constraints 2026-04-09 21:07:51 +08:00
581ef7ccea Add qwen235b decode TPOT40 study config 2026-04-09 12:57:05 +08:00
c158807fac Add decode-only study mode support 2026-04-09 11:23:17 +08:00
94c89e1103 Add codex and bailian LLM provider presets 2026-04-07 11:31:26 +08:00
46ed688ace Add trace length bucket tuning support 2026-04-07 11:03:16 +08:00
e9b5e9b957 Add targeted low-threshold probe specs 2026-04-05 02:08:27 +08:00
84c5d6bd80 Add deeper infeasible probe diagnostics 2026-04-05 01:44:38 +08:00
8b024c72f1 Tighten LLM proposal schema 2026-04-04 23:24:32 +08:00
7e8523fdaa Add probe early stop guards 2026-04-04 22:58:33 +08:00
56fa6747d2 Add replay time scaling for smoke tuning 2026-04-04 22:40:49 +08:00
dcb972014a Enable BLADNN for dash0 fp4 smoke study 2026-04-04 22:32:55 +08:00
f192c741ed Add study tune loop and smoke configs 2026-04-04 22:29:59 +08:00
7b7eaafd78 Use time-based trace window ids 2026-04-04 22:09:43 +08:00
gahow
cdcca1d9d7 Initial AITuner study orchestrator 2026-04-04 21:26:37 +08:00