|
|
f18765b235
|
Document eight-GPU harness rerun
|
2026-05-13 09:04:14 +08:00 |
|
|
|
5c2958e6c1
|
Constrain harness topology by visible GPUs
|
2026-05-13 01:25:31 +08:00 |
|
|
|
fb6d74a18c
|
Document harness v2 rerun criteria
|
2026-05-12 22:23:12 +08:00 |
|
|
|
ef359c8eea
|
Document profile-driven harness run
|
2026-05-12 21:40:19 +08:00 |
|
|
|
17e9681ca0
|
Add profile-driven harness planner
|
2026-05-12 21:28:44 +08:00 |
|
|
|
63d6a111f4
|
Document profile-driven harness design
|
2026-05-12 21:09:29 +08:00 |
|
|
|
14259fcec9
|
Measure lower-range performance for infeasible trials
|
2026-05-10 14:30:34 +08:00 |
|
|
|
bf7c02e721
|
Clarify qwen27b raw per-iteration performance
|
2026-05-10 14:24:10 +08:00 |
|
|
|
b0325ecfd9
|
Clarify qwen235b raw per-iteration performance
|
2026-05-10 14:21:49 +08:00 |
|
|
|
4cfd3757b6
|
Document qwen235b prefill harness ablation
|
2026-05-10 13:05:49 +08:00 |
|
|
|
307e2eb0e8
|
Document qwen27b harness ablation
|
2026-05-10 01:12:21 +08:00 |
|
|
|
adc4351e5d
|
Report latency stats for infeasible baseline
|
2026-05-08 11:10:34 +08:00 |
|
|
|
eb137a0b62
|
Document TPOT40 baseline infeasible run
|
2026-05-08 02:57:03 +08:00 |
|
|
|
d7df1ebdac
|
Add open source project metadata
CI / test (3.11) (push) Has been cancelled
CI / test (3.12) (push) Has been cancelled
|
2026-05-06 21:18:21 +08:00 |
|
|
|
871c4cfc02
|
Document qwen27b chat setup audit
|
2026-05-06 20:32:09 +08:00 |
|
|
|
98cd6dd81a
|
Document qwen27b current config harness curve
|
2026-05-06 18:00:43 +08:00 |
|
|
|
5d96689ea6
|
Make harness runtime refinement memory safe
|
2026-05-06 17:37:31 +08:00 |
|
|
|
cf2e741550
|
Document high search rerun
|
2026-05-06 03:19:51 +08:00 |
|
|
|
915861b706
|
Document community vllm harness ablation
|
2026-05-02 11:17:24 +08:00 |
|
|
|
ccbf24ac47
|
Use time-compressed community vllm ablation
|
2026-05-02 10:03:59 +08:00 |
|
|
|
d3d4c234f6
|
Bound community vllm ablation replay
|
2026-05-02 09:58:56 +08:00 |
|
|
|
4ef69cce78
|
Make harness stop conservative for ablation
|
2026-05-02 09:47:16 +08:00 |
|
|
|
664aeb49b2
|
Use local cache for qwen30b vllm runs
|
2026-05-02 08:47:16 +08:00 |
|
|
|
1880e859b5
|
Use vllm cu129 wheel on dash0
|
2026-05-02 08:28:23 +08:00 |
|
|
|
e215827503
|
Use uv auto torch backend for vllm 0.20
|
2026-05-02 08:21:27 +08:00 |
|
|
|
a7c9518ef6
|
Use local vllm venv for dash0 community run
|
2026-05-02 08:17:04 +08:00 |
|
|
|
1a3d628268
|
Add harness early stop ablation
|
2026-05-02 08:08:14 +08:00 |
|
|
|
6d3459c82d
|
Document decode harness one-shot mechanism
|
2026-05-02 06:25:06 +08:00 |
|
|
|
9e5394b557
|
Inherit incumbent topology for runtime validation
|
2026-04-30 09:33:49 +08:00 |
|
|
|
f59919e21c
|
Clarify base-relative validation patches
|
2026-04-30 06:52:09 +08:00 |
|
|
|
46e9040613
|
Record decode validation follow-up
|
2026-04-28 21:20:41 +08:00 |
|
|
|
38ff4380e5
|
Make strong incumbent trigger validation phase
|
2026-04-28 20:54:05 +08:00 |
|
|
|
68cdaf56a8
|
Summarize qwen235b decode harness result
|
2026-04-28 20:36:17 +08:00 |
|
|
|
f982395aad
|
Record qwen235b decode harness launch
|
2026-04-28 07:02:13 +08:00 |
|
|
|
c9089cf4f0
|
Ignore non-SLO probe bookkeeping in bottleneck diagnosis
|
2026-04-28 06:58:38 +08:00 |
|
|
|
a9943e0240
|
Use probe sequence bottlenecks in harness
|
2026-04-28 06:57:45 +08:00 |
|
|
|
39aa47fbf1
|
Add generic decode-only harness guidance
|
2026-04-28 06:46:18 +08:00 |
|
|
|
71902b9fc2
|
Record qwen235b harness convergence test
|
2026-04-27 18:59:25 +08:00 |
|
|
|
bc884f6701
|
Document AITuner harness behavior
|
2026-04-27 16:34:19 +08:00 |
|
|
|
a962781b6c
|
Document qwen27b harness convergence curve
|
2026-04-26 01:32:18 +08:00 |
|
|
|
440f5b491b
|
Record plateau guard verification
|
2026-04-25 18:50:23 +08:00 |
|
|
|
6bac389aae
|
Add infeasible plateau guard to harness
|
2026-04-25 18:49:23 +08:00 |
|
|
|
6c04b9dbbc
|
Evaluate baseline before LLM tuning
|
2026-04-25 17:14:05 +08:00 |
|
|
|
2d7ebe50ee
|
Drain inflight requests after early stop
|
2026-04-25 16:57:01 +08:00 |
|
|
|
2dc2815620
|
Make harness verification portable
|
2026-04-25 16:37:13 +08:00 |
|
|
|
2c5e9af02a
|
Add harness-guided tuning prompts
|
2026-04-25 16:35:33 +08:00 |
|
|
|
dfe792ff6f
|
docs: add q235b prefill 0-32k tight summary
|
2026-04-18 16:10:29 +08:00 |
|
|
|
d237fc2723
|
docs: expand qwen27b 0-8k compare summary
|
2026-04-17 20:45:24 +08:00 |
|
|
|
bf286ef2a6
|
docs: add qwen235b prefill 7-day compare
|
2026-04-14 10:27:08 +08:00 |
|
|
|
bbecec4e9f
|
docs: add qwen235b tight ttft prefill summary
|
2026-04-13 09:37:06 +08:00 |
|