aituner

Files

Gahow Wang a1cbab0e69 Document harness-vs-naive ablation: setup, substrate calibration, blocker

Sets up the controlled use_harness ON-vs-OFF ablation on dense 27B:
- both configs committed and validated on dash0 (differ only in
  use_harness + study_id), LLM auth + clean engine launch confirmed;
- characterizes exactly what the harness toggles (Harnesses: prompt
  section with ranked bottleneck hypotheses + knob-family steering,
  deterministic guided/stop proposals, Stop-B validator/veto) vs naive;
- substrate calibration from a real harness-ON run: at scale=0.2 the
  180s elapsed cap fires correctly but TP1 is uniformly infeasible even
  at u=0.125 (pass=0, elapsed-capped) -> recommend scale 0.4-0.5 for a
  real baseline; comparability caveat documented.

Honest status: full two-run sweep NOT completed in-session (~5-6
GPU-hours, sequential); GPUs left clean (all 0 MiB, no orphans; SIGTERM
teardown re-validated). Includes a precise continuation recipe and the
scripts/ablation_trajectory.py helper (validated against a prior store).

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

2026-06-16 20:16:27 +08:00

harness-ablation

Document harness-vs-naive ablation: setup, substrate calibration, blocker

2026-06-16 20:16:27 +08:00

qwen27b-chat-0-8k-7day-compare

docs: expand qwen27b 0-8k compare summary

2026-04-17 20:45:24 +08:00

qwen27b-chat-pd-colocation

Add qwen27b and qwen235b tuning notes

2026-04-11 12:07:42 +08:00

qwen30b-community-vllm020

Add open source project metadata