aituner

Files

Gahow Wang f2ff0faebd Document Stop-B end-to-end on dense 27B: the improving climb + no-regression

Real gpt-5.4 agentic loop raised per-GPU TP1 0.123 -> TP2 0.2925 -> TP4 1.0012 (8.1x),
each a correctly-diagnosed real gain; then a TP4 runtime tweak measured 0.942 < 1.00
and was correctly rejected (no regression). With the 30B run (validator stop + LLM-stop
veto), all Stop-B behaviors are now validated end-to-end. The SIGTERM-teardown fix was
validated in practice (clean engine teardown, no GPU leak on stop). Efficiency finding:
at scale=1.0, infeasible high-theta probes burn the 900s elapsed cap, so a practical
loop needs a lower cap; this is why the run was stopped after iter-4 rather than driven
to an explicit Stop-B firing.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

2026-06-16 18:07:00 +08:00

harness-ablation

Document Stop-B end-to-end on dense 27B: the improving climb + no-regression

2026-06-16 18:07:00 +08:00

qwen27b-chat-0-8k-7day-compare

docs: expand qwen27b 0-8k compare summary

2026-04-17 20:45:24 +08:00

qwen27b-chat-pd-colocation

Add qwen27b and qwen235b tuning notes

2026-04-11 12:07:42 +08:00

qwen30b-community-vllm020

Add open source project metadata