aituner

Files

Gahow Wang 90c3eb51c8 Document Stop-B end-to-end validation (Phase 5)

Real gpt-5.4 agentic loop on Qwen3-30B-A3B/H20 with Stop-A enabled. Validates both
Stop-B paths: search-high-saturation (validator-authorized immediate stop) and
multi-iteration convergence. The TP1 baseline stays the per-GPU incumbent (2.90
req/s/GPU); TP/DP scaling raises raw throughput but lowers per-GPU efficiency and is
correctly never adopted (no regression). The Phase-4 authority model is exercised
live: a premature LLM stop is vetoed (validator_did_not_authorize_stop), then a later
justified stop is honored after the veto budget. EP launch-failures handled as
hard-negative evidence. Auditable reason chains throughout.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

2026-06-15 17:58:44 +08:00

harness-ablation

Document Stop-B end-to-end validation (Phase 5)

2026-06-15 17:58:44 +08:00

qwen27b-chat-0-8k-7day-compare

docs: expand qwen27b 0-8k compare summary

2026-04-17 20:45:24 +08:00

qwen27b-chat-pd-colocation

Add qwen27b and qwen235b tuning notes

2026-04-11 12:07:42 +08:00

qwen30b-community-vllm020

Add open source project metadata