xtrain

Files

Gahow Wang 1c76573cb4 export: safetensors + config.json for xserv qwen3

New bin export_safetensors: load an xtrain checkpoint, map every param to its
HF Qwen3 tensor name, transpose 2D projection weights [in,out]->[out,in]
(1D norms + [vocab,dim] embed/lm_head kept), cast to BF16 (xserv's qwen3
forward is BF16-only), and write config.json + model.safetensors + a copy of
the gpt2 tokenizer.json. Sized exactly like bin/train.rs. safetensors 0.5 to
match xserv. GPU body gated behind not(no_cuda).

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

2026-06-15 17:33:26 +08:00

xtrain-autodiff

ops: grad-check the T5 structural ops

2026-06-15 16:05:20 +08:00

xtrain-cuda

perf: streams / drop per-op sync

2026-06-15 16:56:17 +08:00

xtrain-distributed

dist: lengthen scaling bench so NCCL init amortizes

2026-06-15 17:18:23 +08:00

xtrain-model

model: add per-head QK-norm (Qwen3-compat) for xserv export