Logo
Explore Help
Sign In
gahow/xtrain
1
0
Fork 0
You've already forked xtrain
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
39df0b40c1d354c3775f4a2557843dd80dd1821d
xtrain/crates
History
Gahow Wang 39df0b40c1 gqa: fix kv-proj shape test param indices (embed,attn_norm precede wq)
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-18 01:38:42 +08:00
..
xtrain-autodiff
gqa: real grouped-query attention (repeat_kv op + both SDPA paths + wiring + tests)
2026-06-18 01:37:37 +08:00
xtrain-cuda
gqa: real grouped-query attention (repeat_kv op + both SDPA paths + wiring + tests)
2026-06-18 01:37:37 +08:00
xtrain-distributed
gqa: real grouped-query attention (repeat_kv op + both SDPA paths + wiring + tests)
2026-06-18 01:37:37 +08:00
xtrain-model
gqa: fix kv-proj shape test param indices (embed,attn_norm precede wq)
2026-06-18 01:38:42 +08:00
xtrain-optim
perf: make xtrain-cuda a regular dep of xtrain-optim (GPU AdamW)
2026-06-15 16:53:52 +08:00
xtrain-tensor
gqa: real grouped-query attention (repeat_kv op + both SDPA paths + wiring + tests)
2026-06-18 01:37:37 +08:00
xtrain-train
gqa: real grouped-query attention (repeat_kv op + both SDPA paths + wiring + tests)
2026-06-18 01:37:37 +08:00
Powered by Gitea Version: 1.24.7 Page: 35ms Template: 1ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API