This website requires JavaScript.
Explore
Help
Sign In
gahow
/
xtrain
Watch
1
Star
0
Fork
0
You've already forked xtrain
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
c470c627a7d60aabb701ef0774b8dfb2a3d8e784
xtrain
/
crates
History
Gahow Wang
39df0b40c1
gqa: fix kv-proj shape test param indices (embed,attn_norm precede wq)
...
Co-Authored-By: Claude Opus 4.8 <
noreply@anthropic.com
>
2026-06-18 01:38:42 +08:00
..
xtrain-autodiff
gqa: real grouped-query attention (repeat_kv op + both SDPA paths + wiring + tests)
2026-06-18 01:37:37 +08:00
xtrain-cuda
gqa: real grouped-query attention (repeat_kv op + both SDPA paths + wiring + tests)
2026-06-18 01:37:37 +08:00
xtrain-distributed
gqa: real grouped-query attention (repeat_kv op + both SDPA paths + wiring + tests)
2026-06-18 01:37:37 +08:00
xtrain-model
gqa: fix kv-proj shape test param indices (embed,attn_norm precede wq)
2026-06-18 01:38:42 +08:00
xtrain-optim
perf: make xtrain-cuda a regular dep of xtrain-optim (GPU AdamW)
2026-06-15 16:53:52 +08:00
xtrain-tensor
gqa: real grouped-query attention (repeat_kv op + both SDPA paths + wiring + tests)
2026-06-18 01:37:37 +08:00
xtrain-train
gqa: real grouped-query attention (repeat_kv op + both SDPA paths + wiring + tests)
2026-06-18 01:37:37 +08:00