This website requires JavaScript.
Explore
Help
Sign In
gahow
/
xtrain
Watch
1
Star
0
Fork
0
You've already forked xtrain
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
980605474ba881e0024973dffb2af76d098f164f
xtrain
/
crates
/
xtrain-model
History
Gahow Wang
39df0b40c1
gqa: fix kv-proj shape test param indices (embed,attn_norm precede wq)
...
Co-Authored-By: Claude Opus 4.8 <
noreply@anthropic.com
>
2026-06-18 01:38:42 +08:00
..
src
gqa: real grouped-query attention (repeat_kv op + both SDPA paths + wiring + tests)
2026-06-18 01:37:37 +08:00
tests
gqa: fix kv-proj shape test param indices (embed,attn_norm precede wq)
2026-06-18 01:38:42 +08:00
build.rs
model: tiny RoPE+RMSNorm+SwiGLU transformer + overfit test
2026-06-15 16:05:20 +08:00
Cargo.toml
model: tiny RoPE+RMSNorm+SwiGLU transformer + overfit test
2026-06-15 16:05:20 +08:00