Files
xserv/crates
Gahow Wang ce10e4a998 sampling: NaN-safe sample() top-k/top-p path
partial_cmp().unwrap() in the top-k / top-p sort and softmax paths would
panic the engine thread on a single NaN logit. The greedy argmax path
is already NaN-safe. Add a one-pass NaN → -inf sweep on the extracted
last_row before temperature scaling, which is equivalent to masking the
token and keeps the sampler deterministic. Warn once when triggered so
the underlying numeric bug isn't silently hidden.
2026-07-01 15:13:19 +08:00
..
2026-06-18 18:11:58 +08:00
2026-06-18 18:11:58 +08:00