xserv

Files

Gahow Wang 1d0ec32e8d server: Jinja chat template rendering via minijinja

Load the model's chat_template.jinja (or tokenizer_config.json
chat_template field) at startup and render it with minijinja instead of
hardcoded per-model prompt builders.

Custom Jinja functions: strftime_now (date formatting), raise_exception
(template validation errors).  Falls back to Qwen3 ChatML template if
no Jinja template is found.

Removes the hardcoded build_prompt_gpt_oss() — the model's own template
now drives prompt formatting, matching llama.cpp's behavior exactly.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-05-31 13:23:18 +08:00

src

server: Jinja chat template rendering via minijinja

2026-05-31 13:23:18 +08:00

Cargo.toml

server: Jinja chat template rendering via minijinja

2026-05-31 13:23:18 +08:00