Cache-size sweep: build_meta is O(|cache|), +85.6 μs / 1k blocks

Follow-up to Microbench 3 that finally tests H5 (cache-size
dependence) and instruments worker-side connector callbacks the
original patch missed.

Patch v2 (apply_step_timing_v2.py) adds:
  scheduler: `cache_size` field in engine_step.jsonl
  worker:    `get_finished_us` + `start_load_kv_us` in worker_step.r0.jsonl
  uses BLOCK_BEGIN/END sentinels for safe multi-line revert
  (the original v1 patch survives this v2's apply/revert cycle)

Driver: continuous open-loop (1.5 req/s, 4096x256 random per req)
that lets APC fill from 0 → ceiling within one vLLM lifetime so a
single run produces the full cache_size sweep. Decode-only steps
are filtered post-hoc to remove prefill-mix variance.

Findings (H20 96GB, ceiling reached ~17.5k blocks; n=15-18k decode
steps per config):

  config         | slope (μs / 1k blocks) | step_dur p50 @ |cache|=16.6k
  ---------------|------------------------|-----------------------------
  mooncake_both  | +85.6                  | 1528 μs (build_meta=1442, 94%)
  noop_connector | -0.8 (≈0)              |  79 μs
  plain          | +1.0 (≈0)              |  84 μs

  Worker-side get_finished p50/p90/p99 (μs/step):
    mooncake_both:  180 / 257 / 333
    noop_connector:   0 /   0 /   2

H5 PASSES. mooncake_both step_duration scales linearly with |cache|
because build_connector_meta walks set(cache.keys()) every step
(`mooncake_connector.py:434-450`). plain and noop are flat.

The previously-uninstrumented get_finished() adds a constant
180 μs/step on top — two `run_coroutine_threadsafe(...).result()`
blocking waits in kv_both mode (`mooncake_connector.py:1107-1137`)
fire every step even when no transfer is pending.

Trace-replay reconciliation (APC ≈ 79% → |cache| ≈ 13k blocks):
  build_meta @ 13k ≈ 1060 μs + get_finished ≈ 180 μs = 1.24 ms/step
  On ~7 ms decode forward → +15-20% TPOT per step.
  This explains most of the trace-replay +25% TPOT p90 gap from
  single-instance per-step cost alone, leaving a smaller residual
  for multi-instance coupling than originally assumed.

Two clear fixes pointed out in REPORT.md:
  1. replace O(|cache|) per-step walk with incremental delta
     listener using block_pool's add/remove callbacks
  2. short-circuit get_finished() when both producer/consumer
     queues are empty in kv_both

Heavy raw artifacts (engine_step.jsonl, vllm_stdout/stderr,
.vllm.pid) are .gitignored — they re-derive from `bash run_all.sh`
and SUMMARY.md / per_config.json fully capture the conclusions.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
This commit is contained in:
2026-05-26 23:34:21 +08:00
parent 54de78eb11
commit 8829928fc5
21 changed files with 39354 additions and 0 deletions

View File

@@ -0,0 +1,98 @@
# Cache-size Sweep — testing H5 from connector_tax DESIGN.md
## Hypothesis under test
**H5**: `MooncakeConnectorScheduler.build_connector_meta()` walks
`set(self._block_pool.cached_block_hash_to_block._cache.keys())` every
scheduler step, so `step_duration_us` and `build_meta_us` should
grow **linearly with |cache|** (= the number of cached block-hash
entries in the block pool). The +45 % trace-replay tax is hypothesised
to come from running this O(|cache|) loop at APC ≈ 79 %, which the
prior microbench never tested (random content → cache stays small).
## What we instrument
The original `apply_step_timing.py` only recorded
`step_duration_us` and `build_meta_us`. This sweep adds:
| Field | Source | Why |
|---|---|---|
| `cache_size` | `len(scheduler.kv_cache_manager.block_pool.cached_block_hash_to_block._cache)` | The exact dict that `set(...)` walks |
| `get_finished_us` | wraps `kv_connector.get_finished(...)` in worker mixin | The other suspected cost (two `run_coroutine_threadsafe(...).result()` blocking waits for `kv_both`) |
| `start_load_kv_us` | wraps `kv_connector.start_load_kv(...)` in worker mixin | Mostly fast for `kv_both` w/o transfers, but include for completeness |
Scheduler-side fields go to `engine_step.jsonl` (existing channel).
Worker-side timings go to `worker_step.jsonl` (one file per worker
process).
## Method
For each config in {`plain`, `noop_connector`, `mooncake_both`}:
1. Launch one fresh vLLM (TP=1, H20, max_model_len=200000,
gpu-memory-utilization=0.9, enable_prefix_caching).
2. Read /metrics once to record `kv_cache_max_blocks` (the dict
ceiling).
3. Drive an open-loop stream:
- shape = 4096 in / 256 out
- rate = 2 req/s (kept below saturation to keep step duration
dominated by scheduler-not-queueing)
- content random per request (UUID + hash), zero prefix-cache
hit ⇒ `|cache|` grows monotonically until hit by LRU eviction
- duration = until cache fills (≤ 12 min)
4. Collect `engine_step.jsonl` + `worker_step.jsonl` + the per-request
metrics from `bench_loop.py`.
5. Tear down vLLM, wait for GPU release.
After all three configs:
- Apply LWESS-style binning on (`cache_size`, `step_duration_us`) to
show the curve per config.
- Compute linear fit per config: `step_duration_us ≈ a + b · cache_size`.
- Connector-attributable per-step tax at a given |cache|:
`tax_us(cache_size) = mc_step(cache_size) plain_step(cache_size)`.
- Same decomposition for `build_meta_us` (only mooncake / noop have
non-zero values; plain is 0 by construction).
- For worker side: `get_finished_us` distribution per config; in
`kv_both` mode this should be non-zero even when no transfer fires.
## What "passes" or "fails" H5
- **PASS**: `step_duration_us` (mooncake_both) grows roughly
linearly with |cache|, with slope **> 5 μs / 1 000 blocks** so that
at |cache| ≈ 200 k it is ≥ 1 ms of per-step overhead. `plain`
shows no slope. This matches the source code reading.
- **FAIL**: no measurable slope, or slope is similar for plain and
mooncake_both → the O(|cache|) walk is not the actual cost driver
and we should look elsewhere (e.g. `get_finished` blocking waits,
delay_free overhead).
Either outcome is informative.
## What this sweep does *not* answer
- Multi-instance coupling (8 schedulers running the walk concurrently
vs proxy load-balancing).
- Agentic session structure (long prefix reuse + short uncached tail).
- The 8-instance trace-replay 45 % figure can only be reconciled
once we know the slope and combine with concurrency / coupling
measurements. This sweep is a necessary input, not the full
reconciliation.
## Files
```
cache_sweep/
├── DESIGN.md # this file
├── apply_step_timing_v2.py # extends apply_step_timing.py with cache_size + worker timings
├── run_cache_sweep.py # bench driver: per-config continuous open-loop
├── analyze.py # join engine_step + worker_step, plot, fit
├── run_all.sh # orchestrator (apply patch → run 3 configs → revert → analyze)
└── results/<date>/ # one subdir per run
└── <config>/
├── engine_step.jsonl
├── worker_step.jsonl
├── requests.jsonl
├── summary.json
├── vllm_stdout.log
└── vllm_stderr.log
```

View File

@@ -0,0 +1,153 @@
# Cache-size Sweep — Results
Run: `results/20260526_1507/`
Hardware: H20 96 GB × 1, TP=1, Qwen3-Coder-30B-A3B-Instruct,
`gpu-memory-utilization=0.9`, `enable_prefix_caching=true`.
Cache ceiling reached on this GPU: **17 528 blocks**.
## TL;DR
H5 (build_connector_meta walks `set(cache.keys())` per step, so cost
grows linearly with |cache|) **passes**.
- mooncake_both: step_duration_us p50 grows from **276 μs (cache=2.6k blocks)**
to **1528 μs (cache=16.6k blocks)** — linear fit slope **+85.6 μs / 1 000 blocks**.
- plain: **+1.0 μs / 1 000 blocks** (≈ zero, control).
- noop_connector: **0.8 μs / 1 000 blocks** (≈ zero, control).
`build_connector_meta` accounts for **94 % of the scheduler-side
cost at full cache** (1442 / 1528 μs at the top bin). The vLLM v1
framework dispatch alone (noop_connector vs plain) is **~20 μs**.
The original microbench's **"100 % from build_meta"** claim was an
artefact of *not measuring* the worker-side path. With both sides
measured here, the picture is:
| cost component | mooncake_both (μs/step) | scaling |
|---|---:|---|
| **scheduler `build_connector_meta`** | 207 (cache=2.6k) → **1442 (cache=16.6k)** | **O(\|cache\|)** |
| **worker `get_finished()`** | **p50 = 180 μs, p99 = 333 μs** (independent of \|cache\|) | constant |
| **worker `start_load_kv()`** | p50 = 2-5 μs | constant |
| **framework dispatch** (noopplain) | ≈ 20 μs | constant |
So the previously-uninstrumented `get_finished()` adds another **180 μs
per step on top** of the cache-dependent build_meta. At low cache size
that's the dominant connector cost; at high cache size it's
overshadowed by build_meta but still adds ~10 %.
## The figure
![per-step time vs cache_size](figure.png)
Left: full step time. Right: just the `build_connector_meta`
component. plain and noop stay flat at ~80 μs across the whole range;
mooncake_both rises near-linearly.
## How this changes the trace-replay reconciliation
The 8-instance trace replay (`analysis/characterization/elastic_migration_v2`)
ran with APC ≈ 79 %, i.e. each instance's block pool held **~13 000
blocks**. Plugging that into the fit:
```
mooncake build_meta @ |cache|=13 000 ≈ 1060 μs / step
mooncake get_finished ≈ 180 μs / step
total per-step connector cost ≈ 1240 μs ≈ 1.24 ms / step
```
Decode-step model forward on Qwen3-Coder-30B-A3B / H20 is ~6-9 ms
TPOT, so 1.24 ms of extra scheduler-and-worker time per step is a
**+15-20 % TPOT inflation** purely from the per-step connector cost —
before any inter-instance coupling.
This matches the trace-replay TPOT p90 +25 % gap quite well. The
**residual ~7 pp** can be attributed to:
1. **Block-pool LRU churn under capacity pressure** (random-content
bench reaches ceiling quickly; trace-replay holds at ceiling
for the full session-coupled workload).
2. **Block-lifecycle changes** (`delay_free_blocks=True` once any
connector is loaded; the freed-block backlog is larger under
high APC).
3. **Multi-instance scheduler coupling**: the slowest scheduler in
8-way load_only sets the proxy's batch latency.
For the **+45 % TTFT p90 gap**, the same scheduler tax compounds
across many chunked-prefill steps. A 50-step prefill at 1.24 ms extra
each step is +62 ms, which is on the order of the typical TTFT delta
we see at moderate load.
## How this changes the "decomposition" attribution
The original RESULTS.md said:
> +7-9 % from build_connector_meta per-step cost (this microbench)
> +20-30 % from multi-instance coupling amplification (not measurable)
> remainder from large-cache O(\|cache\|) scaling (Phase B follow-up)
The cache-size sweep replaces the third row with a measurement and
forces the first row to be re-read:
| factor | original claim | revised |
|---|---|---|
| single-instance high-conc tax | +7-9 % | unchanged — that was measured at low \|cache\| |
| multi-instance coupling | +20-30 % | still un-measured, but a *smaller* slice than thought |
| large-cache O(\|cache\|) scaling | "likely 2-3×" | **measured: +85.6 μs/1k blocks; ≈ 1 ms/step at \|cache\|=13k** |
| worker-side get_finished | not in the model | **measured: +180 μs/step (constant)** |
The "trace-replay 45 % TTFT p90" is now explainable mostly from
cache-size + worker get_finished + framework dispatch, without
having to invoke a large multi-instance coupling term. The data is
also consistent with NIXL's smaller tax (NIXL doesn't walk the
block-pool dict in scheduler.build_connector_meta; the trace-replay
NIXL vs plain gap of +38 % is consistent with "only the get_finished
+ framework constant" parts, lacking the O(\|cache\|) component).
## What this still doesn't settle
1. **Multi-instance coupling**: the 8-instance run would need its
own cache-size sweep + per-instance step instrumentation. We
know the per-instance per-step cost; what we don't know is how
that propagates through the cache-aware proxy's routing
decisions.
2. **Larger \|cache\| extrapolation**: H20 96 GB caps at ~17.5 k
blocks at the configured memory. Settings with smaller models
(or `gpu-memory-utilization` ≥ 0.95 on bigger GPUs) reach
higher \|cache\|. The slope is linear in this range, but we
have not extrapolated past ~17 k.
3. **NIXL slope**: NIXL was in the prior microbench's plan but
not in this run. Same instrumentation on NIXL would confirm
whether NIXL has a different (smaller) slope.
## Practical recommendation
The root cause is clearly identifiable: the per-scheduler-step
`set(self._block_pool.cached_block_hash_to_block._cache.keys())` walk
in `mooncake_connector.py:434-450`. Replacing it with an incremental
delta listener (using the block-pool's existing
`add`/`remove`/`evict` callbacks) would zero out the cache-size
slope and bring mooncake_both into the same ballpark as noop_connector
on the scheduler side.
The worker-side `get_finished` cost (180 μs constant) is also
fixable: in `kv_both` mode it submits two empty `coroutine_threadsafe`
futures every step. Caching/coalescing or short-circuiting when both
queues are empty would eliminate this constant.
## Reproducibility
```
cd microbench/connector_tax/cache_sweep
bash run_all.sh # ~22 min on H20 single-GPU
```
The orchestrator applies v1 + v2 patches, runs the three configs
sequentially, reverts both patches on exit, and produces
`results/<date>/SUMMARY.md` + `figure.png`.
Artifacts in `results/20260526_1507/`:
- `figure.png` — the headline plot
- `SUMMARY.md` — per-config tables (this report's source)
- `per_config.json` — machine-readable
- per-config: `engine_step.jsonl`, `worker_step.r0.jsonl`,
`requests.jsonl`, `metrics_final.txt`, vLLM stdout/stderr

View File

@@ -0,0 +1,300 @@
#!/usr/bin/env python3
"""Analyse cache-size sweep results.
For each subdir under --run-root containing engine_step.jsonl:
- read all per-step records
- bin by cache_size
- report median/p90 of step_duration_us, build_meta_us per bin
- fit step_duration_us ~ a + b * cache_size (linear least squares)
- tabulate connector tax(cache_size) = mc_step - plain_step (if plain present)
- optionally render matplotlib plot if matplotlib available
Outputs:
results/<date>/SUMMARY.md human report
results/<date>/per_config.json machine-readable
results/<date>/figure.png (optional)
"""
import argparse
import json
import statistics
from pathlib import Path
def load_steps(p: Path):
rows = []
with open(p) as f:
for line in f:
line = line.strip()
if not line:
continue
try:
rows.append(json.loads(line))
except Exception:
pass
return rows
def percentile(xs, p):
if not xs:
return None
xs = sorted(xs)
k = max(0, min(len(xs) - 1, int(p / 100.0 * (len(xs) - 1))))
return xs[k]
def linfit(xs, ys):
"""Tiny linear least squares. Returns (slope, intercept)."""
n = len(xs)
if n < 2:
return None, None
sx = sum(xs); sy = sum(ys); sxx = sum(x*x for x in xs)
sxy = sum(x*y for x, y in zip(xs, ys))
denom = n * sxx - sx * sx
if denom == 0:
return None, None
b = (n * sxy - sx * sy) / denom
a = (sy - b * sx) / n
return b, a
def bucket(rows, key="cache_size", n_bins=10):
"""Equal-width bin on cache_size; returns dict bin_id -> list of rows."""
if not rows:
return {}
vmax = max(r.get(key, 0) for r in rows)
if vmax <= 0:
return {}
width = vmax / n_bins
out: dict[int, list[dict]] = {}
for r in rows:
v = r.get(key, 0)
bid = min(n_bins - 1, max(0, int(v / width))) if width > 0 else 0
out.setdefault(bid, []).append(r)
return out, width
def analyse_config(cfg_name: str, cfg_dir: Path):
eng_path = cfg_dir / "engine_step.jsonl"
if not eng_path.exists() or eng_path.stat().st_size == 0:
return None
raw = load_steps(eng_path)
if not raw:
return None
# Filter: skip first 500 steps (cold start), and steps with no cache_size.
base = [r for r in raw[500:] if r.get("cache_size", -1) >= 0]
if not base:
return None
# Decode-only filter: steps where the scheduler did NOT touch any
# new/resumed request and total tokens == n_running_total (each running
# request emits exactly one token). This gives the cleanest per-step
# baseline since prefill chunks dominate step time at high token counts.
decode_only = [r for r in base
if r.get("prefill_tokens", 0) == 0
and r.get("decode_tokens", 0) > 0]
# Fall back to "all post-warmup" if decode-only is too sparse
rows = decode_only if len(decode_only) >= 200 else base
decode_share = len(decode_only) / max(1, len(base))
cache_max = max(r["cache_size"] for r in rows)
bins, width = bucket(rows, n_bins=10)
per_bin = []
for bid in sorted(bins):
rs = bins[bid]
sd = [r["step_duration_us"] for r in rs if "step_duration_us" in r]
bm = [r.get("build_meta_us", 0) for r in rs]
cs = [r["cache_size"] for r in rs]
per_bin.append({
"bin_id": bid,
"cache_size_mid": (bid + 0.5) * width,
"n": len(rs),
"cache_size_p50": percentile(cs, 50),
"step_duration_us_p50": percentile(sd, 50),
"step_duration_us_p90": percentile(sd, 90),
"build_meta_us_p50": percentile(bm, 50),
"build_meta_us_p90": percentile(bm, 90),
})
# Fit per-step duration vs cache size on all records (not bin averages)
sd_b, sd_a = linfit([r["cache_size"] for r in rows if "step_duration_us" in r],
[r["step_duration_us"] for r in rows if "step_duration_us" in r])
bm_b, bm_a = linfit([r["cache_size"] for r in rows if "build_meta_us" in r],
[r.get("build_meta_us", 0) for r in rows if "build_meta_us" in r])
# Worker-side timings if available. Filename is `worker_step.r<rank>.jsonl`
# because os.path.splitext keeps the .jsonl extension.
worker_path = None
for c in sorted(cfg_dir.glob("worker_step.r*.jsonl")):
worker_path = c
break
if worker_path is None:
worker_path = cfg_dir / "missing"
worker_summary = None
if worker_path.exists() and worker_path.stat().st_size > 0:
wrows = load_steps(worker_path)
if wrows:
gf = [r["get_finished_us"] for r in wrows if "get_finished_us" in r]
sl = [r["start_load_kv_us"] for r in wrows if "start_load_kv_us" in r]
worker_summary = {
"n": len(wrows),
"get_finished_us_p50": percentile(gf, 50),
"get_finished_us_p90": percentile(gf, 90),
"get_finished_us_p99": percentile(gf, 99),
"start_load_kv_us_p50": percentile(sl, 50),
"start_load_kv_us_p90": percentile(sl, 90),
}
return {
"config": cfg_name,
"n_steps_total": len(raw),
"n_steps_after_warmup": len(base),
"n_steps_decode_only": len(decode_only),
"decode_share": decode_share,
"rows_used_for_fit": "decode_only" if rows is decode_only else "all_post_warmup",
"cache_size_max": cache_max,
"per_bin": per_bin,
"fit_step_duration": {"slope_us_per_block": sd_b, "intercept_us": sd_a},
"fit_build_meta": {"slope_us_per_block": bm_b, "intercept_us": bm_a},
"worker_summary": worker_summary,
}
def render(root: Path, all_cfg: dict):
lines = ["# Cache-size sweep — summary\n"]
lines.append(f"Run root: `{root}`\n")
lines.append("## Per-config fit (`step_duration_us ≈ a + b · cache_size`)\n")
lines.append("| config | n steps | cache max | step_dur p50 (μs) | build_meta p50 (μs) | slope (μs / 1k blocks) | intercept (μs) |")
lines.append("|---|---:|---:|---:|---:|---:|---:|")
for cfg, r in all_cfg.items():
if r is None:
lines.append(f"| {cfg} | — | — | — | — | — | — |")
continue
last_bin = r["per_bin"][-1] if r["per_bin"] else {}
slope = r["fit_step_duration"]["slope_us_per_block"]
intercept = r["fit_step_duration"]["intercept_us"]
slope1k = (slope or 0) * 1000
lines.append(
f"| {cfg} | {r['n_steps_after_warmup']} | {r['cache_size_max']} | "
f"{last_bin.get('step_duration_us_p50','-')} | "
f"{last_bin.get('build_meta_us_p50','-')} | "
f"{slope1k:.1f} | {intercept:.1f} |"
if slope is not None else
f"| {cfg} | {r['n_steps_after_warmup']} | {r['cache_size_max']} | - | - | - | - |"
)
# Per-bin tables
for cfg, r in all_cfg.items():
if r is None:
continue
lines.append(f"\n### {cfg} — per-bin\n")
lines.append("| bin | cache mid | n | step_dur p50 | step_dur p90 | build_meta p50 | build_meta p90 |")
lines.append("|---:|---:|---:|---:|---:|---:|---:|")
for b in r["per_bin"]:
lines.append(
f"| {b['bin_id']} | {b['cache_size_mid']:.0f} | {b['n']} | "
f"{b['step_duration_us_p50']} | {b['step_duration_us_p90']} | "
f"{b['build_meta_us_p50']} | {b['build_meta_us_p90']} |"
)
if r["worker_summary"]:
w = r["worker_summary"]
lines.append(f"\n*worker side (n={w['n']})* — "
f"get_finished p50/p90/p99 = "
f"{w['get_finished_us_p50']}/{w['get_finished_us_p90']}/"
f"{w['get_finished_us_p99']} μs; "
f"start_load_kv p50/p90 = "
f"{w['start_load_kv_us_p50']}/{w['start_load_kv_us_p90']} μs\n")
# Tax vs cache for mc vs plain
plain = all_cfg.get("plain")
mc = all_cfg.get("mooncake_both")
noop = all_cfg.get("noop_connector")
if plain and mc:
lines.append("\n## Connector tax(cache_size) — mooncake_both vs plain\n")
lines.append("| bin | cache mid | plain step p50 | mc step p50 | tax (μs) | tax (%) |")
lines.append("|---:|---:|---:|---:|---:|---:|")
for bp, bm in zip(plain["per_bin"], mc["per_bin"]):
if bp["step_duration_us_p50"] and bm["step_duration_us_p50"]:
tax = bm["step_duration_us_p50"] - bp["step_duration_us_p50"]
pct = tax / bp["step_duration_us_p50"] * 100
lines.append(
f"| {bp['bin_id']} | {bp['cache_size_mid']:.0f} | "
f"{bp['step_duration_us_p50']} | {bm['step_duration_us_p50']} | "
f"{tax:+d} | {pct:+.1f} |"
)
if plain and noop:
# Framework cost: noop_connector tax = pure dispatch
lines.append("\n## Framework cost — noop_connector vs plain\n")
lines.append("| bin | cache mid | plain step p50 | noop step p50 | tax (μs) |")
lines.append("|---:|---:|---:|---:|---:|")
for bp, bn in zip(plain["per_bin"], noop["per_bin"]):
if bp["step_duration_us_p50"] and bn["step_duration_us_p50"]:
tax = bn["step_duration_us_p50"] - bp["step_duration_us_p50"]
lines.append(
f"| {bp['bin_id']} | {bp['cache_size_mid']:.0f} | "
f"{bp['step_duration_us_p50']} | {bn['step_duration_us_p50']} | "
f"{tax:+d} |"
)
out_md = root / "SUMMARY.md"
out_md.write_text("\n".join(lines) + "\n")
out_json = root / "per_config.json"
out_json.write_text(json.dumps(all_cfg, indent=2, default=str))
print(f" wrote {out_md}")
print(f" wrote {out_json}")
# Optional plot
try:
import matplotlib
matplotlib.use("Agg")
import matplotlib.pyplot as plt
fig, (ax1, ax2) = plt.subplots(1, 2, figsize=(13, 5))
colors = {"plain": "tab:blue", "noop_connector": "tab:orange",
"mooncake_both": "tab:red"}
for cfg, r in all_cfg.items():
if r is None: continue
xs = [b["cache_size_mid"] for b in r["per_bin"]]
ys = [b["step_duration_us_p50"] or 0 for b in r["per_bin"]]
zs = [b["build_meta_us_p50"] or 0 for b in r["per_bin"]]
c = colors.get(cfg, None)
ax1.plot(xs, ys, marker="o", label=cfg, color=c)
ax2.plot(xs, zs, marker="s", label=cfg, color=c)
ax1.set_xlabel("cache_size (blocks)")
ax1.set_ylabel("step_duration_us p50")
ax1.set_title("Per-step scheduler time vs |cache|")
ax1.legend(); ax1.grid(True, alpha=0.3)
ax2.set_xlabel("cache_size (blocks)")
ax2.set_ylabel("build_meta_us p50")
ax2.set_title("build_connector_meta time vs |cache|")
ax2.legend(); ax2.grid(True, alpha=0.3)
fig.tight_layout()
fig.savefig(root / "figure.png", dpi=120)
print(f" wrote {root/'figure.png'}")
except Exception as e:
print(f" (skipped plot: {e})")
def main():
ap = argparse.ArgumentParser()
ap.add_argument("--run-root", type=Path, required=True)
args = ap.parse_args()
cfgs = {}
for d in sorted(args.run_root.iterdir()):
if not d.is_dir(): continue
r = analyse_config(d.name, d)
cfgs[d.name] = r
if r:
sl = r["fit_step_duration"]["slope_us_per_block"]
print(f" {d.name}: n={r['n_steps_after_warmup']} "
f"cache_max={r['cache_size_max']} "
f"slope={(sl or 0)*1000:.2f} μs/1k blocks")
else:
print(f" {d.name}: no data")
render(args.run_root, cfgs)
if __name__ == "__main__":
main()

View File

@@ -0,0 +1,251 @@
#!/usr/bin/env python3
"""Cache-sweep patch v2.
Extends `microbench/connector_tax/patches/apply_step_timing.py` with:
scheduler.py → adds `cache_size` field (block_pool dict len) to
the per-step JSONL.
kv_connector_model_runner_mixin.py
→ wraps `kv_connector.start_load_kv(...)` and
`kv_connector.get_finished(...)` to record their
duration into per-process `worker_step.jsonl`
(path from env var `CT_WORKER_STEP_LOG_PATH`).
All inserts carry the marker `# CT_CACHE_SWEEP_PATCH` so revert is
"delete every line that contains the marker". The original
`CONNECTOR_TAX_PATCH` lines (step_duration_us, build_meta_us) are
left alone.
Usage:
python apply_step_timing_v2.py --apply [--vllm-root PATH]
python apply_step_timing_v2.py --revert [--vllm-root PATH]
"""
import argparse
import re
import sys
from pathlib import Path
MARKER = "# CT_CACHE_SWEEP_PATCH"
BLOCK_BEGIN = f"{MARKER}_BLOCK_BEGIN"
BLOCK_END = f"{MARKER}_BLOCK_END"
# Default vLLM is the in-tree third_party copy; on dash0 the .venv links to it.
DEFAULT_VLLM_ROOT = Path(
"/home/admin/cpfs/wjh/agentic-kv/.venv/lib/python3.12/site-packages/vllm"
)
def already_patched(text: str) -> bool:
return MARKER in text
def revert_text(text: str) -> str:
"""Two pass revert:
1. drop everything between BLOCK_BEGIN / BLOCK_END (inclusive)
2. drop any remaining single line that contains MARKER
Order matters so single-line MARKERs inside a block don't shrink-wrap.
"""
out_lines = []
inside_block = False
for line in text.splitlines():
if BLOCK_BEGIN in line:
inside_block = True
continue
if BLOCK_END in line:
inside_block = False
continue
if inside_block:
continue
if MARKER in line:
continue
out_lines.append(line)
return "\n".join(out_lines) + ("\n" if text.endswith("\n") else "")
# ── scheduler.py: add cache_size field to record dict ───────────────────────
def patch_scheduler(text: str) -> str:
if already_patched(text):
print(" scheduler.py already has CT_CACHE_SWEEP_PATCH, skipping")
return text
# Inject `cache_size` field into the record built by
# `_agentic_emit_step_log`. We piggy-back on the existing
# CONNECTOR_TAX_PATCH lines that already inject `step_duration_us`
# and `build_meta_us`. Insert immediately after `"t_unix": _time.time(),`.
pat = (
r"( record = \{\n"
r"(?:[^\n]*CONNECTOR_TAX_PATCH\n)*"
r" \"t_unix\": _time\.time\(\),\n)"
)
def repl(m):
return (
m.group(1)
+ " \"cache_size\": _ct_cache_size(self), " + MARKER + "\n"
)
text, n = re.subn(pat, repl, text, count=1)
if n == 0:
raise RuntimeError(
"Failed to inject cache_size — is the base CONNECTOR_TAX_PATCH "
"applied?"
)
# Add a helper at module bottom (only once). Wrap in block sentinels
# so revert removes the whole block, not just lines that happen to
# contain MARKER.
helper = (
f"\n# {BLOCK_BEGIN}\n"
f"def _ct_cache_size(scheduler):\n"
f" try:\n"
f" return len(scheduler.kv_cache_manager.block_pool.cached_block_hash_to_block._cache)\n"
f" except Exception:\n"
f" return -1\n"
f"# {BLOCK_END}\n"
)
if BLOCK_BEGIN not in text:
text = text.rstrip() + "\n" + helper
return text
# ── worker mixin: time start_load_kv + get_finished ─────────────────────────
def patch_worker_mixin(text: str) -> str:
if already_patched(text):
print(" kv_connector_model_runner_mixin.py already patched, skipping")
return text
# Wrap the start_load_kv call.
pat = (
r" kv_connector\.start_load_kv\(get_forward_context\(\)\)\n"
)
repl = (
" _ct_t0 = __import__('time').perf_counter_ns() " + MARKER + "\n"
" kv_connector.start_load_kv(get_forward_context())\n"
" _ct_start_load_kv_us = (__import__('time').perf_counter_ns() - _ct_t0) // 1000 " + MARKER + "\n"
)
text, n = re.subn(pat, repl, text, count=1)
if n == 0:
raise RuntimeError("Failed to patch start_load_kv call")
# Wrap the get_finished call. We assign into local _ct_get_finished_us
# in the same `try/finally` block where get_finished is called.
pat = (
r" output\.finished_sending, output\.finished_recving = \(\n"
r" kv_connector\.get_finished\(scheduler_output\.finished_req_ids\)\n"
r" \)\n"
)
repl = (
" _ct_t1 = __import__('time').perf_counter_ns() " + MARKER + "\n"
" output.finished_sending, output.finished_recving = (\n"
" kv_connector.get_finished(scheduler_output.finished_req_ids)\n"
" )\n"
" _ct_get_finished_us = (__import__('time').perf_counter_ns() - _ct_t1) // 1000 " + MARKER + "\n"
" _ct_emit_worker_step(_ct_start_load_kv_us, _ct_get_finished_us) " + MARKER + "\n"
)
text, n = re.subn(pat, repl, text, count=1)
if n == 0:
raise RuntimeError("Failed to patch get_finished call")
# Add the emitter at module bottom. Lazy-opens worker_step.jsonl on
# the first call. Wrapped in block sentinels so revert is clean.
emitter = f"""
# {BLOCK_BEGIN}
_ct_worker_fh = None
_ct_worker_init_done = False
_ct_worker_step_id = 0
def _ct_emit_worker_step(start_load_kv_us, get_finished_us):
global _ct_worker_fh, _ct_worker_init_done, _ct_worker_step_id
import os, json, time
if not _ct_worker_init_done:
_ct_worker_init_done = True
path = os.environ.get("CT_WORKER_STEP_LOG_PATH")
if not path:
return
try:
rank = os.environ.get("RANK", os.environ.get("LOCAL_RANK", "0"))
base, ext = os.path.splitext(path)
full = f"{{base}}.r{{rank}}{{ext or '.jsonl'}}"
_ct_worker_fh = open(full, "a", buffering=1)
except Exception:
_ct_worker_fh = None
fh = _ct_worker_fh
if fh is None:
return
try:
fh.write(json.dumps({{
"t_unix": time.time(),
"step_id": _ct_worker_step_id,
"start_load_kv_us": int(start_load_kv_us),
"get_finished_us": int(get_finished_us),
}}) + "\\n")
_ct_worker_step_id += 1
except Exception:
try:
fh.close()
except Exception:
pass
_ct_worker_fh = None
# {BLOCK_END}
"""
if BLOCK_BEGIN not in text:
text = text.rstrip() + "\n" + emitter
return text
# ── driver ──────────────────────────────────────────────────────────────────
def apply_to_file(path: Path, fn) -> bool:
if not path.exists():
print(f" SKIP {path} (not found)")
return False
orig = path.read_text()
new = fn(orig)
if new == orig:
print(f" unchanged: {path}")
return False
path.write_text(new)
print(f" patched ({new.count(MARKER)} marks): {path}")
return True
def revert_file(path: Path) -> bool:
if not path.exists():
return False
orig = path.read_text()
new = revert_text(orig)
if new == orig:
print(f" no marks: {path}")
return False
path.write_text(new)
print(f" reverted: {path}")
return True
def main():
ap = argparse.ArgumentParser()
ap.add_argument("--apply", action="store_true")
ap.add_argument("--revert", action="store_true")
ap.add_argument("--vllm-root", type=Path, default=DEFAULT_VLLM_ROOT)
args = ap.parse_args()
if not (args.apply ^ args.revert):
ap.error("Specify exactly one of --apply / --revert")
sched = args.vllm_root / "v1/core/sched/scheduler.py"
mixin = args.vllm_root / "v1/worker/kv_connector_model_runner_mixin.py"
if args.apply:
print(f"Applying CT_CACHE_SWEEP_PATCH to {args.vllm_root}")
apply_to_file(sched, patch_scheduler)
apply_to_file(mixin, patch_worker_mixin)
else:
print(f"Reverting CT_CACHE_SWEEP_PATCH from {args.vllm_root}")
revert_file(sched)
revert_file(mixin)
if __name__ == "__main__":
main()

View File

@@ -0,0 +1,6 @@
# Heavy raw artifacts — re-derivable from a re-run.
# Per-bin summaries are preserved in SUMMARY.md and per_config.json.
**/engine_step.jsonl
**/vllm_stdout.log
**/vllm_stderr.log
**/.vllm.pid

View File

@@ -0,0 +1,85 @@
# Cache-size sweep — summary
Run root: `microbench/connector_tax/cache_sweep/results/20260526_1507`
## Per-config fit (`step_duration_us ≈ a + b · cache_size`)
| config | n steps | cache max | step_dur p50 (μs) | build_meta p50 (μs) | slope (μs / 1k blocks) | intercept (μs) |
|---|---:|---:|---:|---:|---:|---:|
| mooncake_both | 17675 | 17528 | 1528 | 1442 | 85.6 | 194.4 |
| noop_connector | 16127 | 17529 | 79 | 0 | -0.8 | 102.6 |
| plain | 15692 | 17529 | 84 | 0 | 1.0 | 74.7 |
### mooncake_both — per-bin
| bin | cache mid | n | step_dur p50 | step_dur p90 | build_meta p50 | build_meta p90 |
|---:|---:|---:|---:|---:|---:|---:|
| 1 | 2629 | 282 | 276 | 430 | 207 | 339 |
| 2 | 4382 | 115 | 763 | 993 | 609 | 831 |
| 3 | 6135 | 120 | 960 | 1055 | 815 | 889 |
| 4 | 7888 | 248 | 972 | 1254 | 834 | 1102 |
| 5 | 9640 | 508 | 941 | 1356 | 860 | 1237 |
| 6 | 11393 | 220 | 1025 | 1456 | 943 | 1352 |
| 7 | 13146 | 679 | 1135 | 1616 | 1060 | 1515 |
| 8 | 14899 | 308 | 1261 | 1881 | 1174 | 1769 |
| 9 | 16652 | 14849 | 1528 | 2208 | 1442 | 2079 |
*worker side (n=18175)* — get_finished p50/p90/p99 = 180/257/333 μs; start_load_kv p50/p90 = 2/5 μs
### noop_connector — per-bin
| bin | cache mid | n | step_dur p50 | step_dur p90 | build_meta p50 | build_meta p90 |
|---:|---:|---:|---:|---:|---:|---:|
| 2 | 4382 | 270 | 100 | 126 | 0 | 1 |
| 3 | 6135 | 173 | 110 | 128 | 0 | 1 |
| 4 | 7888 | 437 | 84 | 116 | 0 | 1 |
| 5 | 9641 | 17 | 101 | 123 | 0 | 1 |
| 6 | 11394 | 408 | 130 | 164 | 0 | 1 |
| 7 | 13147 | 458 | 73 | 100 | 0 | 1 |
| 8 | 14900 | 621 | 67 | 101 | 0 | 1 |
| 9 | 16653 | 13386 | 79 | 139 | 0 | 0 |
*worker side (n=16627)* — get_finished p50/p90/p99 = 0/0/2 μs; start_load_kv p50/p90 = 0/1 μs
### plain — per-bin
| bin | cache mid | n | step_dur p50 | step_dur p90 | build_meta p50 | build_meta p90 |
|---:|---:|---:|---:|---:|---:|---:|
| 1 | 2629 | 555 | 59 | 79 | 0 | 0 |
| 2 | 4382 | 505 | 69 | 95 | 0 | 0 |
| 3 | 6135 | 173 | 74 | 99 | 0 | 0 |
| 4 | 7888 | 132 | 139 | 185 | 0 | 0 |
| 5 | 9641 | 241 | 125 | 161 | 0 | 0 |
| 6 | 11394 | 267 | 82 | 104 | 0 | 0 |
| 7 | 13147 | 402 | 91 | 128 | 0 | 0 |
| 8 | 14900 | 85 | 136 | 163 | 0 | 0 |
| 9 | 16653 | 12973 | 84 | 141 | 0 | 0 |
## Connector tax(cache_size) — mooncake_both vs plain
| bin | cache mid | plain step p50 | mc step p50 | tax (μs) | tax (%) |
|---:|---:|---:|---:|---:|---:|
| 1 | 2629 | 59 | 276 | +217 | +367.8 |
| 2 | 4382 | 69 | 763 | +694 | +1005.8 |
| 3 | 6135 | 74 | 960 | +886 | +1197.3 |
| 4 | 7888 | 139 | 972 | +833 | +599.3 |
| 5 | 9641 | 125 | 941 | +816 | +652.8 |
| 6 | 11394 | 82 | 1025 | +943 | +1150.0 |
| 7 | 13147 | 91 | 1135 | +1044 | +1147.3 |
| 8 | 14900 | 136 | 1261 | +1125 | +827.2 |
| 9 | 16653 | 84 | 1528 | +1444 | +1719.0 |
## Framework cost — noop_connector vs plain
| bin | cache mid | plain step p50 | noop step p50 | tax (μs) |
|---:|---:|---:|---:|---:|
| 1 | 2629 | 59 | 100 | +41 |
| 2 | 4382 | 69 | 110 | +41 |
| 3 | 6135 | 74 | 84 | +10 |
| 4 | 7888 | 139 | 101 | -38 |
| 5 | 9641 | 125 | 130 | +5 |
| 6 | 11394 | 82 | 73 | -9 |
| 7 | 13147 | 91 | 67 | -24 |
| 8 | 14900 | 136 | 79 | -57 |

Binary file not shown.

After

Width:  |  Height:  |  Size: 86 KiB

View File

@@ -0,0 +1,624 @@
# HELP python_gc_objects_collected_total Objects collected during gc
# TYPE python_gc_objects_collected_total counter
python_gc_objects_collected_total{generation="0"} 11970.0
python_gc_objects_collected_total{generation="1"} 1549.0
python_gc_objects_collected_total{generation="2"} 855.0
# HELP python_gc_objects_uncollectable_total Uncollectable objects found during GC
# TYPE python_gc_objects_uncollectable_total counter
python_gc_objects_uncollectable_total{generation="0"} 0.0
python_gc_objects_uncollectable_total{generation="1"} 0.0
python_gc_objects_uncollectable_total{generation="2"} 0.0
# HELP python_gc_collections_total Number of times this generation was collected
# TYPE python_gc_collections_total counter
python_gc_collections_total{generation="0"} 1331.0
python_gc_collections_total{generation="1"} 121.0
python_gc_collections_total{generation="2"} 9.0
# HELP python_info Python platform information
# TYPE python_info gauge
python_info{implementation="CPython",major="3",minor="12",patchlevel="3",version="3.12.3"} 1.0
# HELP process_virtual_memory_bytes Virtual memory size in bytes.
# TYPE process_virtual_memory_bytes gauge
process_virtual_memory_bytes 4.0688443392e+010
# HELP process_resident_memory_bytes Resident memory size in bytes.
# TYPE process_resident_memory_bytes gauge
process_resident_memory_bytes 1.380319232e+09
# HELP process_start_time_seconds Start time of the process since unix epoch in seconds.
# TYPE process_start_time_seconds gauge
process_start_time_seconds 1.77980878561e+09
# HELP process_cpu_seconds_total Total user and system CPU time spent in seconds.
# TYPE process_cpu_seconds_total counter
process_cpu_seconds_total 40.3
# HELP process_open_fds Number of open file descriptors.
# TYPE process_open_fds gauge
process_open_fds 67.0
# HELP process_max_fds Maximum number of open file descriptors.
# TYPE process_max_fds gauge
process_max_fds 1.048575e+06
# HELP vllm:estimated_flops_per_gpu_total Estimated number of floating point operations per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_flops_per_gpu_total counter
vllm:estimated_flops_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:estimated_flops_per_gpu_created Estimated number of floating point operations per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_flops_per_gpu_created gauge
vllm:estimated_flops_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920332582e+09
# HELP vllm:estimated_read_bytes_per_gpu_total Estimated number of bytes read from memory per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_read_bytes_per_gpu_total counter
vllm:estimated_read_bytes_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:estimated_read_bytes_per_gpu_created Estimated number of bytes read from memory per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_read_bytes_per_gpu_created gauge
vllm:estimated_read_bytes_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920332909e+09
# HELP vllm:estimated_write_bytes_per_gpu_total Estimated number of bytes written to memory per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_write_bytes_per_gpu_total counter
vllm:estimated_write_bytes_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:estimated_write_bytes_per_gpu_created Estimated number of bytes written to memory per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_write_bytes_per_gpu_created gauge
vllm:estimated_write_bytes_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920333076e+09
# HELP vllm:num_requests_running Number of requests in model execution batches.
# TYPE vllm:num_requests_running gauge
vllm:num_requests_running{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:num_requests_waiting Number of requests waiting to be processed.
# TYPE vllm:num_requests_waiting gauge
vllm:num_requests_waiting{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:engine_sleep_state Engine sleep state; awake = 0 means engine is sleeping; awake = 1 means engine is awake; weights_offloaded = 1 means sleep level 1; discard_all = 1 means sleep level 2.
# TYPE vllm:engine_sleep_state gauge
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="awake"} 1.0
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="weights_offloaded"} 0.0
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="discard_all"} 0.0
# HELP vllm:kv_cache_usage_perc KV-cache usage. 1 means 100 percent usage.
# TYPE vllm:kv_cache_usage_perc gauge
vllm:kv_cache_usage_perc{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:prefix_cache_queries_total Prefix cache queries, in terms of number of queried tokens.
# TYPE vllm:prefix_cache_queries_total counter
vllm:prefix_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.344332e+06
# HELP vllm:prefix_cache_queries_created Prefix cache queries, in terms of number of queried tokens.
# TYPE vllm:prefix_cache_queries_created gauge
vllm:prefix_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920334654e+09
# HELP vllm:prefix_cache_hits_total Prefix cache hits, in terms of number of cached tokens.
# TYPE vllm:prefix_cache_hits_total counter
vllm:prefix_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:prefix_cache_hits_created Prefix cache hits, in terms of number of cached tokens.
# TYPE vllm:prefix_cache_hits_created gauge
vllm:prefix_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920334804e+09
# HELP vllm:external_prefix_cache_queries_total External prefix cache queries from KV connector cross-instance cache sharing, in terms of number of queried tokens.
# TYPE vllm:external_prefix_cache_queries_total counter
vllm:external_prefix_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.344332e+06
# HELP vllm:external_prefix_cache_queries_created External prefix cache queries from KV connector cross-instance cache sharing, in terms of number of queried tokens.
# TYPE vllm:external_prefix_cache_queries_created gauge
vllm:external_prefix_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920334933e+09
# HELP vllm:external_prefix_cache_hits_total External prefix cache hits from KV connector cross-instance cache sharing, in terms of number of cached tokens.
# TYPE vllm:external_prefix_cache_hits_total counter
vllm:external_prefix_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:external_prefix_cache_hits_created External prefix cache hits from KV connector cross-instance cache sharing, in terms of number of cached tokens.
# TYPE vllm:external_prefix_cache_hits_created gauge
vllm:external_prefix_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920335064e+09
# HELP vllm:mm_cache_queries_total Multi-modal cache queries, in terms of number of queried items.
# TYPE vllm:mm_cache_queries_total counter
vllm:mm_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:mm_cache_queries_created Multi-modal cache queries, in terms of number of queried items.
# TYPE vllm:mm_cache_queries_created gauge
vllm:mm_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808892033517e+09
# HELP vllm:mm_cache_hits_total Multi-modal cache hits, in terms of number of cached items.
# TYPE vllm:mm_cache_hits_total counter
vllm:mm_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:mm_cache_hits_created Multi-modal cache hits, in terms of number of cached items.
# TYPE vllm:mm_cache_hits_created gauge
vllm:mm_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920335355e+09
# HELP vllm:num_preemptions_total Cumulative number of preemption from the engine.
# TYPE vllm:num_preemptions_total counter
vllm:num_preemptions_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:num_preemptions_created Cumulative number of preemption from the engine.
# TYPE vllm:num_preemptions_created gauge
vllm:num_preemptions_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920335562e+09
# HELP vllm:prompt_tokens_total Number of prefill tokens processed.
# TYPE vllm:prompt_tokens_total counter
vllm:prompt_tokens_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.344332e+06
# HELP vllm:prompt_tokens_created Number of prefill tokens processed.
# TYPE vllm:prompt_tokens_created gauge
vllm:prompt_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.77980889203357e+09
# HELP vllm:prompt_tokens_by_source_total Number of prompt tokens by source.
# TYPE vllm:prompt_tokens_by_source_total counter
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_compute"} 1.344332e+06
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_cache_hit"} 0.0
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="external_kv_transfer"} 0.0
# HELP vllm:prompt_tokens_by_source_created Number of prompt tokens by source.
# TYPE vllm:prompt_tokens_by_source_created gauge
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_compute"} 1.7798088920335863e+09
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_cache_hit"} 1.7798088920335915e+09
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="external_kv_transfer"} 1.779808892033597e+09
# HELP vllm:prompt_tokens_cached_total Number of cached prompt tokens (local + external).
# TYPE vllm:prompt_tokens_cached_total counter
vllm:prompt_tokens_cached_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:prompt_tokens_cached_created Number of cached prompt tokens (local + external).
# TYPE vllm:prompt_tokens_cached_created gauge
vllm:prompt_tokens_cached_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808892033609e+09
# HELP vllm:prompt_tokens_recomputed_total Number of cached tokens recomputed for forward pass.
# TYPE vllm:prompt_tokens_recomputed_total counter
vllm:prompt_tokens_recomputed_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:prompt_tokens_recomputed_created Number of cached tokens recomputed for forward pass.
# TYPE vllm:prompt_tokens_recomputed_created gauge
vllm:prompt_tokens_recomputed_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.77980889203362e+09
# HELP vllm:generation_tokens_total Number of generation tokens processed.
# TYPE vllm:generation_tokens_total counter
vllm:generation_tokens_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 86272.0
# HELP vllm:generation_tokens_created Number of generation tokens processed.
# TYPE vllm:generation_tokens_created gauge
vllm:generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920336325e+09
# HELP vllm:request_success_total Count of successfully processed requests.
# TYPE vllm:request_success_total counter
vllm:request_success_total{engine="0",finished_reason="stop",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_success_total{engine="0",finished_reason="length",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_success_total{engine="0",finished_reason="abort",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_success_total{engine="0",finished_reason="error",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_success_total{engine="0",finished_reason="repetition",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:request_success_created Count of successfully processed requests.
# TYPE vllm:request_success_created gauge
vllm:request_success_created{engine="0",finished_reason="stop",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808892033657e+09
vllm:request_success_created{engine="0",finished_reason="length",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920336642e+09
vllm:request_success_created{engine="0",finished_reason="abort",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920336797e+09
vllm:request_success_created{engine="0",finished_reason="error",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920336847e+09
vllm:request_success_created{engine="0",finished_reason="repetition",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920336902e+09
# HELP vllm:request_prompt_tokens Number of prefill tokens processed.
# TYPE vllm:request_prompt_tokens histogram
vllm:request_prompt_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prompt_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prompt_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prompt_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prompt_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prompt_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prompt_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prompt_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prompt_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.344332e+06
# HELP vllm:request_prompt_tokens_created Number of prefill tokens processed.
# TYPE vllm:request_prompt_tokens_created gauge
vllm:request_prompt_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808892033737e+09
# HELP vllm:request_generation_tokens Number of generation tokens processed.
# TYPE vllm:request_generation_tokens histogram
vllm:request_generation_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_generation_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_generation_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_generation_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_generation_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_generation_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_generation_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_generation_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_generation_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_generation_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_generation_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_generation_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 86272.0
# HELP vllm:request_generation_tokens_created Number of generation tokens processed.
# TYPE vllm:request_generation_tokens_created gauge
vllm:request_generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920337977e+09
# HELP vllm:iteration_tokens_total Histogram of number of tokens per engine_step.
# TYPE vllm:iteration_tokens_total histogram
vllm:iteration_tokens_total_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2159.0
vllm:iteration_tokens_total_bucket{engine="0",le="8.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15439.0
vllm:iteration_tokens_total_bucket{engine="0",le="16.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 17819.0
vllm:iteration_tokens_total_bucket{engine="0",le="32.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 17820.0
vllm:iteration_tokens_total_bucket{engine="0",le="64.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 17820.0
vllm:iteration_tokens_total_bucket{engine="0",le="128.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 17820.0
vllm:iteration_tokens_total_bucket{engine="0",le="256.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 17820.0
vllm:iteration_tokens_total_bucket{engine="0",le="512.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 17820.0
vllm:iteration_tokens_total_bucket{engine="0",le="1024.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 17820.0
vllm:iteration_tokens_total_bucket{engine="0",le="2048.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 17820.0
vllm:iteration_tokens_total_bucket{engine="0",le="4096.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 18129.0
vllm:iteration_tokens_total_bucket{engine="0",le="8192.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 18143.0
vllm:iteration_tokens_total_bucket{engine="0",le="16384.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 18143.0
vllm:iteration_tokens_total_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 18143.0
vllm:iteration_tokens_total_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 18143.0
vllm:iteration_tokens_total_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.430604e+06
# HELP vllm:iteration_tokens_total_created Histogram of number of tokens per engine_step.
# TYPE vllm:iteration_tokens_total_created gauge
vllm:iteration_tokens_total_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920338387e+09
# HELP vllm:request_max_num_generation_tokens Histogram of maximum number of requested generation tokens.
# TYPE vllm:request_max_num_generation_tokens histogram
vllm:request_max_num_generation_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_max_num_generation_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_max_num_generation_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 86272.0
# HELP vllm:request_max_num_generation_tokens_created Histogram of maximum number of requested generation tokens.
# TYPE vllm:request_max_num_generation_tokens_created gauge
vllm:request_max_num_generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920338805e+09
# HELP vllm:request_params_n Histogram of the n request parameter.
# TYPE vllm:request_params_n histogram
vllm:request_params_n_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_n_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_n_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_n_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_n_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_n_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_n_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_n_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
# HELP vllm:request_params_n_created Histogram of the n request parameter.
# TYPE vllm:request_params_n_created gauge
vllm:request_params_n_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920339143e+09
# HELP vllm:request_params_max_tokens Histogram of the max_tokens request parameter.
# TYPE vllm:request_params_max_tokens histogram
vllm:request_params_max_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_max_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_max_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_max_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_max_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_max_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_max_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_max_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_max_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_max_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_max_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_params_max_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 86272.0
# HELP vllm:request_params_max_tokens_created Histogram of the max_tokens request parameter.
# TYPE vllm:request_params_max_tokens_created gauge
vllm:request_params_max_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920339627e+09
# HELP vllm:time_to_first_token_seconds Histogram of time to first token in seconds.
# TYPE vllm:time_to_first_token_seconds histogram
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.001",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.005",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.02",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.04",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.06",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.08",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.25",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 303.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 327.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 335.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="160.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="640.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="2560.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:time_to_first_token_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:time_to_first_token_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 115.4521017074585
# HELP vllm:time_to_first_token_seconds_created Histogram of time to first token in seconds.
# TYPE vllm:time_to_first_token_seconds_created gauge
vllm:time_to_first_token_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808892034025e+09
# HELP vllm:inter_token_latency_seconds Histogram of inter-token latency in seconds.
# TYPE vllm:inter_token_latency_seconds histogram
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 40831.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.025",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 82698.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.05",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 84107.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.075",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 84189.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 84190.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.15",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 84190.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.2",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 84373.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 85851.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.4",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 85851.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 85935.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 85935.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 85935.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 85935.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 85935.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 85935.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 85935.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 85935.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 85935.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 85935.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 85935.0
vllm:inter_token_latency_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 85935.0
vllm:inter_token_latency_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1308.0858185302932
# HELP vllm:inter_token_latency_seconds_created Histogram of inter-token latency in seconds.
# TYPE vllm:inter_token_latency_seconds_created gauge
vllm:inter_token_latency_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808892034075e+09
# HELP vllm:request_time_per_output_token_seconds Histogram of time_per_output_token_seconds per request.
# TYPE vllm:request_time_per_output_token_seconds histogram
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 58.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.025",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 328.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.05",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.075",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.15",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.2",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.4",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_time_per_output_token_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 5.129748307961927
# HELP vllm:request_time_per_output_token_seconds_created Histogram of time_per_output_token_seconds per request.
# TYPE vllm:request_time_per_output_token_seconds_created gauge
vllm:request_time_per_output_token_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920341132e+09
# HELP vllm:e2e_request_latency_seconds Histogram of e2e request latency in seconds.
# TYPE vllm:e2e_request_latency_seconds histogram
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 12.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 32.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 230.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:e2e_request_latency_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:e2e_request_latency_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1423.348400592804
# HELP vllm:e2e_request_latency_seconds_created Histogram of e2e request latency in seconds.
# TYPE vllm:e2e_request_latency_seconds_created gauge
vllm:e2e_request_latency_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808892034203e+09
# HELP vllm:request_queue_time_seconds Histogram of time spent in WAITING phase for request.
# TYPE vllm:request_queue_time_seconds histogram
vllm:request_queue_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_queue_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.23973538324935362
# HELP vllm:request_queue_time_seconds_created Histogram of time spent in WAITING phase for request.
# TYPE vllm:request_queue_time_seconds_created gauge
vllm:request_queue_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808892034244e+09
# HELP vllm:request_inference_time_seconds Histogram of time spent in RUNNING phase for request.
# TYPE vllm:request_inference_time_seconds histogram
vllm:request_inference_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_inference_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_inference_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_inference_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_inference_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_inference_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 13.0
vllm:request_inference_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 35.0
vllm:request_inference_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 233.0
vllm:request_inference_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_inference_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_inference_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_inference_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_inference_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_inference_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_inference_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_inference_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_inference_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_inference_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_inference_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_inference_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_inference_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_inference_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_inference_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_inference_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1407.2252857190324
# HELP vllm:request_inference_time_seconds_created Histogram of time spent in RUNNING phase for request.
# TYPE vllm:request_inference_time_seconds_created gauge
vllm:request_inference_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920342817e+09
# HELP vllm:request_prefill_time_seconds Histogram of time spent in PREFILL phase for request.
# TYPE vllm:request_prefill_time_seconds histogram
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 276.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 322.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 335.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 336.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99.13946718873922
# HELP vllm:request_prefill_time_seconds_created Histogram of time spent in PREFILL phase for request.
# TYPE vllm:request_prefill_time_seconds_created gauge
vllm:request_prefill_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920343258e+09
# HELP vllm:request_decode_time_seconds Histogram of time spent in DECODE phase for request.
# TYPE vllm:request_decode_time_seconds histogram
vllm:request_decode_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_decode_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_decode_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_decode_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_decode_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 3.0
vllm:request_decode_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 20.0
vllm:request_decode_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 51.0
vllm:request_decode_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 254.0
vllm:request_decode_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_decode_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_decode_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_decode_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_decode_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_decode_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_decode_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_decode_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_decode_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_decode_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_decode_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_decode_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_decode_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_decode_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_decode_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_decode_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1308.0858185302932
# HELP vllm:request_decode_time_seconds_created Histogram of time spent in DECODE phase for request.
# TYPE vllm:request_decode_time_seconds_created gauge
vllm:request_decode_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920343637e+09
# HELP vllm:request_prefill_kv_computed_tokens Histogram of new KV tokens computed during prefill (excluding cached tokens).
# TYPE vllm:request_prefill_kv_computed_tokens histogram
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_kv_computed_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 337.0
vllm:request_prefill_kv_computed_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.344332e+06
# HELP vllm:request_prefill_kv_computed_tokens_created Histogram of new KV tokens computed during prefill (excluding cached tokens).
# TYPE vllm:request_prefill_kv_computed_tokens_created gauge
vllm:request_prefill_kv_computed_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798088920344057e+09
# HELP vllm:cache_config_info Information of the LLMEngine CacheConfig
# TYPE vllm:cache_config_info gauge
vllm:cache_config_info{_block_size_resolved="True",block_size="16",cache_dtype="auto",calculate_kv_scales="False",cpu_kvcache_space_bytes="None",enable_prefix_caching="True",engine="0",gpu_memory_utilization="0.9",is_attention_free="False",kv_cache_memory_bytes="None",kv_offloading_backend="native",kv_offloading_size="None",kv_sharing_fast_prefill="False",mamba_block_size="None",mamba_cache_dtype="auto",mamba_cache_mode="none",mamba_page_size_padded="None",mamba_ssm_cache_dtype="auto",num_cpu_blocks="None",num_gpu_blocks="17590",num_gpu_blocks_override="None",prefix_caching_hash_algo="sha256",sliding_window="None",user_specified_block_size="False"} 1.0
# HELP http_requests_total Total number of requests by method, status and handler.
# TYPE http_requests_total counter
http_requests_total{handler="/v1/models",method="GET",status="2xx"} 1.0
http_requests_total{handler="/v1/chat/completions",method="POST",status="2xx"} 337.0
# HELP http_requests_created Total number of requests by method, status and handler.
# TYPE http_requests_created gauge
http_requests_created{handler="/v1/models",method="GET",status="2xx"} 1.7798088933366792e+09
http_requests_created{handler="/v1/chat/completions",method="POST",status="2xx"} 1.779808896853593e+09
# HELP http_request_size_bytes Content length of incoming requests by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
# TYPE http_request_size_bytes summary
http_request_size_bytes_count{handler="/v1/models"} 1.0
http_request_size_bytes_sum{handler="/v1/models"} 0.0
http_request_size_bytes_count{handler="/v1/chat/completions"} 337.0
http_request_size_bytes_sum{handler="/v1/chat/completions"} 1.777338e+06
# HELP http_request_size_bytes_created Content length of incoming requests by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
# TYPE http_request_size_bytes_created gauge
http_request_size_bytes_created{handler="/v1/models"} 1.77980889333671e+09
http_request_size_bytes_created{handler="/v1/chat/completions"} 1.7798088968536212e+09
# HELP http_response_size_bytes Content length of outgoing responses by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
# TYPE http_response_size_bytes summary
http_response_size_bytes_count{handler="/v1/models"} 1.0
http_response_size_bytes_sum{handler="/v1/models"} 558.0
http_response_size_bytes_count{handler="/v1/chat/completions"} 337.0
http_response_size_bytes_sum{handler="/v1/chat/completions"} 0.0
# HELP http_response_size_bytes_created Content length of outgoing responses by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
# TYPE http_response_size_bytes_created gauge
http_response_size_bytes_created{handler="/v1/models"} 1.7798088933367383e+09
http_response_size_bytes_created{handler="/v1/chat/completions"} 1.779808896853654e+09
# HELP http_request_duration_highr_seconds Latency with many buckets but no API specific labels. Made for more accurate percentile calculations.
# TYPE http_request_duration_highr_seconds histogram
http_request_duration_highr_seconds_bucket{le="0.01"} 1.0
http_request_duration_highr_seconds_bucket{le="0.025"} 1.0
http_request_duration_highr_seconds_bucket{le="0.05"} 1.0
http_request_duration_highr_seconds_bucket{le="0.075"} 1.0
http_request_duration_highr_seconds_bucket{le="0.1"} 1.0
http_request_duration_highr_seconds_bucket{le="0.25"} 1.0
http_request_duration_highr_seconds_bucket{le="0.5"} 1.0
http_request_duration_highr_seconds_bucket{le="0.75"} 1.0
http_request_duration_highr_seconds_bucket{le="1.0"} 1.0
http_request_duration_highr_seconds_bucket{le="1.5"} 1.0
http_request_duration_highr_seconds_bucket{le="2.0"} 13.0
http_request_duration_highr_seconds_bucket{le="2.5"} 33.0
http_request_duration_highr_seconds_bucket{le="3.0"} 66.0
http_request_duration_highr_seconds_bucket{le="3.5"} 113.0
http_request_duration_highr_seconds_bucket{le="4.0"} 164.0
http_request_duration_highr_seconds_bucket{le="4.5"} 206.0
http_request_duration_highr_seconds_bucket{le="5.0"} 231.0
http_request_duration_highr_seconds_bucket{le="7.5"} 338.0
http_request_duration_highr_seconds_bucket{le="10.0"} 338.0
http_request_duration_highr_seconds_bucket{le="30.0"} 338.0
http_request_duration_highr_seconds_bucket{le="60.0"} 338.0
http_request_duration_highr_seconds_bucket{le="+Inf"} 338.0
http_request_duration_highr_seconds_count 338.0
http_request_duration_highr_seconds_sum 1423.86807049677
# HELP http_request_duration_highr_seconds_created Latency with many buckets but no API specific labels. Made for more accurate percentile calculations.
# TYPE http_request_duration_highr_seconds_created gauge
http_request_duration_highr_seconds_created 1.7798088925083833e+09
# HELP http_request_duration_seconds Latency with only few buckets by handler. Made to be only used if aggregation by handler is important.
# TYPE http_request_duration_seconds histogram
http_request_duration_seconds_bucket{handler="/v1/models",le="0.1",method="GET"} 1.0
http_request_duration_seconds_bucket{handler="/v1/models",le="0.5",method="GET"} 1.0
http_request_duration_seconds_bucket{handler="/v1/models",le="1.0",method="GET"} 1.0
http_request_duration_seconds_bucket{handler="/v1/models",le="+Inf",method="GET"} 1.0
http_request_duration_seconds_count{handler="/v1/models",method="GET"} 1.0
http_request_duration_seconds_sum{handler="/v1/models",method="GET"} 0.0025588699500076473
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="0.1",method="POST"} 0.0
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="0.5",method="POST"} 0.0
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="1.0",method="POST"} 0.0
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="+Inf",method="POST"} 337.0
http_request_duration_seconds_count{handler="/v1/chat/completions",method="POST"} 337.0
http_request_duration_seconds_sum{handler="/v1/chat/completions",method="POST"} 1423.86551162682
# HELP http_request_duration_seconds_created Latency with only few buckets by handler. Made to be only used if aggregation by handler is important.
# TYPE http_request_duration_seconds_created gauge
http_request_duration_seconds_created{handler="/v1/models",method="GET"} 1.7798088933367746e+09
http_request_duration_seconds_created{handler="/v1/chat/completions",method="POST"} 1.77980889685369e+09

View File

@@ -0,0 +1,337 @@
{"req_id": "aea2448b4a684f10", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378029226149444, "t_first_token_ns": 378030010080192, "t_last_token_ns": 378031839512347, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "ba9a9c7670d9454b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378030111082182, "t_first_token_ns": 378030407369776, "t_last_token_ns": 378031970569180, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "fb9d2dbe66964120", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378033073417506, "t_first_token_ns": 378033328926763, "t_last_token_ns": 378036549241190, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "6be84ae0769740af", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378033160364166, "t_first_token_ns": 378033568087554, "t_last_token_ns": 378036566577848, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "9cab1305fb0c422a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378033835082314, "t_first_token_ns": 378034092419044, "t_last_token_ns": 378036946796591, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "edabb47d8f8e419b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378033875528395, "t_first_token_ns": 378034325653535, "t_last_token_ns": 378036953161453, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "005d455e32544dfc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378034588824147, "t_first_token_ns": 378034853968395, "t_last_token_ns": 378037145884312, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "f3b40c7777004be8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378037391824422, "t_first_token_ns": 378037644412230, "t_last_token_ns": 378042872668974, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "7a04a4b73e33472a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378037728175713, "t_first_token_ns": 378037984323237, "t_last_token_ns": 378043182429564, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "ac4ce66df1fd449e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378037847354185, "t_first_token_ns": 378038223910380, "t_last_token_ns": 378043240656855, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "19559520ef984c05", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378039021441830, "t_first_token_ns": 378039281073958, "t_last_token_ns": 378045233938165, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "bb0c8fee32af4b81", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378039415646004, "t_first_token_ns": 378039679431648, "t_last_token_ns": 378046898600201, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "8d3ff3e0ef044a7b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378039480232591, "t_first_token_ns": 378039922579284, "t_last_token_ns": 378046925916200, "prompt_tokens": 3941, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "7ef6bddca6984f3d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378039706707539, "t_first_token_ns": 378040160089875, "t_last_token_ns": 378046940215026, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "bf1f26989e784806", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378039967023843, "t_first_token_ns": 378040410163123, "t_last_token_ns": 378046969516402, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "7a01c1bc84414273", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378040218722174, "t_first_token_ns": 378040704529245, "t_last_token_ns": 378046996862620, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "62f8ca63250c4a58", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378040554648726, "t_first_token_ns": 378040897672758, "t_last_token_ns": 378047010709830, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "bc35ef9281184ffe", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378040651786111, "t_first_token_ns": 378041136804477, "t_last_token_ns": 378047023176776, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "e8877d36f8a54846", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378041214903484, "t_first_token_ns": 378041482880115, "t_last_token_ns": 378047371272248, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "440e08d5264940df", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378041558398395, "t_first_token_ns": 378041833975307, "t_last_token_ns": 378047489599722, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "04537492d0b04483", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378043178821160, "t_first_token_ns": 378043452133908, "t_last_token_ns": 378049234911828, "prompt_tokens": 3948, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "fd546085228e4ba0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378044227335218, "t_first_token_ns": 378044507354223, "t_last_token_ns": 378050329790507, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "3e083392d9944223", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378045227452975, "t_first_token_ns": 378045508104334, "t_last_token_ns": 378051096860985, "prompt_tokens": 4043, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "7685c3db8c7846cc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378045610571204, "t_first_token_ns": 378045894283778, "t_last_token_ns": 378051473022634, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "4cc93aba1ebe4c67", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378045646084617, "t_first_token_ns": 378046178267658, "t_last_token_ns": 378051516029283, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "20baa0ae95034302", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378045869454633, "t_first_token_ns": 378046590619464, "t_last_token_ns": 378051730915380, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "0a2454da838240a3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378045699984019, "t_first_token_ns": 378046590860190, "t_last_token_ns": 378051731095368, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "3163dea44871465c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378046587334205, "t_first_token_ns": 378046880743826, "t_last_token_ns": 378052001455113, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
{"req_id": "2d75f2f3565841c0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378047095898596, "t_first_token_ns": 378047371534536, "t_last_token_ns": 378052227689711, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "2aee24cf2cd249bf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378047894435807, "t_first_token_ns": 378048165269073, "t_last_token_ns": 378053186436502, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "746e58a3bc7f4be1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378048167761025, "t_first_token_ns": 378048448755900, "t_last_token_ns": 378053232611682, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "145a50a0c09f4b87", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378049590401423, "t_first_token_ns": 378049866899342, "t_last_token_ns": 378054445044586, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "9b73ccb973594105", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378051116960247, "t_first_token_ns": 378051386281454, "t_last_token_ns": 378055311879831, "prompt_tokens": 4008, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "925f851e265b421d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378051442408467, "t_first_token_ns": 378051731419128, "t_last_token_ns": 378055392874381, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "ce1f8a72f9334c2c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378051660103033, "t_first_token_ns": 378051979653433, "t_last_token_ns": 378055409517940, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "d52722418e8b4297", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378052522390257, "t_first_token_ns": 378052796421214, "t_last_token_ns": 378055826853931, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "ae20a626417c4205", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378052567232326, "t_first_token_ns": 378053035984209, "t_last_token_ns": 378055832760957, "prompt_tokens": 4046, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "829a9dafe5fe4d37", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378053260844192, "t_first_token_ns": 378053526963583, "t_last_token_ns": 378055954181178, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "d5f9286d72de4f60", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378056625184141, "t_first_token_ns": 378056884741109, "t_last_token_ns": 378059057771036, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "521edcf4fbfe4629", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378057758811531, "t_first_token_ns": 378058016647607, "t_last_token_ns": 378061802567814, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "83d110418d8942b8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378057986712509, "t_first_token_ns": 378058257129787, "t_last_token_ns": 378061825416130, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "58a4dbe340ef414f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378058749773866, "t_first_token_ns": 378059011534726, "t_last_token_ns": 378062495758587, "prompt_tokens": 4008, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "dd9d059edc4f48eb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378059589264663, "t_first_token_ns": 378059850387912, "t_last_token_ns": 378063286338379, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "6e5e3be7ae164a6d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378059847580779, "t_first_token_ns": 378060110214789, "t_last_token_ns": 378063325300924, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "0a3d8fed74e346cb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378060741553741, "t_first_token_ns": 378061011890896, "t_last_token_ns": 378063838831624, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "3c57fead4a0d4ea4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378060789230729, "t_first_token_ns": 378061248530046, "t_last_token_ns": 378063845101054, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "d675cf7b90ee4387", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378061085660255, "t_first_token_ns": 378061495827627, "t_last_token_ns": 378063854376254, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "158be90bee124881", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378064807616700, "t_first_token_ns": 378065065634069, "t_last_token_ns": 378068190634445, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "fedce93a49db465f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378064815079772, "t_first_token_ns": 378065299162216, "t_last_token_ns": 378068199906469, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "32b88aff207f4f5c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378065131982106, "t_first_token_ns": 378065535119793, "t_last_token_ns": 378068215132115, "prompt_tokens": 3939, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "27770cd54f284c1a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378066982013325, "t_first_token_ns": 378067247625525, "t_last_token_ns": 378069941263212, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "864f6fceb0a04580", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378067320942542, "t_first_token_ns": 378067589229432, "t_last_token_ns": 378070047382463, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "d496a36d730e432b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378067582763214, "t_first_token_ns": 378067850172578, "t_last_token_ns": 378070531322650, "prompt_tokens": 3943, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "9e26de6a23d14a50", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378069541033162, "t_first_token_ns": 378069805847465, "t_last_token_ns": 378073094528952, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "5d83466d93fe408e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378070037169590, "t_first_token_ns": 378070295488219, "t_last_token_ns": 378073430315232, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "4249e7c77340441b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378070069052697, "t_first_token_ns": 378070531643424, "t_last_token_ns": 378073438717802, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "1c66a3f5e115429a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378070724338583, "t_first_token_ns": 378070982660689, "t_last_token_ns": 378073907085887, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "4443327cdb394373", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378071892537634, "t_first_token_ns": 378072152972593, "t_last_token_ns": 378076657194817, "prompt_tokens": 3962, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "482b81d58c7a4bee", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378072259417382, "t_first_token_ns": 378072531515909, "t_last_token_ns": 378076830580259, "prompt_tokens": 4023, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "1b2611a1d0c54d3d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378073518714647, "t_first_token_ns": 378073783385272, "t_last_token_ns": 378078869031291, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "082c99e03c12454c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378074094333551, "t_first_token_ns": 378074361559896, "t_last_token_ns": 378079914194506, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "8f752a3219e14b6f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378074616264442, "t_first_token_ns": 378074881643608, "t_last_token_ns": 378080412404708, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "b4797e4a864045fc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378074644904834, "t_first_token_ns": 378075115734949, "t_last_token_ns": 378080425288858, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "d1257591bb4c4a87", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378074685464132, "t_first_token_ns": 378075564932030, "t_last_token_ns": 378080437759127, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "ddf7b31d71e947b3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378074805726663, "t_first_token_ns": 378075565268913, "t_last_token_ns": 378080437952090, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "bb5b68c66065446e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378075420664856, "t_first_token_ns": 378075812650685, "t_last_token_ns": 378080458614196, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "b73072a874d54a0e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378075978109218, "t_first_token_ns": 378076252381061, "t_last_token_ns": 378080635647305, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "1f5da0dfcdd5475e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378076957997959, "t_first_token_ns": 378077229989705, "t_last_token_ns": 378081265869975, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "caea94f626ac4e9d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378078043390533, "t_first_token_ns": 378078317468950, "t_last_token_ns": 378082425623355, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "ccd4b4b55e17408f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378078515790855, "t_first_token_ns": 378078795951792, "t_last_token_ns": 378082609265629, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "37e972f60f8d45ed", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378078792730162, "t_first_token_ns": 378079079918113, "t_last_token_ns": 378082640346565, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "fda8e45b166a430d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378078844157054, "t_first_token_ns": 378079317862803, "t_last_token_ns": 378082647256744, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "5b05ae896ab445f9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378081338533285, "t_first_token_ns": 378081605570197, "t_last_token_ns": 378084929890025, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "8b39866686024ed8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378081404026126, "t_first_token_ns": 378081851884618, "t_last_token_ns": 378084983886564, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "a3c02cf035a8493f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378083144292135, "t_first_token_ns": 378083404030067, "t_last_token_ns": 378087504479875, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "09780e9ad42f46b7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378083226757985, "t_first_token_ns": 378083645140518, "t_last_token_ns": 378087527829271, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "779f002d97994593", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378083587942010, "t_first_token_ns": 378083889502856, "t_last_token_ns": 378087545227435, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "364b1f27dc104ef8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378084007193205, "t_first_token_ns": 378084276723748, "t_last_token_ns": 378087700432365, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "209cb424dbb84f8b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378084923876587, "t_first_token_ns": 378085193996472, "t_last_token_ns": 378088520893506, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "752672d18e0f45b4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378085924157253, "t_first_token_ns": 378086194884331, "t_last_token_ns": 378089369370126, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "6aa9800bcbdd4c84", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378086928745445, "t_first_token_ns": 378087201819576, "t_last_token_ns": 378090266591136, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "864d5978f0964501", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378087028594564, "t_first_token_ns": 378087448366918, "t_last_token_ns": 378090282486041, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "bc3df0272afb4d41", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378088014719896, "t_first_token_ns": 378088273491007, "t_last_token_ns": 378090798067142, "prompt_tokens": 3948, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "b022391dede64e13", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378088985996328, "t_first_token_ns": 378089247956166, "t_last_token_ns": 378091324100654, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "796c9330781b4b97", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378089841345857, "t_first_token_ns": 378090106439464, "t_last_token_ns": 378092631724367, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "a9ffbbe6b7ad480f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378091614400947, "t_first_token_ns": 378091873961156, "t_last_token_ns": 378097665830970, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "448a70e8c8bf4eeb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378091837797232, "t_first_token_ns": 378092327341786, "t_last_token_ns": 378097695402397, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "8bd016e190154647", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378091727300888, "t_first_token_ns": 378092327587386, "t_last_token_ns": 378097695801716, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "b1a53102c5f84fa1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378092312149911, "t_first_token_ns": 378092578266704, "t_last_token_ns": 378097735836902, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "7c42971dd5064d7e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378092785454865, "t_first_token_ns": 378093046809055, "t_last_token_ns": 378098389526565, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "5f404eda051a4640", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378093296435388, "t_first_token_ns": 378093560165090, "t_last_token_ns": 378098785722482, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "2ba0c67df2114644", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378094033487826, "t_first_token_ns": 378094304576217, "t_last_token_ns": 378099682814934, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "a079c9121f564900", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378094325589028, "t_first_token_ns": 378094601404183, "t_last_token_ns": 378099753473420, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "aeceb64e146b4ba6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378094822295820, "t_first_token_ns": 378095098911390, "t_last_token_ns": 378100265021189, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "9c3e57e3481e42b6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378095482775931, "t_first_token_ns": 378095758443031, "t_last_token_ns": 378101235403105, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "161d9333a4264e02", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378095584683222, "t_first_token_ns": 378096009398899, "t_last_token_ns": 378101487550743, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "6f3a06e9dbb54bb7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378096166213874, "t_first_token_ns": 378096440063158, "t_last_token_ns": 378101888792209, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "b726faa4a6334a04", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378096656712526, "t_first_token_ns": 378096930217388, "t_last_token_ns": 378102365384829, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "22c7c7749883486f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378097013155164, "t_first_token_ns": 378097296043781, "t_last_token_ns": 378102731085485, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "4eeec26d5b114b0c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378098118388346, "t_first_token_ns": 378098390390624, "t_last_token_ns": 378103446282583, "prompt_tokens": 3952, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "7e56ea0506a94567", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378099028703242, "t_first_token_ns": 378099299334474, "t_last_token_ns": 378104062906388, "prompt_tokens": 3947, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "533c77980c684396", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378099737596275, "t_first_token_ns": 378100015743454, "t_last_token_ns": 378104505813799, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "06ac7daf2fde47a7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378100375137872, "t_first_token_ns": 378100647707963, "t_last_token_ns": 378104820503919, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "fecef290d88a42c8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378100409129682, "t_first_token_ns": 378100887443596, "t_last_token_ns": 378104830037606, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "f9e04cab2f9b4509", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378101163266210, "t_first_token_ns": 378101449097145, "t_last_token_ns": 378105279351259, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "2f4fa999fac246e6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378101374026945, "t_first_token_ns": 378101702716874, "t_last_token_ns": 378105524507382, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "b52ec264bf8a408c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378101963244960, "t_first_token_ns": 378102238370607, "t_last_token_ns": 378106639916931, "prompt_tokens": 4026, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "75a292efbb114e25", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378102341880766, "t_first_token_ns": 378102617397514, "t_last_token_ns": 378106764509797, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "b57d5a01833d4613", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378104849221279, "t_first_token_ns": 378105117612024, "t_last_token_ns": 378110309552153, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "ed6dc9556b5c4e04", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378105255718046, "t_first_token_ns": 378105525011001, "t_last_token_ns": 378110799745763, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "5526d74607844cb5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378105573959007, "t_first_token_ns": 378105838839849, "t_last_token_ns": 378110946134165, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "ba0d1568e12e4ff5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378105756138367, "t_first_token_ns": 378106349677572, "t_last_token_ns": 378110969970402, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "9631bc67ca5340af", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378105665915613, "t_first_token_ns": 378106349855707, "t_last_token_ns": 378110970239915, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "02fe7b8fe2da4515", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378105765311528, "t_first_token_ns": 378106540487183, "t_last_token_ns": 378110981088088, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "48c249acbf344730", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378106818165279, "t_first_token_ns": 378107087992045, "t_last_token_ns": 378111272390947, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "608f8acfd9244894", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378106966917096, "t_first_token_ns": 378107337797191, "t_last_token_ns": 378111291528906, "prompt_tokens": 4030, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "ea08db78d57a4e74", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378108854193783, "t_first_token_ns": 378109130445148, "t_last_token_ns": 378112393117116, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "aa978ae0e0de4266", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378109018385638, "t_first_token_ns": 378109378370796, "t_last_token_ns": 378112407394636, "prompt_tokens": 3950, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "4ef217f1cc1749b1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378109268824641, "t_first_token_ns": 378109631948163, "t_last_token_ns": 378112418309508, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "41198ce829fa4729", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378110526574329, "t_first_token_ns": 378110799588925, "t_last_token_ns": 378113011106903, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "84562524891444c7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378112542144042, "t_first_token_ns": 378112799347065, "t_last_token_ns": 378114507688417, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "440ba55f75ab4a41", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378113113296955, "t_first_token_ns": 378113372284114, "t_last_token_ns": 378114811937014, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "e9e3669917ec48ab", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378114800288964, "t_first_token_ns": 378115058139592, "t_last_token_ns": 378118014390356, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "975beb3e4a494a33", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378115075659433, "t_first_token_ns": 378115334638478, "t_last_token_ns": 378118102146426, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "f27e169d0f4d44a0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378116257582428, "t_first_token_ns": 378116520619270, "t_last_token_ns": 378119978117783, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "1ba4bead82e542c0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378116335152404, "t_first_token_ns": 378116765052428, "t_last_token_ns": 378119997094797, "prompt_tokens": 4023, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "45ee817a12694a2a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378116855013114, "t_first_token_ns": 378117121650303, "t_last_token_ns": 378120126464179, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "b4c6f69bd9984fdb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378117306136746, "t_first_token_ns": 378117577251686, "t_last_token_ns": 378120298075118, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "99f4d02cf05a4be0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378118536282961, "t_first_token_ns": 378118800880106, "t_last_token_ns": 378121309570236, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "1ba326910bc844bd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378118545791877, "t_first_token_ns": 378119039061827, "t_last_token_ns": 378121315811144, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "f6d39ba1f3924ada", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378120502838662, "t_first_token_ns": 378120767230782, "t_last_token_ns": 378122181459261, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "ce0aee3714094aa2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378123043946096, "t_first_token_ns": 378123303581673, "t_last_token_ns": 378124948757893, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "96f8e9248b084377", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378123652181250, "t_first_token_ns": 378123913394955, "t_last_token_ns": 378125326741777, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "4366927a7aee4c4b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378126495531713, "t_first_token_ns": 378126756838846, "t_last_token_ns": 378130570461369, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "d2be011e82f3466b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378126628653257, "t_first_token_ns": 378127211378416, "t_last_token_ns": 378130589346883, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "282238cce0744bbf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378126542667416, "t_first_token_ns": 378127211548521, "t_last_token_ns": 378130589469002, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "2ca4e631df3147f4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378127313070110, "t_first_token_ns": 378127573504787, "t_last_token_ns": 378130750906649, "prompt_tokens": 3950, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "d03651eba8254cc1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378127547185549, "t_first_token_ns": 378127816579138, "t_last_token_ns": 378130765080734, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "9dbcd00fb0174137", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378128621892609, "t_first_token_ns": 378128894591293, "t_last_token_ns": 378131288010289, "prompt_tokens": 4026, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "12d22a8914774896", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378129583669871, "t_first_token_ns": 378129849482170, "t_last_token_ns": 378131911614480, "prompt_tokens": 3949, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "e63a1ef5daf341f4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378131569566285, "t_first_token_ns": 378131832929509, "t_last_token_ns": 378133960856402, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "a69ff11e457e4635", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378132181224565, "t_first_token_ns": 378132439053768, "t_last_token_ns": 378134934417109, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "713e61810b724d91", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378132476216625, "t_first_token_ns": 378132738871636, "t_last_token_ns": 378135020973257, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "676d8fe02ddf4abb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378134132585577, "t_first_token_ns": 378134396966904, "t_last_token_ns": 378136626711040, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "a62f9b37e483498a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378134171338608, "t_first_token_ns": 378134630750096, "t_last_token_ns": 378136632805689, "prompt_tokens": 3962, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "53d5570ba6a84779", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378135390210533, "t_first_token_ns": 378135652668759, "t_last_token_ns": 378137207668065, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "74fdef0582c74bce", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378138138231005, "t_first_token_ns": 378138397186940, "t_last_token_ns": 378140674779114, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "68e4927e2a124bea", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378138403241548, "t_first_token_ns": 378138662592067, "t_last_token_ns": 378140715365032, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "21687d57bb5c4252", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378138499202208, "t_first_token_ns": 378138903246544, "t_last_token_ns": 378140723979331, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "194f96ec1e2e43c7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378140859624483, "t_first_token_ns": 378141112695900, "t_last_token_ns": 378145013466711, "prompt_tokens": 3958, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "72dc6bb04c4644d4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378140882101313, "t_first_token_ns": 378141350669854, "t_last_token_ns": 378145023415828, "prompt_tokens": 4035, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "ddbaf38767b44921", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378141385488084, "t_first_token_ns": 378141647528313, "t_last_token_ns": 378145134271423, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "d949d4574670408a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378141691194708, "t_first_token_ns": 378141953135767, "t_last_token_ns": 378145240696001, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "78a76e5f1e6b4a9e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378142179076228, "t_first_token_ns": 378142441559050, "t_last_token_ns": 378145507016354, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "c86f5cfe091641b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378142461925948, "t_first_token_ns": 378142726218611, "t_last_token_ns": 378145550338293, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "006c58af6f4d463d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378144341323631, "t_first_token_ns": 378144611329985, "t_last_token_ns": 378147676876018, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "4a12d83960ce42fa", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378144460865769, "t_first_token_ns": 378144859863291, "t_last_token_ns": 378147696286302, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "34a8901e36044fc1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378146183196912, "t_first_token_ns": 378146445971329, "t_last_token_ns": 378149641537011, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "3314ab6c781d4a1a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378146217364952, "t_first_token_ns": 378146681096809, "t_last_token_ns": 378149649701187, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "6d472a09a79a4923", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378146253705730, "t_first_token_ns": 378146920064068, "t_last_token_ns": 378149657389135, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "b9c947d0d59642a9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378146809412288, "t_first_token_ns": 378147161332415, "t_last_token_ns": 378149669582437, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "064e4f05ba7542c5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378148357406600, "t_first_token_ns": 378148619586326, "t_last_token_ns": 378150378133427, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "231018f70172449a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378150483824041, "t_first_token_ns": 378150743163708, "t_last_token_ns": 378153265706456, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "879f9b525ae84409", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378150688097924, "t_first_token_ns": 378150981896318, "t_last_token_ns": 378153279935625, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "62f771e9f5ef43c9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378151315054649, "t_first_token_ns": 378151579893130, "t_last_token_ns": 378153659710750, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "43aac6047c774a63", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378152123751768, "t_first_token_ns": 378152389271550, "t_last_token_ns": 378155127890734, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "6d714acf4019437d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378153771803019, "t_first_token_ns": 378154033938751, "t_last_token_ns": 378157240815209, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "991f85cf9487449c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378153992597344, "t_first_token_ns": 378154489790771, "t_last_token_ns": 378157260533441, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "42ad8273d24147d6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378153872713501, "t_first_token_ns": 378154489688733, "t_last_token_ns": 378157260744803, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "a5d677d7dbe24139", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378154678184867, "t_first_token_ns": 378154941575561, "t_last_token_ns": 378157484281762, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "873a87204cb0428f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378156588931242, "t_first_token_ns": 378156856896407, "t_last_token_ns": 378158710166414, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "786ae5ccacce4602", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378156865104396, "t_first_token_ns": 378157132578103, "t_last_token_ns": 378158733031280, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "d02034d8a806462c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378159399342172, "t_first_token_ns": 378159660737371, "t_last_token_ns": 378163197285388, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "8607194990c5433c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378159575294465, "t_first_token_ns": 378159900432516, "t_last_token_ns": 378163220319567, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "d5cbe99cc1564aa0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378160375836431, "t_first_token_ns": 378160635933234, "t_last_token_ns": 378164045087884, "prompt_tokens": 4008, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "abb8606ed5da4eff", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378161126901507, "t_first_token_ns": 378161388544172, "t_last_token_ns": 378165299856265, "prompt_tokens": 3954, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "fa0edd39659c49ec", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378161400384454, "t_first_token_ns": 378161664308256, "t_last_token_ns": 378165572681388, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "65706bfcd51649cf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378161719795805, "t_first_token_ns": 378161987581830, "t_last_token_ns": 378165670441389, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "29962405dc6045ca", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378162510225356, "t_first_token_ns": 378162778186669, "t_last_token_ns": 378166459690216, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "4075343da31f4010", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378162869233378, "t_first_token_ns": 378163143642958, "t_last_token_ns": 378166579940742, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "c00e97827679411f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378164593196406, "t_first_token_ns": 378164866269883, "t_last_token_ns": 378168256465150, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "43ad1dfbd4e34000", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378164791663133, "t_first_token_ns": 378165114884100, "t_last_token_ns": 378168270756122, "prompt_tokens": 4026, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "0814ac286eb14cc3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378165235779813, "t_first_token_ns": 378165514048147, "t_last_token_ns": 378168367833387, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "d325e17c531745a4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378165721862914, "t_first_token_ns": 378165995060911, "t_last_token_ns": 378168515270434, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "75b1449dee3345bb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378166797707656, "t_first_token_ns": 378167060455732, "t_last_token_ns": 378168964935597, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "60a6229a3d7446b0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378168956860175, "t_first_token_ns": 378169211219164, "t_last_token_ns": 378174601994248, "prompt_tokens": 3947, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "8eaf21d5adb2410f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378169148134909, "t_first_token_ns": 378169449277097, "t_last_token_ns": 378174635467882, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "ae77200372554b8b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378169737188081, "t_first_token_ns": 378170001812343, "t_last_token_ns": 378176086548755, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "5c614c6ab9814c5f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378169751969083, "t_first_token_ns": 378170238249588, "t_last_token_ns": 378176100508493, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "28c6624c78d44f07", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378169901328279, "t_first_token_ns": 378170512950463, "t_last_token_ns": 378176115922601, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "09aaf7835d264c4a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378170196872075, "t_first_token_ns": 378170930554600, "t_last_token_ns": 378176129714757, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "c2a9c1521c7e4f47", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378170166389506, "t_first_token_ns": 378170930666378, "t_last_token_ns": 378176130216505, "prompt_tokens": 4040, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "6737496c43314fc0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378171565557949, "t_first_token_ns": 378171853261018, "t_last_token_ns": 378177599558735, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "bd22b537db0b4d4b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378171948649489, "t_first_token_ns": 378172229665170, "t_last_token_ns": 378177797790062, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "43be507009904a8e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378171983318729, "t_first_token_ns": 378172461387945, "t_last_token_ns": 378177810952655, "prompt_tokens": 3922, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "b77c4048510f4758", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378172027845773, "t_first_token_ns": 378172704871529, "t_last_token_ns": 378177820854036, "prompt_tokens": 4021, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "12a6cb60ba9e4489", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378172590763394, "t_first_token_ns": 378172957774213, "t_last_token_ns": 378177843772759, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "000e781fe54e4cac", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378174676541763, "t_first_token_ns": 378174955628144, "t_last_token_ns": 378180419903585, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "669f965743e341d1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378174739167647, "t_first_token_ns": 378175210287642, "t_last_token_ns": 378180446885080, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "ff26ef7207d34e96", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378175426367880, "t_first_token_ns": 378175700428912, "t_last_token_ns": 378180672940471, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "479978e13bb24642", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378176239715225, "t_first_token_ns": 378176509882953, "t_last_token_ns": 378181390464650, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "fd936022773f4cd7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378176684629230, "t_first_token_ns": 378176958397234, "t_last_token_ns": 378181838772404, "prompt_tokens": 4022, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "3db04cb3ad5047a8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378177083921315, "t_first_token_ns": 378177362988094, "t_last_token_ns": 378181976650436, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "63cb940b13d14d31", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378178340449630, "t_first_token_ns": 378178606096718, "t_last_token_ns": 378184119011772, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "7e1f8275faf541b5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378178589369315, "t_first_token_ns": 378178869072989, "t_last_token_ns": 378184162272789, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "d8ddfede78994023", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378178807775988, "t_first_token_ns": 378179331989156, "t_last_token_ns": 378184185935176, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "c3dfa749aab04c2b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378178793032566, "t_first_token_ns": 378179332566203, "t_last_token_ns": 378184186806863, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "ee95ae4de1bb4f51", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378179380242402, "t_first_token_ns": 378179660875119, "t_last_token_ns": 378184274811154, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "de99fb851eb34ea2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378181033452590, "t_first_token_ns": 378181310067030, "t_last_token_ns": 378185724939972, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "bb8c5905821e4c03", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378181496990800, "t_first_token_ns": 378181770383985, "t_last_token_ns": 378185908033227, "prompt_tokens": 3953, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "b1c6385c06724df7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378182178788429, "t_first_token_ns": 378182456787200, "t_last_token_ns": 378186999908383, "prompt_tokens": 4073, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "37f79e2e35f3491b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378182256444540, "t_first_token_ns": 378182708644815, "t_last_token_ns": 378187024482424, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "2804eeb5318041e8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378182468537340, "t_first_token_ns": 378182948383647, "t_last_token_ns": 378187035389184, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "5f378cd5470e4b51", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378182944622932, "t_first_token_ns": 378183225503838, "t_last_token_ns": 378187075797046, "prompt_tokens": 3958, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "b1e0170d40c94b3e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378184319796813, "t_first_token_ns": 378184590134808, "t_last_token_ns": 378188334803994, "prompt_tokens": 4026, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "8aa6e091a87b4700", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378185952971053, "t_first_token_ns": 378186222725852, "t_last_token_ns": 378189897362377, "prompt_tokens": 3935, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "8e246e4c58bf4e8e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378186390880084, "t_first_token_ns": 378186659236897, "t_last_token_ns": 378190302790507, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "e292069af0bb46be", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378186521360725, "t_first_token_ns": 378186909150286, "t_last_token_ns": 378190354596140, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "bcc29a86df084ec8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378187172985787, "t_first_token_ns": 378187436490779, "t_last_token_ns": 378190868769425, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "bcda7b8c093f4f9a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378187961693959, "t_first_token_ns": 378188234857265, "t_last_token_ns": 378191556620338, "prompt_tokens": 4022, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "50122a980a5246d6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378189405970414, "t_first_token_ns": 378189674640311, "t_last_token_ns": 378192505243751, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "6da6c693fd714065", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378189833989156, "t_first_token_ns": 378190109857928, "t_last_token_ns": 378193356541243, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "e8bc03689427418e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378190290227568, "t_first_token_ns": 378190566841096, "t_last_token_ns": 378193797574855, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "3e45fa91acdd4797", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378191237079119, "t_first_token_ns": 378191502696626, "t_last_token_ns": 378194790339137, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "f9771edf68de4ead", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378192546118357, "t_first_token_ns": 378192815695009, "t_last_token_ns": 378196180049127, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "d0c94432444c4131", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378192644477064, "t_first_token_ns": 378193058479382, "t_last_token_ns": 378196197590913, "prompt_tokens": 3963, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "23244c3a2b90451a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378192965812076, "t_first_token_ns": 378193304602864, "t_last_token_ns": 378196212393959, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "3a8fbc1db2cd41dc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378193406938428, "t_first_token_ns": 378193680340763, "t_last_token_ns": 378196310341169, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "bf7f32d23b28470e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378193990285522, "t_first_token_ns": 378194260197482, "t_last_token_ns": 378196753247488, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "ca5c3dd88c184ee3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378196323511208, "t_first_token_ns": 378196582804225, "t_last_token_ns": 378198759935416, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "2e08f36cf5094678", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378196852523885, "t_first_token_ns": 378197116588548, "t_last_token_ns": 378199083854004, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "4fdc98962bde46ae", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378197058596698, "t_first_token_ns": 378197356463748, "t_last_token_ns": 378199092375331, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "b3a85331457146e8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378199390300654, "t_first_token_ns": 378199648261175, "t_last_token_ns": 378202239379423, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "37cd7d87c8224567", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378199828555539, "t_first_token_ns": 378200090653970, "t_last_token_ns": 378202565065207, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "7d60a2376f8644f2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378200088458332, "t_first_token_ns": 378200348710357, "t_last_token_ns": 378202594295692, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "06c66b41e9db493a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378200545737980, "t_first_token_ns": 378200818164199, "t_last_token_ns": 378203014508827, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "59812db546f54306", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378202590911421, "t_first_token_ns": 378202851215975, "t_last_token_ns": 378204583600384, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "0572f190a8d64051", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378204014711968, "t_first_token_ns": 378204279573725, "t_last_token_ns": 378209221557596, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "0490cacb55e04e68", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378204072427392, "t_first_token_ns": 378204517465432, "t_last_token_ns": 378209248044431, "prompt_tokens": 3951, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "36c0d201c9104701", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378205317143295, "t_first_token_ns": 378205581126835, "t_last_token_ns": 378211209653689, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "da650ce516ff4bc8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378205445383138, "t_first_token_ns": 378205824908636, "t_last_token_ns": 378211237393037, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "ed513d08f60345cc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378205578422029, "t_first_token_ns": 378206062978575, "t_last_token_ns": 378211250287150, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "43e397cf198c47f0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378205801881533, "t_first_token_ns": 378206294683139, "t_last_token_ns": 378211261029547, "prompt_tokens": 3946, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "f77ea4b7d37d4542", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378205966229007, "t_first_token_ns": 378206530553082, "t_last_token_ns": 378211273910139, "prompt_tokens": 3958, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "1cecfcd2e1334a48", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378207110019098, "t_first_token_ns": 378207387200880, "t_last_token_ns": 378211833352582, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "1cffb71a734d4f66", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378207428677360, "t_first_token_ns": 378207704711247, "t_last_token_ns": 378212144424976, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "3060485037cd4640", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378207569889652, "t_first_token_ns": 378207954405633, "t_last_token_ns": 378212392548859, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "15e2c506de6c4c9b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378208097556797, "t_first_token_ns": 378208384943459, "t_last_token_ns": 378212542189287, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "4a33ed68da634656", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378208490252894, "t_first_token_ns": 378208775942421, "t_last_token_ns": 378212641161402, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "07d7319400b4439f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378208649138250, "t_first_token_ns": 378209032083914, "t_last_token_ns": 378212655926763, "prompt_tokens": 4040, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "46ae895aba9a4de3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378211851854685, "t_first_token_ns": 378212123974833, "t_last_token_ns": 378214459424929, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "1f5a5e011bec4e5d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378212118171197, "t_first_token_ns": 378212392451635, "t_last_token_ns": 378214483362438, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "cd0819447fe54c5b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378212944647315, "t_first_token_ns": 378213209487443, "t_last_token_ns": 378214880167967, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "afce6f70ed9f4302", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378215585757291, "t_first_token_ns": 378215843189683, "t_last_token_ns": 378217569448092, "prompt_tokens": 3945, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "4c17564dd8b7444f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378216931097916, "t_first_token_ns": 378217195908829, "t_last_token_ns": 378220826060109, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "7a1be77e9f5a41fd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378217057025656, "t_first_token_ns": 378217435856195, "t_last_token_ns": 378220849749037, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "4208d10f22e54057", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378217826479918, "t_first_token_ns": 378218088922986, "t_last_token_ns": 378222075423191, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "ae538548957e4d7c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378218706482288, "t_first_token_ns": 378218974244422, "t_last_token_ns": 378223131552574, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "5bee0fd9d5494bec", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378218740347909, "t_first_token_ns": 378219210100066, "t_last_token_ns": 378223141094408, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "fba5effcbb564de1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378219226906985, "t_first_token_ns": 378219495043192, "t_last_token_ns": 378223200316388, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "6f120d71db064a2f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378220007828214, "t_first_token_ns": 378220274303049, "t_last_token_ns": 378223945871836, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "03be81baea254f63", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378220536191299, "t_first_token_ns": 378220815645798, "t_last_token_ns": 378224154350893, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "5b0f5580d437484a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378220855278702, "t_first_token_ns": 378221132113941, "t_last_token_ns": 378224212860564, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "113c37b841fc43c3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378221008462395, "t_first_token_ns": 378221377181014, "t_last_token_ns": 378224225114892, "prompt_tokens": 3953, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "e3b6a2da98fe4303", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378223597825054, "t_first_token_ns": 378223870955074, "t_last_token_ns": 378226137602046, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "367948373f294169", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378224703816512, "t_first_token_ns": 378224967173818, "t_last_token_ns": 378227833019821, "prompt_tokens": 4028, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "043cb38603824f7f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378225172373173, "t_first_token_ns": 378225434131252, "t_last_token_ns": 378228136899592, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "062632607e104805", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378225759501689, "t_first_token_ns": 378226027806748, "t_last_token_ns": 378228507340438, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "8fbcb715488e4c96", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378226118432671, "t_first_token_ns": 378226381134369, "t_last_token_ns": 378228602449687, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "1c877981e0c448d9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378227183232937, "t_first_token_ns": 378227447972999, "t_last_token_ns": 378229686858745, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "edb31b2032e84f52", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378228776368293, "t_first_token_ns": 378229042457643, "t_last_token_ns": 378232887532022, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "a2fafa09384c40a6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378229282217146, "t_first_token_ns": 378229545309572, "t_last_token_ns": 378233645259258, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "59a5993c3c784552", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378229870868089, "t_first_token_ns": 378230131867432, "t_last_token_ns": 378234273427519, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "d03ad09ecc974c80", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378230006654429, "t_first_token_ns": 378230375129218, "t_last_token_ns": 378234298855235, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "c506936441734fa6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378231009222395, "t_first_token_ns": 378231279955206, "t_last_token_ns": 378235132924824, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "180d3f079047491f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378231137433622, "t_first_token_ns": 378231526357282, "t_last_token_ns": 378235151894852, "prompt_tokens": 4024, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "f12f0f3a373443b1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378231602876296, "t_first_token_ns": 378231874402540, "t_last_token_ns": 378235251036163, "prompt_tokens": 4025, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "6bf91bf05af84c14", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378231723984091, "t_first_token_ns": 378232122405476, "t_last_token_ns": 378235264885710, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "5e9c9fbb2d854cff", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378232198095380, "t_first_token_ns": 378232468180334, "t_last_token_ns": 378235329377161, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "20f04c7f1b00424e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378232950237353, "t_first_token_ns": 378233219593841, "t_last_token_ns": 378236300511516, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "81743176f8974e28", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378235385490199, "t_first_token_ns": 378235650651815, "t_last_token_ns": 378239576646739, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "22ca60d6f2a84dcd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378235472490223, "t_first_token_ns": 378236108926736, "t_last_token_ns": 378239596664794, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "6241ebace2744510", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378235607800273, "t_first_token_ns": 378236109054397, "t_last_token_ns": 378239597357961, "prompt_tokens": 4023, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "8b4a070cd6dc45c8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378236876049109, "t_first_token_ns": 378237143812476, "t_last_token_ns": 378241157245016, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "ca62d518e6cd4f1a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378237863852157, "t_first_token_ns": 378238133958463, "t_last_token_ns": 378243355175556, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "fc6767e54b6c446d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378237879834132, "t_first_token_ns": 378238371652317, "t_last_token_ns": 378243370071451, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "b36cc591e7cc462b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378238348549608, "t_first_token_ns": 378238619796504, "t_last_token_ns": 378243393882035, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "1d4c083e69a4470d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378238569567403, "t_first_token_ns": 378239082879183, "t_last_token_ns": 378243418115429, "prompt_tokens": 3955, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "6b5e7cbddca343b7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378238445841948, "t_first_token_ns": 378239082676766, "t_last_token_ns": 378243418625375, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "a4ef1c97cba54e8d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378240129666443, "t_first_token_ns": 378240401115050, "t_last_token_ns": 378244886749253, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "e58e1d2f92834db0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378240600562200, "t_first_token_ns": 378240873276169, "t_last_token_ns": 378245108850797, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "20068c614b144f91", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378241350291982, "t_first_token_ns": 378241629972579, "t_last_token_ns": 378245541066126, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "532a287ccccc41bd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378241459768296, "t_first_token_ns": 378241879719094, "t_last_token_ns": 378245557335198, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "8083b913de714e45", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378241655444256, "t_first_token_ns": 378242127851874, "t_last_token_ns": 378245606573204, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "fb4b84bd84c64f26", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378242625145144, "t_first_token_ns": 378242906667177, "t_last_token_ns": 378246834061068, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "7e294328416a439b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378243582832389, "t_first_token_ns": 378243858452531, "t_last_token_ns": 378247386161262, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "75b258bddd754d61", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378245542976948, "t_first_token_ns": 378245810401629, "t_last_token_ns": 378251801163162, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "02552871e8014bfc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378245655171623, "t_first_token_ns": 378246266524777, "t_last_token_ns": 378251830479054, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "b50a0314c3d54f06", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378245706375593, "t_first_token_ns": 378246267039000, "t_last_token_ns": 378251831775574, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "f15ee18c62094447", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378246565476971, "t_first_token_ns": 378246833967822, "t_last_token_ns": 378252582178580, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "6eb8e1f60fd445f9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378247989181936, "t_first_token_ns": 378248259732737, "t_last_token_ns": 378255274285997, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "7374148d493f43f9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378248130654581, "t_first_token_ns": 378248505227564, "t_last_token_ns": 378255305643858, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "34bae8feb9bf4056", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378248350174445, "t_first_token_ns": 378248753217297, "t_last_token_ns": 378255335294684, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "169afc94360f46a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378248979065099, "t_first_token_ns": 378249257862939, "t_last_token_ns": 378255924474123, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "81fa27493b474d09", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378249011459307, "t_first_token_ns": 378249493456063, "t_last_token_ns": 378255938473340, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "1dbdc64036dd4bea", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378249055575851, "t_first_token_ns": 378249993774563, "t_last_token_ns": 378255953658447, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "64d05190c06744ad", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378249153147922, "t_first_token_ns": 378249994195085, "t_last_token_ns": 378255954410603, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "6eb8c94414ea4b3f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378249162760403, "t_first_token_ns": 378250399435542, "t_last_token_ns": 378255967370398, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "5e0584773c454e06", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378249235768724, "t_first_token_ns": 378250399763796, "t_last_token_ns": 378255967578824, "prompt_tokens": 4044, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "2fcf1ca3e8de41f7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378249946454400, "t_first_token_ns": 378250638138942, "t_last_token_ns": 378255979767661, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "0de986dcfb7944c6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378251211936025, "t_first_token_ns": 378251499795480, "t_last_token_ns": 378256483055018, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "9836974aab404835", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378251515077434, "t_first_token_ns": 378251801030820, "t_last_token_ns": 378256530847287, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "5d4da06af09e49c4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378251940343608, "t_first_token_ns": 378252213874364, "t_last_token_ns": 378256660039186, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "ecb450278d6844c8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378253352309451, "t_first_token_ns": 378253629290808, "t_last_token_ns": 378257567757526, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "480dcb8a11d84d78", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378253662586795, "t_first_token_ns": 378253946502328, "t_last_token_ns": 378257612797547, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "c55bb23f18dc4024", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378254140152612, "t_first_token_ns": 378254434307194, "t_last_token_ns": 378257734971074, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "0702480149b64201", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378255340211741, "t_first_token_ns": 378255617528634, "t_last_token_ns": 378258620204227, "prompt_tokens": 3940, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "14c96648e5d24b71", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378257078785160, "t_first_token_ns": 378257350063597, "t_last_token_ns": 378259665087273, "prompt_tokens": 4037, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "1448760f86c74f95", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378257913413609, "t_first_token_ns": 378258174714853, "t_last_token_ns": 378260141085056, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "3e592b3e4f81495f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378258182301637, "t_first_token_ns": 378258446841762, "t_last_token_ns": 378260168569800, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "20797f5a0ef34700", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378260180078110, "t_first_token_ns": 378260438077832, "t_last_token_ns": 378264939940611, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "cee46dac5c1045ec", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378260207670771, "t_first_token_ns": 378260670135674, "t_last_token_ns": 378264952948241, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "cd1a8ef3b06d458e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378260946688089, "t_first_token_ns": 378261210504849, "t_last_token_ns": 378265675654146, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "4f382589450b4963", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378261563684986, "t_first_token_ns": 378261828382247, "t_last_token_ns": 378266284176080, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "f407336728064b36", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378262474596475, "t_first_token_ns": 378262742785165, "t_last_token_ns": 378268259314774, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "e56c2d55ff1541dd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378262964533180, "t_first_token_ns": 378263237630727, "t_last_token_ns": 378268885034107, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "140c032008474e71", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378263257923936, "t_first_token_ns": 378263534121734, "t_last_token_ns": 378268983118666, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "33ffba54c13c41bd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378263270396671, "t_first_token_ns": 378263769944054, "t_last_token_ns": 378268997404825, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "2fb9661e282147a3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378263331032973, "t_first_token_ns": 378264010218336, "t_last_token_ns": 378269009214842, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "5cbb70a1f31044bf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378263770266110, "t_first_token_ns": 378264471504519, "t_last_token_ns": 378269030912431, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "d2fdd2ba87274d90", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378263908234141, "t_first_token_ns": 378264471597205, "t_last_token_ns": 378269031124259, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "79ac86da881a4d65", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378264275035747, "t_first_token_ns": 378264717966459, "t_last_token_ns": 378269049142048, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "dec24431eace4926", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378266899928606, "t_first_token_ns": 378267171241052, "t_last_token_ns": 378270873281151, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "fb5ba63e90684055", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378266966893832, "t_first_token_ns": 378267419041511, "t_last_token_ns": 378270890382792, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "b0137c20cdea43b8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378267356267631, "t_first_token_ns": 378267667255317, "t_last_token_ns": 378270904360730, "prompt_tokens": 3958, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "cdfb2a760c544e67", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378267973952018, "t_first_token_ns": 378268258823255, "t_last_token_ns": 378271055809523, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "07c3da02a5c84d7d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 378268203965311, "t_first_token_ns": 378268512935241, "t_last_token_ns": 378271064033834, "prompt_tokens": 4019, "completion_tokens": 256, "inflight_at_send": 13, "error": null}

View File

@@ -0,0 +1,8 @@
{
"rate": 1.5,
"input_tokens": 4096,
"output_tokens": 256,
"duration_target_s": 240.0,
"duration_actual_s": 242.16401139402296,
"n_requests": 337
}

View File

@@ -0,0 +1,624 @@
# HELP python_gc_objects_collected_total Objects collected during gc
# TYPE python_gc_objects_collected_total counter
python_gc_objects_collected_total{generation="0"} 11970.0
python_gc_objects_collected_total{generation="1"} 1549.0
python_gc_objects_collected_total{generation="2"} 855.0
# HELP python_gc_objects_uncollectable_total Uncollectable objects found during GC
# TYPE python_gc_objects_uncollectable_total counter
python_gc_objects_uncollectable_total{generation="0"} 0.0
python_gc_objects_uncollectable_total{generation="1"} 0.0
python_gc_objects_uncollectable_total{generation="2"} 0.0
# HELP python_gc_collections_total Number of times this generation was collected
# TYPE python_gc_collections_total counter
python_gc_collections_total{generation="0"} 1344.0
python_gc_collections_total{generation="1"} 122.0
python_gc_collections_total{generation="2"} 9.0
# HELP python_info Python platform information
# TYPE python_info gauge
python_info{implementation="CPython",major="3",minor="12",patchlevel="3",version="3.12.3"} 1.0
# HELP process_virtual_memory_bytes Virtual memory size in bytes.
# TYPE process_virtual_memory_bytes gauge
process_virtual_memory_bytes 3.8946234368e+010
# HELP process_resident_memory_bytes Resident memory size in bytes.
# TYPE process_resident_memory_bytes gauge
process_resident_memory_bytes 1.349824512e+09
# HELP process_start_time_seconds Start time of the process since unix epoch in seconds.
# TYPE process_start_time_seconds gauge
process_start_time_seconds 1.77980843334e+09
# HELP process_cpu_seconds_total Total user and system CPU time spent in seconds.
# TYPE process_cpu_seconds_total counter
process_cpu_seconds_total 40.91
# HELP process_open_fds Number of open file descriptors.
# TYPE process_open_fds gauge
process_open_fds 64.0
# HELP process_max_fds Maximum number of open file descriptors.
# TYPE process_max_fds gauge
process_max_fds 65535.0
# HELP vllm:estimated_flops_per_gpu_total Estimated number of floating point operations per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_flops_per_gpu_total counter
vllm:estimated_flops_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:estimated_flops_per_gpu_created Estimated number of floating point operations per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_flops_per_gpu_created gauge
vllm:estimated_flops_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808537012282e+09
# HELP vllm:estimated_read_bytes_per_gpu_total Estimated number of bytes read from memory per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_read_bytes_per_gpu_total counter
vllm:estimated_read_bytes_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:estimated_read_bytes_per_gpu_created Estimated number of bytes read from memory per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_read_bytes_per_gpu_created gauge
vllm:estimated_read_bytes_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370123103e+09
# HELP vllm:estimated_write_bytes_per_gpu_total Estimated number of bytes written to memory per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_write_bytes_per_gpu_total counter
vllm:estimated_write_bytes_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:estimated_write_bytes_per_gpu_created Estimated number of bytes written to memory per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_write_bytes_per_gpu_created gauge
vllm:estimated_write_bytes_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370123277e+09
# HELP vllm:num_requests_running Number of requests in model execution batches.
# TYPE vllm:num_requests_running gauge
vllm:num_requests_running{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:num_requests_waiting Number of requests waiting to be processed.
# TYPE vllm:num_requests_waiting gauge
vllm:num_requests_waiting{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:engine_sleep_state Engine sleep state; awake = 0 means engine is sleeping; awake = 1 means engine is awake; weights_offloaded = 1 means sleep level 1; discard_all = 1 means sleep level 2.
# TYPE vllm:engine_sleep_state gauge
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="awake"} 1.0
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="weights_offloaded"} 0.0
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="discard_all"} 0.0
# HELP vllm:kv_cache_usage_perc KV-cache usage. 1 means 100 percent usage.
# TYPE vllm:kv_cache_usage_perc gauge
vllm:kv_cache_usage_perc{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:prefix_cache_queries_total Prefix cache queries, in terms of number of queried tokens.
# TYPE vllm:prefix_cache_queries_total counter
vllm:prefix_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.444181e+06
# HELP vllm:prefix_cache_queries_created Prefix cache queries, in terms of number of queried tokens.
# TYPE vllm:prefix_cache_queries_created gauge
vllm:prefix_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370124936e+09
# HELP vllm:prefix_cache_hits_total Prefix cache hits, in terms of number of cached tokens.
# TYPE vllm:prefix_cache_hits_total counter
vllm:prefix_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:prefix_cache_hits_created Prefix cache hits, in terms of number of cached tokens.
# TYPE vllm:prefix_cache_hits_created gauge
vllm:prefix_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370125084e+09
# HELP vllm:external_prefix_cache_queries_total External prefix cache queries from KV connector cross-instance cache sharing, in terms of number of queried tokens.
# TYPE vllm:external_prefix_cache_queries_total counter
vllm:external_prefix_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.444181e+06
# HELP vllm:external_prefix_cache_queries_created External prefix cache queries from KV connector cross-instance cache sharing, in terms of number of queried tokens.
# TYPE vllm:external_prefix_cache_queries_created gauge
vllm:external_prefix_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808537012521e+09
# HELP vllm:external_prefix_cache_hits_total External prefix cache hits from KV connector cross-instance cache sharing, in terms of number of cached tokens.
# TYPE vllm:external_prefix_cache_hits_total counter
vllm:external_prefix_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:external_prefix_cache_hits_created External prefix cache hits from KV connector cross-instance cache sharing, in terms of number of cached tokens.
# TYPE vllm:external_prefix_cache_hits_created gauge
vllm:external_prefix_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370125337e+09
# HELP vllm:mm_cache_queries_total Multi-modal cache queries, in terms of number of queried items.
# TYPE vllm:mm_cache_queries_total counter
vllm:mm_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:mm_cache_queries_created Multi-modal cache queries, in terms of number of queried items.
# TYPE vllm:mm_cache_queries_created gauge
vllm:mm_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370125554e+09
# HELP vllm:mm_cache_hits_total Multi-modal cache hits, in terms of number of cached items.
# TYPE vllm:mm_cache_hits_total counter
vllm:mm_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:mm_cache_hits_created Multi-modal cache hits, in terms of number of cached items.
# TYPE vllm:mm_cache_hits_created gauge
vllm:mm_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808537012577e+09
# HELP vllm:num_preemptions_total Cumulative number of preemption from the engine.
# TYPE vllm:num_preemptions_total counter
vllm:num_preemptions_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:num_preemptions_created Cumulative number of preemption from the engine.
# TYPE vllm:num_preemptions_created gauge
vllm:num_preemptions_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370125916e+09
# HELP vllm:prompt_tokens_total Number of prefill tokens processed.
# TYPE vllm:prompt_tokens_total counter
vllm:prompt_tokens_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.444181e+06
# HELP vllm:prompt_tokens_created Number of prefill tokens processed.
# TYPE vllm:prompt_tokens_created gauge
vllm:prompt_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370126052e+09
# HELP vllm:prompt_tokens_by_source_total Number of prompt tokens by source.
# TYPE vllm:prompt_tokens_by_source_total counter
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_compute"} 1.444181e+06
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_cache_hit"} 0.0
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="external_kv_transfer"} 0.0
# HELP vllm:prompt_tokens_by_source_created Number of prompt tokens by source.
# TYPE vllm:prompt_tokens_by_source_created gauge
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_compute"} 1.7798085370126219e+09
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_cache_hit"} 1.779808537012627e+09
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="external_kv_transfer"} 1.7798085370126324e+09
# HELP vllm:prompt_tokens_cached_total Number of cached prompt tokens (local + external).
# TYPE vllm:prompt_tokens_cached_total counter
vllm:prompt_tokens_cached_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:prompt_tokens_cached_created Number of cached prompt tokens (local + external).
# TYPE vllm:prompt_tokens_cached_created gauge
vllm:prompt_tokens_cached_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370126455e+09
# HELP vllm:prompt_tokens_recomputed_total Number of cached tokens recomputed for forward pass.
# TYPE vllm:prompt_tokens_recomputed_total counter
vllm:prompt_tokens_recomputed_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:prompt_tokens_recomputed_created Number of cached tokens recomputed for forward pass.
# TYPE vllm:prompt_tokens_recomputed_created gauge
vllm:prompt_tokens_recomputed_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370126567e+09
# HELP vllm:generation_tokens_total Number of generation tokens processed.
# TYPE vllm:generation_tokens_total counter
vllm:generation_tokens_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92672.0
# HELP vllm:generation_tokens_created Number of generation tokens processed.
# TYPE vllm:generation_tokens_created gauge
vllm:generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808537012669e+09
# HELP vllm:request_success_total Count of successfully processed requests.
# TYPE vllm:request_success_total counter
vllm:request_success_total{engine="0",finished_reason="stop",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_success_total{engine="0",finished_reason="length",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_success_total{engine="0",finished_reason="abort",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_success_total{engine="0",finished_reason="error",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_success_total{engine="0",finished_reason="repetition",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:request_success_created Count of successfully processed requests.
# TYPE vllm:request_success_created gauge
vllm:request_success_created{engine="0",finished_reason="stop",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808537012699e+09
vllm:request_success_created{engine="0",finished_reason="length",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370127063e+09
vllm:request_success_created{engine="0",finished_reason="abort",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370127132e+09
vllm:request_success_created{engine="0",finished_reason="error",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370127187e+09
vllm:request_success_created{engine="0",finished_reason="repetition",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370127244e+09
# HELP vllm:request_prompt_tokens Number of prefill tokens processed.
# TYPE vllm:request_prompt_tokens histogram
vllm:request_prompt_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prompt_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prompt_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prompt_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prompt_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prompt_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prompt_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prompt_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prompt_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.444181e+06
# HELP vllm:request_prompt_tokens_created Number of prefill tokens processed.
# TYPE vllm:request_prompt_tokens_created gauge
vllm:request_prompt_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370127792e+09
# HELP vllm:request_generation_tokens Number of generation tokens processed.
# TYPE vllm:request_generation_tokens histogram
vllm:request_generation_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_generation_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_generation_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_generation_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_generation_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_generation_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_generation_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_generation_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_generation_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_generation_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_generation_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_generation_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92672.0
# HELP vllm:request_generation_tokens_created Number of generation tokens processed.
# TYPE vllm:request_generation_tokens_created gauge
vllm:request_generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370128436e+09
# HELP vllm:iteration_tokens_total Histogram of number of tokens per engine_step.
# TYPE vllm:iteration_tokens_total histogram
vllm:iteration_tokens_total_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2330.0
vllm:iteration_tokens_total_bucket{engine="0",le="8.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 12556.0
vllm:iteration_tokens_total_bucket{engine="0",le="16.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15964.0
vllm:iteration_tokens_total_bucket{engine="0",le="32.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16256.0
vllm:iteration_tokens_total_bucket{engine="0",le="64.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16256.0
vllm:iteration_tokens_total_bucket{engine="0",le="128.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16256.0
vllm:iteration_tokens_total_bucket{engine="0",le="256.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16256.0
vllm:iteration_tokens_total_bucket{engine="0",le="512.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16256.0
vllm:iteration_tokens_total_bucket{engine="0",le="1024.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16256.0
vllm:iteration_tokens_total_bucket{engine="0",le="2048.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16256.0
vllm:iteration_tokens_total_bucket{engine="0",le="4096.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16572.0
vllm:iteration_tokens_total_bucket{engine="0",le="8192.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16595.0
vllm:iteration_tokens_total_bucket{engine="0",le="16384.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16595.0
vllm:iteration_tokens_total_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16595.0
vllm:iteration_tokens_total_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16595.0
vllm:iteration_tokens_total_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.536853e+06
# HELP vllm:iteration_tokens_total_created Histogram of number of tokens per engine_step.
# TYPE vllm:iteration_tokens_total_created gauge
vllm:iteration_tokens_total_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808537012879e+09
# HELP vllm:request_max_num_generation_tokens Histogram of maximum number of requested generation tokens.
# TYPE vllm:request_max_num_generation_tokens histogram
vllm:request_max_num_generation_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_max_num_generation_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_max_num_generation_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92672.0
# HELP vllm:request_max_num_generation_tokens_created Histogram of maximum number of requested generation tokens.
# TYPE vllm:request_max_num_generation_tokens_created gauge
vllm:request_max_num_generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370129192e+09
# HELP vllm:request_params_n Histogram of the n request parameter.
# TYPE vllm:request_params_n histogram
vllm:request_params_n_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_n_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_n_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_n_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_n_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_n_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_n_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_n_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
# HELP vllm:request_params_n_created Histogram of the n request parameter.
# TYPE vllm:request_params_n_created gauge
vllm:request_params_n_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808537012967e+09
# HELP vllm:request_params_max_tokens Histogram of the max_tokens request parameter.
# TYPE vllm:request_params_max_tokens histogram
vllm:request_params_max_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_max_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_max_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_max_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_max_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_max_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_max_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_max_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_max_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_max_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_max_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_params_max_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92672.0
# HELP vllm:request_params_max_tokens_created Histogram of the max_tokens request parameter.
# TYPE vllm:request_params_max_tokens_created gauge
vllm:request_params_max_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370129972e+09
# HELP vllm:time_to_first_token_seconds Histogram of time to first token in seconds.
# TYPE vllm:time_to_first_token_seconds histogram
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.001",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.005",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.02",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.04",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.06",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.08",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.25",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 11.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 312.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 356.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="160.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="640.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="2560.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:time_to_first_token_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:time_to_first_token_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 126.39743399620056
# HELP vllm:time_to_first_token_seconds_created Histogram of time to first token in seconds.
# TYPE vllm:time_to_first_token_seconds_created gauge
vllm:time_to_first_token_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370130339e+09
# HELP vllm:inter_token_latency_seconds Histogram of inter-token latency in seconds.
# TYPE vllm:inter_token_latency_seconds histogram
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 29463.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.025",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 88034.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.05",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 89987.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.075",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 90034.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 90034.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.15",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 90034.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.2",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 90149.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92128.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.4",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92128.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92310.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92310.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92310.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92310.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92310.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92310.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92310.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92310.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92310.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92310.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92310.0
vllm:inter_token_latency_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92310.0
vllm:inter_token_latency_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1626.6343603626592
# HELP vllm:inter_token_latency_seconds_created Histogram of inter-token latency in seconds.
# TYPE vllm:inter_token_latency_seconds_created gauge
vllm:inter_token_latency_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808537013079e+09
# HELP vllm:request_time_per_output_token_seconds Histogram of time_per_output_token_seconds per request.
# TYPE vllm:request_time_per_output_token_seconds histogram
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 45.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.025",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 316.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.05",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.075",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.15",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.2",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.4",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_time_per_output_token_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 6.378958275931998
# HELP vllm:request_time_per_output_token_seconds_created Histogram of time_per_output_token_seconds per request.
# TYPE vllm:request_time_per_output_token_seconds_created gauge
vllm:request_time_per_output_token_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370131166e+09
# HELP vllm:e2e_request_latency_seconds Histogram of e2e request latency in seconds.
# TYPE vllm:e2e_request_latency_seconds histogram
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 3.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 11.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 24.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 195.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:e2e_request_latency_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:e2e_request_latency_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1752.9047212600708
# HELP vllm:e2e_request_latency_seconds_created Histogram of e2e request latency in seconds.
# TYPE vllm:e2e_request_latency_seconds_created gauge
vllm:e2e_request_latency_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370142846e+09
# HELP vllm:request_queue_time_seconds Histogram of time spent in WAITING phase for request.
# TYPE vllm:request_queue_time_seconds histogram
vllm:request_queue_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_queue_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0027433858485892415
# HELP vllm:request_queue_time_seconds_created Histogram of time spent in WAITING phase for request.
# TYPE vllm:request_queue_time_seconds_created gauge
vllm:request_queue_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808537014356e+09
# HELP vllm:request_inference_time_seconds Histogram of time spent in RUNNING phase for request.
# TYPE vllm:request_inference_time_seconds histogram
vllm:request_inference_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_inference_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_inference_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_inference_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_inference_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 3.0
vllm:request_inference_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 12.0
vllm:request_inference_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 27.0
vllm:request_inference_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 200.0
vllm:request_inference_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_inference_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_inference_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_inference_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_inference_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_inference_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_inference_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_inference_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_inference_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_inference_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_inference_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_inference_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_inference_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_inference_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_inference_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_inference_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1733.5705015579006
# HELP vllm:request_inference_time_seconds_created Histogram of time spent in RUNNING phase for request.
# TYPE vllm:request_inference_time_seconds_created gauge
vllm:request_inference_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370144022e+09
# HELP vllm:request_prefill_time_seconds Histogram of time spent in PREFILL phase for request.
# TYPE vllm:request_prefill_time_seconds histogram
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 288.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 351.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 106.93614119524136
# HELP vllm:request_prefill_time_seconds_created Histogram of time spent in PREFILL phase for request.
# TYPE vllm:request_prefill_time_seconds_created gauge
vllm:request_prefill_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370144622e+09
# HELP vllm:request_decode_time_seconds Histogram of time spent in DECODE phase for request.
# TYPE vllm:request_decode_time_seconds histogram
vllm:request_decode_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_decode_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_decode_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_decode_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_decode_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 6.0
vllm:request_decode_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16.0
vllm:request_decode_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 42.0
vllm:request_decode_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 221.0
vllm:request_decode_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_decode_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_decode_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_decode_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_decode_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_decode_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_decode_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_decode_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_decode_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_decode_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_decode_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_decode_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_decode_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_decode_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_decode_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_decode_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1626.6343603626592
# HELP vllm:request_decode_time_seconds_created Histogram of time spent in DECODE phase for request.
# TYPE vllm:request_decode_time_seconds_created gauge
vllm:request_decode_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370145073e+09
# HELP vllm:request_prefill_kv_computed_tokens Histogram of new KV tokens computed during prefill (excluding cached tokens).
# TYPE vllm:request_prefill_kv_computed_tokens histogram
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_kv_computed_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:request_prefill_kv_computed_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.444181e+06
# HELP vllm:request_prefill_kv_computed_tokens_created Histogram of new KV tokens computed during prefill (excluding cached tokens).
# TYPE vllm:request_prefill_kv_computed_tokens_created gauge
vllm:request_prefill_kv_computed_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798085370145638e+09
# HELP vllm:cache_config_info Information of the LLMEngine CacheConfig
# TYPE vllm:cache_config_info gauge
vllm:cache_config_info{_block_size_resolved="True",block_size="16",cache_dtype="auto",calculate_kv_scales="False",cpu_kvcache_space_bytes="None",enable_prefix_caching="True",engine="0",gpu_memory_utilization="0.9",is_attention_free="False",kv_cache_memory_bytes="None",kv_offloading_backend="native",kv_offloading_size="None",kv_sharing_fast_prefill="False",mamba_block_size="None",mamba_cache_dtype="auto",mamba_cache_mode="none",mamba_page_size_padded="None",mamba_ssm_cache_dtype="auto",num_cpu_blocks="None",num_gpu_blocks="17590",num_gpu_blocks_override="None",prefix_caching_hash_algo="sha256",sliding_window="None",user_specified_block_size="False"} 1.0
# HELP http_requests_total Total number of requests by method, status and handler.
# TYPE http_requests_total counter
http_requests_total{handler="/v1/models",method="GET",status="2xx"} 1.0
http_requests_total{handler="/v1/chat/completions",method="POST",status="2xx"} 362.0
# HELP http_requests_created Total number of requests by method, status and handler.
# TYPE http_requests_created gauge
http_requests_created{handler="/v1/models",method="GET",status="2xx"} 1.7798085390617669e+09
http_requests_created{handler="/v1/chat/completions",method="POST",status="2xx"} 1.7798085444920475e+09
# HELP http_request_size_bytes Content length of incoming requests by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
# TYPE http_request_size_bytes summary
http_request_size_bytes_count{handler="/v1/models"} 1.0
http_request_size_bytes_sum{handler="/v1/models"} 0.0
http_request_size_bytes_count{handler="/v1/chat/completions"} 362.0
http_request_size_bytes_sum{handler="/v1/chat/completions"} 1.909188e+06
# HELP http_request_size_bytes_created Content length of incoming requests by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
# TYPE http_request_size_bytes_created gauge
http_request_size_bytes_created{handler="/v1/models"} 1.7798085390617936e+09
http_request_size_bytes_created{handler="/v1/chat/completions"} 1.7798085444920697e+09
# HELP http_response_size_bytes Content length of outgoing responses by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
# TYPE http_response_size_bytes summary
http_response_size_bytes_count{handler="/v1/models"} 1.0
http_response_size_bytes_sum{handler="/v1/models"} 558.0
http_response_size_bytes_count{handler="/v1/chat/completions"} 362.0
http_response_size_bytes_sum{handler="/v1/chat/completions"} 0.0
# HELP http_response_size_bytes_created Content length of outgoing responses by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
# TYPE http_response_size_bytes_created gauge
http_response_size_bytes_created{handler="/v1/models"} 1.7798085390618207e+09
http_response_size_bytes_created{handler="/v1/chat/completions"} 1.779808544492094e+09
# HELP http_request_duration_highr_seconds Latency with many buckets but no API specific labels. Made for more accurate percentile calculations.
# TYPE http_request_duration_highr_seconds histogram
http_request_duration_highr_seconds_bucket{le="0.01"} 1.0
http_request_duration_highr_seconds_bucket{le="0.025"} 1.0
http_request_duration_highr_seconds_bucket{le="0.05"} 1.0
http_request_duration_highr_seconds_bucket{le="0.075"} 1.0
http_request_duration_highr_seconds_bucket{le="0.1"} 1.0
http_request_duration_highr_seconds_bucket{le="0.25"} 1.0
http_request_duration_highr_seconds_bucket{le="0.5"} 1.0
http_request_duration_highr_seconds_bucket{le="0.75"} 1.0
http_request_duration_highr_seconds_bucket{le="1.0"} 1.0
http_request_duration_highr_seconds_bucket{le="1.5"} 4.0
http_request_duration_highr_seconds_bucket{le="2.0"} 12.0
http_request_duration_highr_seconds_bucket{le="2.5"} 25.0
http_request_duration_highr_seconds_bucket{le="3.0"} 57.0
http_request_duration_highr_seconds_bucket{le="3.5"} 92.0
http_request_duration_highr_seconds_bucket{le="4.0"} 108.0
http_request_duration_highr_seconds_bucket{le="4.5"} 151.0
http_request_duration_highr_seconds_bucket{le="5.0"} 196.0
http_request_duration_highr_seconds_bucket{le="7.5"} 349.0
http_request_duration_highr_seconds_bucket{le="10.0"} 363.0
http_request_duration_highr_seconds_bucket{le="30.0"} 363.0
http_request_duration_highr_seconds_bucket{le="60.0"} 363.0
http_request_duration_highr_seconds_bucket{le="+Inf"} 363.0
http_request_duration_highr_seconds_count 363.0
http_request_duration_highr_seconds_sum 1753.430982518359
# HELP http_request_duration_highr_seconds_created Latency with many buckets but no API specific labels. Made for more accurate percentile calculations.
# TYPE http_request_duration_highr_seconds_created gauge
http_request_duration_highr_seconds_created 1.7798085375314133e+09
# HELP http_request_duration_seconds Latency with only few buckets by handler. Made to be only used if aggregation by handler is important.
# TYPE http_request_duration_seconds histogram
http_request_duration_seconds_bucket{handler="/v1/models",le="0.1",method="GET"} 1.0
http_request_duration_seconds_bucket{handler="/v1/models",le="0.5",method="GET"} 1.0
http_request_duration_seconds_bucket{handler="/v1/models",le="1.0",method="GET"} 1.0
http_request_duration_seconds_bucket{handler="/v1/models",le="+Inf",method="GET"} 1.0
http_request_duration_seconds_count{handler="/v1/models",method="GET"} 1.0
http_request_duration_seconds_sum{handler="/v1/models",method="GET"} 0.0023236559936776757
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="0.1",method="POST"} 0.0
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="0.5",method="POST"} 0.0
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="1.0",method="POST"} 0.0
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="+Inf",method="POST"} 362.0
http_request_duration_seconds_count{handler="/v1/chat/completions",method="POST"} 362.0
http_request_duration_seconds_sum{handler="/v1/chat/completions",method="POST"} 1753.4286588623654
# HELP http_request_duration_seconds_created Latency with only few buckets by handler. Made to be only used if aggregation by handler is important.
# TYPE http_request_duration_seconds_created gauge
http_request_duration_seconds_created{handler="/v1/models",method="GET"} 1.7798085390618532e+09
http_request_duration_seconds_created{handler="/v1/chat/completions",method="POST"} 1.7798085444921227e+09

View File

@@ -0,0 +1,362 @@
{"req_id": "9a98116bc3b4444b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377675101705638, "t_first_token_ns": 377675942906368, "t_last_token_ns": 377679478305026, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "048e018bafbc4d85", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377676149114603, "t_first_token_ns": 377676405014768, "t_last_token_ns": 377680423248646, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "d1eb2195da0d4bbc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377676358707491, "t_first_token_ns": 377676645092254, "t_last_token_ns": 377680481089501, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "1f2bff69e5bd4dbf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377676756303839, "t_first_token_ns": 377677014996055, "t_last_token_ns": 377681119322455, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "6d12050111934bbb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377677735398924, "t_first_token_ns": 377678001902315, "t_last_token_ns": 377682234909754, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "f6d3be00d4684851", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377677850760922, "t_first_token_ns": 377678248213107, "t_last_token_ns": 377682254444772, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "4f4166f860dc42ad", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377678687005559, "t_first_token_ns": 377678960201648, "t_last_token_ns": 377683425343956, "prompt_tokens": 4035, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "d014838e06e74e28", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377679587308337, "t_first_token_ns": 377679852452788, "t_last_token_ns": 377684126823290, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "d744eab2769b440e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377679903590469, "t_first_token_ns": 377680169641125, "t_last_token_ns": 377684204514144, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "57732b474a3f4eea", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377680411449720, "t_first_token_ns": 377680692790835, "t_last_token_ns": 377684919669700, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "f5e6489641524c36", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377680713770111, "t_first_token_ns": 377680986895109, "t_last_token_ns": 377685434451467, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "e3117a494a384791", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377682657803797, "t_first_token_ns": 377682922041690, "t_last_token_ns": 377687501403184, "prompt_tokens": 3962, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "ec90926659a5417b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377682751804758, "t_first_token_ns": 377683165079937, "t_last_token_ns": 377687523267558, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "ef4e7c5f961c4d5b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377682916882743, "t_first_token_ns": 377683402270467, "t_last_token_ns": 377687532425253, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "cdfe6aee5cc8429f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377684406301244, "t_first_token_ns": 377684671341118, "t_last_token_ns": 377688893940629, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "f4e22a054de74397", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377684511116608, "t_first_token_ns": 377684919972966, "t_last_token_ns": 377688914634746, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "2fb4f772eec447ca", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377684755101818, "t_first_token_ns": 377685166028775, "t_last_token_ns": 377688934536227, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "b235b96d8dfb4de8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377684998215054, "t_first_token_ns": 377685414770343, "t_last_token_ns": 377689180541980, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "f69fe4f4b26844aa", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377687150092316, "t_first_token_ns": 377687425645845, "t_last_token_ns": 377691989215170, "prompt_tokens": 4026, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "5c74feea17114ae8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377687758347952, "t_first_token_ns": 377688023618159, "t_last_token_ns": 377692403732970, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "a8ba046faf0c45b3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377688539153296, "t_first_token_ns": 377688801025685, "t_last_token_ns": 377693345080498, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "7731f895c40843a3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377688913878467, "t_first_token_ns": 377689180755573, "t_last_token_ns": 377693511285253, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "b09ddbd37e754d84", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377688965171405, "t_first_token_ns": 377689424759841, "t_last_token_ns": 377693520896242, "prompt_tokens": 4036, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "ce08aacc97c04987", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377689401590621, "t_first_token_ns": 377689670329880, "t_last_token_ns": 377693540637489, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "70b5076519b940ac", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377689426076240, "t_first_token_ns": 377689907375878, "t_last_token_ns": 377693549051286, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "08d01c32ab0141bb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377689520480170, "t_first_token_ns": 377690146094482, "t_last_token_ns": 377693556459634, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "8dd3c785bbdd41e8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377691594995825, "t_first_token_ns": 377691868761229, "t_last_token_ns": 377694913191965, "prompt_tokens": 4029, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "1a7d1958dbe54414", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377693026867672, "t_first_token_ns": 377693299771473, "t_last_token_ns": 377696512019209, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "8607cf75ff51407f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377694049296021, "t_first_token_ns": 377694308373482, "t_last_token_ns": 377698516567090, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "7e854e2109354740", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377694288065274, "t_first_token_ns": 377694550145312, "t_last_token_ns": 377698983334074, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "1da581798cd348ac", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377695190658465, "t_first_token_ns": 377695457772827, "t_last_token_ns": 377701092667080, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "1faea9ad34e845ea", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377695333809928, "t_first_token_ns": 377695700667868, "t_last_token_ns": 377701120429123, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "43b77e5a52774e18", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377696165197910, "t_first_token_ns": 377696435191266, "t_last_token_ns": 377702044953056, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "6608fda7e5e44093", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377697387238744, "t_first_token_ns": 377697657864079, "t_last_token_ns": 377703852596285, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "c17f084e79f24e86", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377697777476193, "t_first_token_ns": 377698041125509, "t_last_token_ns": 377704083958997, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "564f405d67d44ec2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377697805572315, "t_first_token_ns": 377698278351574, "t_last_token_ns": 377704099038767, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "5e2884f375064419", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377697878223790, "t_first_token_ns": 377698516365577, "t_last_token_ns": 377704112699741, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "10d6bee6490c4950", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377698471749141, "t_first_token_ns": 377698982950326, "t_last_token_ns": 377704136759262, "prompt_tokens": 4034, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "5ac31d6823f34e99", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377698485060311, "t_first_token_ns": 377698983696683, "t_last_token_ns": 377704137573355, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "068985b5f1354c26", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377698599224909, "t_first_token_ns": 377699495208195, "t_last_token_ns": 377704160908826, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "774eae1c9c664544", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377698819395058, "t_first_token_ns": 377699495474464, "t_last_token_ns": 377704161226395, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "daf7bf9fe60f4ce6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377698820986306, "t_first_token_ns": 377699683643378, "t_last_token_ns": 377704170529298, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "aec35f81c4d748a7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377699645093226, "t_first_token_ns": 377699936334012, "t_last_token_ns": 377704185367623, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "5c4765572ebc4823", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377701317429825, "t_first_token_ns": 377701586625152, "t_last_token_ns": 377704944319324, "prompt_tokens": 3958, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "5e31156c50e344ef", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377702841592048, "t_first_token_ns": 377703125662242, "t_last_token_ns": 377705797946597, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "35b8d624be444d7f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377703407819409, "t_first_token_ns": 377703689990208, "t_last_token_ns": 377705946261311, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "0145793265f04b01", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377705255917102, "t_first_token_ns": 377705517062464, "t_last_token_ns": 377708582167804, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "a5f1c962c723465e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377705963038406, "t_first_token_ns": 377706217631856, "t_last_token_ns": 377709288540903, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "8fa9ea222f2545b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377706546157560, "t_first_token_ns": 377706803554541, "t_last_token_ns": 377709806351662, "prompt_tokens": 3953, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "832a6b4ff15941ba", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377707030073681, "t_first_token_ns": 377707294459958, "t_last_token_ns": 377710075246349, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "5ab02c7f231e43b6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377707606369452, "t_first_token_ns": 377707867070710, "t_last_token_ns": 377710339624678, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "24eb0f15760d4fc4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377707712128046, "t_first_token_ns": 377708112647241, "t_last_token_ns": 377710349026269, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "8983b093109040dc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377710392522076, "t_first_token_ns": 377710643884865, "t_last_token_ns": 377711870339161, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "7af7cf1538874eb9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377712710817818, "t_first_token_ns": 377712963834714, "t_last_token_ns": 377716243475239, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "cb8525c3ae384f92", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377712888205834, "t_first_token_ns": 377713204185946, "t_last_token_ns": 377716262697088, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "aff8b87c2ce64fbd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377713148019736, "t_first_token_ns": 377713441020172, "t_last_token_ns": 377716278850282, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "b0892d9280a543bd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377714095883587, "t_first_token_ns": 377714360890497, "t_last_token_ns": 377717006863521, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "88f8c5e35fff49b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377714186908473, "t_first_token_ns": 377714605514477, "t_last_token_ns": 377717017919296, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "972b30eeb9f84878", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377714678438741, "t_first_token_ns": 377714945200542, "t_last_token_ns": 377717074457294, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "23fbacc597a946b1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377717283305364, "t_first_token_ns": 377717540325466, "t_last_token_ns": 377724724183959, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "4f19a2fd4ded410a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377717410424048, "t_first_token_ns": 377717993669772, "t_last_token_ns": 377724762825514, "prompt_tokens": 4032, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "3058b8ba991b4469", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377717382914054, "t_first_token_ns": 377717993934944, "t_last_token_ns": 377724763266696, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "daddd3bef0dc481b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377718039532204, "t_first_token_ns": 377718295866731, "t_last_token_ns": 377724944320449, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "fc49513485424723", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377718100923123, "t_first_token_ns": 377718540446971, "t_last_token_ns": 377724976363861, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "e428e2d0bb6b4370", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377718461298625, "t_first_token_ns": 377718784378801, "t_last_token_ns": 377725006054793, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "46927bb40c894ca6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377719063910588, "t_first_token_ns": 377719333770039, "t_last_token_ns": 377725986856662, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "2998c95ea9ab4e45", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377719211398965, "t_first_token_ns": 377719794527479, "t_last_token_ns": 377726014610685, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "16d93f416c414e3c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377719237096815, "t_first_token_ns": 377719795307958, "t_last_token_ns": 377726015640786, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "50ec413493324886", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377720068799696, "t_first_token_ns": 377720344402496, "t_last_token_ns": 377727107058629, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "a4dd60f7c2964d0e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377720545100286, "t_first_token_ns": 377720816186821, "t_last_token_ns": 377727371841789, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "97367b73da5145a9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377720955455213, "t_first_token_ns": 377721232649861, "t_last_token_ns": 377727572655517, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "5aca5a9693fc4194", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377721984500059, "t_first_token_ns": 377722261682089, "t_last_token_ns": 377728378382496, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "eaaa33b0da064402", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377722153574514, "t_first_token_ns": 377722516021763, "t_last_token_ns": 377728407312792, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "013c134b30fa4d9c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377722552098254, "t_first_token_ns": 377722829900083, "t_last_token_ns": 377728480364889, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "15cfc08ace5a4451", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377722638727358, "t_first_token_ns": 377723337414170, "t_last_token_ns": 377728503898356, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "230a716c19a544ca", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377722659911410, "t_first_token_ns": 377723337709557, "t_last_token_ns": 377728504350446, "prompt_tokens": 3956, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
{"req_id": "ce6398d72f384f33", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377722888281546, "t_first_token_ns": 377723541420607, "t_last_token_ns": 377728513032186, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
{"req_id": "62323b7d4f644c05", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377725208891215, "t_first_token_ns": 377725480125112, "t_last_token_ns": 377731441779226, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "f842a3d381b44e83", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377725340410731, "t_first_token_ns": 377725732379894, "t_last_token_ns": 377731471603028, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "67575b2befa3406d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377726235853825, "t_first_token_ns": 377726514975624, "t_last_token_ns": 377732224839796, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "0649501ae32240f7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377726284189904, "t_first_token_ns": 377726969884970, "t_last_token_ns": 377732238924882, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "b48e725703ef400c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377726252072895, "t_first_token_ns": 377726970308485, "t_last_token_ns": 377732239733651, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "1f3825932cf64f8c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377728579550159, "t_first_token_ns": 377728848650182, "t_last_token_ns": 377734807954163, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "028afbf277694ec0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377728770610157, "t_first_token_ns": 377729092510369, "t_last_token_ns": 377734835471681, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "07d3558931f14462", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377729178083816, "t_first_token_ns": 377729446905469, "t_last_token_ns": 377734985370916, "prompt_tokens": 4028, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "9ae26a74a3ae4262", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377729482062435, "t_first_token_ns": 377729751162781, "t_last_token_ns": 377735081159801, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "d00859cfd217417e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377729764414588, "t_first_token_ns": 377730033543772, "t_last_token_ns": 377735137218533, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "c24e4d3724564bfe", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377729991361131, "t_first_token_ns": 377730285830024, "t_last_token_ns": 377735159037549, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "fcbaaad3a09d4863", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377730065058121, "t_first_token_ns": 377730539718407, "t_last_token_ns": 377735182205922, "prompt_tokens": 4024, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "273dfe8f35f04935", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377731453104595, "t_first_token_ns": 377731725797576, "t_last_token_ns": 377735836115098, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "ed8070ac37284ba0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377732532737353, "t_first_token_ns": 377732798325619, "t_last_token_ns": 377737466708209, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "10896a678de743be", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377733312399485, "t_first_token_ns": 377733581825382, "t_last_token_ns": 377738022474201, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "13154174e64c454f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377733877758965, "t_first_token_ns": 377734158185484, "t_last_token_ns": 377738327770921, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "20f2186fe2994fea", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377734074805475, "t_first_token_ns": 377734409432966, "t_last_token_ns": 377738346183657, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "d3d16c38ad3640d6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377734317706731, "t_first_token_ns": 377734661118333, "t_last_token_ns": 377738365644396, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "2d8ac64b6c804ee2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377736145674947, "t_first_token_ns": 377736417713153, "t_last_token_ns": 377739553104570, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "64b867cfa2ea480a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377736194529383, "t_first_token_ns": 377736655927147, "t_last_token_ns": 377739560227701, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "38b7d839caec4f05", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377736438150304, "t_first_token_ns": 377736904942639, "t_last_token_ns": 377739571311558, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "627006d80ddd47bb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377736667473726, "t_first_token_ns": 377737145315744, "t_last_token_ns": 377739575443923, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "92638b61ab434949", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377740490937178, "t_first_token_ns": 377740743848914, "t_last_token_ns": 377745848630588, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "a530cf81dacc41b2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377740534090991, "t_first_token_ns": 377740983996807, "t_last_token_ns": 377745875989459, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "20893f72243644d1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377740783628692, "t_first_token_ns": 377741222911889, "t_last_token_ns": 377745902429009, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "1dbe0221b2cd42da", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377741863366137, "t_first_token_ns": 377742125427048, "t_last_token_ns": 377747544232377, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "dc8d500b03e04830", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377742109154653, "t_first_token_ns": 377742368667146, "t_last_token_ns": 377747800479316, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "be014b24abcd4dd9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377742432643083, "t_first_token_ns": 377742702366752, "t_last_token_ns": 377747952597075, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "8e66d14b3d7c4aa3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377742703625264, "t_first_token_ns": 377742966936920, "t_last_token_ns": 377747997336504, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "f87661ef65594b14", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377743302600693, "t_first_token_ns": 377743574529553, "t_last_token_ns": 377748893982556, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "d12ae175f0ad4618", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377743817707128, "t_first_token_ns": 377744092263746, "t_last_token_ns": 377749476119437, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "8c3e0151b85d4873", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377743929578169, "t_first_token_ns": 377744344047679, "t_last_token_ns": 377749499529200, "prompt_tokens": 4022, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "18ac55e144dc4b4c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377744427813924, "t_first_token_ns": 377744706322722, "t_last_token_ns": 377749605067910, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "80085216940f4deb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377744453070569, "t_first_token_ns": 377744947192151, "t_last_token_ns": 377749616849669, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "9059b9977d0d4387", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377746652669495, "t_first_token_ns": 377746927662033, "t_last_token_ns": 377751434877072, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "3b94a113b43a4282", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377747269707256, "t_first_token_ns": 377747545452201, "t_last_token_ns": 377751946964930, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "68ca605046cc4cbb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377747304097436, "t_first_token_ns": 377747788427621, "t_last_token_ns": 377751957354066, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "284bd5b5a0f040e7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377748085575668, "t_first_token_ns": 377748361751831, "t_last_token_ns": 377752236748018, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "29320c5c64d54334", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377748204884606, "t_first_token_ns": 377748612104200, "t_last_token_ns": 377752252511392, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "a698406406b84045", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377748964152041, "t_first_token_ns": 377749233544892, "t_last_token_ns": 377752462731685, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "1923e7f405f54267", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377749719477356, "t_first_token_ns": 377749990626158, "t_last_token_ns": 377752724234736, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "593b8316b3074fac", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377751583677892, "t_first_token_ns": 377751853144305, "t_last_token_ns": 377753689263008, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "d474a5d9f1fb4094", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377753150899779, "t_first_token_ns": 377753407027273, "t_last_token_ns": 377755383120274, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "a8a0ce116d0f40b6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377754164373258, "t_first_token_ns": 377754417162450, "t_last_token_ns": 377757132294951, "prompt_tokens": 3962, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "6c39211993074393", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377754435783463, "t_first_token_ns": 377754693782395, "t_last_token_ns": 377757434629319, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "c9790f2ebb3d46e0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377755810930714, "t_first_token_ns": 377756069638471, "t_last_token_ns": 377760260802131, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "fa48ad7a2e4a4ded", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377756011877231, "t_first_token_ns": 377756312424528, "t_last_token_ns": 377760283262213, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "91b506c60e3b4be9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377756755066047, "t_first_token_ns": 377757017112371, "t_last_token_ns": 377761454913963, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "9312ca2d7e1e4ea1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377757119823150, "t_first_token_ns": 377757387386920, "t_last_token_ns": 377761620568014, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "707f8d2275cd4747", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377757509873532, "t_first_token_ns": 377757774681070, "t_last_token_ns": 377761842068909, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "5a177ea2b3404127", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377757764211651, "t_first_token_ns": 377758031602970, "t_last_token_ns": 377761869719801, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "4c7c6df45b6d4044", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377758917288712, "t_first_token_ns": 377759184166230, "t_last_token_ns": 377763058466387, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "13665c6b167046a6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377759155724827, "t_first_token_ns": 377759435860934, "t_last_token_ns": 377763079470108, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "09181c259e974492", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377759226648096, "t_first_token_ns": 377759684736925, "t_last_token_ns": 377763095131845, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "99385fb3eef5424f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377760517274129, "t_first_token_ns": 377760790687548, "t_last_token_ns": 377763674659138, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "dfe65f9511f04bd1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377760874691951, "t_first_token_ns": 377761141985397, "t_last_token_ns": 377763740361333, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "db45bf2b095c4dc8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377762276707376, "t_first_token_ns": 377762542304673, "t_last_token_ns": 377764264158157, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "eb511ac6ba3d4aee", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377764592175829, "t_first_token_ns": 377764842827006, "t_last_token_ns": 377768150061691, "prompt_tokens": 3963, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "92ffd80f1bf34986", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377764595798716, "t_first_token_ns": 377765077376501, "t_last_token_ns": 377768160255650, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "d671c6673c3d43b4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377765299668022, "t_first_token_ns": 377765559395419, "t_last_token_ns": 377768523541795, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "213f27b24a9c477e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377765621516055, "t_first_token_ns": 377765883201245, "t_last_token_ns": 377768622934373, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "41eeed1f9ec64016", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377766222964681, "t_first_token_ns": 377766485716932, "t_last_token_ns": 377768911347693, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "9206be145e27448d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377767384309202, "t_first_token_ns": 377767654471998, "t_last_token_ns": 377769375461450, "prompt_tokens": 4027, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "98fcf47fb8274c43", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377769481183188, "t_first_token_ns": 377769733805762, "t_last_token_ns": 377771436884743, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "56241fa764fa41b0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377769816846226, "t_first_token_ns": 377770075181818, "t_last_token_ns": 377771546362605, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "940d6b3d23064c7d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377772114572452, "t_first_token_ns": 377772367848626, "t_last_token_ns": 377776858772205, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "2efe8bd71c08434c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377772894992692, "t_first_token_ns": 377773152226810, "t_last_token_ns": 377778708329701, "prompt_tokens": 3956, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "020fc3681fb84853", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377773049936968, "t_first_token_ns": 377773610524535, "t_last_token_ns": 377778736098455, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "959c5afdd5314a70", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377772997330366, "t_first_token_ns": 377773610117112, "t_last_token_ns": 377778736519760, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "5277baacc9e34acb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377773328794995, "t_first_token_ns": 377773851943301, "t_last_token_ns": 377778759051698, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "0ca787ba70be4c1c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377774658528897, "t_first_token_ns": 377774929084192, "t_last_token_ns": 377780004744246, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "830574cb234a4265", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377774774797001, "t_first_token_ns": 377775175407047, "t_last_token_ns": 377780027279993, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "3a444729ccf645a7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377775271801056, "t_first_token_ns": 377775540748708, "t_last_token_ns": 377780167900585, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "46f723092e27470b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377775558845014, "t_first_token_ns": 377775835269783, "t_last_token_ns": 377780225578623, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "61a1a2b9e274432e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377775832427071, "t_first_token_ns": 377776108096426, "t_last_token_ns": 377780263515520, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "8a2c621174a044c4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377776163377701, "t_first_token_ns": 377776442488208, "t_last_token_ns": 377780326392143, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "9734f37b3cab4c07", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377776574087154, "t_first_token_ns": 377776846066396, "t_last_token_ns": 377780420271440, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "363792fea6a54e40", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377777542228907, "t_first_token_ns": 377777816519132, "t_last_token_ns": 377780740448703, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "604c050896a7407e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377779458614495, "t_first_token_ns": 377779735245804, "t_last_token_ns": 377783694789140, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "4a148254445b47a0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377780781774635, "t_first_token_ns": 377781041012382, "t_last_token_ns": 377785760924179, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "27df370272994ab8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377780915719285, "t_first_token_ns": 377781282310080, "t_last_token_ns": 377785784538570, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "58ba03b6e74c41b0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377781136802940, "t_first_token_ns": 377781739465226, "t_last_token_ns": 377785808556102, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "d2d6d1b31884452a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377781079906616, "t_first_token_ns": 377781739683676, "t_last_token_ns": 377785808962458, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "44dda3e17c6342ae", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377781744198989, "t_first_token_ns": 377782010412615, "t_last_token_ns": 377786086682937, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "d691387b34bf49b7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377782077754602, "t_first_token_ns": 377782347251765, "t_last_token_ns": 377786208512307, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "b4a9487e0fb94800", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377783261658694, "t_first_token_ns": 377783530577948, "t_last_token_ns": 377786861352110, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "38a0e6392aeb4f55", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377783854756554, "t_first_token_ns": 377784121707234, "t_last_token_ns": 377787338461399, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "879c0db8712e442a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377784232928150, "t_first_token_ns": 377784507767055, "t_last_token_ns": 377787675996850, "prompt_tokens": 4026, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "7c641b1fe17440f8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377785793178032, "t_first_token_ns": 377786068727257, "t_last_token_ns": 377788724888033, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "3211021e5fbe4e6f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377786871253035, "t_first_token_ns": 377787135966258, "t_last_token_ns": 377789494348907, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "16458a97195f4cc5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377787358351730, "t_first_token_ns": 377787621777702, "t_last_token_ns": 377789693973495, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "bdbca50f7cfa4749", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377788308962298, "t_first_token_ns": 377788574283448, "t_last_token_ns": 377790738084432, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "001ba79de57f48ed", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377789713696280, "t_first_token_ns": 377789968412103, "t_last_token_ns": 377792720581734, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "0ca05f476efd4db6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377790421662219, "t_first_token_ns": 377790682147633, "t_last_token_ns": 377793730183403, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "f3670bed93d04b82", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377791481637648, "t_first_token_ns": 377791741863840, "t_last_token_ns": 377796869927426, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "a3d7ec8fe9de478b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377791698651089, "t_first_token_ns": 377791984820898, "t_last_token_ns": 377796897027195, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "f48046b30d284c7b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377792139663684, "t_first_token_ns": 377792403445670, "t_last_token_ns": 377798135720419, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "baf0c1fcf3e34b53", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377792327997491, "t_first_token_ns": 377792649825719, "t_last_token_ns": 377798164115573, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "8c216c2f65734318", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377793033717139, "t_first_token_ns": 377793298832027, "t_last_token_ns": 377799985407672, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "010a5effba7040a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377794386546763, "t_first_token_ns": 377794651119528, "t_last_token_ns": 377803245338493, "prompt_tokens": 3955, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "df08a59fcfcb4f12", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377794690578929, "t_first_token_ns": 377794953559738, "t_last_token_ns": 377803415741790, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "5385a9672ef44bc9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377794897850610, "t_first_token_ns": 377795417909242, "t_last_token_ns": 377803449698524, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "36a061ca6bcc409a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377794813151570, "t_first_token_ns": 377795417524267, "t_last_token_ns": 377803450684016, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "deed8d72fe574d30", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377795112656349, "t_first_token_ns": 377795884085959, "t_last_token_ns": 377803487054435, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "3f8b7cf3f37b454a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377795223124786, "t_first_token_ns": 377795884179008, "t_last_token_ns": 377803487328315, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "046e19e5768f4c91", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377796392405397, "t_first_token_ns": 377796668150501, "t_last_token_ns": 377804087201607, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "0e53ce1b78714d3f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377796878637399, "t_first_token_ns": 377797150229898, "t_last_token_ns": 377804582775384, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "b85a25459b15417c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377796906075761, "t_first_token_ns": 377797609219171, "t_last_token_ns": 377804597338269, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "e8bd533149444a2e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377796919841741, "t_first_token_ns": 377797609839767, "t_last_token_ns": 377804598237485, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "03fabb8e14b84864", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377797201643559, "t_first_token_ns": 377797861126808, "t_last_token_ns": 377804625079233, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "4f76474427644a30", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377798238839125, "t_first_token_ns": 377798519508604, "t_last_token_ns": 377805961625930, "prompt_tokens": 4026, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "376f786bdfba41ff", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377798339062232, "t_first_token_ns": 377798774788708, "t_last_token_ns": 377805991456460, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "98e873d1631f4624", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377798755867718, "t_first_token_ns": 377799245073971, "t_last_token_ns": 377806020374378, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "9ce83ec8c1a34a42", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377798670076564, "t_first_token_ns": 377799245374666, "t_last_token_ns": 377806020711568, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "ac2457ed2da7478d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377799030817080, "t_first_token_ns": 377799498990881, "t_last_token_ns": 377806048519126, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
{"req_id": "56ae21ec76cd4bf0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377800449595128, "t_first_token_ns": 377800737295893, "t_last_token_ns": 377807636409676, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
{"req_id": "f29d3eeb2e1849fc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377800650080941, "t_first_token_ns": 377800999043490, "t_last_token_ns": 377807665255324, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
{"req_id": "c9a7d9447a0b4854", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377801145560806, "t_first_token_ns": 377801432271863, "t_last_token_ns": 377807833062521, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
{"req_id": "f47e2adf33644a54", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377801235389050, "t_first_token_ns": 377801693741383, "t_last_token_ns": 377807861207939, "prompt_tokens": 4008, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
{"req_id": "018fa3aae80148b5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377802021180579, "t_first_token_ns": 377802305815343, "t_last_token_ns": 377808080584773, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
{"req_id": "49005cde6f7f48eb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377804134088890, "t_first_token_ns": 377804425312066, "t_last_token_ns": 377809261598465, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "76fdde68d6af4da8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377804809156322, "t_first_token_ns": 377805091394511, "t_last_token_ns": 377809641860573, "prompt_tokens": 4032, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "acf0422162f844ae", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377804830788258, "t_first_token_ns": 377805333025680, "t_last_token_ns": 377809652072956, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "d2f0d51564294c4d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377805217894893, "t_first_token_ns": 377805589366864, "t_last_token_ns": 377809671828487, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "0ba04744269e496b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377805654537954, "t_first_token_ns": 377805932801343, "t_last_token_ns": 377809740560126, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "ca9a52e05f1d481a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377806297095593, "t_first_token_ns": 377806572638568, "t_last_token_ns": 377809965263663, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "9207e8da10c44193", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377806333544872, "t_first_token_ns": 377806813082957, "t_last_token_ns": 377810003624445, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "0e095b80aa4043a6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377806655109228, "t_first_token_ns": 377807066336607, "t_last_token_ns": 377810218266304, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "d40409eedd9c4e42", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377809943918005, "t_first_token_ns": 377810214241388, "t_last_token_ns": 377813566981597, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "eb982e2b42f3474a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377810237065135, "t_first_token_ns": 377810492549786, "t_last_token_ns": 377813669801272, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "23007af1678c4139", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377810277248490, "t_first_token_ns": 377810727877577, "t_last_token_ns": 377813678744867, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "8f88466d80f44a5f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377811116712377, "t_first_token_ns": 377811378467899, "t_last_token_ns": 377814119896487, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "ae28d64d7a164f3a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377811161804858, "t_first_token_ns": 377811624342892, "t_last_token_ns": 377814131225904, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "e8204d9908134449", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377811510551707, "t_first_token_ns": 377811870326837, "t_last_token_ns": 377814139939432, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "8d03c79d873947d2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377814795217114, "t_first_token_ns": 377815050843260, "t_last_token_ns": 377817928307862, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "2914eb04004a4574", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377815005385742, "t_first_token_ns": 377815290653277, "t_last_token_ns": 377817944393318, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "aafdb1d73ea34478", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377815644659614, "t_first_token_ns": 377815905688434, "t_last_token_ns": 377818441032943, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "8eff35fafd754157", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377816142079567, "t_first_token_ns": 377816401790682, "t_last_token_ns": 377818657155722, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "8f64c12e3bbc4b3f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377817103574051, "t_first_token_ns": 377817364704441, "t_last_token_ns": 377819788458400, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "e393d0047b6942b1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377818649381479, "t_first_token_ns": 377818908813974, "t_last_token_ns": 377821670062440, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "026bce27e5cb4f45", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377818888148700, "t_first_token_ns": 377819147787872, "t_last_token_ns": 377821684747136, "prompt_tokens": 3962, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "3ae6710a94784213", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377820361772758, "t_first_token_ns": 377820623048664, "t_last_token_ns": 377823047928492, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "e6305686016e48a2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377820556515769, "t_first_token_ns": 377820864751326, "t_last_token_ns": 377823059283472, "prompt_tokens": 3954, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "41707ba140134233", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377820643411529, "t_first_token_ns": 377821104030390, "t_last_token_ns": 377823063267466, "prompt_tokens": 4032, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "f9426c029a354cda", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377823219983660, "t_first_token_ns": 377823471734290, "t_last_token_ns": 377824691723485, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "7fa66112bf044449", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377825341196743, "t_first_token_ns": 377825595771471, "t_last_token_ns": 377828175392032, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "9e1a1a201ac843c4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377826023014505, "t_first_token_ns": 377826282873541, "t_last_token_ns": 377830471915830, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "42964caf98df42f9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377826072714030, "t_first_token_ns": 377826522546545, "t_last_token_ns": 377830495240798, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "7d2c1f1477c7495b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377827378106026, "t_first_token_ns": 377827633187429, "t_last_token_ns": 377833220171220, "prompt_tokens": 3956, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "2628d823d17c4835", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377827742297680, "t_first_token_ns": 377828005039342, "t_last_token_ns": 377833466059748, "prompt_tokens": 4034, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "1a2cce694b9f455b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377828484425701, "t_first_token_ns": 377828745278703, "t_last_token_ns": 377835475257289, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "027586d810344b82", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377828690537949, "t_first_token_ns": 377828990769770, "t_last_token_ns": 377835505526986, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "83068b91148d4fe9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377829043537016, "t_first_token_ns": 377829312389800, "t_last_token_ns": 377835659075760, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "112e387486574455", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377829086650141, "t_first_token_ns": 377829553220827, "t_last_token_ns": 377835674743517, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "5d373fc9790740b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377829507112128, "t_first_token_ns": 377829804342132, "t_last_token_ns": 377835703086084, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "9dd80903dfcd4d54", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377829982276093, "t_first_token_ns": 377830258940692, "t_last_token_ns": 377835983986333, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "ea4a74acd591474b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377830700830541, "t_first_token_ns": 377830969195105, "t_last_token_ns": 377836542652819, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "3bf05c0ae3024817", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377831357745603, "t_first_token_ns": 377831635040870, "t_last_token_ns": 377837232102888, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "2ba7bc8f24804489", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377831724828906, "t_first_token_ns": 377832003815902, "t_last_token_ns": 377837346275933, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "d0b04c3fee0843b7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377832116050831, "t_first_token_ns": 377832388304135, "t_last_token_ns": 377837698584760, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "33bbf57d00d743f4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377832322882772, "t_first_token_ns": 377832643118980, "t_last_token_ns": 377837719421764, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "2136362379f74bab", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377833911260422, "t_first_token_ns": 377834194740155, "t_last_token_ns": 377839018826568, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "c7d7bbb4a7554fb8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377833974395043, "t_first_token_ns": 377834665677222, "t_last_token_ns": 377839042946401, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "c75c63aedf9d49e1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377834156253412, "t_first_token_ns": 377834665770026, "t_last_token_ns": 377839043341165, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "22fc2bb059524e50", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377834545357776, "t_first_token_ns": 377834920642248, "t_last_token_ns": 377839062602961, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "9695ccb39db04fa8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377834665930343, "t_first_token_ns": 377835163601973, "t_last_token_ns": 377839070206589, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "8fb2ef8a5d2643ab", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377836717778430, "t_first_token_ns": 377836992879047, "t_last_token_ns": 377840143666760, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "b189474cadc44d40", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377837358153260, "t_first_token_ns": 377837633444496, "t_last_token_ns": 377840389260698, "prompt_tokens": 4035, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "09d4e9fb87e44fd9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377838447085739, "t_first_token_ns": 377838718934510, "t_last_token_ns": 377840840860324, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "f0f5b366f9e74007", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377839680548277, "t_first_token_ns": 377839942287678, "t_last_token_ns": 377841415512779, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "837c2f492f734e77", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377841971837676, "t_first_token_ns": 377842225504592, "t_last_token_ns": 377845055902096, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "ab9cee883e984fa2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377842332546976, "t_first_token_ns": 377842588798104, "t_last_token_ns": 377845270705229, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "2715644827c7443c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377842727262761, "t_first_token_ns": 377842985321515, "t_last_token_ns": 377845713477602, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "f6bf849b3beb4fef", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377843441366051, "t_first_token_ns": 377843702298987, "t_last_token_ns": 377846448404115, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "43b0b26da1ff4ee7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377844233311758, "t_first_token_ns": 377844491410828, "t_last_token_ns": 377846984959857, "prompt_tokens": 3947, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "4c9860133cff4d63", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377845348933486, "t_first_token_ns": 377845613184360, "t_last_token_ns": 377847922303156, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "58230279f5644658", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377845845649953, "t_first_token_ns": 377846105584934, "t_last_token_ns": 377848130800061, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "13e47c1fbeb24231", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377847112480198, "t_first_token_ns": 377847375014102, "t_last_token_ns": 377848825322053, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "ac9abe6314eb420d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377849058278880, "t_first_token_ns": 377849313709230, "t_last_token_ns": 377850533614854, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "f119a1dcafb0419e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377851357246142, "t_first_token_ns": 377851610337830, "t_last_token_ns": 377855960681043, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "2bea76ba48b34c09", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377851538987714, "t_first_token_ns": 377851849481375, "t_last_token_ns": 377856012469886, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "3ca6975184bc4605", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377852823087739, "t_first_token_ns": 377853083020520, "t_last_token_ns": 377858799172928, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "4055216303034350", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377852887654997, "t_first_token_ns": 377853326551590, "t_last_token_ns": 377858826676754, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "658e7af818c24d4e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377853543759040, "t_first_token_ns": 377853808143576, "t_last_token_ns": 377859508454219, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "0cffcbfc8a9148e1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377853776570109, "t_first_token_ns": 377854057716742, "t_last_token_ns": 377859535470602, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "45bbc81063dd411e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377853885580552, "t_first_token_ns": 377854518418109, "t_last_token_ns": 377859558050446, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "86bd8f825d5e4a41", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377853906349037, "t_first_token_ns": 377854518695898, "t_last_token_ns": 377859558381955, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "25593abb313546ce", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377854671693124, "t_first_token_ns": 377854946119620, "t_last_token_ns": 377859754325360, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "e3451bca6d29403c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377854946290741, "t_first_token_ns": 377855220361000, "t_last_token_ns": 377859791427257, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "fbec81b5f9e248cc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377855180335244, "t_first_token_ns": 377855685965067, "t_last_token_ns": 377859810872157, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "8a0fe11ef9ee4ca2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377855026965300, "t_first_token_ns": 377855686230636, "t_last_token_ns": 377859811109042, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "6b05cb74736042f8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377855954005801, "t_first_token_ns": 377856232884703, "t_last_token_ns": 377859978109765, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "323e0ddc93b54a66", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377857855430403, "t_first_token_ns": 377858131863276, "t_last_token_ns": 377860697693246, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "14602d965298429b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377858886087461, "t_first_token_ns": 377859165573606, "t_last_token_ns": 377860974784364, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "88fe30bdffba4234", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377861089528979, "t_first_token_ns": 377861347173069, "t_last_token_ns": 377867708713762, "prompt_tokens": 4044, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "17e02f02e41448cc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377861345760763, "t_first_token_ns": 377861603845604, "t_last_token_ns": 377867802768205, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "b27f73dabdea4d8c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377861547331221, "t_first_token_ns": 377862060786271, "t_last_token_ns": 377867832465392, "prompt_tokens": 4030, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "d174d0a4c5994713", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377861458154236, "t_first_token_ns": 377862060437316, "t_last_token_ns": 377867833026059, "prompt_tokens": 3948, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "479b607207174dbb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377861897277706, "t_first_token_ns": 377862516218467, "t_last_token_ns": 377867860972519, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "63161a7039f24e9e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377861997579442, "t_first_token_ns": 377862516473473, "t_last_token_ns": 377867861480419, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "e88a0a069b1c499b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377862790823527, "t_first_token_ns": 377863056754754, "t_last_token_ns": 377869460416940, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "43cff1583edb46ab", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377863542973174, "t_first_token_ns": 377863809482245, "t_last_token_ns": 377871176522195, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "5ded354b6d4d4914", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377863742013325, "t_first_token_ns": 377864062266894, "t_last_token_ns": 377871205418912, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "41b064dc96fa4298", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377863906715549, "t_first_token_ns": 377864527072068, "t_last_token_ns": 377871233169289, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "e6317a71147c4d15", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377863915160810, "t_first_token_ns": 377864527286895, "t_last_token_ns": 377871233311951, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "9f9f604f19364dd5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377864609675479, "t_first_token_ns": 377864884252833, "t_last_token_ns": 377871376069399, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "d7a7fe31cda44c73", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377864847031479, "t_first_token_ns": 377865136860502, "t_last_token_ns": 377871404367581, "prompt_tokens": 3956, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "a1d43785c48f46f6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377865360970634, "t_first_token_ns": 377865639287385, "t_last_token_ns": 377871680774566, "prompt_tokens": 4024, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "90dd2dd43a1b4fff", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377866633489250, "t_first_token_ns": 377866918323708, "t_last_token_ns": 377872654817766, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "5f69a1aeea7344f8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377868076201268, "t_first_token_ns": 377868348650767, "t_last_token_ns": 377874521590008, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "eaf92ad8e9f54c9d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377868116104090, "t_first_token_ns": 377868589640755, "t_last_token_ns": 377874535242655, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "6a96338757754887", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377868636081294, "t_first_token_ns": 377868915074949, "t_last_token_ns": 377874633031356, "prompt_tokens": 4034, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "f0de415a1f3f45af", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377868764418394, "t_first_token_ns": 377869168735544, "t_last_token_ns": 377874655700165, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "6f21a52f59ee4d70", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377869062732553, "t_first_token_ns": 377869423924970, "t_last_token_ns": 377874678706237, "prompt_tokens": 3957, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "d83a33df1fb34b9e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377869297992955, "t_first_token_ns": 377869903154147, "t_last_token_ns": 377874701518428, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "b193493d275e4180", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377869205693652, "t_first_token_ns": 377869903482533, "t_last_token_ns": 377874701791685, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "c271111700874e49", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377869319154909, "t_first_token_ns": 377870133781478, "t_last_token_ns": 377874711460549, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
{"req_id": "4bc1ab08307b43fd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377870104563007, "t_first_token_ns": 377870391564200, "t_last_token_ns": 377874726391948, "prompt_tokens": 4027, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
{"req_id": "8885e9fbc7804188", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377872683653204, "t_first_token_ns": 377872954333711, "t_last_token_ns": 377878510702167, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "2570e48e7a344424", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377872948678781, "t_first_token_ns": 377873234416765, "t_last_token_ns": 377878590185615, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "ae8cc3d5d15a45c6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377873477679469, "t_first_token_ns": 377873752257389, "t_last_token_ns": 377879076936805, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "a68933b904d44d27", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377874972806395, "t_first_token_ns": 377875238129160, "t_last_token_ns": 377881353462263, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "ccd82e647b67456d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377875128040047, "t_first_token_ns": 377875480874302, "t_last_token_ns": 377881381293227, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "972b1249566f4a3f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377875905587163, "t_first_token_ns": 377876172209759, "t_last_token_ns": 377882064332391, "prompt_tokens": 4025, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "b66388e3178d4ad0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377876412215458, "t_first_token_ns": 377876684794460, "t_last_token_ns": 377882485953867, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "64f30f0b34a64844", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377876852805055, "t_first_token_ns": 377877333788495, "t_last_token_ns": 377882742868566, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "af318d2a9bbb4b69", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377876845400860, "t_first_token_ns": 377877334144086, "t_last_token_ns": 377882743405873, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "6917d608581c43c5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377877198400277, "t_first_token_ns": 377877800722496, "t_last_token_ns": 377882766273606, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "3bd85e1ee25c4085", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377876903206670, "t_first_token_ns": 377877801239947, "t_last_token_ns": 377882766685122, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "f01c589f495e42ed", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377877649894232, "t_first_token_ns": 377878055569959, "t_last_token_ns": 377882785916883, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "f825e5a1107249f5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377878023254941, "t_first_token_ns": 377878310962331, "t_last_token_ns": 377882805073743, "prompt_tokens": 4024, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "dcb787f5ca13473f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377878524239520, "t_first_token_ns": 377878806700201, "t_last_token_ns": 377882975138707, "prompt_tokens": 3947, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "ce0a51c637cd4fb2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377879249547285, "t_first_token_ns": 377879527013738, "t_last_token_ns": 377883236365077, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "6a6395184d2c4d1b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377880038119865, "t_first_token_ns": 377880323640445, "t_last_token_ns": 377883721419724, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "e064c8e5e8c7460b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377880402305052, "t_first_token_ns": 377880678358124, "t_last_token_ns": 377883774568474, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "31e6de0a2d0d41b4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377883375460448, "t_first_token_ns": 377883630164558, "t_last_token_ns": 377885178754071, "prompt_tokens": 3963, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "f4942ec93fc1422e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377884629542555, "t_first_token_ns": 377884889123285, "t_last_token_ns": 377887049483531, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "93c5a6bf8a934ba2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377885331344943, "t_first_token_ns": 377885587583056, "t_last_token_ns": 377887873163673, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "5ce807bf7ce64697", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377885682312015, "t_first_token_ns": 377885942150644, "t_last_token_ns": 377887996143806, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "a33daddcd83949a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377887188774719, "t_first_token_ns": 377887451593327, "t_last_token_ns": 377889133251224, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "8393acc5702940fb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377888524933378, "t_first_token_ns": 377888781590884, "t_last_token_ns": 377891193531321, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "a41124a0a91c4986", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377889147345632, "t_first_token_ns": 377889402943538, "t_last_token_ns": 377891685860316, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "d68286b8ec55445c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377889673178897, "t_first_token_ns": 377889929189316, "t_last_token_ns": 377891985739464, "prompt_tokens": 3952, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "f290accda5f442c9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377890125430954, "t_first_token_ns": 377890390035266, "t_last_token_ns": 377892133535176, "prompt_tokens": 4021, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "a261a6e6a0a84c19", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377892766841935, "t_first_token_ns": 377893020766647, "t_last_token_ns": 377897488430790, "prompt_tokens": 3940, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "f7b7cda8556c435c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377892962676592, "t_first_token_ns": 377893474511570, "t_last_token_ns": 377897510788215, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "5082828bf3314c94", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377892822979757, "t_first_token_ns": 377893474270972, "t_last_token_ns": 377897511378450, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "61832ef903cd406c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377893675888133, "t_first_token_ns": 377893931850302, "t_last_token_ns": 377898114619071, "prompt_tokens": 3963, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "7fef0c99d0204a64", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377894571993455, "t_first_token_ns": 377894835680891, "t_last_token_ns": 377900107621903, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "dd6213be70484fae", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377895166640731, "t_first_token_ns": 377895437825332, "t_last_token_ns": 377902717567128, "prompt_tokens": 4051, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "fd700f7552ae44f3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377895305436363, "t_first_token_ns": 377895684230071, "t_last_token_ns": 377902775951232, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "057b8cc63a21437a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377895924700221, "t_first_token_ns": 377896201872039, "t_last_token_ns": 377903400875405, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "31f7d35dc87b4561", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377896395084711, "t_first_token_ns": 377896671369766, "t_last_token_ns": 377903809120838, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "a867dd7e55634047", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377896763383888, "t_first_token_ns": 377897034719721, "t_last_token_ns": 377903982596590, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "7b7daefd291846de", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377897738443023, "t_first_token_ns": 377898005186216, "t_last_token_ns": 377904868146823, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "bbe6e5ce16464038", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377898094376027, "t_first_token_ns": 377898365452382, "t_last_token_ns": 377905052383582, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "c27cd43250a94f24", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377898444401676, "t_first_token_ns": 377898715343015, "t_last_token_ns": 377905212441563, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "35fad25c744840b0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377899080486718, "t_first_token_ns": 377899350770477, "t_last_token_ns": 377905699530183, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "86bbc71b59684284", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377899458038694, "t_first_token_ns": 377899743066646, "t_last_token_ns": 377905860088494, "prompt_tokens": 4021, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "cde21710fe714026", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377900160750698, "t_first_token_ns": 377900436202904, "t_last_token_ns": 377907238699921, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "a952080ef8a74768", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377900361652120, "t_first_token_ns": 377900688780092, "t_last_token_ns": 377907267970494, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "a4e378c561ca498a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377900619390236, "t_first_token_ns": 377900943405213, "t_last_token_ns": 377907294925791, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "1cae13700a054ca0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377901024145780, "t_first_token_ns": 377901312943433, "t_last_token_ns": 377907429513633, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "b4f9174e8c444941", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377901049589411, "t_first_token_ns": 377901552139717, "t_last_token_ns": 377907442614528, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "a40f6d4cdd0b4fe5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377901241456547, "t_first_token_ns": 377901792246232, "t_last_token_ns": 377907453749412, "prompt_tokens": 3946, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "ed294e9f7ed24bb3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377901673623016, "t_first_token_ns": 377902047215998, "t_last_token_ns": 377907474142132, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
{"req_id": "3fcd90c4cb27438c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377901975831600, "t_first_token_ns": 377902307720434, "t_last_token_ns": 377907495885317, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
{"req_id": "1c240e5a3ee24ca1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377902324672713, "t_first_token_ns": 377902603831767, "t_last_token_ns": 377907533242305, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
{"req_id": "ce3bcefc47c94fef", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377902705797255, "t_first_token_ns": 377903002788038, "t_last_token_ns": 377907630519685, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
{"req_id": "7c5a57cf072f4549", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377905935088614, "t_first_token_ns": 377906209224457, "t_last_token_ns": 377909220622472, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "c880b8ace9624eaa", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377906350048054, "t_first_token_ns": 377906850862143, "t_last_token_ns": 377909552265307, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "5baad3e1c6284c4b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377906363328857, "t_first_token_ns": 377906851545100, "t_last_token_ns": 377909552706971, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "528d9bec9a4c4296", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377906401710535, "t_first_token_ns": 377907091509929, "t_last_token_ns": 377909557549796, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "3e796b13d4b64bf5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377909284286681, "t_first_token_ns": 377909546271328, "t_last_token_ns": 377913530076304, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "73d9d1c34c7a4e7b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377909885235202, "t_first_token_ns": 377910143483466, "t_last_token_ns": 377915219864778, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "a60f2890a4de4429", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377910003477490, "t_first_token_ns": 377910384596452, "t_last_token_ns": 377915246558092, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "eb082451b0c94585", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377910535497491, "t_first_token_ns": 377910801260532, "t_last_token_ns": 377915546107978, "prompt_tokens": 4029, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "e6169dc5dd204efa", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377910711863142, "t_first_token_ns": 377911044563569, "t_last_token_ns": 377915567526062, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "4141b44a9ceb4f33", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377911808096441, "t_first_token_ns": 377912070548878, "t_last_token_ns": 377916553913080, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "f424b90520824da8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377911826506547, "t_first_token_ns": 377912309577142, "t_last_token_ns": 377916564680758, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "2dd75bc233444183", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377911983021278, "t_first_token_ns": 377912548012580, "t_last_token_ns": 377916574146907, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "9ed986ed536b4462", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377912264202505, "t_first_token_ns": 377912787244497, "t_last_token_ns": 377916582531169, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "ae5f6622fac44c2e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377913464926302, "t_first_token_ns": 377913746556402, "t_last_token_ns": 377917045106635, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "a9ca64e01caf4181", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377913544791473, "t_first_token_ns": 377913997402769, "t_last_token_ns": 377917056952708, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "a426d6d483b2422e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377913733811530, "t_first_token_ns": 377914237744230, "t_last_token_ns": 377917060983310, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 11, "error": null}

View File

@@ -0,0 +1,8 @@
{
"rate": 1.5,
"input_tokens": 4096,
"output_tokens": 256,
"duration_target_s": 240.0,
"duration_actual_s": 242.44389323098585,
"n_requests": 362
}

View File

@@ -0,0 +1,336 @@
{
"mooncake_both": {
"config": "mooncake_both",
"n_steps_total": 18175,
"n_steps_after_warmup": 17675,
"n_steps_decode_only": 17329,
"decode_share": 0.9804243281471005,
"rows_used_for_fit": "decode_only",
"cache_size_max": 17528,
"per_bin": [
{
"bin_id": 1,
"cache_size_mid": 2629.2,
"n": 282,
"cache_size_p50": 2099,
"step_duration_us_p50": 276,
"step_duration_us_p90": 430,
"build_meta_us_p50": 207,
"build_meta_us_p90": 339
},
{
"bin_id": 2,
"cache_size_mid": 4382.0,
"n": 115,
"cache_size_p50": 5158,
"step_duration_us_p50": 763,
"step_duration_us_p90": 993,
"build_meta_us_p50": 609,
"build_meta_us_p90": 831
},
{
"bin_id": 3,
"cache_size_mid": 6134.8,
"n": 120,
"cache_size_p50": 5743,
"step_duration_us_p50": 960,
"step_duration_us_p90": 1055,
"build_meta_us_p50": 815,
"build_meta_us_p90": 889
},
{
"bin_id": 4,
"cache_size_mid": 7887.599999999999,
"n": 248,
"cache_size_p50": 8122,
"step_duration_us_p50": 972,
"step_duration_us_p90": 1254,
"build_meta_us_p50": 834,
"build_meta_us_p90": 1102
},
{
"bin_id": 5,
"cache_size_mid": 9640.4,
"n": 508,
"cache_size_p50": 10055,
"step_duration_us_p50": 941,
"step_duration_us_p90": 1356,
"build_meta_us_p50": 860,
"build_meta_us_p90": 1237
},
{
"bin_id": 6,
"cache_size_mid": 11393.199999999999,
"n": 220,
"cache_size_p50": 11093,
"step_duration_us_p50": 1025,
"step_duration_us_p90": 1456,
"build_meta_us_p50": 943,
"build_meta_us_p90": 1352
},
{
"bin_id": 7,
"cache_size_mid": 13146.0,
"n": 679,
"cache_size_p50": 13205,
"step_duration_us_p50": 1135,
"step_duration_us_p90": 1616,
"build_meta_us_p50": 1060,
"build_meta_us_p90": 1515
},
{
"bin_id": 8,
"cache_size_mid": 14898.8,
"n": 308,
"cache_size_p50": 15056,
"step_duration_us_p50": 1261,
"step_duration_us_p90": 1881,
"build_meta_us_p50": 1174,
"build_meta_us_p90": 1769
},
{
"bin_id": 9,
"cache_size_mid": 16651.6,
"n": 14849,
"cache_size_p50": 17525,
"step_duration_us_p50": 1528,
"step_duration_us_p90": 2208,
"build_meta_us_p50": 1442,
"build_meta_us_p90": 2079
}
],
"fit_step_duration": {
"slope_us_per_block": 0.08561383027523924,
"intercept_us": 194.4061324441241
},
"fit_build_meta": {
"slope_us_per_block": 0.08639183976007032,
"intercept_us": 87.77312313364038
},
"worker_summary": {
"n": 18175,
"get_finished_us_p50": 180,
"get_finished_us_p90": 257,
"get_finished_us_p99": 333,
"start_load_kv_us_p50": 2,
"start_load_kv_us_p90": 5
}
},
"noop_connector": {
"config": "noop_connector",
"n_steps_total": 16627,
"n_steps_after_warmup": 16127,
"n_steps_decode_only": 15770,
"decode_share": 0.9778632107645563,
"rows_used_for_fit": "decode_only",
"cache_size_max": 17529,
"per_bin": [
{
"bin_id": 2,
"cache_size_mid": 4382.25,
"n": 270,
"cache_size_p50": 4706,
"step_duration_us_p50": 100,
"step_duration_us_p90": 126,
"build_meta_us_p50": 0,
"build_meta_us_p90": 1
},
{
"bin_id": 3,
"cache_size_mid": 6135.150000000001,
"n": 173,
"cache_size_p50": 6799,
"step_duration_us_p50": 110,
"step_duration_us_p90": 128,
"build_meta_us_p50": 0,
"build_meta_us_p90": 1
},
{
"bin_id": 4,
"cache_size_mid": 7888.05,
"n": 437,
"cache_size_p50": 7908,
"step_duration_us_p50": 84,
"step_duration_us_p90": 116,
"build_meta_us_p50": 0,
"build_meta_us_p90": 1
},
{
"bin_id": 5,
"cache_size_mid": 9640.95,
"n": 17,
"cache_size_p50": 8976,
"step_duration_us_p50": 101,
"step_duration_us_p90": 123,
"build_meta_us_p50": 0,
"build_meta_us_p90": 1
},
{
"bin_id": 6,
"cache_size_mid": 11393.85,
"n": 408,
"cache_size_p50": 11873,
"step_duration_us_p50": 130,
"step_duration_us_p90": 164,
"build_meta_us_p50": 0,
"build_meta_us_p90": 1
},
{
"bin_id": 7,
"cache_size_mid": 13146.75,
"n": 458,
"cache_size_p50": 13721,
"step_duration_us_p50": 73,
"step_duration_us_p90": 100,
"build_meta_us_p50": 0,
"build_meta_us_p90": 1
},
{
"bin_id": 8,
"cache_size_mid": 14899.650000000001,
"n": 621,
"cache_size_p50": 14799,
"step_duration_us_p50": 67,
"step_duration_us_p90": 101,
"build_meta_us_p50": 0,
"build_meta_us_p90": 1
},
{
"bin_id": 9,
"cache_size_mid": 16652.55,
"n": 13386,
"cache_size_p50": 17527,
"step_duration_us_p50": 79,
"step_duration_us_p90": 139,
"build_meta_us_p50": 0,
"build_meta_us_p90": 0
}
],
"fit_step_duration": {
"slope_us_per_block": -0.0007809282168828985,
"intercept_us": 102.62857247143907
},
"fit_build_meta": {
"slope_us_per_block": -3.064002303475271e-05,
"intercept_us": 0.651343083865946
},
"worker_summary": {
"n": 16627,
"get_finished_us_p50": 0,
"get_finished_us_p90": 0,
"get_finished_us_p99": 2,
"start_load_kv_us_p50": 0,
"start_load_kv_us_p90": 1
}
},
"plain": {
"config": "plain",
"n_steps_total": 16192,
"n_steps_after_warmup": 15692,
"n_steps_decode_only": 15333,
"decode_share": 0.9771221004333418,
"rows_used_for_fit": "decode_only",
"cache_size_max": 17529,
"per_bin": [
{
"bin_id": 1,
"cache_size_mid": 2629.3500000000004,
"n": 555,
"cache_size_p50": 3149,
"step_duration_us_p50": 59,
"step_duration_us_p90": 79,
"build_meta_us_p50": 0,
"build_meta_us_p90": 0
},
{
"bin_id": 2,
"cache_size_mid": 4382.25,
"n": 505,
"cache_size_p50": 4733,
"step_duration_us_p50": 69,
"step_duration_us_p90": 95,
"build_meta_us_p50": 0,
"build_meta_us_p90": 0
},
{
"bin_id": 3,
"cache_size_mid": 6135.150000000001,
"n": 173,
"cache_size_p50": 5533,
"step_duration_us_p50": 74,
"step_duration_us_p90": 99,
"build_meta_us_p50": 0,
"build_meta_us_p90": 0
},
{
"bin_id": 4,
"cache_size_mid": 7888.05,
"n": 132,
"cache_size_p50": 8093,
"step_duration_us_p50": 139,
"step_duration_us_p90": 185,
"build_meta_us_p50": 0,
"build_meta_us_p90": 0
},
{
"bin_id": 5,
"cache_size_mid": 9640.95,
"n": 241,
"cache_size_p50": 9736,
"step_duration_us_p50": 125,
"step_duration_us_p90": 161,
"build_meta_us_p50": 0,
"build_meta_us_p90": 0
},
{
"bin_id": 6,
"cache_size_mid": 11393.85,
"n": 267,
"cache_size_p50": 10826,
"step_duration_us_p50": 82,
"step_duration_us_p90": 104,
"build_meta_us_p50": 0,
"build_meta_us_p90": 0
},
{
"bin_id": 7,
"cache_size_mid": 13146.75,
"n": 402,
"cache_size_p50": 12938,
"step_duration_us_p50": 91,
"step_duration_us_p90": 128,
"build_meta_us_p50": 0,
"build_meta_us_p90": 0
},
{
"bin_id": 8,
"cache_size_mid": 14899.650000000001,
"n": 85,
"cache_size_p50": 15743,
"step_duration_us_p50": 136,
"step_duration_us_p90": 163,
"build_meta_us_p50": 0,
"build_meta_us_p90": 0
},
{
"bin_id": 9,
"cache_size_mid": 16652.55,
"n": 12973,
"cache_size_p50": 17527,
"step_duration_us_p50": 84,
"step_duration_us_p90": 141,
"build_meta_us_p50": 0,
"build_meta_us_p90": 0
}
],
"fit_step_duration": {
"slope_us_per_block": 0.0010177705924683715,
"intercept_us": 74.7264113650295
},
"fit_build_meta": {
"slope_us_per_block": 0.0,
"intercept_us": 0.0
},
"worker_summary": null
}
}

View File

@@ -0,0 +1,624 @@
# HELP python_gc_objects_collected_total Objects collected during gc
# TYPE python_gc_objects_collected_total counter
python_gc_objects_collected_total{generation="0"} 11967.0
python_gc_objects_collected_total{generation="1"} 1552.0
python_gc_objects_collected_total{generation="2"} 855.0
# HELP python_gc_objects_uncollectable_total Uncollectable objects found during GC
# TYPE python_gc_objects_uncollectable_total counter
python_gc_objects_uncollectable_total{generation="0"} 0.0
python_gc_objects_uncollectable_total{generation="1"} 0.0
python_gc_objects_uncollectable_total{generation="2"} 0.0
# HELP python_gc_collections_total Number of times this generation was collected
# TYPE python_gc_collections_total counter
python_gc_collections_total{generation="0"} 1341.0
python_gc_collections_total{generation="1"} 121.0
python_gc_collections_total{generation="2"} 9.0
# HELP python_info Python platform information
# TYPE python_info gauge
python_info{implementation="CPython",major="3",minor="12",patchlevel="3",version="3.12.3"} 1.0
# HELP process_virtual_memory_bytes Virtual memory size in bytes.
# TYPE process_virtual_memory_bytes gauge
process_virtual_memory_bytes 3.8945837056e+010
# HELP process_resident_memory_bytes Resident memory size in bytes.
# TYPE process_resident_memory_bytes gauge
process_resident_memory_bytes 1.350602752e+09
# HELP process_start_time_seconds Start time of the process since unix epoch in seconds.
# TYPE process_start_time_seconds gauge
process_start_time_seconds 1.77980807988e+09
# HELP process_cpu_seconds_total Total user and system CPU time spent in seconds.
# TYPE process_cpu_seconds_total counter
process_cpu_seconds_total 41.7
# HELP process_open_fds Number of open file descriptors.
# TYPE process_open_fds gauge
process_open_fds 64.0
# HELP process_max_fds Maximum number of open file descriptors.
# TYPE process_max_fds gauge
process_max_fds 65535.0
# HELP vllm:estimated_flops_per_gpu_total Estimated number of floating point operations per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_flops_per_gpu_total counter
vllm:estimated_flops_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:estimated_flops_per_gpu_created Estimated number of floating point operations per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_flops_per_gpu_created gauge
vllm:estimated_flops_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808185822307e+09
# HELP vllm:estimated_read_bytes_per_gpu_total Estimated number of bytes read from memory per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_read_bytes_per_gpu_total counter
vllm:estimated_read_bytes_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:estimated_read_bytes_per_gpu_created Estimated number of bytes read from memory per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_read_bytes_per_gpu_created gauge
vllm:estimated_read_bytes_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858223376e+09
# HELP vllm:estimated_write_bytes_per_gpu_total Estimated number of bytes written to memory per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_write_bytes_per_gpu_total counter
vllm:estimated_write_bytes_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:estimated_write_bytes_per_gpu_created Estimated number of bytes written to memory per GPU (for Model Flops Utilization calculations).
# TYPE vllm:estimated_write_bytes_per_gpu_created gauge
vllm:estimated_write_bytes_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858223584e+09
# HELP vllm:num_requests_running Number of requests in model execution batches.
# TYPE vllm:num_requests_running gauge
vllm:num_requests_running{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:num_requests_waiting Number of requests waiting to be processed.
# TYPE vllm:num_requests_waiting gauge
vllm:num_requests_waiting{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:engine_sleep_state Engine sleep state; awake = 0 means engine is sleeping; awake = 1 means engine is awake; weights_offloaded = 1 means sleep level 1; discard_all = 1 means sleep level 2.
# TYPE vllm:engine_sleep_state gauge
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="awake"} 1.0
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="weights_offloaded"} 0.0
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="discard_all"} 0.0
# HELP vllm:kv_cache_usage_perc KV-cache usage. 1 means 100 percent usage.
# TYPE vllm:kv_cache_usage_perc gauge
vllm:kv_cache_usage_perc{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:prefix_cache_queries_total Prefix cache queries, in terms of number of queried tokens.
# TYPE vllm:prefix_cache_queries_total counter
vllm:prefix_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.483835e+06
# HELP vllm:prefix_cache_queries_created Prefix cache queries, in terms of number of queried tokens.
# TYPE vllm:prefix_cache_queries_created gauge
vllm:prefix_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808185822533e+09
# HELP vllm:prefix_cache_hits_total Prefix cache hits, in terms of number of cached tokens.
# TYPE vllm:prefix_cache_hits_total counter
vllm:prefix_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:prefix_cache_hits_created Prefix cache hits, in terms of number of cached tokens.
# TYPE vllm:prefix_cache_hits_created gauge
vllm:prefix_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858225553e+09
# HELP vllm:external_prefix_cache_queries_total External prefix cache queries from KV connector cross-instance cache sharing, in terms of number of queried tokens.
# TYPE vllm:external_prefix_cache_queries_total counter
vllm:external_prefix_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:external_prefix_cache_queries_created External prefix cache queries from KV connector cross-instance cache sharing, in terms of number of queried tokens.
# TYPE vllm:external_prefix_cache_queries_created gauge
vllm:external_prefix_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808185822571e+09
# HELP vllm:external_prefix_cache_hits_total External prefix cache hits from KV connector cross-instance cache sharing, in terms of number of cached tokens.
# TYPE vllm:external_prefix_cache_hits_total counter
vllm:external_prefix_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:external_prefix_cache_hits_created External prefix cache hits from KV connector cross-instance cache sharing, in terms of number of cached tokens.
# TYPE vllm:external_prefix_cache_hits_created gauge
vllm:external_prefix_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858225827e+09
# HELP vllm:mm_cache_queries_total Multi-modal cache queries, in terms of number of queried items.
# TYPE vllm:mm_cache_queries_total counter
vllm:mm_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:mm_cache_queries_created Multi-modal cache queries, in terms of number of queried items.
# TYPE vllm:mm_cache_queries_created gauge
vllm:mm_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858225946e+09
# HELP vllm:mm_cache_hits_total Multi-modal cache hits, in terms of number of cached items.
# TYPE vllm:mm_cache_hits_total counter
vllm:mm_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:mm_cache_hits_created Multi-modal cache hits, in terms of number of cached items.
# TYPE vllm:mm_cache_hits_created gauge
vllm:mm_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858226066e+09
# HELP vllm:num_preemptions_total Cumulative number of preemption from the engine.
# TYPE vllm:num_preemptions_total counter
vllm:num_preemptions_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:num_preemptions_created Cumulative number of preemption from the engine.
# TYPE vllm:num_preemptions_created gauge
vllm:num_preemptions_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858226185e+09
# HELP vllm:prompt_tokens_total Number of prefill tokens processed.
# TYPE vllm:prompt_tokens_total counter
vllm:prompt_tokens_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.483835e+06
# HELP vllm:prompt_tokens_created Number of prefill tokens processed.
# TYPE vllm:prompt_tokens_created gauge
vllm:prompt_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858226302e+09
# HELP vllm:prompt_tokens_by_source_total Number of prompt tokens by source.
# TYPE vllm:prompt_tokens_by_source_total counter
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_compute"} 1.483835e+06
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_cache_hit"} 0.0
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="external_kv_transfer"} 0.0
# HELP vllm:prompt_tokens_by_source_created Number of prompt tokens by source.
# TYPE vllm:prompt_tokens_by_source_created gauge
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_compute"} 1.779808185822646e+09
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_cache_hit"} 1.7798081858226511e+09
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="external_kv_transfer"} 1.7798081858226562e+09
# HELP vllm:prompt_tokens_cached_total Number of cached prompt tokens (local + external).
# TYPE vllm:prompt_tokens_cached_total counter
vllm:prompt_tokens_cached_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:prompt_tokens_cached_created Number of cached prompt tokens (local + external).
# TYPE vllm:prompt_tokens_cached_created gauge
vllm:prompt_tokens_cached_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858226674e+09
# HELP vllm:prompt_tokens_recomputed_total Number of cached tokens recomputed for forward pass.
# TYPE vllm:prompt_tokens_recomputed_total counter
vllm:prompt_tokens_recomputed_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:prompt_tokens_recomputed_created Number of cached tokens recomputed for forward pass.
# TYPE vllm:prompt_tokens_recomputed_created gauge
vllm:prompt_tokens_recomputed_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858226821e+09
# HELP vllm:generation_tokens_total Number of generation tokens processed.
# TYPE vllm:generation_tokens_total counter
vllm:generation_tokens_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 95232.0
# HELP vllm:generation_tokens_created Number of generation tokens processed.
# TYPE vllm:generation_tokens_created gauge
vllm:generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858226945e+09
# HELP vllm:request_success_total Count of successfully processed requests.
# TYPE vllm:request_success_total counter
vllm:request_success_total{engine="0",finished_reason="stop",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_success_total{engine="0",finished_reason="length",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_success_total{engine="0",finished_reason="abort",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_success_total{engine="0",finished_reason="error",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_success_total{engine="0",finished_reason="repetition",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
# HELP vllm:request_success_created Count of successfully processed requests.
# TYPE vllm:request_success_created gauge
vllm:request_success_created{engine="0",finished_reason="stop",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858227305e+09
vllm:request_success_created{engine="0",finished_reason="length",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858227377e+09
vllm:request_success_created{engine="0",finished_reason="abort",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858227463e+09
vllm:request_success_created{engine="0",finished_reason="error",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858227522e+09
vllm:request_success_created{engine="0",finished_reason="repetition",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808185822759e+09
# HELP vllm:request_prompt_tokens Number of prefill tokens processed.
# TYPE vllm:request_prompt_tokens histogram
vllm:request_prompt_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prompt_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prompt_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prompt_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prompt_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prompt_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prompt_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prompt_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prompt_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prompt_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.483835e+06
# HELP vllm:request_prompt_tokens_created Number of prefill tokens processed.
# TYPE vllm:request_prompt_tokens_created gauge
vllm:request_prompt_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858228157e+09
# HELP vllm:request_generation_tokens Number of generation tokens processed.
# TYPE vllm:request_generation_tokens histogram
vllm:request_generation_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_generation_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_generation_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_generation_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_generation_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_generation_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_generation_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_generation_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_generation_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_generation_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_generation_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_generation_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_generation_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 95232.0
# HELP vllm:request_generation_tokens_created Number of generation tokens processed.
# TYPE vllm:request_generation_tokens_created gauge
vllm:request_generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858228867e+09
# HELP vllm:iteration_tokens_total Histogram of number of tokens per engine_step.
# TYPE vllm:iteration_tokens_total histogram
vllm:iteration_tokens_total_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1612.0
vllm:iteration_tokens_total_bucket{engine="0",le="8.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 12089.0
vllm:iteration_tokens_total_bucket{engine="0",le="16.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15377.0
vllm:iteration_tokens_total_bucket{engine="0",le="32.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15824.0
vllm:iteration_tokens_total_bucket{engine="0",le="64.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15824.0
vllm:iteration_tokens_total_bucket{engine="0",le="128.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15824.0
vllm:iteration_tokens_total_bucket{engine="0",le="256.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15824.0
vllm:iteration_tokens_total_bucket{engine="0",le="512.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15824.0
vllm:iteration_tokens_total_bucket{engine="0",le="1024.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15824.0
vllm:iteration_tokens_total_bucket{engine="0",le="2048.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15824.0
vllm:iteration_tokens_total_bucket{engine="0",le="4096.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16156.0
vllm:iteration_tokens_total_bucket{engine="0",le="8192.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16176.0
vllm:iteration_tokens_total_bucket{engine="0",le="16384.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16176.0
vllm:iteration_tokens_total_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16176.0
vllm:iteration_tokens_total_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16176.0
vllm:iteration_tokens_total_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.579067e+06
# HELP vllm:iteration_tokens_total_created Histogram of number of tokens per engine_step.
# TYPE vllm:iteration_tokens_total_created gauge
vllm:iteration_tokens_total_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858229313e+09
# HELP vllm:request_max_num_generation_tokens Histogram of maximum number of requested generation tokens.
# TYPE vllm:request_max_num_generation_tokens histogram
vllm:request_max_num_generation_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_max_num_generation_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_max_num_generation_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_max_num_generation_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 95232.0
# HELP vllm:request_max_num_generation_tokens_created Histogram of maximum number of requested generation tokens.
# TYPE vllm:request_max_num_generation_tokens_created gauge
vllm:request_max_num_generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858229792e+09
# HELP vllm:request_params_n Histogram of the n request parameter.
# TYPE vllm:request_params_n histogram
vllm:request_params_n_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_n_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_n_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_n_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_n_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_n_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_n_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_n_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
# HELP vllm:request_params_n_created Histogram of the n request parameter.
# TYPE vllm:request_params_n_created gauge
vllm:request_params_n_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858230217e+09
# HELP vllm:request_params_max_tokens Histogram of the max_tokens request parameter.
# TYPE vllm:request_params_max_tokens histogram
vllm:request_params_max_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_params_max_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_max_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_max_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_max_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_max_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_max_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_max_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_max_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_max_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_max_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_max_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_params_max_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 95232.0
# HELP vllm:request_params_max_tokens_created Histogram of the max_tokens request parameter.
# TYPE vllm:request_params_max_tokens_created gauge
vllm:request_params_max_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858230462e+09
# HELP vllm:time_to_first_token_seconds Histogram of time to first token in seconds.
# TYPE vllm:time_to_first_token_seconds histogram
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.001",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.005",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.02",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.04",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.06",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.08",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.25",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 5.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 318.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 362.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 368.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="160.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="640.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="2560.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:time_to_first_token_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:time_to_first_token_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:time_to_first_token_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 132.1748490333557
# HELP vllm:time_to_first_token_seconds_created Histogram of time to first token in seconds.
# TYPE vllm:time_to_first_token_seconds_created gauge
vllm:time_to_first_token_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858230832e+09
# HELP vllm:inter_token_latency_seconds Histogram of inter-token latency in seconds.
# TYPE vllm:inter_token_latency_seconds histogram
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 30101.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.025",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 90102.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.05",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92270.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.075",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92321.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92321.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.15",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92321.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.2",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 92482.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 94700.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.4",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 94700.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 94847.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 94860.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 94860.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 94860.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 94860.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 94860.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 94860.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 94860.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 94860.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 94860.0
vllm:inter_token_latency_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 94860.0
vllm:inter_token_latency_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 94860.0
vllm:inter_token_latency_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1720.9519141139463
# HELP vllm:inter_token_latency_seconds_created Histogram of inter-token latency in seconds.
# TYPE vllm:inter_token_latency_seconds_created gauge
vllm:inter_token_latency_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858231328e+09
# HELP vllm:request_time_per_output_token_seconds Histogram of time_per_output_token_seconds per request.
# TYPE vllm:request_time_per_output_token_seconds histogram
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 40.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.025",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 319.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.05",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.075",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.15",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.2",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.4",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_time_per_output_token_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 6.74883103574097
# HELP vllm:request_time_per_output_token_seconds_created Histogram of time_per_output_token_seconds per request.
# TYPE vllm:request_time_per_output_token_seconds_created gauge
vllm:request_time_per_output_token_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808185823172e+09
# HELP vllm:e2e_request_latency_seconds Histogram of e2e request latency in seconds.
# TYPE vllm:e2e_request_latency_seconds histogram
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 8.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 26.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 195.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:e2e_request_latency_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:e2e_request_latency_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:e2e_request_latency_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1853.0146126747131
# HELP vllm:e2e_request_latency_seconds_created Histogram of e2e request latency in seconds.
# TYPE vllm:e2e_request_latency_seconds_created gauge
vllm:e2e_request_latency_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808185824654e+09
# HELP vllm:request_queue_time_seconds Histogram of time spent in WAITING phase for request.
# TYPE vllm:request_queue_time_seconds histogram
vllm:request_queue_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_queue_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0028111001010984182
# HELP vllm:request_queue_time_seconds_created Histogram of time spent in WAITING phase for request.
# TYPE vllm:request_queue_time_seconds_created gauge
vllm:request_queue_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858247194e+09
# HELP vllm:request_inference_time_seconds Histogram of time spent in RUNNING phase for request.
# TYPE vllm:request_inference_time_seconds histogram
vllm:request_inference_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_inference_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_inference_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_inference_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_inference_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_inference_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 8.0
vllm:request_inference_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 27.0
vllm:request_inference_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 201.0
vllm:request_inference_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_inference_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_inference_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_inference_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_inference_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_inference_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_inference_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_inference_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_inference_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_inference_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_inference_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_inference_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_inference_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_inference_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_inference_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_inference_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1831.554692038335
# HELP vllm:request_inference_time_seconds_created Histogram of time spent in RUNNING phase for request.
# TYPE vllm:request_inference_time_seconds_created gauge
vllm:request_inference_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779808185824763e+09
# HELP vllm:request_prefill_time_seconds Histogram of time spent in PREFILL phase for request.
# TYPE vllm:request_prefill_time_seconds histogram
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 304.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 355.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 369.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 371.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 110.60277792438865
# HELP vllm:request_prefill_time_seconds_created Histogram of time spent in PREFILL phase for request.
# TYPE vllm:request_prefill_time_seconds_created gauge
vllm:request_prefill_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858248146e+09
# HELP vllm:request_decode_time_seconds Histogram of time spent in DECODE phase for request.
# TYPE vllm:request_decode_time_seconds histogram
vllm:request_decode_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_decode_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_decode_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_decode_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_decode_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 3.0
vllm:request_decode_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 14.0
vllm:request_decode_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 35.0
vllm:request_decode_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 224.0
vllm:request_decode_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_decode_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_decode_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_decode_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_decode_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_decode_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_decode_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_decode_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_decode_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_decode_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_decode_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_decode_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_decode_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_decode_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_decode_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_decode_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1720.9519141139463
# HELP vllm:request_decode_time_seconds_created Histogram of time spent in DECODE phase for request.
# TYPE vllm:request_decode_time_seconds_created gauge
vllm:request_decode_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858248522e+09
# HELP vllm:request_prefill_kv_computed_tokens Histogram of new KV tokens computed during prefill (excluding cached tokens).
# TYPE vllm:request_prefill_kv_computed_tokens histogram
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_kv_computed_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 372.0
vllm:request_prefill_kv_computed_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.483835e+06
# HELP vllm:request_prefill_kv_computed_tokens_created Histogram of new KV tokens computed during prefill (excluding cached tokens).
# TYPE vllm:request_prefill_kv_computed_tokens_created gauge
vllm:request_prefill_kv_computed_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798081858249016e+09
# HELP vllm:cache_config_info Information of the LLMEngine CacheConfig
# TYPE vllm:cache_config_info gauge
vllm:cache_config_info{_block_size_resolved="True",block_size="16",cache_dtype="auto",calculate_kv_scales="False",cpu_kvcache_space_bytes="None",enable_prefix_caching="True",engine="0",gpu_memory_utilization="0.9",is_attention_free="False",kv_cache_memory_bytes="None",kv_offloading_backend="native",kv_offloading_size="None",kv_sharing_fast_prefill="False",mamba_block_size="None",mamba_cache_dtype="auto",mamba_cache_mode="none",mamba_page_size_padded="None",mamba_ssm_cache_dtype="auto",num_cpu_blocks="None",num_gpu_blocks="17590",num_gpu_blocks_override="None",prefix_caching_hash_algo="sha256",sliding_window="None",user_specified_block_size="False"} 1.0
# HELP http_requests_total Total number of requests by method, status and handler.
# TYPE http_requests_total counter
http_requests_total{handler="/v1/models",method="GET",status="2xx"} 1.0
http_requests_total{handler="/v1/chat/completions",method="POST",status="2xx"} 372.0
# HELP http_requests_created Total number of requests by method, status and handler.
# TYPE http_requests_created gauge
http_requests_created{handler="/v1/models",method="GET",status="2xx"} 1.7798081876221206e+09
http_requests_created{handler="/v1/chat/completions",method="POST",status="2xx"} 1.7798081937688167e+09
# HELP http_request_size_bytes Content length of incoming requests by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
# TYPE http_request_size_bytes summary
http_request_size_bytes_count{handler="/v1/models"} 1.0
http_request_size_bytes_sum{handler="/v1/models"} 0.0
http_request_size_bytes_count{handler="/v1/chat/completions"} 372.0
http_request_size_bytes_sum{handler="/v1/chat/completions"} 1.961928e+06
# HELP http_request_size_bytes_created Content length of incoming requests by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
# TYPE http_request_size_bytes_created gauge
http_request_size_bytes_created{handler="/v1/models"} 1.7798081876221485e+09
http_request_size_bytes_created{handler="/v1/chat/completions"} 1.7798081937688417e+09
# HELP http_response_size_bytes Content length of outgoing responses by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
# TYPE http_response_size_bytes summary
http_response_size_bytes_count{handler="/v1/models"} 1.0
http_response_size_bytes_sum{handler="/v1/models"} 558.0
http_response_size_bytes_count{handler="/v1/chat/completions"} 372.0
http_response_size_bytes_sum{handler="/v1/chat/completions"} 0.0
# HELP http_response_size_bytes_created Content length of outgoing responses by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
# TYPE http_response_size_bytes_created gauge
http_response_size_bytes_created{handler="/v1/models"} 1.779808187622177e+09
http_response_size_bytes_created{handler="/v1/chat/completions"} 1.7798081937688694e+09
# HELP http_request_duration_highr_seconds Latency with many buckets but no API specific labels. Made for more accurate percentile calculations.
# TYPE http_request_duration_highr_seconds histogram
http_request_duration_highr_seconds_bucket{le="0.01"} 1.0
http_request_duration_highr_seconds_bucket{le="0.025"} 1.0
http_request_duration_highr_seconds_bucket{le="0.05"} 1.0
http_request_duration_highr_seconds_bucket{le="0.075"} 1.0
http_request_duration_highr_seconds_bucket{le="0.1"} 1.0
http_request_duration_highr_seconds_bucket{le="0.25"} 1.0
http_request_duration_highr_seconds_bucket{le="0.5"} 1.0
http_request_duration_highr_seconds_bucket{le="0.75"} 1.0
http_request_duration_highr_seconds_bucket{le="1.0"} 1.0
http_request_duration_highr_seconds_bucket{le="1.5"} 1.0
http_request_duration_highr_seconds_bucket{le="2.0"} 9.0
http_request_duration_highr_seconds_bucket{le="2.5"} 27.0
http_request_duration_highr_seconds_bucket{le="3.0"} 48.0
http_request_duration_highr_seconds_bucket{le="3.5"} 78.0
http_request_duration_highr_seconds_bucket{le="4.0"} 114.0
http_request_duration_highr_seconds_bucket{le="4.5"} 151.0
http_request_duration_highr_seconds_bucket{le="5.0"} 196.0
http_request_duration_highr_seconds_bucket{le="7.5"} 342.0
http_request_duration_highr_seconds_bucket{le="10.0"} 373.0
http_request_duration_highr_seconds_bucket{le="30.0"} 373.0
http_request_duration_highr_seconds_bucket{le="60.0"} 373.0
http_request_duration_highr_seconds_bucket{le="+Inf"} 373.0
http_request_duration_highr_seconds_count 373.0
http_request_duration_highr_seconds_sum 1853.5824478221475
# HELP http_request_duration_highr_seconds_created Latency with many buckets but no API specific labels. Made for more accurate percentile calculations.
# TYPE http_request_duration_highr_seconds_created gauge
http_request_duration_highr_seconds_created 1.7798081863473852e+09
# HELP http_request_duration_seconds Latency with only few buckets by handler. Made to be only used if aggregation by handler is important.
# TYPE http_request_duration_seconds histogram
http_request_duration_seconds_bucket{handler="/v1/models",le="0.1",method="GET"} 1.0
http_request_duration_seconds_bucket{handler="/v1/models",le="0.5",method="GET"} 1.0
http_request_duration_seconds_bucket{handler="/v1/models",le="1.0",method="GET"} 1.0
http_request_duration_seconds_bucket{handler="/v1/models",le="+Inf",method="GET"} 1.0
http_request_duration_seconds_count{handler="/v1/models",method="GET"} 1.0
http_request_duration_seconds_sum{handler="/v1/models",method="GET"} 0.00240229198243469
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="0.1",method="POST"} 0.0
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="0.5",method="POST"} 0.0
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="1.0",method="POST"} 0.0
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="+Inf",method="POST"} 372.0
http_request_duration_seconds_count{handler="/v1/chat/completions",method="POST"} 372.0
http_request_duration_seconds_sum{handler="/v1/chat/completions",method="POST"} 1853.580045530165
# HELP http_request_duration_seconds_created Latency with only few buckets by handler. Made to be only used if aggregation by handler is important.
# TYPE http_request_duration_seconds_created gauge
http_request_duration_seconds_created{handler="/v1/models",method="GET"} 1.779808187622212e+09
http_request_duration_seconds_created{handler="/v1/chat/completions",method="POST"} 1.7798081937689064e+09

View File

@@ -0,0 +1,372 @@
{"req_id": "2d4b27e5b3b943f4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377323568694606, "t_first_token_ns": 377324820901603, "t_last_token_ns": 377328754851504, "prompt_tokens": 3963, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "04f57fc587544821", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377324031433998, "t_first_token_ns": 377324826075536, "t_last_token_ns": 377328766093878, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "b94172400b59478a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377324204772012, "t_first_token_ns": 377324826212587, "t_last_token_ns": 377328766249783, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "995d393bd71b4b8c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377324882950607, "t_first_token_ns": 377325144369252, "t_last_token_ns": 377328889574042, "prompt_tokens": 4024, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "4912600328f44eea", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377325022707179, "t_first_token_ns": 377325387213500, "t_last_token_ns": 377328909374406, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "b975d9d5272a4427", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377326537575348, "t_first_token_ns": 377326798901899, "t_last_token_ns": 377329902579865, "prompt_tokens": 3963, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "ccba743082364781", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377327461164692, "t_first_token_ns": 377327731193197, "t_last_token_ns": 377330697318472, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "6452f810a40947dc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377327958318130, "t_first_token_ns": 377328226147190, "t_last_token_ns": 377330863136458, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "67d0f3ff0f2d455b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377328247250841, "t_first_token_ns": 377328526359717, "t_last_token_ns": 377330899906616, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "c99f6d7100064036", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377330339410243, "t_first_token_ns": 377330602704129, "t_last_token_ns": 377331995245807, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "a2aa28d492b24da1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377333286349337, "t_first_token_ns": 377333542371645, "t_last_token_ns": 377335728172515, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "f48df1b5da0541e8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377333766318154, "t_first_token_ns": 377334026521673, "t_last_token_ns": 377336030718014, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "1728cb8d3bf24f05", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377334268118893, "t_first_token_ns": 377334524615304, "t_last_token_ns": 377336240880861, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "ffda3d4d524a408b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377336374658415, "t_first_token_ns": 377336629779755, "t_last_token_ns": 377339433078425, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "da2ad06814584d64", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377336760570863, "t_first_token_ns": 377337017929818, "t_last_token_ns": 377339690394601, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "a7f1651b6d8a477f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377337354292879, "t_first_token_ns": 377337607223109, "t_last_token_ns": 377340138335353, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "9b70a69a9e4e46eb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377337634996044, "t_first_token_ns": 377337889511668, "t_last_token_ns": 377340414784238, "prompt_tokens": 3934, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "0b11aa3c6fe34db7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377338422526069, "t_first_token_ns": 377338688800408, "t_last_token_ns": 377340851670663, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "7ce8d66a43e14b4f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377340158826307, "t_first_token_ns": 377340414659023, "t_last_token_ns": 377342654946705, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "8e1d2b4b86114d68", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377341206816838, "t_first_token_ns": 377341460844432, "t_last_token_ns": 377346345859143, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "af4ec6776e684bcd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377341376826113, "t_first_token_ns": 377341701444633, "t_last_token_ns": 377346373729072, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "2a738147e8eb413b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377342399434685, "t_first_token_ns": 377342655265362, "t_last_token_ns": 377348693747435, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "98d393d4ca1a4820", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377342716344570, "t_first_token_ns": 377342977645286, "t_last_token_ns": 377348907807624, "prompt_tokens": 4032, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "e5c60bf53d954508", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377343247933989, "t_first_token_ns": 377343512146866, "t_last_token_ns": 377349914453756, "prompt_tokens": 4033, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "391e04cedb5f4f82", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377343298435768, "t_first_token_ns": 377343960915731, "t_last_token_ns": 377349928936037, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "ca0d59bf7b1b4a6a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377343284751544, "t_first_token_ns": 377343960817378, "t_last_token_ns": 377349929111172, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "ceda3058052843b0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377344169056422, "t_first_token_ns": 377344437144111, "t_last_token_ns": 377350244123021, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "46d2eee1b7184a0d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377344524182451, "t_first_token_ns": 377344799654670, "t_last_token_ns": 377350412900015, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "423e3e98eafd4528", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377344653657662, "t_first_token_ns": 377345049261451, "t_last_token_ns": 377350435851419, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "b396226867dd4c0a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377345284662300, "t_first_token_ns": 377345560946443, "t_last_token_ns": 377350673010869, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "967fcd2e1620414b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377345793431652, "t_first_token_ns": 377346072984412, "t_last_token_ns": 377351135484393, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "f7b6e14fb76d4519", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377346389651441, "t_first_token_ns": 377346665276482, "t_last_token_ns": 377351445778221, "prompt_tokens": 3955, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "7f0928e0c5f04653", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377346548701667, "t_first_token_ns": 377346915731976, "t_last_token_ns": 377351464553869, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "655a31ddd18c4331", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377347626814046, "t_first_token_ns": 377347900199440, "t_last_token_ns": 377352446767847, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "0ea2c3d280124126", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377348155324276, "t_first_token_ns": 377348434440030, "t_last_token_ns": 377352652087787, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "abe57ce576a54e18", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377348835375030, "t_first_token_ns": 377349120328668, "t_last_token_ns": 377352956218114, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "93267fbbed02452f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377349264902543, "t_first_token_ns": 377349543311748, "t_last_token_ns": 377353311903719, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "bcc1d79459d04f8a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377350744945452, "t_first_token_ns": 377351015962984, "t_last_token_ns": 377354063599369, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "6225f49fee0749db", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377351695054054, "t_first_token_ns": 377351962455019, "t_last_token_ns": 377355517534434, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "8243d841e8d1472f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377352095311078, "t_first_token_ns": 377352356923518, "t_last_token_ns": 377355679300960, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "5136dd17de9b4b1c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377352960277021, "t_first_token_ns": 377353225696220, "t_last_token_ns": 377356853924409, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "d36b8d367f584500", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377354203943416, "t_first_token_ns": 377354462887780, "t_last_token_ns": 377358529646323, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "7c7e868fc3fe467a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377354710717837, "t_first_token_ns": 377354971218546, "t_last_token_ns": 377359135567469, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "373d98540c4e4b94", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377354895162091, "t_first_token_ns": 377355215336565, "t_last_token_ns": 377359153616746, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "d2cdc8ea32bd4c5f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377354996454346, "t_first_token_ns": 377355453391450, "t_last_token_ns": 377359163555945, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "342a9993102c4e83", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377355806814406, "t_first_token_ns": 377356068129448, "t_last_token_ns": 377359482516666, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "8713dbd79c444ede", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377355954487387, "t_first_token_ns": 377356309729168, "t_last_token_ns": 377359496675661, "prompt_tokens": 3941, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "15aa6a283ffd4ecc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377356960464591, "t_first_token_ns": 377357221461292, "t_last_token_ns": 377360102166864, "prompt_tokens": 3949, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "91e15f27f4ab4fa1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377358461168817, "t_first_token_ns": 377358738965892, "t_last_token_ns": 377362957709774, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "d11620f576d74e3d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377359783347409, "t_first_token_ns": 377360040453229, "t_last_token_ns": 377366220155812, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "0a6be7a1aef343ad", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377360373840936, "t_first_token_ns": 377360631266921, "t_last_token_ns": 377367869746526, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "96d1c02685034bf3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377360842927727, "t_first_token_ns": 377361108062437, "t_last_token_ns": 377368508908819, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "b4edcec7212f4b21", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377361122906199, "t_first_token_ns": 377361382190097, "t_last_token_ns": 377368599423212, "prompt_tokens": 3953, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "fcdfb41b75ae478d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377361288147309, "t_first_token_ns": 377361838575332, "t_last_token_ns": 377368632826351, "prompt_tokens": 3956, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "bce44b83ad854fbd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377361233203761, "t_first_token_ns": 377361838694178, "t_last_token_ns": 377368633357465, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "357c2dec78ad4e78", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377361778132441, "t_first_token_ns": 377362298433935, "t_last_token_ns": 377368662738000, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "c9f700d253fa4e84", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377361548451212, "t_first_token_ns": 377362298887323, "t_last_token_ns": 377368662967777, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "2d98a7a98a894dad", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377362383784803, "t_first_token_ns": 377362653119323, "t_last_token_ns": 377368837784761, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "58b345665da64055", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377362651319314, "t_first_token_ns": 377362919254021, "t_last_token_ns": 377368881817517, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "261f9787e8834aea", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377363075835826, "t_first_token_ns": 377363345147939, "t_last_token_ns": 377369520427954, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "fdac07314f6e48a4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377364092460247, "t_first_token_ns": 377364376629268, "t_last_token_ns": 377370309996776, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "3d45ca007e104ec3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377364265386979, "t_first_token_ns": 377364629293987, "t_last_token_ns": 377370331699710, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "72cd38d1600e4510", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377364764153111, "t_first_token_ns": 377365037803480, "t_last_token_ns": 377370477037673, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "e4ebe8a291614a20", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377365607765355, "t_first_token_ns": 377365891070864, "t_last_token_ns": 377370965691402, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "62eb3644f7384939", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377365883596926, "t_first_token_ns": 377366161390090, "t_last_token_ns": 377371032961370, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "9e3f8005a1f5461f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377366538250284, "t_first_token_ns": 377366817107103, "t_last_token_ns": 377372215778540, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "9f586a7017e945d6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377366936093216, "t_first_token_ns": 377367221982304, "t_last_token_ns": 377372342697504, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
{"req_id": "248f22ccdb1042a0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377367508210547, "t_first_token_ns": 377367802259787, "t_last_token_ns": 377373245401182, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
{"req_id": "10dff73c96df4028", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377368958159474, "t_first_token_ns": 377369228126399, "t_last_token_ns": 377375496512817, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "94bf988ff6f84a8c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377369156158552, "t_first_token_ns": 377369479128091, "t_last_token_ns": 377375523975275, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "e8f6fb8b5b8b4458", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377370963805713, "t_first_token_ns": 377371235879442, "t_last_token_ns": 377377804991887, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "d6c21b3ee4c94953", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377371093895000, "t_first_token_ns": 377371481540174, "t_last_token_ns": 377377836954679, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "6ae118f379a246ed", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377371487814961, "t_first_token_ns": 377371752624491, "t_last_token_ns": 377377893784206, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "fb5925f16a054703", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377371507073026, "t_first_token_ns": 377371989826383, "t_last_token_ns": 377377908627739, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "18a87e78810347eb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377372456602672, "t_first_token_ns": 377372728449995, "t_last_token_ns": 377378792703048, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "ef097e98a7954371", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377372565214779, "t_first_token_ns": 377372978271602, "t_last_token_ns": 377378821595985, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "693e188c82b449c7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377372858236464, "t_first_token_ns": 377373230558921, "t_last_token_ns": 377378850081074, "prompt_tokens": 4019, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "23cc849ebc094ff4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377373702573386, "t_first_token_ns": 377373971018165, "t_last_token_ns": 377379377509980, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "295842851b934600", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377373881833610, "t_first_token_ns": 377374483443555, "t_last_token_ns": 377379400223947, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "540c348868d74fb4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377373765250847, "t_first_token_ns": 377374484388724, "t_last_token_ns": 377379400782729, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "f00bc54030d64137", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377373894221154, "t_first_token_ns": 377374668334815, "t_last_token_ns": 377379410780097, "prompt_tokens": 3950, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "a86c6483c2b84655", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377374496893437, "t_first_token_ns": 377374920618330, "t_last_token_ns": 377379431136250, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "22ef3bed908c47e4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377374776535896, "t_first_token_ns": 377375175374175, "t_last_token_ns": 377379447185063, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "a9d29cc28cbc4533", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377375650671732, "t_first_token_ns": 377375929544787, "t_last_token_ns": 377380413696908, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "97dac52b403347b1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377377065762460, "t_first_token_ns": 377377357087085, "t_last_token_ns": 377381858592797, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "10a9340392c14fe7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377377906728300, "t_first_token_ns": 377378191925954, "t_last_token_ns": 377382781826214, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "73882fd886214d57", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377379634670052, "t_first_token_ns": 377379895506765, "t_last_token_ns": 377385275650405, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "5a327a13079445a4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377379838660552, "t_first_token_ns": 377380352948140, "t_last_token_ns": 377385304516469, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "4204c2d164a44a27", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377379717408379, "t_first_token_ns": 377380353191696, "t_last_token_ns": 377385304751051, "prompt_tokens": 3958, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "b63266c3415c408c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377380767577880, "t_first_token_ns": 377381038237911, "t_last_token_ns": 377385884648357, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "8f2ca0d75fd44ee5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377381212953125, "t_first_token_ns": 377381480341333, "t_last_token_ns": 377386146296800, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "3d2b2bfeffc343cf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377381416820680, "t_first_token_ns": 377381729954168, "t_last_token_ns": 377386170329035, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "eeb5d171cd5444f8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377382035154372, "t_first_token_ns": 377382308111582, "t_last_token_ns": 377386746516645, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "21ea2e608cd443f3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377382269012457, "t_first_token_ns": 377382557547923, "t_last_token_ns": 377386764832993, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "52c60bce6e4e43e3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377383350661322, "t_first_token_ns": 377383620387852, "t_last_token_ns": 377387978208102, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "f30f1b23f2714f6e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377383764139305, "t_first_token_ns": 377384040720905, "t_last_token_ns": 377388132779721, "prompt_tokens": 4023, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "16b7d8c1ffdd4c62", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377384583566608, "t_first_token_ns": 377384862409107, "t_last_token_ns": 377388537694427, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "0ce9d89ab22a4692", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377384882380063, "t_first_token_ns": 377385160066268, "t_last_token_ns": 377388577550659, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "338a6daea7e34e95", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377386171845818, "t_first_token_ns": 377386435574911, "t_last_token_ns": 377389225527341, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "ddeb446439f7417f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377387187222293, "t_first_token_ns": 377387449871647, "t_last_token_ns": 377390583384234, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "244f9d9a10624080", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377387553867442, "t_first_token_ns": 377387822282253, "t_last_token_ns": 377390702130828, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "592a14d2448c4b2b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377389273077380, "t_first_token_ns": 377389534864262, "t_last_token_ns": 377393511887729, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "362c865eae664bb0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377389339886639, "t_first_token_ns": 377389779074067, "t_last_token_ns": 377393535177824, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "4f613743c1d34225", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377389690093487, "t_first_token_ns": 377390021998049, "t_last_token_ns": 377393558198846, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "9188709ed7c94817", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377391453565913, "t_first_token_ns": 377391713806470, "t_last_token_ns": 377396917602805, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "77fa5134632945eb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377392043967758, "t_first_token_ns": 377392306723049, "t_last_token_ns": 377398716219888, "prompt_tokens": 3941, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "9de8757e0cca40e3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377392157118605, "t_first_token_ns": 377392554519830, "t_last_token_ns": 377398746555504, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "d17541264648486b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377392399842793, "t_first_token_ns": 377392802161313, "t_last_token_ns": 377398777481833, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "1296e81703c34971", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377392795423837, "t_first_token_ns": 377393064817803, "t_last_token_ns": 377398819205444, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "d5e53d21af4d4600", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377393197270283, "t_first_token_ns": 377393466352604, "t_last_token_ns": 377399025957536, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "862d51c327c84780", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377394009012379, "t_first_token_ns": 377394281203421, "t_last_token_ns": 377400136530417, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "923038446ceb48e4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377394085259896, "t_first_token_ns": 377394527369449, "t_last_token_ns": 377400165769136, "prompt_tokens": 3954, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "75902c11af3c4c39", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377394428127917, "t_first_token_ns": 377394776096578, "t_last_token_ns": 377400194329674, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "fe51b99d386a4585", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377395856834572, "t_first_token_ns": 377396134148873, "t_last_token_ns": 377401610624275, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "8fb4f3ec45034061", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377395876048777, "t_first_token_ns": 377396373815784, "t_last_token_ns": 377401622320535, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "e21229f3c0434766", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377396381974062, "t_first_token_ns": 377396653816517, "t_last_token_ns": 377401668066333, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "849cbc95ab394f15", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377396923336610, "t_first_token_ns": 377397208878446, "t_last_token_ns": 377402171567376, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "b2653f98edd14980", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377397032800607, "t_first_token_ns": 377397455057343, "t_last_token_ns": 377402194648819, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "4dff5e5177bb45cf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377397610912546, "t_first_token_ns": 377397888075711, "t_last_token_ns": 377402327684138, "prompt_tokens": 4008, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "de3a0c5fd4d84118", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377397769048188, "t_first_token_ns": 377398145273587, "t_last_token_ns": 377402346890810, "prompt_tokens": 4008, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "58b186345c11451d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377398001744407, "t_first_token_ns": 377398401983272, "t_last_token_ns": 377402361997644, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "f1d344a6b94148fe", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377399239049579, "t_first_token_ns": 377399504488617, "t_last_token_ns": 377402826959132, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "a3add0976ee447fd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377401069805640, "t_first_token_ns": 377401344414131, "t_last_token_ns": 377403558641139, "prompt_tokens": 3953, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "ec16058eb54349bf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377401831190387, "t_first_token_ns": 377402100693677, "t_last_token_ns": 377403759290502, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "02e37baa12c24303", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377405368629186, "t_first_token_ns": 377405623643623, "t_last_token_ns": 377407823253284, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "7f7e91d535964eb1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377405549024664, "t_first_token_ns": 377405863361572, "t_last_token_ns": 377407835242473, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "d2bebd89c55a4270", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377406445328063, "t_first_token_ns": 377406708173615, "t_last_token_ns": 377409497716395, "prompt_tokens": 4023, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "c52f9f1438f84ece", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377408049084541, "t_first_token_ns": 377408306180256, "t_last_token_ns": 377413481372480, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "6baaa495f6e64d19", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377408193553711, "t_first_token_ns": 377408764154041, "t_last_token_ns": 377413509290257, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "8624dde42b0b4475", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377408117842028, "t_first_token_ns": 377408763760613, "t_last_token_ns": 377413509599931, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "dda82a717c6b4f28", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377408389846770, "t_first_token_ns": 377409005661387, "t_last_token_ns": 377413532088247, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "a9ace5bdf1814feb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377409579996809, "t_first_token_ns": 377409841381890, "t_last_token_ns": 377414516623101, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "8ef4c65739e048ab", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377410125956428, "t_first_token_ns": 377410399318320, "t_last_token_ns": 377414903039981, "prompt_tokens": 4024, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "754a676d5712435e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377410136074795, "t_first_token_ns": 377410636723446, "t_last_token_ns": 377414912401079, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "7e168009159b4250", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377410339507761, "t_first_token_ns": 377410876555957, "t_last_token_ns": 377414921018161, "prompt_tokens": 4039, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "a09d3d613fb34a04", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377411050817264, "t_first_token_ns": 377411315449416, "t_last_token_ns": 377415076442723, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "cd753044012a4e63", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377412040080131, "t_first_token_ns": 377412315193757, "t_last_token_ns": 377415544807431, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "550c29f9a9c44092", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377412067156578, "t_first_token_ns": 377412554425754, "t_last_token_ns": 377415550934204, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "0db31e0409d1462c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377414203780426, "t_first_token_ns": 377414472251218, "t_last_token_ns": 377418360330144, "prompt_tokens": 3962, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "fd0692d6ac804e5e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377415673928809, "t_first_token_ns": 377415935888437, "t_last_token_ns": 377420754047048, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "6081da97a3444c3b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377415740387434, "t_first_token_ns": 377416176986800, "t_last_token_ns": 377420778472255, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "54de5de0d7124d7c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377416100328061, "t_first_token_ns": 377416419097593, "t_last_token_ns": 377420801475331, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "38fcc0a19e004aee", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377416827415738, "t_first_token_ns": 377417088033346, "t_last_token_ns": 377421721779715, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "d2bd59bd035546f7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377416838379547, "t_first_token_ns": 377417323939310, "t_last_token_ns": 377421733674466, "prompt_tokens": 3955, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "2a964f424b58481c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377417394799906, "t_first_token_ns": 377417664042038, "t_last_token_ns": 377421847619966, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "63dc3ae3bd8540d8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377417550581924, "t_first_token_ns": 377418126980788, "t_last_token_ns": 377421868878551, "prompt_tokens": 3942, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "481788f25c6e48f5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377417461306630, "t_first_token_ns": 377418127258293, "t_last_token_ns": 377421869173478, "prompt_tokens": 4028, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "9b6a6cf83a5749d4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377419336707388, "t_first_token_ns": 377419609638677, "t_last_token_ns": 377422687988431, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "bf0e5b000beb4f9c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377419635813107, "t_first_token_ns": 377419908221796, "t_last_token_ns": 377422723935681, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "0402d90f375c411b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377421271636785, "t_first_token_ns": 377421544559880, "t_last_token_ns": 377423240063928, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "9b0eda38a6e7401b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377423268737110, "t_first_token_ns": 377423519208123, "t_last_token_ns": 377428518467151, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "6ed3e6861206410f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377423433564265, "t_first_token_ns": 377423760533288, "t_last_token_ns": 377428546024800, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "688a919307d84434", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377423631537395, "t_first_token_ns": 377423998621091, "t_last_token_ns": 377428568627261, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "e6669be317794c31", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377424175241092, "t_first_token_ns": 377424440320482, "t_last_token_ns": 377428896249663, "prompt_tokens": 4019, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "4912171354b64649", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377424347887079, "t_first_token_ns": 377424683555940, "t_last_token_ns": 377428921686875, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "09454dc795cb4897", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377425165468004, "t_first_token_ns": 377425433388786, "t_last_token_ns": 377429943220755, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "45c98d1f49984338", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377425456357389, "t_first_token_ns": 377425720098217, "t_last_token_ns": 377430013901111, "prompt_tokens": 4029, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "3ce04282560f4e25", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377427307597239, "t_first_token_ns": 377427576338574, "t_last_token_ns": 377432019161542, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "53356f791b704fba", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377427545594783, "t_first_token_ns": 377428037688280, "t_last_token_ns": 377432270562910, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "c79fa2ba4f3640a9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377427453904680, "t_first_token_ns": 377428037837491, "t_last_token_ns": 377432270776604, "prompt_tokens": 3954, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "04cc4f8015554a77", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377428147273087, "t_first_token_ns": 377428424424088, "t_last_token_ns": 377432383379058, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "f573205534b44188", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377429139919059, "t_first_token_ns": 377429406828163, "t_last_token_ns": 377433059007785, "prompt_tokens": 4022, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "7acf13f41e504e52", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377429233852539, "t_first_token_ns": 377429657536960, "t_last_token_ns": 377433074918249, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "0061b96aa5d94c19", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377430372523785, "t_first_token_ns": 377430641228094, "t_last_token_ns": 377434204800538, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "8e309c9e9d2044aa", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377431370223508, "t_first_token_ns": 377431643220720, "t_last_token_ns": 377435572600634, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "607d450c0bf94c55", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377431995560393, "t_first_token_ns": 377432271552223, "t_last_token_ns": 377435901585705, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "4926fbd623fe4796", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377433077421577, "t_first_token_ns": 377433338893928, "t_last_token_ns": 377436975145745, "prompt_tokens": 3956, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "de26a0876970457c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377433361619558, "t_first_token_ns": 377433623263634, "t_last_token_ns": 377437043701145, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "b0ec2a23067a416e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377434294434139, "t_first_token_ns": 377434553666966, "t_last_token_ns": 377437631340932, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "821e564265744123", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377434400636287, "t_first_token_ns": 377434802258451, "t_last_token_ns": 377437645275830, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "69aa543d383141b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377434588046409, "t_first_token_ns": 377435044827664, "t_last_token_ns": 377437656544654, "prompt_tokens": 3941, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "32867420e7ed4e6b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377436424968127, "t_first_token_ns": 377436696114783, "t_last_token_ns": 377438620693251, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "8b9178af0cc74a89", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377438032760282, "t_first_token_ns": 377438290097933, "t_last_token_ns": 377440378016790, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "4cc212ea0e2e4759", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377438635220524, "t_first_token_ns": 377438892809603, "t_last_token_ns": 377440760199174, "prompt_tokens": 4034, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "7c6d7d5517c4425a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377439436479058, "t_first_token_ns": 377439694106440, "t_last_token_ns": 377441707594344, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "6213c21de78b48eb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377441112977037, "t_first_token_ns": 377441371087068, "t_last_token_ns": 377444750166562, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "6a6818a012164e0b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377441444243266, "t_first_token_ns": 377441708023671, "t_last_token_ns": 377444918850660, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "7153f273155d4a82", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377441833296131, "t_first_token_ns": 377442095276064, "t_last_token_ns": 377445180177434, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "125a806c74ff4bdf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377442744147334, "t_first_token_ns": 377443005885535, "t_last_token_ns": 377445935195467, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "c5eba030bc6e47d6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377443254148476, "t_first_token_ns": 377443519303951, "t_last_token_ns": 377446432623861, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "bfdfa0ddac1d4775", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377443319704181, "t_first_token_ns": 377443764085735, "t_last_token_ns": 377446446878568, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "a1debffbb5524a6b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377444331730601, "t_first_token_ns": 377444602253855, "t_last_token_ns": 377446837391789, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "56dd58824ada4d43", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377445967786007, "t_first_token_ns": 377446229438580, "t_last_token_ns": 377448678159632, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "1dbf6c774dec445f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377446890705776, "t_first_token_ns": 377447145188900, "t_last_token_ns": 377449733746505, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "52297a7a93b94b2d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377447271122375, "t_first_token_ns": 377447529009302, "t_last_token_ns": 377449929243507, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "0e0ccc6fc62d4e13", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377447665484229, "t_first_token_ns": 377447923650248, "t_last_token_ns": 377450061478504, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "b22de0314f6446e3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377449088141014, "t_first_token_ns": 377449349351636, "t_last_token_ns": 377453775946900, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "a6147a59c9904a93", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377450352595836, "t_first_token_ns": 377450609967182, "t_last_token_ns": 377456561686185, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "9a4bcda9dc3c4fb5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377450568342623, "t_first_token_ns": 377450851728670, "t_last_token_ns": 377456587978474, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "4a4bd26276e54d17", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377450718484770, "t_first_token_ns": 377451096438315, "t_last_token_ns": 377456614289074, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "b7f9ff69aa2540b8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377450917630065, "t_first_token_ns": 377451556110876, "t_last_token_ns": 377456636985288, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "81cbda9e56b44669", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377450916741950, "t_first_token_ns": 377451556413414, "t_last_token_ns": 377456637386288, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "9ca0f2970bfd442f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377451356708633, "t_first_token_ns": 377451800846620, "t_last_token_ns": 377456658078051, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "901443127ac245be", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377451583644976, "t_first_token_ns": 377452048832772, "t_last_token_ns": 377456675837711, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "9b92577917194961", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377451897461496, "t_first_token_ns": 377452516372609, "t_last_token_ns": 377456695246859, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "de995a87887d4765", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377451876571323, "t_first_token_ns": 377452516753088, "t_last_token_ns": 377456695459002, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "44fc8465535a41a7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377452142995561, "t_first_token_ns": 377452767174988, "t_last_token_ns": 377456709032036, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "125961a2ee184085", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377454016818032, "t_first_token_ns": 377454292517923, "t_last_token_ns": 377457594345383, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "aca3790c548b4988", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377455615641749, "t_first_token_ns": 377455892466728, "t_last_token_ns": 377459099757171, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "a2d44d7f192f4aa8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377456892397092, "t_first_token_ns": 377457150655305, "t_last_token_ns": 377459863907029, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "58a830dcb8344cd9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377457648471819, "t_first_token_ns": 377457907592426, "t_last_token_ns": 377460939360695, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "cc63adb423f04da4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377457946253807, "t_first_token_ns": 377458204518948, "t_last_token_ns": 377461015889165, "prompt_tokens": 3935, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "bea9e7e269064766", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377458329937551, "t_first_token_ns": 377458592112460, "t_last_token_ns": 377461164211974, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "21907e91a86347a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377460218622278, "t_first_token_ns": 377460484688298, "t_last_token_ns": 377462363394230, "prompt_tokens": 4027, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "edfeee2c0d0b4bb4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377460400656114, "t_first_token_ns": 377460729054970, "t_last_token_ns": 377462371965357, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "6c2eae33c9eb4960", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377462612697384, "t_first_token_ns": 377462866055004, "t_last_token_ns": 377467893265873, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "1570a2f0da304bb0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377463014416737, "t_first_token_ns": 377463275115455, "t_last_token_ns": 377469983017194, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "4456ed0f21024905", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377463058960802, "t_first_token_ns": 377463509294813, "t_last_token_ns": 377470225451849, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "b5246f449b0f46a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377463192299271, "t_first_token_ns": 377463964307165, "t_last_token_ns": 377470469858457, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "0e58940db16a47a4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377463071557263, "t_first_token_ns": 377463964545819, "t_last_token_ns": 377470470411580, "prompt_tokens": 4026, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "d0b0dca81c4d40c0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377463675863299, "t_first_token_ns": 377464423352736, "t_last_token_ns": 377470724677615, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "f77d9aea2c8b4e11", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377463779067732, "t_first_token_ns": 377464423443054, "t_last_token_ns": 377470724821141, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "6949f7b8f5a44943", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377465271542137, "t_first_token_ns": 377465541139948, "t_last_token_ns": 377473287582090, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "16114470831548a4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377465469194069, "t_first_token_ns": 377465789516010, "t_last_token_ns": 377473320514338, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "ea0044203fc94eea", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377466372774121, "t_first_token_ns": 377466640855597, "t_last_token_ns": 377474784085221, "prompt_tokens": 3953, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "9db3f7eccd5a416a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377466429363933, "t_first_token_ns": 377466895524920, "t_last_token_ns": 377474819208181, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "114f632e936b475e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377468301989057, "t_first_token_ns": 377468575380278, "t_last_token_ns": 377477304889257, "prompt_tokens": 3963, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "61b39b7dd12a47ee", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377468536458450, "t_first_token_ns": 377468830425835, "t_last_token_ns": 377477347284910, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "c551cce7a9774027", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377468564896972, "t_first_token_ns": 377469071930842, "t_last_token_ns": 377477364872221, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "eb9e0d02ad9645b6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377468653309155, "t_first_token_ns": 377469576273629, "t_last_token_ns": 377477382115869, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "2e50c63040ba43c7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377468623838887, "t_first_token_ns": 377469576365128, "t_last_token_ns": 377477382507004, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "a33cd093818e4cc6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377468866066219, "t_first_token_ns": 377469982849502, "t_last_token_ns": 377477396179135, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
{"req_id": "947911e6902744d2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377468792720449, "t_first_token_ns": 377469983417333, "t_last_token_ns": 377477396934739, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "de16d4be26ac4daf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377469055283813, "t_first_token_ns": 377470226099331, "t_last_token_ns": 377477411146544, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
{"req_id": "b3b15799de5d4f5e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377469803105007, "t_first_token_ns": 377470470998081, "t_last_token_ns": 377477425881419, "prompt_tokens": 3962, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
{"req_id": "b05a6a8cc6444c84", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377470238451609, "t_first_token_ns": 377470723646171, "t_last_token_ns": 377477453951522, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
{"req_id": "6e3fd206a8634bec", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377470678448297, "t_first_token_ns": 377471192947635, "t_last_token_ns": 377477482971112, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
{"req_id": "dff3376669b54f4e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377470566621316, "t_first_token_ns": 377471193631945, "t_last_token_ns": 377477483451198, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
{"req_id": "c3752de7676c43a4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377472031739391, "t_first_token_ns": 377472308519078, "t_last_token_ns": 377478557633936, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
{"req_id": "7b4427c1b5e9460a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377472288778789, "t_first_token_ns": 377472783368414, "t_last_token_ns": 377478580804643, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
{"req_id": "0bb36936d138475a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377472221645021, "t_first_token_ns": 377472782992782, "t_last_token_ns": 377478581409475, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
{"req_id": "6c406d279a564ef5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377472508714255, "t_first_token_ns": 377473042426915, "t_last_token_ns": 377478603469389, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
{"req_id": "616100a49e6f459c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377473295290870, "t_first_token_ns": 377473581558059, "t_last_token_ns": 377478759176666, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
{"req_id": "eca40c61224c4c81", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377474018501669, "t_first_token_ns": 377474303513612, "t_last_token_ns": 377479742172963, "prompt_tokens": 3957, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
{"req_id": "4e13d82ab3d34423", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377475909772295, "t_first_token_ns": 377476203354588, "t_last_token_ns": 377481705951083, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
{"req_id": "bf371c0ea87742dc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377476599046356, "t_first_token_ns": 377476876790065, "t_last_token_ns": 377482057176947, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
{"req_id": "f6bf4539029b41b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377476618715961, "t_first_token_ns": 377477120824921, "t_last_token_ns": 377482069330135, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
{"req_id": "9ba4c28fbc514d77", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377477902304354, "t_first_token_ns": 377478176871001, "t_last_token_ns": 377483237473539, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "8fae90ad0970468b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377478953271434, "t_first_token_ns": 377479224238654, "t_last_token_ns": 377485826206295, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "6374efe5c6134dcc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377479028249178, "t_first_token_ns": 377479472502184, "t_last_token_ns": 377485855136800, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "c755f2514ad34566", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377479438252639, "t_first_token_ns": 377479721768860, "t_last_token_ns": 377486113418169, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "2ab58c3cb42a4599", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377480322654677, "t_first_token_ns": 377480596657614, "t_last_token_ns": 377486942277114, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "04ad0e8396db47cb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377480887401826, "t_first_token_ns": 377481163620537, "t_last_token_ns": 377487380140342, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "c76320db98e4400c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377481090133956, "t_first_token_ns": 377481627659425, "t_last_token_ns": 377487407870909, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "43c2190936ea4f2f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377481101584990, "t_first_token_ns": 377481627885493, "t_last_token_ns": 377487408379136, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "b7ec4fa80cfc44fe", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377482050439537, "t_first_token_ns": 377482319250456, "t_last_token_ns": 377487890784948, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "c04c30103fa94ebf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377482922082459, "t_first_token_ns": 377483198510042, "t_last_token_ns": 377489078859262, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "1d39741cc12a4132", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377482972289076, "t_first_token_ns": 377483456458110, "t_last_token_ns": 377489106991287, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "d1c84fe5a6634b44", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377483345836310, "t_first_token_ns": 377483708812128, "t_last_token_ns": 377489130639680, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "1d5b7c08213a4b42", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377483480081858, "t_first_token_ns": 377483949060483, "t_last_token_ns": 377489141485418, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "e1a7ac75feac440e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377483534524817, "t_first_token_ns": 377484197255686, "t_last_token_ns": 377489152763080, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "2cb68b03c85f49ed", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377483720788522, "t_first_token_ns": 377484433638178, "t_last_token_ns": 377489162098166, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "4ef17f1730324bc0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377484640664292, "t_first_token_ns": 377484927072846, "t_last_token_ns": 377489336926291, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "3e604ea792dd4653", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377485530460520, "t_first_token_ns": 377485812833319, "t_last_token_ns": 377489692392547, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "8eb49eb03303425c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377485827861019, "t_first_token_ns": 377486113831863, "t_last_token_ns": 377489729354805, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "c12f96c9fa294fa0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377488122077407, "t_first_token_ns": 377488394594793, "t_last_token_ns": 377491230142230, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "9250c62e4c10435f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377488666793254, "t_first_token_ns": 377488940397258, "t_last_token_ns": 377491401394022, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "42813e11628d474a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377489960249868, "t_first_token_ns": 377490218417417, "t_last_token_ns": 377492733250631, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "355cfc528b60420e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377490172618637, "t_first_token_ns": 377490459315564, "t_last_token_ns": 377492747953843, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "897d9052cbe54626", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377491827412670, "t_first_token_ns": 377492089905814, "t_last_token_ns": 377494410968387, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "59337fbd005f460b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377492086809762, "t_first_token_ns": 377492345874055, "t_last_token_ns": 377494440201723, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "8952de2fe53545ab", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377493788915717, "t_first_token_ns": 377494048409980, "t_last_token_ns": 377497178467111, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "1a0f833c8bf64d45", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377494075732718, "t_first_token_ns": 377494336538107, "t_last_token_ns": 377497259475419, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "9a38df423f034607", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377495053853210, "t_first_token_ns": 377495317428673, "t_last_token_ns": 377498225404271, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "93f34c8fd3d7478d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377495268329483, "t_first_token_ns": 377495558977778, "t_last_token_ns": 377498240362986, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "3fef97c756f54135", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377495429115649, "t_first_token_ns": 377495803660106, "t_last_token_ns": 377498251635451, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "ddc5d189f8bf46c3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377496841585805, "t_first_token_ns": 377497113094001, "t_last_token_ns": 377499330823843, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "3dc2592db90c49f5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377498493332968, "t_first_token_ns": 377498751464025, "t_last_token_ns": 377502042092409, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "77194bfe50654093", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377499058541243, "t_first_token_ns": 377499318482840, "t_last_token_ns": 377503316231061, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "939d4f59120f479a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377499731556210, "t_first_token_ns": 377499989137474, "t_last_token_ns": 377504379535176, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "a396d8c797324d2f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377500264077449, "t_first_token_ns": 377500524934560, "t_last_token_ns": 377504831625310, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "3745f31c63864824", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377500535529374, "t_first_token_ns": 377500800674339, "t_last_token_ns": 377504902567066, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "5504de3e36614a6c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377501259959738, "t_first_token_ns": 377501524222983, "t_last_token_ns": 377505383765544, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "30559303bec14a33", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377501598612383, "t_first_token_ns": 377501864590303, "t_last_token_ns": 377505724341292, "prompt_tokens": 3958, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "218f5263090d45e9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377502153548359, "t_first_token_ns": 377502416836024, "t_last_token_ns": 377506026947687, "prompt_tokens": 3935, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "d63c3df87df74615", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377502457082265, "t_first_token_ns": 377502729481240, "t_last_token_ns": 377506082096920, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "dba51121e11e4b20", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377502582177927, "t_first_token_ns": 377502975568269, "t_last_token_ns": 377506097044458, "prompt_tokens": 3954, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "a05773a48d784f34", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377503732596429, "t_first_token_ns": 377503999857960, "t_last_token_ns": 377506516768981, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "2c7cfb320fe849dc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377505365927724, "t_first_token_ns": 377505632549710, "t_last_token_ns": 377507157705369, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "8d2f8c77330e4488", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377507962232187, "t_first_token_ns": 377508219741777, "t_last_token_ns": 377515641446492, "prompt_tokens": 4035, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "4239469960d74415", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377508326700624, "t_first_token_ns": 377508584784759, "t_last_token_ns": 377516128274366, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "2453c828e2d24662", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377508348529844, "t_first_token_ns": 377508814757158, "t_last_token_ns": 377516145507572, "prompt_tokens": 3929, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "871f5be13526458c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377508526359275, "t_first_token_ns": 377509053365850, "t_last_token_ns": 377516161296146, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "169e4e25c4184df8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377508843029728, "t_first_token_ns": 377509297828823, "t_last_token_ns": 377516192695602, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "a0cffa2cebd64afc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377509273667781, "t_first_token_ns": 377509546315416, "t_last_token_ns": 377516224185711, "prompt_tokens": 4032, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "1a94e298d69645b7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377509376387422, "t_first_token_ns": 377509792248180, "t_last_token_ns": 377516253561522, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "6000a7a49cf346e7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377509682460389, "t_first_token_ns": 377510041397480, "t_last_token_ns": 377516283945570, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "3efc4c0d72f84eb5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377509917331100, "t_first_token_ns": 377510291185813, "t_last_token_ns": 377516313226238, "prompt_tokens": 4026, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "a8fa667688584722", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377511146553263, "t_first_token_ns": 377511418222206, "t_last_token_ns": 377519758968015, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "b376e235393b485b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377511733115911, "t_first_token_ns": 377512005008366, "t_last_token_ns": 377520239570079, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "9f54077922f449eb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377511735496994, "t_first_token_ns": 377512244367657, "t_last_token_ns": 377520465897806, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "9521007f7bfe4dfb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377511795540789, "t_first_token_ns": 377512486092001, "t_last_token_ns": 377520504456546, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "1c275891ed1342c0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377512055236014, "t_first_token_ns": 377512729317414, "t_last_token_ns": 377520728592687, "prompt_tokens": 4023, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "ea7a423b344b47df", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377512344959376, "t_first_token_ns": 377512970596072, "t_last_token_ns": 377520745499026, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "d759ec615fb34629", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377512711591060, "t_first_token_ns": 377513213785041, "t_last_token_ns": 377520760583418, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "189c57c8e50a4897", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377513172377689, "t_first_token_ns": 377513470272581, "t_last_token_ns": 377521018522006, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
{"req_id": "bc0a97b1fa1c42d7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377513598365601, "t_first_token_ns": 377513891888807, "t_last_token_ns": 377521179617280, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
{"req_id": "cd82c60b6f624955", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377514259692256, "t_first_token_ns": 377514548088334, "t_last_token_ns": 377521792476927, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
{"req_id": "53ebbca6992a4324", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377516556788337, "t_first_token_ns": 377516835302158, "t_last_token_ns": 377524463718287, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "ee5fd756d0bd4342", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377516600431579, "t_first_token_ns": 377517292657994, "t_last_token_ns": 377524478328615, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "8e8a4bc92e6044c5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377516596169200, "t_first_token_ns": 377517292901516, "t_last_token_ns": 377524478730880, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "372e05b3bc6d42bb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377517163421818, "t_first_token_ns": 377517761981237, "t_last_token_ns": 377524508558120, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "c8b2dd44739e43cc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377517090542435, "t_first_token_ns": 377517761886428, "t_last_token_ns": 377524509477785, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "6f61841b812c4113", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377518083686764, "t_first_token_ns": 377518368887179, "t_last_token_ns": 377525103590143, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "cdda6e4055004799", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377518412708454, "t_first_token_ns": 377518695097715, "t_last_token_ns": 377525200572788, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
{"req_id": "465e67daab224129", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377518689439660, "t_first_token_ns": 377518974914839, "t_last_token_ns": 377525242029280, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
{"req_id": "b2520942705b456e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377519031863541, "t_first_token_ns": 377519325310826, "t_last_token_ns": 377525337436983, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
{"req_id": "c1fe40678f404b01", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377519459900268, "t_first_token_ns": 377519739699093, "t_last_token_ns": 377525473423536, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
{"req_id": "a4ec0202a679436c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377520179080198, "t_first_token_ns": 377520466147836, "t_last_token_ns": 377525795082958, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
{"req_id": "2c778f655c444778", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377520440331666, "t_first_token_ns": 377520729240916, "t_last_token_ns": 377525818975097, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
{"req_id": "1168a555d9a84950", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377520739709178, "t_first_token_ns": 377521018120589, "t_last_token_ns": 377525856478674, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
{"req_id": "111d77b1be374860", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377521432697210, "t_first_token_ns": 377521718652512, "t_last_token_ns": 377526395377469, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "d4bfe2b886bd43a3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377522308068883, "t_first_token_ns": 377522586399661, "t_last_token_ns": 377526822065075, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
{"req_id": "f5d66c55422b44a7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377523213074407, "t_first_token_ns": 377523492117887, "t_last_token_ns": 377527177681628, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
{"req_id": "cfda11e93ec44203", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377523368128023, "t_first_token_ns": 377523748801410, "t_last_token_ns": 377527192781913, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
{"req_id": "0ca96d97a26a4081", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377524616626796, "t_first_token_ns": 377524895794964, "t_last_token_ns": 377527775102282, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "9aac54ad5ed64c81", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377526094083575, "t_first_token_ns": 377526358458729, "t_last_token_ns": 377528876195726, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "7d134da190c64a8c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377527307080445, "t_first_token_ns": 377527566048201, "t_last_token_ns": 377531250601923, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "300001d45d1d45af", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377528337567890, "t_first_token_ns": 377528597587683, "t_last_token_ns": 377533758499497, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "97098ed5029848b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377528392317920, "t_first_token_ns": 377528839873715, "t_last_token_ns": 377533787300373, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "ae47a07595c7418b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377529350440003, "t_first_token_ns": 377529611614513, "t_last_token_ns": 377535778100959, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "3f0270989f594a00", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377529525120861, "t_first_token_ns": 377529857171903, "t_last_token_ns": 377535807399166, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "b2a3422af4c848fd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377529682445844, "t_first_token_ns": 377530100778638, "t_last_token_ns": 377535837575853, "prompt_tokens": 3955, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "1b4763457669496a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377530398399389, "t_first_token_ns": 377530669953352, "t_last_token_ns": 377536363830199, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "c0b7002ea5bf4e10", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377530703064248, "t_first_token_ns": 377530975279340, "t_last_token_ns": 377536460840542, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "5b645c2ea8c543b6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377530748449106, "t_first_token_ns": 377531214266997, "t_last_token_ns": 377536472568208, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "a69eadb3ca644323", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377531145684705, "t_first_token_ns": 377531468992877, "t_last_token_ns": 377536494226137, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "50b5644e0fb34906", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377532259147311, "t_first_token_ns": 377532535535335, "t_last_token_ns": 377537542716668, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "6a81ee2df818465f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377532317886456, "t_first_token_ns": 377532786643933, "t_last_token_ns": 377537565682840, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "85515d1216244fef", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377532525139376, "t_first_token_ns": 377533027455315, "t_last_token_ns": 377537575353815, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "6ac239c2194d4693", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377534048508966, "t_first_token_ns": 377534319092173, "t_last_token_ns": 377538331328776, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "d423af8d36b14ffc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377534301331051, "t_first_token_ns": 377534572594490, "t_last_token_ns": 377538348176612, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "658fe55c560748bc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377534754358054, "t_first_token_ns": 377535038233133, "t_last_token_ns": 377539165070051, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "27d8f6d2af894e32", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377534821325788, "t_first_token_ns": 377535290261426, "t_last_token_ns": 377539184045886, "prompt_tokens": 3951, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
{"req_id": "17e096100b024d0f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377536863967591, "t_first_token_ns": 377537132361549, "t_last_token_ns": 377540390833643, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "a3a52f2677ec485d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377538373175550, "t_first_token_ns": 377538636028480, "t_last_token_ns": 377541884281428, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "81500a30ffcf4146", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377538650891358, "t_first_token_ns": 377538910750185, "t_last_token_ns": 377542172711454, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "8685e1caa83c4d50", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377538659871033, "t_first_token_ns": 377539149539542, "t_last_token_ns": 377542182312103, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "52edce91c081458c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377539816246762, "t_first_token_ns": 377540081666237, "t_last_token_ns": 377544017254175, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "9083dd61675d48f7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377540953234852, "t_first_token_ns": 377541220377201, "t_last_token_ns": 377546735486087, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "1adce1ea920f401a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377540960887474, "t_first_token_ns": 377541454533002, "t_last_token_ns": 377546748466789, "prompt_tokens": 3949, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "563f2fed524145b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377541897072803, "t_first_token_ns": 377542164236497, "t_last_token_ns": 377547466431988, "prompt_tokens": 4025, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "2b9093a2c11c49e6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377542248799111, "t_first_token_ns": 377542512432791, "t_last_token_ns": 377547908350323, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "52f23594727845e1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377542326808514, "t_first_token_ns": 377542757418276, "t_last_token_ns": 377547935565605, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "89c0cc9c16da4798", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377542967125008, "t_first_token_ns": 377543235202843, "t_last_token_ns": 377548493108092, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "21451d7a5ed34dbc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377543089872886, "t_first_token_ns": 377543480297417, "t_last_token_ns": 377548515043904, "prompt_tokens": 3939, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "8d4adc02f4dc4da0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377544531060454, "t_first_token_ns": 377544808124875, "t_last_token_ns": 377549556277980, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "cf2ed6b79fe6410a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377544595242362, "t_first_token_ns": 377545059752500, "t_last_token_ns": 377549579874134, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "4e5072ac6e2c4879", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377545388205368, "t_first_token_ns": 377545663942353, "t_last_token_ns": 377549883116949, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "6e6f8193bbdc4bf1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377545495164966, "t_first_token_ns": 377545916134637, "t_last_token_ns": 377549902358748, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "0e1cb12476634a80", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377546030881432, "t_first_token_ns": 377546314647232, "t_last_token_ns": 377550002532516, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
{"req_id": "9140fc95dcda4fdb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377546237660397, "t_first_token_ns": 377546570252475, "t_last_token_ns": 377550017256520, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
{"req_id": "453ba061ca194670", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377547527785936, "t_first_token_ns": 377547807482865, "t_last_token_ns": 377550443357321, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
{"req_id": "d7980b9ef4ff43db", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377548114071324, "t_first_token_ns": 377548386138188, "t_last_token_ns": 377550576671874, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "11abc92e321740bf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377550697375176, "t_first_token_ns": 377550947372055, "t_last_token_ns": 377552404787971, "prompt_tokens": 3957, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
{"req_id": "bcaf172b9c3e43f8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377552105944472, "t_first_token_ns": 377552363237120, "t_last_token_ns": 377554286058230, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "9924ce8e22454da6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377552583078918, "t_first_token_ns": 377552841136847, "t_last_token_ns": 377554813383283, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
{"req_id": "3d6d1e01287445b7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377553930288883, "t_first_token_ns": 377554188815328, "t_last_token_ns": 377557687899654, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "cd92b36d725e4ebf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377554555893056, "t_first_token_ns": 377554813809669, "t_last_token_ns": 377559298273741, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "b815423b0a4e4cf4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377554845638993, "t_first_token_ns": 377555101677063, "t_last_token_ns": 377559414367812, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
{"req_id": "1658ee0702204652", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377555067539439, "t_first_token_ns": 377555343557260, "t_last_token_ns": 377559436356267, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
{"req_id": "e70502b814cb46eb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377555539977946, "t_first_token_ns": 377555804470309, "t_last_token_ns": 377559780662553, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
{"req_id": "86db5dc6effa49b5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377556961754814, "t_first_token_ns": 377557231595149, "t_last_token_ns": 377561537426250, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
{"req_id": "9b6806621cbe4374", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377557412198062, "t_first_token_ns": 377557676823433, "t_last_token_ns": 377562026885951, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "fbd9b68197864500", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377557807367104, "t_first_token_ns": 377558075410500, "t_last_token_ns": 377562230906560, "prompt_tokens": 3952, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "591c5359b2f74382", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377558327969020, "t_first_token_ns": 377558596476178, "t_last_token_ns": 377562459327687, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "066cf6a1835241f9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377558760165995, "t_first_token_ns": 377559035875284, "t_last_token_ns": 377562643452681, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
{"req_id": "e7be8dc5d7074ec3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377558976465418, "t_first_token_ns": 377559285504999, "t_last_token_ns": 377562658472140, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
{"req_id": "d0913de330eb413f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377560480352161, "t_first_token_ns": 377560744788804, "t_last_token_ns": 377563801899357, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
{"req_id": "6134390de49243b6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377560607546318, "t_first_token_ns": 377560993720351, "t_last_token_ns": 377563817032324, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "5c4762ef7bdc4eb0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377561544599747, "t_first_token_ns": 377561814542876, "t_last_token_ns": 377564121440489, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
{"req_id": "4149e157973248fd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 377563066865308, "t_first_token_ns": 377563330614103, "t_last_token_ns": 377564800641116, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 4, "error": null}

View File

@@ -0,0 +1,8 @@
{
"rate": 1.5,
"input_tokens": 4096,
"output_tokens": 256,
"duration_target_s": 240.0,
"duration_actual_s": 241.68477007100591,
"n_requests": 372
}

View File

@@ -0,0 +1,136 @@
#!/bin/bash
# Cache-size sweep orchestrator.
#
# 1. Apply CT_CACHE_SWEEP_PATCH on top of the already-installed
# CONNECTOR_TAX_PATCH (adds cache_size + worker timings).
# 2. For each config in {plain, noop_connector, mooncake_both}:
# launch vLLM → wait ready → 8-min open-loop bench → kill,
# release GPU.
# 3. Revert CT_CACHE_SWEEP_PATCH so a follow-up run is clean.
# 4. Run analyze.py and emit figures + summary.
#
# Usage: bash run_all.sh
#
# Env overrides:
# DURATION per-config bench duration (default 480 s)
# RATE open-loop rate (default 1.5 req/s)
# PORT vLLM port (default 8000)
# GPU_ID GPU index (default 0)
# MODEL_PATH model dir (default $HOME/models/Qwen/Qwen3-Coder-30B-A3B-Instruct)
# CONFIGS space-separated subset (default "plain noop_connector mooncake_both")
# SKIP_PATCH set to 1 to skip apply/revert (e.g. patch already applied)
set -uo pipefail
HERE="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
CT_DIR="$(cd "$HERE/.." && pwd)"
PROJ_DIR="$(cd "$HERE/../../.." && pwd)"
PYTHON="${PYTHON:-$PROJ_DIR/.venv/bin/python}"
VLLM_ROOT="${VLLM_ROOT:-$PROJ_DIR/.venv/lib/python3.12/site-packages/vllm}"
DURATION="${DURATION:-480}"
RATE="${RATE:-1.5}"
PORT="${PORT:-8000}"
GPU_ID="${GPU_ID:-0}"
MODEL_PATH="${MODEL_PATH:-$HOME/models/Qwen/Qwen3-Coder-30B-A3B-Instruct}"
CONFIGS="${CONFIGS:-plain noop_connector mooncake_both}"
SKIP_PATCH="${SKIP_PATCH:-0}"
DATE="$(date +%Y%m%d_%H%M)"
RUN_ROOT="$HERE/results/$DATE"
mkdir -p "$RUN_ROOT"
echo "=== Cache-size sweep ==="
echo "Run dir : $RUN_ROOT"
echo "vLLM root : $VLLM_ROOT"
echo "Configs : $CONFIGS"
echo "Rate : $RATE Duration: ${DURATION}s"
echo ""
# ── kill any leftover vLLM ────────────────────────────────────────────────
kill_all_vllm() {
pkill -9 -f "VLLM::EngineCore" 2>/dev/null || true
pkill -9 -f "vllm.entrypoints" 2>/dev/null || true
pkill -9 -f "vllm serve" 2>/dev/null || true
sleep 4
for _ in $(seq 1 20); do
used=$(nvidia-smi --query-gpu=memory.used --format=csv,noheader,nounits -i "$GPU_ID" 2>/dev/null | tr -d ' ')
[[ -n "$used" && "$used" -lt 1000 ]] && return 0
sleep 3
done
echo "WARN: GPU $GPU_ID not free after kill" >&2
}
trap 'kill_all_vllm; if [[ "$SKIP_PATCH" != "1" ]]; then "$PYTHON" "$HERE/apply_step_timing_v2.py" --revert --vllm-root "$VLLM_ROOT" || true; [[ -f "$CT_DIR/patches/apply_step_timing.py" ]] && "$PYTHON" "$CT_DIR/patches/apply_step_timing.py" --revert --vllm-root "$VLLM_ROOT" || true; fi' EXIT
# ── patch ─────────────────────────────────────────────────────────────────
if [[ "$SKIP_PATCH" != "1" ]]; then
echo "[stage 0] applying CONNECTOR_TAX_PATCH (v1) then CT_CACHE_SWEEP_PATCH (v2)"
# v1 patch adds step_duration_us + build_meta_us; v2 stacks on top.
if [[ -f "$CT_DIR/patches/apply_step_timing.py" ]]; then
"$PYTHON" "$CT_DIR/patches/apply_step_timing.py" --apply --vllm-root "$VLLM_ROOT" || true
fi
"$PYTHON" "$HERE/apply_step_timing_v2.py" --apply --vllm-root "$VLLM_ROOT"
fi
# ── per-config runs ───────────────────────────────────────────────────────
kill_all_vllm
for cfg in $CONFIGS; do
cfg_dir="$RUN_ROOT/$cfg"
mkdir -p "$cfg_dir"
launch_script="$CT_DIR/launch/launch_${cfg}.sh"
if [[ ! -f "$launch_script" ]]; then
echo "SKIP $cfg (no launch script at $launch_script)"
continue
fi
echo ""
echo "====== Config: $cfg ======"
export RUN_DIR="$cfg_dir"
export PORT GPU_ID MODEL_PATH
export AGENTIC_STEP_LOG_PATH="$cfg_dir/engine_step.jsonl"
export CT_WORKER_STEP_LOG_PATH="$cfg_dir/worker_step.jsonl"
export PYTHONPATH="$PROJ_DIR:${PYTHONPATH:-}"
: > "$cfg_dir/engine_step.jsonl" # truncate
rm -f "$cfg_dir/worker_step.jsonl".* # any leftover ranks
: > "$cfg_dir/requests.jsonl"
# Launch (the launch scripts already use common.sh).
bash "$launch_script" 2>&1 | tail -5
rc=$?
if [[ $rc -ne 0 ]]; then
echo "FAIL $cfg (launch rc=$rc) — skipping bench"
kill_all_vllm
continue
fi
echo "[bench] running ${DURATION}s open-loop at rate=$RATE"
"$PYTHON" "$HERE/run_cache_sweep.py" \
--url "http://127.0.0.1:$PORT/v1/chat/completions" \
--model "$MODEL_PATH" \
--rate "$RATE" --duration "$DURATION" \
--output-dir "$cfg_dir" 2>&1 | tail -8
# Final fetch of /metrics so the analyze step has the ceiling.
curl -s "http://127.0.0.1:$PORT/metrics" > "$cfg_dir/metrics_final.txt" 2>&1 || true
echo "[teardown] $cfg"
kill_all_vllm
done
# ── revert + analyze ──────────────────────────────────────────────────────
if [[ "$SKIP_PATCH" != "1" ]]; then
echo ""
echo "[stage Z] reverting CT_CACHE_SWEEP_PATCH then CONNECTOR_TAX_PATCH"
"$PYTHON" "$HERE/apply_step_timing_v2.py" --revert --vllm-root "$VLLM_ROOT"
if [[ -f "$CT_DIR/patches/apply_step_timing.py" ]]; then
"$PYTHON" "$CT_DIR/patches/apply_step_timing.py" --revert --vllm-root "$VLLM_ROOT" || true
fi
fi
echo ""
echo "[analyze]"
"$PYTHON" "$HERE/analyze.py" --run-root "$RUN_ROOT"
echo ""
echo "Done. Artifacts: $RUN_ROOT"

View File

@@ -0,0 +1,220 @@
#!/usr/bin/env python3
"""Continuous open-loop driver for the cache-size sweep.
The intent is to fill the prefix-cache from 0 up to GPU ceiling within a
single vLLM lifetime, so the per-step `cache_size` field (added by
`apply_step_timing_v2.py`) sweeps through every value the engine can
hold. Offline analysis bins by `cache_size` to recover the per-step
overhead curve.
Workload:
- Open-loop Poisson at fixed rate (default 1.5 req/s).
- Random per-request content (UUID + hash, calibrated to ~4096
tokens). Zero prefix-cache hits ⇒ cache strictly grows until
LRU eviction kicks in.
- max_tokens / min_tokens = 256, temperature=0, ignore_eos=True.
- Duration default 8 min.
Writes per-request metrics to `requests.jsonl`. The patch emits
per-step rows to `engine_step.jsonl` (set via AGENTIC_STEP_LOG_PATH)
and worker rows to `worker_step.jsonl.r0` (set via
CT_WORKER_STEP_LOG_PATH).
Usage:
run_cache_sweep.py \\
--url http://127.0.0.1:8000/v1/chat/completions \\
--model /path/to/qwen \\
--rate 1.5 --duration 480 \\
--output-dir results/<date>/<config>
"""
import argparse
import asyncio
import hashlib
import json
import random
import time
import uuid
from dataclasses import asdict, dataclass
from pathlib import Path
import httpx
@dataclass
class ReqMetric:
req_id: str
rate_target: float
input_tokens_target: int
output_tokens_target: int
t_send_ns: int
t_first_token_ns: int | None = None
t_last_token_ns: int | None = None
prompt_tokens: int = 0
completion_tokens: int = 0
inflight_at_send: int = 0
error: str | None = None
def make_random_prompt(target_tokens: int) -> str:
"""Same calibration as the bench_loop.py used elsewhere:
'Block N: <32-hex>' tokenizes to ~35 tokens on Qwen3-Coder."""
n_parts = max(1, target_tokens // 35)
seed = uuid.uuid4().hex
parts = []
for i in range(n_parts):
h = hashlib.md5(f"{seed}_{i}_{time.time_ns()}".encode()).hexdigest()
parts.append(f"Block {i}: {h}")
return " ".join(parts)
async def send_one(client, url, model, inp_tokens, out_tokens,
rate, inflight, inflight_cap, fh):
rid = uuid.uuid4().hex[:16]
if inflight[0] >= inflight_cap:
m = ReqMetric(req_id=rid, rate_target=rate,
input_tokens_target=inp_tokens,
output_tokens_target=out_tokens,
t_send_ns=time.perf_counter_ns(),
inflight_at_send=inflight[0],
error="dropped_inflight_cap")
fh.write(json.dumps(asdict(m)) + "\n")
return
inflight[0] += 1
m = ReqMetric(req_id=rid, rate_target=rate,
input_tokens_target=inp_tokens,
output_tokens_target=out_tokens,
t_send_ns=time.perf_counter_ns(),
inflight_at_send=inflight[0])
try:
prompt = make_random_prompt(inp_tokens)
payload = {
"model": model,
"messages": [{"role": "user", "content": prompt}],
"max_tokens": out_tokens,
"min_tokens": out_tokens,
"temperature": 0,
"ignore_eos": True,
"stream": True,
"stream_options": {"include_usage": True},
}
async with client.stream("POST", url, json=payload, timeout=600.0) as resp:
resp.raise_for_status()
async for line in resp.aiter_lines():
if not line.startswith("data: "):
continue
data = line[6:]
if data.strip() == "[DONE]":
break
try:
chunk = json.loads(data)
except json.JSONDecodeError:
continue
usage = chunk.get("usage")
if usage:
m.prompt_tokens = usage.get("prompt_tokens", m.prompt_tokens)
m.completion_tokens = usage.get(
"completion_tokens", m.completion_tokens)
choices = chunk.get("choices") or []
if not choices:
continue
delta = choices[0].get("delta", {})
if "role" in delta:
continue
now = time.perf_counter_ns()
if m.t_first_token_ns is None:
m.t_first_token_ns = now
m.t_last_token_ns = now
except Exception as e:
m.error = f"{type(e).__name__}: {e}"
finally:
inflight[0] -= 1
fh.write(json.dumps(asdict(m)) + "\n")
async def main_async(args):
out_dir = Path(args.output_dir)
out_dir.mkdir(parents=True, exist_ok=True)
req_path = out_dir / "requests.jsonl"
inflight = [0]
pending: list[asyncio.Task] = []
interval_mean = 1.0 / args.rate
rng = random.Random(int(time.time_ns()) & 0xFFFFFFFF)
print(f"[bench] rate={args.rate} shape=({args.input_tokens},{args.output_tokens}) "
f"duration={args.duration}s output={out_dir}")
fh = open(req_path, "a", buffering=1)
t0 = time.perf_counter()
last_print = t0
async with httpx.AsyncClient(timeout=httpx.Timeout(600.0)) as client:
# producer
async def producer():
while time.perf_counter() - t0 < args.duration:
pending.append(asyncio.create_task(
send_one(client, args.url, args.model,
args.input_tokens, args.output_tokens,
args.rate, inflight, args.inflight_cap, fh)
))
await asyncio.sleep(rng.expovariate(1.0 / interval_mean))
# heartbeat
async def heartbeat():
nonlocal last_print
while time.perf_counter() - t0 < args.duration + 1:
now = time.perf_counter()
if now - last_print >= 30:
print(f" t+{int(now - t0):4d}s inflight={inflight[0]} "
f"pending={len(pending)}")
last_print = now
await asyncio.sleep(2.0)
prod = asyncio.create_task(producer())
hb = asyncio.create_task(heartbeat())
await prod
hb.cancel()
try:
await hb
except asyncio.CancelledError:
pass
# final drain
if pending:
await asyncio.gather(*pending, return_exceptions=True)
fh.close()
# Quick summary so the orchestrator can sanity-check the cell ran.
n_lines = sum(1 for _ in open(req_path))
with open(out_dir / "run_summary.json", "w") as f:
json.dump({
"rate": args.rate,
"input_tokens": args.input_tokens,
"output_tokens": args.output_tokens,
"duration_target_s": args.duration,
"duration_actual_s": time.perf_counter() - t0,
"n_requests": n_lines,
}, f, indent=2)
print(f"[done] {n_lines} requests in {time.perf_counter() - t0:.1f}s")
def main():
ap = argparse.ArgumentParser()
ap.add_argument("--url", required=True)
ap.add_argument("--model", required=True)
ap.add_argument("--rate", type=float, default=1.5)
ap.add_argument("--input-tokens", type=int, default=4096)
ap.add_argument("--output-tokens", type=int, default=256)
ap.add_argument("--duration", type=float, default=480.0,
help="Total run duration in seconds (default 8 min)")
ap.add_argument("--inflight-cap", type=int, default=256)
ap.add_argument("--output-dir", required=True)
args = ap.parse_args()
asyncio.run(main_async(args))
if __name__ == "__main__":
main()