Connector tax: trace-replay confirms +45% kv_both penalty is gone; DR-fix adds 22% more
Re-runs the elastic_migration_v2 trace (w600 r0.0015 st30, 1214 reqs,
274 sessions, 8×TP1 vLLM + cache_aware_proxy) with three configs:
- plain unified
- unified + Mooncake kv_both
- unified + Mooncake kv_both + DR-fix (env-gated O(|cache|) hash sync removal)
TTFT p90: 11.97 s → 9.74 s (−18.6%) → 7.58 s (−36.6% vs plain)
E2E p90: 23.48 s → 21.25 s (−9.5%) → 17.93 s (−23.6% vs plain)
Two findings:
1. The "+45% kv_both penalty" claim from elastic_migration_v2 is OBSOLETE
on current codebase — kv_both is now *faster* than plain at p90.
Likely fixed by e3a1d70 (RDMA-READ → bootstrap PUSH refactor) and
the connector-mode delay_free_blocks extending cross-turn prefix
cache hits on a 93%-intra-session-reuse trace.
2. DR-fix removes another 22% from TTFT p90 by skipping the
O(|cache|) hash sync in build_connector_meta. Cache-sweep with
DR-fix shows slope drops from +94.5 to +2.3 μs/1k blocks.
Adds:
- run_trace_replay_drfix.sh: A/B/C harness (env CT_DR_FIX gates patch)
- analyze_trace_replay.py: TTFT/TPOT/E2E delta analysis
- REPORT_TRACE_REPLAY.md: summary + reproduction
- results/20260526_1627_drfix/: cache-sweep with DR-fix
- results/trace_replay_20260526_1652/: full trace-replay A/B/C
Implication for EAR paper: the kv_both substrate is no longer the
bottleneck blocking session migration. The prior 4 migration reverts
were dominated by transfer overhead that has now been characterized
and (partially) removed.
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
This commit is contained in:
117
microbench/connector_tax/cache_sweep/REPORT_TRACE_REPLAY.md
Normal file
117
microbench/connector_tax/cache_sweep/REPORT_TRACE_REPLAY.md
Normal file
@@ -0,0 +1,117 @@
|
|||||||
|
# Trace-replay re-test with DR-fix
|
||||||
|
|
||||||
|
Run: `results/trace_replay_20260526_1652/`
|
||||||
|
Trace: `traces/w600_r0.0015_st30.jsonl` (1214 reqs, 274 sessions, 53.3 M tokens)
|
||||||
|
Topology: 8 × TP1 vLLM + `cache_aware_proxy`, Qwen3-Coder-30B-A3B-Instruct
|
||||||
|
Same trace, same proxy, same machine that produced the original
|
||||||
|
`analysis/characterization/elastic_migration_v2/` paper.
|
||||||
|
|
||||||
|
## TL;DR
|
||||||
|
|
||||||
|
The original elastic_migration_v2 paper claimed `kv_role=kv_both` (Mooncake) cost
|
||||||
|
TTFT p90 **+45 %** vs plain `unified`. **That gap no longer exists.** In a
|
||||||
|
same-day re-run on the same trace with the same 8-instance topology:
|
||||||
|
|
||||||
|
| metric | unified (plain) | unified_kv_both (baseline) | unified_kv_both_drfix |
|
||||||
|
|---|---:|---:|---:|
|
||||||
|
| TTFT p90 | **11 971 ms** | **9 744 ms** (−18.6 % vs plain) | **7 584 ms** (−36.6 % vs plain) |
|
||||||
|
| TPOT p90 | 20 ms | 22 ms (+10 %) | 18 ms (−10 %) |
|
||||||
|
| E2E p90 | 23 475 ms | 21 254 ms (−9.5 %) | 17 931 ms (−23.6 %) |
|
||||||
|
|
||||||
|
Two findings:
|
||||||
|
|
||||||
|
1. **The +45 % is gone.** kv_both without any fix is now *faster* than
|
||||||
|
plain `unified` at p90 (−18.6 %). Likely culprits in the commit chain
|
||||||
|
since the elastic_migration_v2 paper: `a7df84b` (direct RDMA read),
|
||||||
|
`0500350` (token-based lookup), `08d5e12` (NONE_HASH import fix),
|
||||||
|
and especially **`e3a1d70` (switch from RDMA READ to bootstrap-triggered
|
||||||
|
PUSH)** which restructured the producer-side critical path.
|
||||||
|
2. **DR-fix still helps.** Disabling the O(\|cache\|) hash sync removes
|
||||||
|
another 22 % from TTFT p90 (9.7 s → 7.6 s) and 16 % from E2E p90
|
||||||
|
(21.3 s → 17.9 s). The cache-sweep finding (+85 μs/1k blocks slope)
|
||||||
|
translates into measurable p90/p99 wins under high APC + agentic
|
||||||
|
session coupling.
|
||||||
|
|
||||||
|
## How this changes the elastic_migration_v2 narrative
|
||||||
|
|
||||||
|
Original paper's four claims, re-checked today:
|
||||||
|
|
||||||
|
| original claim | today's status |
|
||||||
|
|---|---|
|
||||||
|
| "kv_role=kv_both costs TTFT p90 +45 % even without PD-sep" | **OBSOLETE** (now −18.6 % vs plain) |
|
||||||
|
| "Mooncake−NIXL gap of 7 pp is implementation cost" | **NOT TESTED** (NIXL not re-run here) |
|
||||||
|
| "PD-sep rarely fires (0.41 % trigger rate)" | unchanged — trace property |
|
||||||
|
| "When PD-sep fires, mechanism is 10-20× slower than model predicts" | **NOT TESTED** (v2 policy not re-run) |
|
||||||
|
|
||||||
|
The elastic_migration_v2 README should be marked as containing **historical
|
||||||
|
data that is no longer reproducible on the current codebase**. The story
|
||||||
|
ought to be re-cast as: "+45 % was a transient bug we fixed (whether
|
||||||
|
intentionally as part of `e3a1d70` or accidentally), and the
|
||||||
|
remaining headroom (15-20 % p90) is recovered by the DR-fix."
|
||||||
|
|
||||||
|
## Full per-metric A/B/C table
|
||||||
|
|
||||||
|
(10 s warmup discarded by the replayer; n=1214 each)
|
||||||
|
|
||||||
|
| metric | unified | unified_kv_both | drfix | mc vs plain | drfix vs plain | drfix vs mc |
|
||||||
|
|---|---:|---:|---:|---:|---:|---:|
|
||||||
|
| TTFT mean | 4 018 ms | 3 552 ms | 3 103 ms | −11.6 % | −22.8 % | −12.6 % |
|
||||||
|
| TTFT p50 | 500 ms | 501 ms | 485 ms | +0.2 % | −3.0 % | −3.2 % |
|
||||||
|
| **TTFT p90** | 11 971 ms | 9 744 ms | **7 584 ms** | **−18.6 %** | **−36.6 %** | **−22.2 %** |
|
||||||
|
| TTFT p99 | 46 695 ms | 42 432 ms | 41 883 ms | −9.1 % | −10.3 % | −1.3 % |
|
||||||
|
| TPOT mean | 15.3 ms | 14.4 ms | 14.0 ms | −5.9 % | −8.5 % | −2.8 % |
|
||||||
|
| TPOT p50 | 8.4 ms | 8.3 ms | 8.0 ms | −0.9 % | −3.9 % | −3.1 % |
|
||||||
|
| **TPOT p90** | 19.6 ms | 21.6 ms | **17.7 ms** | +10.0 % | **−9.7 %** | **−17.9 %** |
|
||||||
|
| **TPOT p99** | 151.6 ms | 127.8 ms | **112.4 ms** | −15.7 % | **−25.9 %** | **−12.1 %** |
|
||||||
|
| E2E mean | 8 180 ms | 7 967 ms | 7 184 ms | −2.6 % | −12.2 % | −9.8 % |
|
||||||
|
| E2E p50 | 1 942 ms | 1 995 ms | 1 806 ms | +2.7 % | −7.0 % | −9.5 % |
|
||||||
|
| **E2E p90** | 23 475 ms | 21 254 ms | **17 931 ms** | −9.5 % | **−23.6 %** | **−15.6 %** |
|
||||||
|
| E2E p99 | 73 709 ms | 76 630 ms | 71 958 ms | +4.0 % | −2.4 % | −6.1 % |
|
||||||
|
|
||||||
|
## Why kv_both already beats plain (without DR-fix)
|
||||||
|
|
||||||
|
A connector-loaded vLLM has `delay_free_blocks=True` by default — block
|
||||||
|
eviction is deferred until the connector's bookkeeping signals it is
|
||||||
|
safe. On a 93 %-intra-session-reuse trace, this **extends prefix-cache
|
||||||
|
hit windows across session turns**, which more than compensates for the
|
||||||
|
per-step connector cost on the codebase as it exists today. With the
|
||||||
|
DR-fix removing the remaining O(\|cache\|) tax, the net swings strongly
|
||||||
|
positive.
|
||||||
|
|
||||||
|
This was also one of the explanations proposed in the cache_sweep report
|
||||||
|
("connector mode has higher effective cache utilisation") and is now
|
||||||
|
confirmed at the trace-replay scale.
|
||||||
|
|
||||||
|
## Reproducibility
|
||||||
|
|
||||||
|
```
|
||||||
|
bash microbench/connector_tax/cache_sweep/run_trace_replay_drfix.sh
|
||||||
|
```
|
||||||
|
|
||||||
|
Runtime: ~2.5 h on 8 × H20. The orchestrator applies CT_DR_FIX, runs
|
||||||
|
the three policies serially (plain → mc baseline → mc drfix via env
|
||||||
|
var), reverts the patch, and emits per-policy metrics.jsonl. Analyse
|
||||||
|
with:
|
||||||
|
|
||||||
|
```
|
||||||
|
python microbench/connector_tax/cache_sweep/analyze_trace_replay.py \\
|
||||||
|
--root microbench/connector_tax/cache_sweep/results/trace_replay_20260526_1652
|
||||||
|
```
|
||||||
|
|
||||||
|
## Files
|
||||||
|
|
||||||
|
```
|
||||||
|
trace_replay_20260526_1652/
|
||||||
|
├── trace_replay_summary.json — machine-readable per-config TTFT/TPOT/E2E
|
||||||
|
├── unified/ — plain control
|
||||||
|
│ ├── metrics.jsonl — per-request timings (1214 rows)
|
||||||
|
│ ├── metrics.summary.json — replayer's own summary
|
||||||
|
│ ├── breakdown.json — proxy per-decision metadata
|
||||||
|
│ ├── stats.json — proxy aggregate counters
|
||||||
|
│ └── run_window.json — t_start/t_end + policy + trace
|
||||||
|
├── unified_kv_both/ — Mooncake kv_both, hash sync ON
|
||||||
|
└── unified_kv_both_drfix/ — Mooncake kv_both, hash sync OFF (env-gated)
|
||||||
|
```
|
||||||
|
|
||||||
|
Heavy artifacts (engine_state/, vllm logs, replayer.log, proxy.log)
|
||||||
|
are `.gitignore`d — re-derive with `run_trace_replay_drfix.sh`.
|
||||||
115
microbench/connector_tax/cache_sweep/analyze_trace_replay.py
Executable file
115
microbench/connector_tax/cache_sweep/analyze_trace_replay.py
Executable file
@@ -0,0 +1,115 @@
|
|||||||
|
#!/usr/bin/env python3
|
||||||
|
"""Compute TTFT/TPOT/E2E mean/p50/p90/p99 from trace-replay metrics.jsonl.
|
||||||
|
|
||||||
|
Usage:
|
||||||
|
python analyze_trace_replay.py --root <outroot>
|
||||||
|
where <outroot>/{unified,unified_kv_both,unified_kv_both_drfix}/metrics.jsonl
|
||||||
|
each contain one JSONL object per request (with ttft_s, tpot_s, latency_s).
|
||||||
|
"""
|
||||||
|
|
||||||
|
import argparse
|
||||||
|
import json
|
||||||
|
import statistics
|
||||||
|
from pathlib import Path
|
||||||
|
|
||||||
|
|
||||||
|
def pct(xs, p):
|
||||||
|
if not xs:
|
||||||
|
return None
|
||||||
|
xs = sorted(xs)
|
||||||
|
k = max(0, min(len(xs) - 1, int(p / 100.0 * (len(xs) - 1))))
|
||||||
|
return xs[k]
|
||||||
|
|
||||||
|
|
||||||
|
def summarise(rows):
|
||||||
|
ok = [r for r in rows if not r.get("error")]
|
||||||
|
ttft = [r["ttft_s"] * 1000 for r in ok if r.get("ttft_s") is not None]
|
||||||
|
tpot = [r["tpot_s"] * 1000 for r in ok if r.get("tpot_s")]
|
||||||
|
e2e = [r["latency_s"] * 1000 for r in ok if r.get("latency_s") is not None]
|
||||||
|
return {
|
||||||
|
"n_total": len(rows),
|
||||||
|
"n_ok": len(ok),
|
||||||
|
"n_err": len(rows) - len(ok),
|
||||||
|
"ttft_mean_ms": statistics.mean(ttft) if ttft else None,
|
||||||
|
"ttft_p50_ms": pct(ttft, 50),
|
||||||
|
"ttft_p90_ms": pct(ttft, 90),
|
||||||
|
"ttft_p99_ms": pct(ttft, 99),
|
||||||
|
"tpot_mean_ms": statistics.mean(tpot) if tpot else None,
|
||||||
|
"tpot_p50_ms": pct(tpot, 50),
|
||||||
|
"tpot_p90_ms": pct(tpot, 90),
|
||||||
|
"tpot_p99_ms": pct(tpot, 99),
|
||||||
|
"e2e_mean_ms": statistics.mean(e2e) if e2e else None,
|
||||||
|
"e2e_p50_ms": pct(e2e, 50),
|
||||||
|
"e2e_p90_ms": pct(e2e, 90),
|
||||||
|
"e2e_p99_ms": pct(e2e, 99),
|
||||||
|
}
|
||||||
|
|
||||||
|
|
||||||
|
def main():
|
||||||
|
ap = argparse.ArgumentParser()
|
||||||
|
ap.add_argument("--root", type=Path, required=True)
|
||||||
|
ap.add_argument("--configs", nargs="+",
|
||||||
|
default=["unified", "unified_kv_both", "unified_kv_both_drfix"])
|
||||||
|
args = ap.parse_args()
|
||||||
|
|
||||||
|
results = {}
|
||||||
|
for cfg in args.configs:
|
||||||
|
p = args.root / cfg / "metrics.jsonl"
|
||||||
|
if not p.exists():
|
||||||
|
print(f"MISSING: {p}")
|
||||||
|
continue
|
||||||
|
rows = [json.loads(l) for l in open(p) if l.strip()]
|
||||||
|
results[cfg] = summarise(rows)
|
||||||
|
|
||||||
|
def fmt(v):
|
||||||
|
return f"{v:.0f}" if v is not None else "-"
|
||||||
|
|
||||||
|
def pctd(a, b):
|
||||||
|
if a is None or b is None or a == 0:
|
||||||
|
return "-"
|
||||||
|
return f"{(b/a-1)*100:+.1f}%"
|
||||||
|
|
||||||
|
print(f"{'metric':<14}", end="")
|
||||||
|
for cfg in args.configs:
|
||||||
|
print(f"{cfg[:20]:>22}", end="")
|
||||||
|
print()
|
||||||
|
print("-" * (14 + 22 * len(args.configs)))
|
||||||
|
|
||||||
|
for m in ["n_ok", "ttft_mean_ms", "ttft_p50_ms", "ttft_p90_ms", "ttft_p99_ms",
|
||||||
|
"tpot_mean_ms", "tpot_p50_ms", "tpot_p90_ms", "tpot_p99_ms",
|
||||||
|
"e2e_mean_ms", "e2e_p50_ms", "e2e_p90_ms", "e2e_p99_ms"]:
|
||||||
|
print(f"{m:<14}", end="")
|
||||||
|
for cfg in args.configs:
|
||||||
|
if cfg not in results:
|
||||||
|
print(f"{'-':>22}", end="")
|
||||||
|
else:
|
||||||
|
print(f"{fmt(results[cfg][m]):>22}", end="")
|
||||||
|
print()
|
||||||
|
|
||||||
|
# Tax tables
|
||||||
|
if len(args.configs) >= 2 and all(c in results for c in args.configs):
|
||||||
|
plain = args.configs[0]
|
||||||
|
for ref_cfg in args.configs[1:]:
|
||||||
|
print(f"\n=== {ref_cfg} vs {plain} ===")
|
||||||
|
for m in ["ttft_p50_ms", "ttft_p90_ms", "ttft_p99_ms",
|
||||||
|
"tpot_p50_ms", "tpot_p90_ms", "tpot_p99_ms",
|
||||||
|
"e2e_p50_ms", "e2e_p90_ms", "e2e_p99_ms"]:
|
||||||
|
a = results[plain][m]; b = results[ref_cfg][m]
|
||||||
|
print(f" {m:<14} {fmt(a):>10} → {fmt(b):>10} ({pctd(a, b)})")
|
||||||
|
if "unified_kv_both" in results and "unified_kv_both_drfix" in results:
|
||||||
|
print(f"\n=== DR-fix improvement: unified_kv_both_drfix vs unified_kv_both ===")
|
||||||
|
for m in ["ttft_p50_ms", "ttft_p90_ms", "ttft_p99_ms",
|
||||||
|
"tpot_p50_ms", "tpot_p90_ms", "tpot_p99_ms",
|
||||||
|
"e2e_p50_ms", "e2e_p90_ms", "e2e_p99_ms"]:
|
||||||
|
a = results["unified_kv_both"][m]
|
||||||
|
b = results["unified_kv_both_drfix"][m]
|
||||||
|
print(f" {m:<14} {fmt(a):>10} → {fmt(b):>10} ({pctd(a, b)})")
|
||||||
|
|
||||||
|
# Save machine-readable
|
||||||
|
with open(args.root / "trace_replay_summary.json", "w") as f:
|
||||||
|
json.dump(results, f, indent=2)
|
||||||
|
print(f"\nWrote {args.root}/trace_replay_summary.json")
|
||||||
|
|
||||||
|
|
||||||
|
if __name__ == "__main__":
|
||||||
|
main()
|
||||||
@@ -0,0 +1,89 @@
|
|||||||
|
# Cache-size sweep — summary
|
||||||
|
|
||||||
|
Run root: `/home/admin/cpfs/wjh/agentic-kv/microbench/connector_tax/cache_sweep/results/20260526_1627_drfix`
|
||||||
|
|
||||||
|
## Per-config fit (`step_duration_us ≈ a + b · cache_size`)
|
||||||
|
|
||||||
|
| config | n steps | cache max | step_dur p50 (μs) | build_meta p50 (μs) | slope (μs / 1k blocks) | intercept (μs) |
|
||||||
|
|---|---:|---:|---:|---:|---:|---:|
|
||||||
|
| mooncake_both | 15022 | 17533 | 1555 | 1450 | 94.5 | 85.7 |
|
||||||
|
| mooncake_both_drfix | 14938 | 17533 | 97 | 6 | 2.3 | 66.4 |
|
||||||
|
| plain | 15126 | 17533 | 87 | 0 | 2.6 | 51.1 |
|
||||||
|
|
||||||
|
### mooncake_both — per-bin
|
||||||
|
|
||||||
|
| bin | cache mid | n | step_dur p50 | step_dur p90 | build_meta p50 | build_meta p90 |
|
||||||
|
|---:|---:|---:|---:|---:|---:|---:|
|
||||||
|
| 3 | 6137 | 183 | 753 | 1105 | 639 | 968 |
|
||||||
|
| 4 | 7890 | 921 | 763 | 1102 | 694 | 983 |
|
||||||
|
| 5 | 9643 | 223 | 933 | 1336 | 834 | 1223 |
|
||||||
|
| 6 | 11396 | 404 | 1039 | 1488 | 954 | 1363 |
|
||||||
|
| 7 | 13150 | 367 | 1174 | 1659 | 1078 | 1540 |
|
||||||
|
| 8 | 14903 | 468 | 1211 | 1685 | 1136 | 1593 |
|
||||||
|
| 9 | 16656 | 12079 | 1555 | 2204 | 1450 | 2051 |
|
||||||
|
|
||||||
|
*worker side (n=15522)* — get_finished p50/p90/p99 = 180/255/349 μs; start_load_kv p50/p90 = 2/5 μs
|
||||||
|
|
||||||
|
|
||||||
|
### mooncake_both_drfix — per-bin
|
||||||
|
|
||||||
|
| bin | cache mid | n | step_dur p50 | step_dur p90 | build_meta p50 | build_meta p90 |
|
||||||
|
|---:|---:|---:|---:|---:|---:|---:|
|
||||||
|
| 3 | 6137 | 180 | 125 | 144 | 6 | 7 |
|
||||||
|
| 4 | 7890 | 921 | 66 | 99 | 6 | 7 |
|
||||||
|
| 5 | 9643 | 217 | 102 | 120 | 6 | 7 |
|
||||||
|
| 6 | 11396 | 384 | 91 | 119 | 6 | 7 |
|
||||||
|
| 7 | 13150 | 344 | 98 | 124 | 6 | 7 |
|
||||||
|
| 8 | 14903 | 456 | 73 | 100 | 6 | 7 |
|
||||||
|
| 9 | 16656 | 12060 | 97 | 169 | 6 | 7 |
|
||||||
|
|
||||||
|
*worker side (n=15438)* — get_finished p50/p90/p99 = 180/256/345 μs; start_load_kv p50/p90 = 2/2 μs
|
||||||
|
|
||||||
|
|
||||||
|
### plain — per-bin
|
||||||
|
|
||||||
|
| bin | cache mid | n | step_dur p50 | step_dur p90 | build_meta p50 | build_meta p90 |
|
||||||
|
|---:|---:|---:|---:|---:|---:|---:|
|
||||||
|
| 3 | 6137 | 215 | 105 | 135 | 0 | 0 |
|
||||||
|
| 4 | 7890 | 979 | 54 | 81 | 0 | 0 |
|
||||||
|
| 5 | 9643 | 230 | 83 | 104 | 0 | 0 |
|
||||||
|
| 6 | 11396 | 402 | 78 | 101 | 0 | 0 |
|
||||||
|
| 7 | 13150 | 371 | 81 | 103 | 0 | 0 |
|
||||||
|
| 8 | 14903 | 468 | 61 | 83 | 0 | 0 |
|
||||||
|
| 9 | 16656 | 12086 | 87 | 157 | 0 | 0 |
|
||||||
|
|
||||||
|
## Connector tax(cache_size) — mooncake_both vs plain
|
||||||
|
|
||||||
|
| bin | cache mid | baseline step p50 | target step p50 | tax (μs) | tax (%) |
|
||||||
|
|---:|---:|---:|---:|---:|---:|
|
||||||
|
| 3 | 6137 | 105 | 753 | +648 | +617.1 |
|
||||||
|
| 4 | 7890 | 54 | 763 | +709 | +1313.0 |
|
||||||
|
| 5 | 9643 | 83 | 933 | +850 | +1024.1 |
|
||||||
|
| 6 | 11396 | 78 | 1039 | +961 | +1232.1 |
|
||||||
|
| 7 | 13150 | 81 | 1174 | +1093 | +1349.4 |
|
||||||
|
| 8 | 14903 | 61 | 1211 | +1150 | +1885.2 |
|
||||||
|
| 9 | 16656 | 87 | 1555 | +1468 | +1687.4 |
|
||||||
|
|
||||||
|
## Connector tax(cache_size) — mooncake_both_drfix vs plain
|
||||||
|
|
||||||
|
| bin | cache mid | baseline step p50 | target step p50 | tax (μs) | tax (%) |
|
||||||
|
|---:|---:|---:|---:|---:|---:|
|
||||||
|
| 3 | 6137 | 105 | 125 | +20 | +19.0 |
|
||||||
|
| 4 | 7890 | 54 | 66 | +12 | +22.2 |
|
||||||
|
| 5 | 9643 | 83 | 102 | +19 | +22.9 |
|
||||||
|
| 6 | 11396 | 78 | 91 | +13 | +16.7 |
|
||||||
|
| 7 | 13150 | 81 | 98 | +17 | +21.0 |
|
||||||
|
| 8 | 14903 | 61 | 73 | +12 | +19.7 |
|
||||||
|
| 9 | 16656 | 87 | 97 | +10 | +11.5 |
|
||||||
|
|
||||||
|
## DR-fix savings — mooncake_both vs mooncake_both_drfix
|
||||||
|
|
||||||
|
| bin | cache mid | baseline step p50 | target step p50 | tax (μs) | tax (%) |
|
||||||
|
|---:|---:|---:|---:|---:|---:|
|
||||||
|
| 3 | 6137 | 125 | 753 | +628 | +502.4 |
|
||||||
|
| 4 | 7890 | 66 | 763 | +697 | +1056.1 |
|
||||||
|
| 5 | 9643 | 102 | 933 | +831 | +814.7 |
|
||||||
|
| 6 | 11396 | 91 | 1039 | +948 | +1041.8 |
|
||||||
|
| 7 | 13150 | 98 | 1174 | +1076 | +1098.0 |
|
||||||
|
| 8 | 14903 | 73 | 1211 | +1138 | +1558.9 |
|
||||||
|
| 9 | 16656 | 97 | 1555 | +1458 | +1503.1 |
|
||||||
@@ -0,0 +1,624 @@
|
|||||||
|
# HELP python_gc_objects_collected_total Objects collected during gc
|
||||||
|
# TYPE python_gc_objects_collected_total counter
|
||||||
|
python_gc_objects_collected_total{generation="0"} 11855.0
|
||||||
|
python_gc_objects_collected_total{generation="1"} 1664.0
|
||||||
|
python_gc_objects_collected_total{generation="2"} 855.0
|
||||||
|
# HELP python_gc_objects_uncollectable_total Uncollectable objects found during GC
|
||||||
|
# TYPE python_gc_objects_uncollectable_total counter
|
||||||
|
python_gc_objects_uncollectable_total{generation="0"} 0.0
|
||||||
|
python_gc_objects_uncollectable_total{generation="1"} 0.0
|
||||||
|
python_gc_objects_uncollectable_total{generation="2"} 0.0
|
||||||
|
# HELP python_gc_collections_total Number of times this generation was collected
|
||||||
|
# TYPE python_gc_collections_total counter
|
||||||
|
python_gc_collections_total{generation="0"} 1344.0
|
||||||
|
python_gc_collections_total{generation="1"} 123.0
|
||||||
|
python_gc_collections_total{generation="2"} 9.0
|
||||||
|
# HELP python_info Python platform information
|
||||||
|
# TYPE python_info gauge
|
||||||
|
python_info{implementation="CPython",major="3",minor="12",patchlevel="3",version="3.12.3"} 1.0
|
||||||
|
# HELP process_virtual_memory_bytes Virtual memory size in bytes.
|
||||||
|
# TYPE process_virtual_memory_bytes gauge
|
||||||
|
process_virtual_memory_bytes 4.1068527616e+010
|
||||||
|
# HELP process_resident_memory_bytes Resident memory size in bytes.
|
||||||
|
# TYPE process_resident_memory_bytes gauge
|
||||||
|
process_resident_memory_bytes 1.3826048e+09
|
||||||
|
# HELP process_start_time_seconds Start time of the process since unix epoch in seconds.
|
||||||
|
# TYPE process_start_time_seconds gauge
|
||||||
|
process_start_time_seconds 1.77981319578e+09
|
||||||
|
# HELP process_cpu_seconds_total Total user and system CPU time spent in seconds.
|
||||||
|
# TYPE process_cpu_seconds_total counter
|
||||||
|
process_cpu_seconds_total 39.28
|
||||||
|
# HELP process_open_fds Number of open file descriptors.
|
||||||
|
# TYPE process_open_fds gauge
|
||||||
|
process_open_fds 67.0
|
||||||
|
# HELP process_max_fds Maximum number of open file descriptors.
|
||||||
|
# TYPE process_max_fds gauge
|
||||||
|
process_max_fds 1.048575e+06
|
||||||
|
# HELP vllm:estimated_flops_per_gpu_total Estimated number of floating point operations per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_flops_per_gpu_total counter
|
||||||
|
vllm:estimated_flops_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:estimated_flops_per_gpu_created Estimated number of floating point operations per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_flops_per_gpu_created gauge
|
||||||
|
vllm:estimated_flops_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003540344e+09
|
||||||
|
# HELP vllm:estimated_read_bytes_per_gpu_total Estimated number of bytes read from memory per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_read_bytes_per_gpu_total counter
|
||||||
|
vllm:estimated_read_bytes_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:estimated_read_bytes_per_gpu_created Estimated number of bytes read from memory per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_read_bytes_per_gpu_created gauge
|
||||||
|
vllm:estimated_read_bytes_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779813300354065e+09
|
||||||
|
# HELP vllm:estimated_write_bytes_per_gpu_total Estimated number of bytes written to memory per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_write_bytes_per_gpu_total counter
|
||||||
|
vllm:estimated_write_bytes_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:estimated_write_bytes_per_gpu_created Estimated number of bytes written to memory per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_write_bytes_per_gpu_created gauge
|
||||||
|
vllm:estimated_write_bytes_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003540845e+09
|
||||||
|
# HELP vllm:num_requests_running Number of requests in model execution batches.
|
||||||
|
# TYPE vllm:num_requests_running gauge
|
||||||
|
vllm:num_requests_running{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:num_requests_waiting Number of requests waiting to be processed.
|
||||||
|
# TYPE vllm:num_requests_waiting gauge
|
||||||
|
vllm:num_requests_waiting{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:engine_sleep_state Engine sleep state; awake = 0 means engine is sleeping; awake = 1 means engine is awake; weights_offloaded = 1 means sleep level 1; discard_all = 1 means sleep level 2.
|
||||||
|
# TYPE vllm:engine_sleep_state gauge
|
||||||
|
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="awake"} 1.0
|
||||||
|
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="weights_offloaded"} 0.0
|
||||||
|
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="discard_all"} 0.0
|
||||||
|
# HELP vllm:kv_cache_usage_perc KV-cache usage. 1 means 100 percent usage.
|
||||||
|
# TYPE vllm:kv_cache_usage_perc gauge
|
||||||
|
vllm:kv_cache_usage_perc{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:prefix_cache_queries_total Prefix cache queries, in terms of number of queried tokens.
|
||||||
|
# TYPE vllm:prefix_cache_queries_total counter
|
||||||
|
vllm:prefix_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.563446e+06
|
||||||
|
# HELP vllm:prefix_cache_queries_created Prefix cache queries, in terms of number of queried tokens.
|
||||||
|
# TYPE vllm:prefix_cache_queries_created gauge
|
||||||
|
vllm:prefix_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003542354e+09
|
||||||
|
# HELP vllm:prefix_cache_hits_total Prefix cache hits, in terms of number of cached tokens.
|
||||||
|
# TYPE vllm:prefix_cache_hits_total counter
|
||||||
|
vllm:prefix_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:prefix_cache_hits_created Prefix cache hits, in terms of number of cached tokens.
|
||||||
|
# TYPE vllm:prefix_cache_hits_created gauge
|
||||||
|
vllm:prefix_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003542495e+09
|
||||||
|
# HELP vllm:external_prefix_cache_queries_total External prefix cache queries from KV connector cross-instance cache sharing, in terms of number of queried tokens.
|
||||||
|
# TYPE vllm:external_prefix_cache_queries_total counter
|
||||||
|
vllm:external_prefix_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.563446e+06
|
||||||
|
# HELP vllm:external_prefix_cache_queries_created External prefix cache queries from KV connector cross-instance cache sharing, in terms of number of queried tokens.
|
||||||
|
# TYPE vllm:external_prefix_cache_queries_created gauge
|
||||||
|
vllm:external_prefix_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003542619e+09
|
||||||
|
# HELP vllm:external_prefix_cache_hits_total External prefix cache hits from KV connector cross-instance cache sharing, in terms of number of cached tokens.
|
||||||
|
# TYPE vllm:external_prefix_cache_hits_total counter
|
||||||
|
vllm:external_prefix_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:external_prefix_cache_hits_created External prefix cache hits from KV connector cross-instance cache sharing, in terms of number of cached tokens.
|
||||||
|
# TYPE vllm:external_prefix_cache_hits_created gauge
|
||||||
|
vllm:external_prefix_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779813300354274e+09
|
||||||
|
# HELP vllm:mm_cache_queries_total Multi-modal cache queries, in terms of number of queried items.
|
||||||
|
# TYPE vllm:mm_cache_queries_total counter
|
||||||
|
vllm:mm_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:mm_cache_queries_created Multi-modal cache queries, in terms of number of queried items.
|
||||||
|
# TYPE vllm:mm_cache_queries_created gauge
|
||||||
|
vllm:mm_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003542848e+09
|
||||||
|
# HELP vllm:mm_cache_hits_total Multi-modal cache hits, in terms of number of cached items.
|
||||||
|
# TYPE vllm:mm_cache_hits_total counter
|
||||||
|
vllm:mm_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:mm_cache_hits_created Multi-modal cache hits, in terms of number of cached items.
|
||||||
|
# TYPE vllm:mm_cache_hits_created gauge
|
||||||
|
vllm:mm_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003542967e+09
|
||||||
|
# HELP vllm:num_preemptions_total Cumulative number of preemption from the engine.
|
||||||
|
# TYPE vllm:num_preemptions_total counter
|
||||||
|
vllm:num_preemptions_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:num_preemptions_created Cumulative number of preemption from the engine.
|
||||||
|
# TYPE vllm:num_preemptions_created gauge
|
||||||
|
vllm:num_preemptions_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003543088e+09
|
||||||
|
# HELP vllm:prompt_tokens_total Number of prefill tokens processed.
|
||||||
|
# TYPE vllm:prompt_tokens_total counter
|
||||||
|
vllm:prompt_tokens_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.563446e+06
|
||||||
|
# HELP vllm:prompt_tokens_created Number of prefill tokens processed.
|
||||||
|
# TYPE vllm:prompt_tokens_created gauge
|
||||||
|
vllm:prompt_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003543196e+09
|
||||||
|
# HELP vllm:prompt_tokens_by_source_total Number of prompt tokens by source.
|
||||||
|
# TYPE vllm:prompt_tokens_by_source_total counter
|
||||||
|
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_compute"} 1.563446e+06
|
||||||
|
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_cache_hit"} 0.0
|
||||||
|
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="external_kv_transfer"} 0.0
|
||||||
|
# HELP vllm:prompt_tokens_by_source_created Number of prompt tokens by source.
|
||||||
|
# TYPE vllm:prompt_tokens_by_source_created gauge
|
||||||
|
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_compute"} 1.7798133003543339e+09
|
||||||
|
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_cache_hit"} 1.7798133003543391e+09
|
||||||
|
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="external_kv_transfer"} 1.779813300354344e+09
|
||||||
|
# HELP vllm:prompt_tokens_cached_total Number of cached prompt tokens (local + external).
|
||||||
|
# TYPE vllm:prompt_tokens_cached_total counter
|
||||||
|
vllm:prompt_tokens_cached_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:prompt_tokens_cached_created Number of cached prompt tokens (local + external).
|
||||||
|
# TYPE vllm:prompt_tokens_cached_created gauge
|
||||||
|
vllm:prompt_tokens_cached_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003543563e+09
|
||||||
|
# HELP vllm:prompt_tokens_recomputed_total Number of cached tokens recomputed for forward pass.
|
||||||
|
# TYPE vllm:prompt_tokens_recomputed_total counter
|
||||||
|
vllm:prompt_tokens_recomputed_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:prompt_tokens_recomputed_created Number of cached tokens recomputed for forward pass.
|
||||||
|
# TYPE vllm:prompt_tokens_recomputed_created gauge
|
||||||
|
vllm:prompt_tokens_recomputed_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003543699e+09
|
||||||
|
# HELP vllm:generation_tokens_total Number of generation tokens processed.
|
||||||
|
# TYPE vllm:generation_tokens_total counter
|
||||||
|
vllm:generation_tokens_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 100352.0
|
||||||
|
# HELP vllm:generation_tokens_created Number of generation tokens processed.
|
||||||
|
# TYPE vllm:generation_tokens_created gauge
|
||||||
|
vllm:generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003543828e+09
|
||||||
|
# HELP vllm:request_success_total Count of successfully processed requests.
|
||||||
|
# TYPE vllm:request_success_total counter
|
||||||
|
vllm:request_success_total{engine="0",finished_reason="stop",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_success_total{engine="0",finished_reason="length",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_success_total{engine="0",finished_reason="abort",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_success_total{engine="0",finished_reason="error",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_success_total{engine="0",finished_reason="repetition",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:request_success_created Count of successfully processed requests.
|
||||||
|
# TYPE vllm:request_success_created gauge
|
||||||
|
vllm:request_success_created{engine="0",finished_reason="stop",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003544142e+09
|
||||||
|
vllm:request_success_created{engine="0",finished_reason="length",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779813300354424e+09
|
||||||
|
vllm:request_success_created{engine="0",finished_reason="abort",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003544307e+09
|
||||||
|
vllm:request_success_created{engine="0",finished_reason="error",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779813300354447e+09
|
||||||
|
vllm:request_success_created{engine="0",finished_reason="repetition",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003544533e+09
|
||||||
|
# HELP vllm:request_prompt_tokens Number of prefill tokens processed.
|
||||||
|
# TYPE vllm:request_prompt_tokens histogram
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.563446e+06
|
||||||
|
# HELP vllm:request_prompt_tokens_created Number of prefill tokens processed.
|
||||||
|
# TYPE vllm:request_prompt_tokens_created gauge
|
||||||
|
vllm:request_prompt_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779813300354504e+09
|
||||||
|
# HELP vllm:request_generation_tokens Number of generation tokens processed.
|
||||||
|
# TYPE vllm:request_generation_tokens histogram
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 100352.0
|
||||||
|
# HELP vllm:request_generation_tokens_created Number of generation tokens processed.
|
||||||
|
# TYPE vllm:request_generation_tokens_created gauge
|
||||||
|
vllm:request_generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003545852e+09
|
||||||
|
# HELP vllm:iteration_tokens_total Histogram of number of tokens per engine_step.
|
||||||
|
# TYPE vllm:iteration_tokens_total histogram
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 3053.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="8.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 10439.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="16.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 14463.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="32.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15126.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="64.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15126.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="128.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15126.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="256.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15126.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="512.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15126.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="1024.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15126.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="2048.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15126.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="4096.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15474.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="8192.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15496.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="16384.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15496.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15496.0
|
||||||
|
vllm:iteration_tokens_total_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15496.0
|
||||||
|
vllm:iteration_tokens_total_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.663798e+06
|
||||||
|
# HELP vllm:iteration_tokens_total_created Histogram of number of tokens per engine_step.
|
||||||
|
# TYPE vllm:iteration_tokens_total_created gauge
|
||||||
|
vllm:iteration_tokens_total_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003546233e+09
|
||||||
|
# HELP vllm:request_max_num_generation_tokens Histogram of maximum number of requested generation tokens.
|
||||||
|
# TYPE vllm:request_max_num_generation_tokens histogram
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 100352.0
|
||||||
|
# HELP vllm:request_max_num_generation_tokens_created Histogram of maximum number of requested generation tokens.
|
||||||
|
# TYPE vllm:request_max_num_generation_tokens_created gauge
|
||||||
|
vllm:request_max_num_generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003546584e+09
|
||||||
|
# HELP vllm:request_params_n Histogram of the n request parameter.
|
||||||
|
# TYPE vllm:request_params_n histogram
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
# HELP vllm:request_params_n_created Histogram of the n request parameter.
|
||||||
|
# TYPE vllm:request_params_n_created gauge
|
||||||
|
vllm:request_params_n_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003546975e+09
|
||||||
|
# HELP vllm:request_params_max_tokens Histogram of the max_tokens request parameter.
|
||||||
|
# TYPE vllm:request_params_max_tokens histogram
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 100352.0
|
||||||
|
# HELP vllm:request_params_max_tokens_created Histogram of the max_tokens request parameter.
|
||||||
|
# TYPE vllm:request_params_max_tokens_created gauge
|
||||||
|
vllm:request_params_max_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779813300354721e+09
|
||||||
|
# HELP vllm:time_to_first_token_seconds Histogram of time to first token in seconds.
|
||||||
|
# TYPE vllm:time_to_first_token_seconds histogram
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.001",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.005",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.02",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.04",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.06",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.08",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.25",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 3.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 335.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 385.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 391.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="160.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="640.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="2560.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 140.4927158355713
|
||||||
|
# HELP vllm:time_to_first_token_seconds_created Histogram of time to first token in seconds.
|
||||||
|
# TYPE vllm:time_to_first_token_seconds_created gauge
|
||||||
|
vllm:time_to_first_token_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779813300354762e+09
|
||||||
|
# HELP vllm:inter_token_latency_seconds Histogram of inter-token latency in seconds.
|
||||||
|
# TYPE vllm:inter_token_latency_seconds histogram
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 24758.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.025",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 93979.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.05",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 96677.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.075",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 96768.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 96768.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.15",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 96768.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.2",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 97006.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99764.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.4",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99764.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2025.8075081515126
|
||||||
|
# HELP vllm:inter_token_latency_seconds_created Histogram of inter-token latency in seconds.
|
||||||
|
# TYPE vllm:inter_token_latency_seconds_created gauge
|
||||||
|
vllm:inter_token_latency_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003548026e+09
|
||||||
|
# HELP vllm:request_time_per_output_token_seconds Histogram of time_per_output_token_seconds per request.
|
||||||
|
# TYPE vllm:request_time_per_output_token_seconds histogram
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 32.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.025",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 276.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.05",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.075",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.15",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.2",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.4",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 7.944343169221613
|
||||||
|
# HELP vllm:request_time_per_output_token_seconds_created Histogram of time_per_output_token_seconds per request.
|
||||||
|
# TYPE vllm:request_time_per_output_token_seconds_created gauge
|
||||||
|
vllm:request_time_per_output_token_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003548424e+09
|
||||||
|
# HELP vllm:e2e_request_latency_seconds Histogram of e2e request latency in seconds.
|
||||||
|
# TYPE vllm:e2e_request_latency_seconds histogram
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 3.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 12.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 16.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 168.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2166.030620574951
|
||||||
|
# HELP vllm:e2e_request_latency_seconds_created Histogram of e2e request latency in seconds.
|
||||||
|
# TYPE vllm:e2e_request_latency_seconds_created gauge
|
||||||
|
vllm:e2e_request_latency_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003549252e+09
|
||||||
|
# HELP vllm:request_queue_time_seconds Histogram of time spent in WAITING phase for request.
|
||||||
|
# TYPE vllm:request_queue_time_seconds histogram
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0029499955708160996
|
||||||
|
# HELP vllm:request_queue_time_seconds_created Histogram of time spent in WAITING phase for request.
|
||||||
|
# TYPE vllm:request_queue_time_seconds_created gauge
|
||||||
|
vllm:request_queue_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003549771e+09
|
||||||
|
# HELP vllm:request_inference_time_seconds Histogram of time spent in RUNNING phase for request.
|
||||||
|
# TYPE vllm:request_inference_time_seconds histogram
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 3.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 12.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 19.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 171.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2144.0190685334965
|
||||||
|
# HELP vllm:request_inference_time_seconds_created Histogram of time spent in RUNNING phase for request.
|
||||||
|
# TYPE vllm:request_inference_time_seconds_created gauge
|
||||||
|
vllm:request_inference_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003550127e+09
|
||||||
|
# HELP vllm:request_prefill_time_seconds Histogram of time spent in PREFILL phase for request.
|
||||||
|
# TYPE vllm:request_prefill_time_seconds histogram
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 306.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 378.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 118.21156038198387
|
||||||
|
# HELP vllm:request_prefill_time_seconds_created Histogram of time spent in PREFILL phase for request.
|
||||||
|
# TYPE vllm:request_prefill_time_seconds_created gauge
|
||||||
|
vllm:request_prefill_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003550553e+09
|
||||||
|
# HELP vllm:request_decode_time_seconds Histogram of time spent in DECODE phase for request.
|
||||||
|
# TYPE vllm:request_decode_time_seconds histogram
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 6.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 32.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 188.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2025.8075081515126
|
||||||
|
# HELP vllm:request_decode_time_seconds_created Histogram of time spent in DECODE phase for request.
|
||||||
|
# TYPE vllm:request_decode_time_seconds_created gauge
|
||||||
|
vllm:request_decode_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003550992e+09
|
||||||
|
# HELP vllm:request_prefill_kv_computed_tokens Histogram of new KV tokens computed during prefill (excluding cached tokens).
|
||||||
|
# TYPE vllm:request_prefill_kv_computed_tokens histogram
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.563446e+06
|
||||||
|
# HELP vllm:request_prefill_kv_computed_tokens_created Histogram of new KV tokens computed during prefill (excluding cached tokens).
|
||||||
|
# TYPE vllm:request_prefill_kv_computed_tokens_created gauge
|
||||||
|
vllm:request_prefill_kv_computed_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798133003551395e+09
|
||||||
|
# HELP vllm:cache_config_info Information of the LLMEngine CacheConfig
|
||||||
|
# TYPE vllm:cache_config_info gauge
|
||||||
|
vllm:cache_config_info{_block_size_resolved="True",block_size="16",cache_dtype="auto",calculate_kv_scales="False",cpu_kvcache_space_bytes="None",enable_prefix_caching="True",engine="0",gpu_memory_utilization="0.9",is_attention_free="False",kv_cache_memory_bytes="None",kv_offloading_backend="native",kv_offloading_size="None",kv_sharing_fast_prefill="False",mamba_block_size="None",mamba_cache_dtype="auto",mamba_cache_mode="none",mamba_page_size_padded="None",mamba_ssm_cache_dtype="auto",num_cpu_blocks="None",num_gpu_blocks="17590",num_gpu_blocks_override="None",prefix_caching_hash_algo="sha256",sliding_window="None",user_specified_block_size="False"} 1.0
|
||||||
|
# HELP http_requests_total Total number of requests by method, status and handler.
|
||||||
|
# TYPE http_requests_total counter
|
||||||
|
http_requests_total{handler="/v1/models",method="GET",status="2xx"} 1.0
|
||||||
|
http_requests_total{handler="/v1/chat/completions",method="POST",status="2xx"} 392.0
|
||||||
|
# HELP http_requests_created Total number of requests by method, status and handler.
|
||||||
|
# TYPE http_requests_created gauge
|
||||||
|
http_requests_created{handler="/v1/models",method="GET",status="2xx"} 1.7798133015051112e+09
|
||||||
|
http_requests_created{handler="/v1/chat/completions",method="POST",status="2xx"} 1.7798133101895015e+09
|
||||||
|
# HELP http_request_size_bytes Content length of incoming requests by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
|
||||||
|
# TYPE http_request_size_bytes summary
|
||||||
|
http_request_size_bytes_count{handler="/v1/models"} 1.0
|
||||||
|
http_request_size_bytes_sum{handler="/v1/models"} 0.0
|
||||||
|
http_request_size_bytes_count{handler="/v1/chat/completions"} 392.0
|
||||||
|
http_request_size_bytes_sum{handler="/v1/chat/completions"} 2.067408e+06
|
||||||
|
# HELP http_request_size_bytes_created Content length of incoming requests by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
|
||||||
|
# TYPE http_request_size_bytes_created gauge
|
||||||
|
http_request_size_bytes_created{handler="/v1/models"} 1.779813301505138e+09
|
||||||
|
http_request_size_bytes_created{handler="/v1/chat/completions"} 1.779813310189524e+09
|
||||||
|
# HELP http_response_size_bytes Content length of outgoing responses by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
|
||||||
|
# TYPE http_response_size_bytes summary
|
||||||
|
http_response_size_bytes_count{handler="/v1/models"} 1.0
|
||||||
|
http_response_size_bytes_sum{handler="/v1/models"} 558.0
|
||||||
|
http_response_size_bytes_count{handler="/v1/chat/completions"} 392.0
|
||||||
|
http_response_size_bytes_sum{handler="/v1/chat/completions"} 0.0
|
||||||
|
# HELP http_response_size_bytes_created Content length of outgoing responses by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
|
||||||
|
# TYPE http_response_size_bytes_created gauge
|
||||||
|
http_response_size_bytes_created{handler="/v1/models"} 1.7798133015051622e+09
|
||||||
|
http_response_size_bytes_created{handler="/v1/chat/completions"} 1.779813310189559e+09
|
||||||
|
# HELP http_request_duration_highr_seconds Latency with many buckets but no API specific labels. Made for more accurate percentile calculations.
|
||||||
|
# TYPE http_request_duration_highr_seconds histogram
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.01"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.025"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.05"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.075"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.1"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.25"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.5"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.75"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="1.0"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="1.5"} 4.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="2.0"} 13.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="2.5"} 17.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="3.0"} 38.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="3.5"} 64.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="4.0"} 97.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="4.5"} 137.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="5.0"} 168.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="7.5"} 317.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="10.0"} 393.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="30.0"} 393.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="60.0"} 393.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="+Inf"} 393.0
|
||||||
|
http_request_duration_highr_seconds_count 393.0
|
||||||
|
http_request_duration_highr_seconds_sum 2166.610084610991
|
||||||
|
# HELP http_request_duration_highr_seconds_created Latency with many buckets but no API specific labels. Made for more accurate percentile calculations.
|
||||||
|
# TYPE http_request_duration_highr_seconds_created gauge
|
||||||
|
http_request_duration_highr_seconds_created 1.7798133008326678e+09
|
||||||
|
# HELP http_request_duration_seconds Latency with only few buckets by handler. Made to be only used if aggregation by handler is important.
|
||||||
|
# TYPE http_request_duration_seconds histogram
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/models",le="0.1",method="GET"} 1.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/models",le="0.5",method="GET"} 1.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/models",le="1.0",method="GET"} 1.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/models",le="+Inf",method="GET"} 1.0
|
||||||
|
http_request_duration_seconds_count{handler="/v1/models",method="GET"} 1.0
|
||||||
|
http_request_duration_seconds_sum{handler="/v1/models",method="GET"} 0.002584011002909392
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="0.1",method="POST"} 0.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="0.5",method="POST"} 0.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="1.0",method="POST"} 0.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="+Inf",method="POST"} 392.0
|
||||||
|
http_request_duration_seconds_count{handler="/v1/chat/completions",method="POST"} 392.0
|
||||||
|
http_request_duration_seconds_sum{handler="/v1/chat/completions",method="POST"} 2166.6075005999883
|
||||||
|
# HELP http_request_duration_seconds_created Latency with only few buckets by handler. Made to be only used if aggregation by handler is important.
|
||||||
|
# TYPE http_request_duration_seconds_created gauge
|
||||||
|
http_request_duration_seconds_created{handler="/v1/models",method="GET"} 1.7798133015051963e+09
|
||||||
|
http_request_duration_seconds_created{handler="/v1/chat/completions",method="POST"} 1.7798133101895926e+09
|
||||||
@@ -0,0 +1,392 @@
|
|||||||
|
{"req_id": "f9f8f44fd4964565", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382437389577401, "t_first_token_ns": 382438624486558, "t_last_token_ns": 382445176383671, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "36bbb6c17a024be2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382437758622379, "t_first_token_ns": 382438628930287, "t_last_token_ns": 382445191700409, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "accbc730595b4e03", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382437750767014, "t_first_token_ns": 382438628800887, "t_last_token_ns": 382445192711752, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "95ac8f889eb69b61", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382438922597478, "t_first_token_ns": 382439185252921, "t_last_token_ns": 382445905930298, "prompt_tokens": 4026, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "70cae603829190e5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382439160927208, "t_first_token_ns": 382439428846049, "t_last_token_ns": 382445935280706, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "2769a82161526fa2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382439467984936, "t_first_token_ns": 382439737177607, "t_last_token_ns": 382446054712022, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "7a3ae284fd025e54", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382439612710610, "t_first_token_ns": 382439979195013, "t_last_token_ns": 382446083950577, "prompt_tokens": 3951, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "8c7ebece6fc8ef96", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382440170206384, "t_first_token_ns": 382440442066324, "t_last_token_ns": 382446377633851, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "8a6e9ed8388bab88", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382440289587605, "t_first_token_ns": 382440950969850, "t_last_token_ns": 382446401628484, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "291192528bfcd0d4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382440378392441, "t_first_token_ns": 382440950474266, "t_last_token_ns": 382446401778723, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "29aa5369f15ef8ed", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382440757795029, "t_first_token_ns": 382441144284197, "t_last_token_ns": 382446412736549, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "7d168f7f34914df8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382441309388810, "t_first_token_ns": 382441592635961, "t_last_token_ns": 382446570064080, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "5949b30b5660db44", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382441438015460, "t_first_token_ns": 382441859834831, "t_last_token_ns": 382446590096511, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "dee2814e53c1133c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382441975959999, "t_first_token_ns": 382442256420634, "t_last_token_ns": 382446701824931, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "d3d6d687b1270d0c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382442269107707, "t_first_token_ns": 382442541951337, "t_last_token_ns": 382446731430437, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "0114c7c017f510dc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382444384297948, "t_first_token_ns": 382444666814468, "t_last_token_ns": 382448570089387, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "6b23ad60e8365056", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382444448698891, "t_first_token_ns": 382444928950527, "t_last_token_ns": 382448590598469, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "c147b43b07e42731", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382447013425270, "t_first_token_ns": 382447266047466, "t_last_token_ns": 382451710024007, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "d6e4d018b0cbaea9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382447368720581, "t_first_token_ns": 382447631457694, "t_last_token_ns": 382452146409124, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "6a3c0ae558f764fa", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382447837183982, "t_first_token_ns": 382448097688268, "t_last_token_ns": 382452506332984, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "6f4afc5b3b7d8068", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382447945273506, "t_first_token_ns": 382448341929293, "t_last_token_ns": 382452530649478, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "e2a3bc5ca5dd8e06", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382448792375180, "t_first_token_ns": 382449057720674, "t_last_token_ns": 382453092189001, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "750b6e6f651a9619", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382448933775290, "t_first_token_ns": 382449304107276, "t_last_token_ns": 382453112039595, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "088884483a9971d8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382449212908720, "t_first_token_ns": 382449765983825, "t_last_token_ns": 382453127777284, "prompt_tokens": 4025, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "318a0f6dc8a5806d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382449229982668, "t_first_token_ns": 382449766179480, "t_last_token_ns": 382453127902117, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "60fa44dd296e868f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382449506948011, "t_first_token_ns": 382450014265300, "t_last_token_ns": 382453140448338, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "be5160126f3a14e8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382451792046929, "t_first_token_ns": 382452064186965, "t_last_token_ns": 382453925982691, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "31994709d9d43ecc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382454362144895, "t_first_token_ns": 382454617475530, "t_last_token_ns": 382456677101078, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "ebb3f3698c984adb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382455272303246, "t_first_token_ns": 382455529012501, "t_last_token_ns": 382457792269315, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "a0f7008353f8beb2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382455275527482, "t_first_token_ns": 382455763865191, "t_last_token_ns": 382457798590489, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "7eac0cde0fd34276", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382457154279527, "t_first_token_ns": 382457415663145, "t_last_token_ns": 382459136315954, "prompt_tokens": 4019, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "95601ae65eae9166", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382458519014415, "t_first_token_ns": 382458775363414, "t_last_token_ns": 382461775777394, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "189d08c7b9ae2624", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382459502179916, "t_first_token_ns": 382459760837769, "t_last_token_ns": 382463993698558, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "607beffbb7acca39", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382459634061552, "t_first_token_ns": 382460219885361, "t_last_token_ns": 382464016879370, "prompt_tokens": 3953, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "93918e85acdf8b5d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382459705304225, "t_first_token_ns": 382460220190030, "t_last_token_ns": 382464017362528, "prompt_tokens": 4052, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "de6c910bd8be862f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382460062778526, "t_first_token_ns": 382460462139348, "t_last_token_ns": 382464034152066, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "776a9724799617aa", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382461508843542, "t_first_token_ns": 382461775693883, "t_last_token_ns": 382465384806618, "prompt_tokens": 3955, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "b6357510a726d24b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382462084901575, "t_first_token_ns": 382462357246820, "t_last_token_ns": 382465970711712, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "7814350e435d2264", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382462974956284, "t_first_token_ns": 382463244583335, "t_last_token_ns": 382466653527027, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "c354ffa98aea604a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382463152644356, "t_first_token_ns": 382463495303606, "t_last_token_ns": 382466668212875, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "b95654a80317948f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382463647248147, "t_first_token_ns": 382463916896184, "t_last_token_ns": 382467265576288, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "d3983c9d6e42e3df", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382464472111144, "t_first_token_ns": 382464734089563, "t_last_token_ns": 382467738975882, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "f12e1b2630a2dfcc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382465634355094, "t_first_token_ns": 382465897675055, "t_last_token_ns": 382468970527033, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "26aa2698c0cddf5a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382466732208704, "t_first_token_ns": 382466994108335, "t_last_token_ns": 382469959668954, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "1dd5dcf6889436bf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382466909538981, "t_first_token_ns": 382467238570993, "t_last_token_ns": 382469974719268, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "55f20a37c1e89743", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382468287338760, "t_first_token_ns": 382468551454768, "t_last_token_ns": 382472309585383, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "e013af03a51488e2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382468450592952, "t_first_token_ns": 382468795594395, "t_last_token_ns": 382472328366364, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "4fefe92b23621583", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382469530792393, "t_first_token_ns": 382469795228681, "t_last_token_ns": 382473260671312, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "5af9d220c4dd9318", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382470071632420, "t_first_token_ns": 382470331206993, "t_last_token_ns": 382474272253525, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "a3ff67ad251967cf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382470210283982, "t_first_token_ns": 382470573707444, "t_last_token_ns": 382474291068276, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "c9258f74cc827511", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382470803017339, "t_first_token_ns": 382471065284026, "t_last_token_ns": 382474556321540, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "140eb88bb615ba37", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382471290160497, "t_first_token_ns": 382471559753754, "t_last_token_ns": 382474770567684, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "d4a164b2ff3b3509", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382473414542570, "t_first_token_ns": 382473679724791, "t_last_token_ns": 382476127118405, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "9f1266f29a9e8b28", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382473445005968, "t_first_token_ns": 382473916338608, "t_last_token_ns": 382476132562562, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "57c78bedb1e5433a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382473566563150, "t_first_token_ns": 382474151063736, "t_last_token_ns": 382476136336060, "prompt_tokens": 3957, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "08ff3938b9e3ae71", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382476295109479, "t_first_token_ns": 382476550216865, "t_last_token_ns": 382480084840128, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "fc976d1e97a41cca", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382477486016430, "t_first_token_ns": 382477745650670, "t_last_token_ns": 382482414526694, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "6c45ae3b99de1e62", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382477594839270, "t_first_token_ns": 382478026788300, "t_last_token_ns": 382482437249163, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "417d200da6fc7d82", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382477768996297, "t_first_token_ns": 382478224680993, "t_last_token_ns": 382482447676552, "prompt_tokens": 4042, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "2b45e140974d73b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382478286209490, "t_first_token_ns": 382478546159467, "t_last_token_ns": 382482571618890, "prompt_tokens": 3948, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "4809e923113cfa22", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382478400750739, "t_first_token_ns": 382478794183956, "t_last_token_ns": 382482591954567, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "b80243b2ea980e07", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382478697175974, "t_first_token_ns": 382479253383133, "t_last_token_ns": 382482607560641, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "1baef27c84916aa0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382478662567849, "t_first_token_ns": 382479253272657, "t_last_token_ns": 382482607838615, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "efd5183f824026cc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382479526718914, "t_first_token_ns": 382479803334785, "t_last_token_ns": 382482781775044, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "df3338f95a55a940", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382479582470896, "t_first_token_ns": 382480047354178, "t_last_token_ns": 382482790919117, "prompt_tokens": 3921, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "973d64287adcd8a7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382483064886146, "t_first_token_ns": 382483315719026, "t_last_token_ns": 382484536420131, "prompt_tokens": 3934, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "19674684ced3de91", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382484754332925, "t_first_token_ns": 382485009723647, "t_last_token_ns": 382486562576231, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "e6fa68dd3de56239", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382485791833395, "t_first_token_ns": 382486050153133, "t_last_token_ns": 382488348675781, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "ec06f5beb70436d9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382486778895482, "t_first_token_ns": 382487037893477, "t_last_token_ns": 382490624167990, "prompt_tokens": 4040, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "034f1151aa2d2bdd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382487085210739, "t_first_token_ns": 382487343912089, "t_last_token_ns": 382490772830448, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "d156973f3cae174d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382487842932072, "t_first_token_ns": 382488107761289, "t_last_token_ns": 382492064647727, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "c2a0107acab59bd0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382488798198258, "t_first_token_ns": 382489061893733, "t_last_token_ns": 382493232248598, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "25faaab088637291", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382489235731063, "t_first_token_ns": 382489497882232, "t_last_token_ns": 382493975534288, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "4a6ff2135636c100", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382489280247392, "t_first_token_ns": 382489735263829, "t_last_token_ns": 382493986443913, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "70f1b6f6bad6005a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382489936547679, "t_first_token_ns": 382490201628971, "t_last_token_ns": 382494219336248, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "485e723c2b84eb6b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382490200994411, "t_first_token_ns": 382490474208550, "t_last_token_ns": 382494255288383, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "6be120b83e225aa3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382490829087298, "t_first_token_ns": 382491096050472, "t_last_token_ns": 382494799554578, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "291a0df34911423c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382491194007215, "t_first_token_ns": 382491460992366, "t_last_token_ns": 382494890810440, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "9a771254d6418297", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382493226986312, "t_first_token_ns": 382493499283891, "t_last_token_ns": 382497243595179, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "3d63e1d817747a07", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382493377569753, "t_first_token_ns": 382493749298011, "t_last_token_ns": 382497262616849, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "a85143fddaf4a5d9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382494354993420, "t_first_token_ns": 382494615993462, "t_last_token_ns": 382497929637802, "prompt_tokens": 3953, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "ed998b60c04d0117", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382495155354715, "t_first_token_ns": 382495421665947, "t_last_token_ns": 382500920313094, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "10c050e2cad99a17", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382495697920101, "t_first_token_ns": 382495963228080, "t_last_token_ns": 382501748435860, "prompt_tokens": 3949, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "03bc3ccba18fc900", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382495754699453, "t_first_token_ns": 382496426353180, "t_last_token_ns": 382501776136824, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "6ca800f53f978a58", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382495875735543, "t_first_token_ns": 382496426442487, "t_last_token_ns": 382501776330229, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "0d6ebbadc6b6f44a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382498155875052, "t_first_token_ns": 382498446433637, "t_last_token_ns": 382505462079780, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "2225316b65c25005", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382498332760132, "t_first_token_ns": 382498690592166, "t_last_token_ns": 382505494237853, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "48f171c96419c86f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382498452220455, "t_first_token_ns": 382498929296884, "t_last_token_ns": 382505511018130, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "53e2a3f9153c46d1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382498680321517, "t_first_token_ns": 382499174378195, "t_last_token_ns": 382505539491266, "prompt_tokens": 3949, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "8f000c5b2b8ac58e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382499194643124, "t_first_token_ns": 382499468350138, "t_last_token_ns": 382505626778294, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "601066832f30b428", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382499684243574, "t_first_token_ns": 382499960670296, "t_last_token_ns": 382505955281317, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "bf6ab969a046d65b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382499690680877, "t_first_token_ns": 382500413097914, "t_last_token_ns": 382505969596833, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "d417d380a4f40330", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382499687588662, "t_first_token_ns": 382500413986604, "t_last_token_ns": 382505970138023, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "d23fe5d3ae909682", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382500148946424, "t_first_token_ns": 382500880741948, "t_last_token_ns": 382505993874864, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "08d7416f17740d2e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382500285778410, "t_first_token_ns": 382500881197127, "t_last_token_ns": 382505994237693, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "f63c4d00aca7e8d4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382500910329182, "t_first_token_ns": 382501189339488, "t_last_token_ns": 382506284106012, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "72e028e252e47e71", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382501978976396, "t_first_token_ns": 382502261558163, "t_last_token_ns": 382507367884139, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "90e18fcccb2043f7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382502046453211, "t_first_token_ns": 382502511742224, "t_last_token_ns": 382507390351589, "prompt_tokens": 3956, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "7e9fdd59228f0876", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382502516188899, "t_first_token_ns": 382502795910463, "t_last_token_ns": 382507429090512, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "245bf7e5865d03d2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382502865829021, "t_first_token_ns": 382503143740173, "t_last_token_ns": 382507507343488, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "2033b1bb6c32c79a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382503908780746, "t_first_token_ns": 382504191323242, "t_last_token_ns": 382507936069608, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "0f0a89285f54e2c1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382505974184533, "t_first_token_ns": 382506240719957, "t_last_token_ns": 382512575213051, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "da6f35b293578e17", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382506334792956, "t_first_token_ns": 382506597397852, "t_last_token_ns": 382512754521545, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "6c160ab03b66d81b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382506521102124, "t_first_token_ns": 382506844462678, "t_last_token_ns": 382512783409796, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "acad4eb8b9a0e9f6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382507927324678, "t_first_token_ns": 382508184796558, "t_last_token_ns": 382516793729695, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "502a12aa8ba51234", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382507946485977, "t_first_token_ns": 382508420349646, "t_last_token_ns": 382516813996890, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "c0e0a7d3c8cf4b95", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382508215141384, "t_first_token_ns": 382508664942341, "t_last_token_ns": 382516852847142, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "541738310001dd1c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382508550871264, "t_first_token_ns": 382508908224668, "t_last_token_ns": 382516887643500, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "525463c05dc42958", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382508682327065, "t_first_token_ns": 382509142379808, "t_last_token_ns": 382516903692291, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "3926bdb966d6e395", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382509442789565, "t_first_token_ns": 382509715139829, "t_last_token_ns": 382517692214247, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "586a0d3540151470", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382509931249037, "t_first_token_ns": 382510200647732, "t_last_token_ns": 382518077571210, "prompt_tokens": 3929, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "37c5cdb56477a163", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382510122966465, "t_first_token_ns": 382510451647261, "t_last_token_ns": 382518107973431, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "5947be4f56ddbab5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382510479547209, "t_first_token_ns": 382510757565536, "t_last_token_ns": 382518430502259, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "2fe62cb82ee987a4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382511265191378, "t_first_token_ns": 382511544026473, "t_last_token_ns": 382520010758384, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "8c278419850c8b5e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382511528235553, "t_first_token_ns": 382511812071381, "t_last_token_ns": 382520064199218, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "021ff28eaacd14c5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382511576266103, "t_first_token_ns": 382512054538001, "t_last_token_ns": 382520081980815, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "faaf800e58c8f85f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382511988602843, "t_first_token_ns": 382512305847801, "t_last_token_ns": 382520138797371, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "744f358ad7f63e97", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382512134262340, "t_first_token_ns": 382512558259585, "t_last_token_ns": 382520368976103, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "927918347bcc40c8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382512782697171, "t_first_token_ns": 382513072882974, "t_last_token_ns": 382520676841536, "prompt_tokens": 4032, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "2fcef28ebf707351", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382512933430842, "t_first_token_ns": 382513539463682, "t_last_token_ns": 382520707349885, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "fe233f2c57e680fd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382513030737352, "t_first_token_ns": 382513539792576, "t_last_token_ns": 382520707639984, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "b95acf9eb3705d28", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382514527718437, "t_first_token_ns": 382514811760873, "t_last_token_ns": 382522795028695, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "96b58f314a838835", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382514795552424, "t_first_token_ns": 382515090939441, "t_last_token_ns": 382522839641262, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "08456882f8d362d1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382515026862060, "t_first_token_ns": 382515348875802, "t_last_token_ns": 382522870704951, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "4c8bd15a715f7720", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382515543192642, "t_first_token_ns": 382515845820383, "t_last_token_ns": 382523290208732, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "6949ec56d00ce8b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382516331822228, "t_first_token_ns": 382516614480687, "t_last_token_ns": 382523929889233, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 22, "error": null}
|
||||||
|
{"req_id": "0fce2da54ec51dfd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382517073924545, "t_first_token_ns": 382517351952672, "t_last_token_ns": 382525062155518, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "4238f6957ae36e57", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382518142696218, "t_first_token_ns": 382518429368359, "t_last_token_ns": 382526993670530, "prompt_tokens": 4027, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "943ff34258d40d28", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382518344386201, "t_first_token_ns": 382518682842460, "t_last_token_ns": 382527033833241, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "dfca62202759aa8a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382518716450255, "t_first_token_ns": 382518998315088, "t_last_token_ns": 382527133404845, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "4397f621098474f5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382519019117753, "t_first_token_ns": 382519313348320, "t_last_token_ns": 382527216881599, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "beebd08c7cc682ce", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382519504991884, "t_first_token_ns": 382519793548537, "t_last_token_ns": 382527496248745, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "7c168a9454ca5620", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382520074984992, "t_first_token_ns": 382520355215427, "t_last_token_ns": 382527794548337, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "affa6c57b9777216", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382520737406251, "t_first_token_ns": 382521021190792, "t_last_token_ns": 382528454442242, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "e5ffcb3c7f664dc7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382520785147939, "t_first_token_ns": 382521263319061, "t_last_token_ns": 382528470519248, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "a024c6b1441ebbd3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382521095248115, "t_first_token_ns": 382521517865045, "t_last_token_ns": 382528500130872, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "17af21e67df524a9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382521917465814, "t_first_token_ns": 382522214574313, "t_last_token_ns": 382529162055877, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "80ff547317c08b98", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382521942946875, "t_first_token_ns": 382522446209121, "t_last_token_ns": 382529175209725, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "70a5b87cfadecff6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382522858195956, "t_first_token_ns": 382523139238024, "t_last_token_ns": 382529549448010, "prompt_tokens": 3945, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "843c0165f72929a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382523630626089, "t_first_token_ns": 382523916049615, "t_last_token_ns": 382530284140975, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "73b4128afce8e243", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382523975291321, "t_first_token_ns": 382524252905006, "t_last_token_ns": 382530385318937, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "205e89d222fdda00", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382524374438289, "t_first_token_ns": 382524661860477, "t_last_token_ns": 382530557660267, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "7117082172557ca5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382524579480604, "t_first_token_ns": 382524918039124, "t_last_token_ns": 382530581854788, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "7cc19a30a453f5f4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382525471524807, "t_first_token_ns": 382525754941155, "t_last_token_ns": 382531044787077, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "8ba8086b8eec10d6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382525909213754, "t_first_token_ns": 382526201111035, "t_last_token_ns": 382531185251712, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "ec1a4c4a2ea7ed12", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382526137329232, "t_first_token_ns": 382526672910762, "t_last_token_ns": 382531204541312, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "6b0f100f2f9945ac", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382526115532501, "t_first_token_ns": 382526672077235, "t_last_token_ns": 382531204891242, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "768828307182d0ad", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382526420089946, "t_first_token_ns": 382526934569338, "t_last_token_ns": 382531220875941, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "034e749c721e456f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382528112549250, "t_first_token_ns": 382528398152142, "t_last_token_ns": 382533588060933, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "73a8827fa8847e13", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382528865442733, "t_first_token_ns": 382529149305356, "t_last_token_ns": 382534012442599, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "bbcdf2009f998102", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382529521405698, "t_first_token_ns": 382529805468501, "t_last_token_ns": 382534359967723, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "9fa8ae1957c0f1d2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382531332708291, "t_first_token_ns": 382531589103058, "t_last_token_ns": 382536344777554, "prompt_tokens": 3937, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "b11891ac95fc3b64", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382531354393380, "t_first_token_ns": 382531823645133, "t_last_token_ns": 382536356702087, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "59a9a00a0d3a1563", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382531969815276, "t_first_token_ns": 382532239742398, "t_last_token_ns": 382537050425487, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "d8bfc0767d724848", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382532109765524, "t_first_token_ns": 382532480545077, "t_last_token_ns": 382537303115064, "prompt_tokens": 3937, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "345d39ae3777d415", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382532447480238, "t_first_token_ns": 382532729127077, "t_last_token_ns": 382537324636915, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "9866cbc828a4fca8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382532582135674, "t_first_token_ns": 382532977638830, "t_last_token_ns": 382537345989837, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "af397ebf951c7719", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382532868973643, "t_first_token_ns": 382533224383475, "t_last_token_ns": 382537364615236, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "65c0b4265eb5dd19", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382534414068559, "t_first_token_ns": 382534680950229, "t_last_token_ns": 382539810900573, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "89dffa88c862c5b0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382535276693870, "t_first_token_ns": 382535553025833, "t_last_token_ns": 382542180226043, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "dc476408b33fbf5a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382536525865820, "t_first_token_ns": 382536802140386, "t_last_token_ns": 382545196588445, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "319a82c9b4681316", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382536578569866, "t_first_token_ns": 382537050342553, "t_last_token_ns": 382545238586382, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "71d7c157bf1ea669", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382536817959306, "t_first_token_ns": 382537292206399, "t_last_token_ns": 382545258834833, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "2cc13ac9d11806a5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382537840579422, "t_first_token_ns": 382538112365538, "t_last_token_ns": 382546901356847, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "129148915a429e01", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382537868701528, "t_first_token_ns": 382538348065041, "t_last_token_ns": 382546923475828, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "8b0e7810ec937f82", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382538067394897, "t_first_token_ns": 382538796559997, "t_last_token_ns": 382546939714805, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "df7587e4a3d50532", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382538009748837, "t_first_token_ns": 382538797411685, "t_last_token_ns": 382546940241100, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "fb520fc29874b954", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382539202081711, "t_first_token_ns": 382539477823824, "t_last_token_ns": 382548591985918, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "dd803dc1b0f0a741", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382539379029004, "t_first_token_ns": 382539727913056, "t_last_token_ns": 382548633670640, "prompt_tokens": 4024, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "93327e2e45a9ea52", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382539747532476, "t_first_token_ns": 382540028246626, "t_last_token_ns": 382548719023767, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "f37f16b87565dc53", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382539757628771, "t_first_token_ns": 382540265122286, "t_last_token_ns": 382548735706172, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "3bcfa41060f2dd80", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382539839084215, "t_first_token_ns": 382540505269875, "t_last_token_ns": 382548751340308, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "5badec87742c1dc2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382540320649291, "t_first_token_ns": 382540758594664, "t_last_token_ns": 382548781037008, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "d1e7b052e89f38f0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382541366643409, "t_first_token_ns": 382541641863947, "t_last_token_ns": 382550199617221, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "36a41522c4af841a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382541645803795, "t_first_token_ns": 382541926952432, "t_last_token_ns": 382550273163689, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "56afd7a006446ce0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382541903505384, "t_first_token_ns": 382542179827927, "t_last_token_ns": 382550300301587, "prompt_tokens": 3949, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "1ec26802903d0510", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382541931639474, "t_first_token_ns": 382542419352459, "t_last_token_ns": 382550315423716, "prompt_tokens": 3947, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "a642c8cc74bcd308", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382542002654835, "t_first_token_ns": 382542872366762, "t_last_token_ns": 382550329902956, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "16ab6647bde110e0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382542110595693, "t_first_token_ns": 382542871172933, "t_last_token_ns": 382550330042072, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "ab5eaa452796e93b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382542265151880, "t_first_token_ns": 382543112170170, "t_last_token_ns": 382550343382527, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "57388896a658fda9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382542829871132, "t_first_token_ns": 382543356622004, "t_last_token_ns": 382550357334738, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "74e18421d9a9571b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382543127745832, "t_first_token_ns": 382543617624562, "t_last_token_ns": 382550383309596, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 22, "error": null}
|
||||||
|
{"req_id": "51627206c78da939", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382545664613837, "t_first_token_ns": 382545953714992, "t_last_token_ns": 382552728980875, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "ce0e4f081e4c436a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382545888426694, "t_first_token_ns": 382546215268879, "t_last_token_ns": 382552757199943, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "9826eaf60529d07a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382547183633361, "t_first_token_ns": 382547468425600, "t_last_token_ns": 382554628407483, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "f06c8516cb43f569", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382547248159920, "t_first_token_ns": 382547727188400, "t_last_token_ns": 382554657382891, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "a2fcaaf1ad76b0b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382547749305957, "t_first_token_ns": 382548035998553, "t_last_token_ns": 382554731970397, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "0edc9b51f130a612", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382548281546192, "t_first_token_ns": 382548572326827, "t_last_token_ns": 382554946612409, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "ab79c0a9dc372db9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382548996595119, "t_first_token_ns": 382549285030314, "t_last_token_ns": 382555384059798, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "284610eaafd0e42a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382549349094595, "t_first_token_ns": 382549631425279, "t_last_token_ns": 382555501894184, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "89cb2a334a9f6238", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382549844906315, "t_first_token_ns": 382550135311004, "t_last_token_ns": 382555939724336, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "7ea2d46c75d8fe9a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382550848584015, "t_first_token_ns": 382551122653883, "t_last_token_ns": 382557020885060, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "024f2a25db9d388f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382550958759520, "t_first_token_ns": 382551373648056, "t_last_token_ns": 382557049054869, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "4bff469af089d526", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382551464967050, "t_first_token_ns": 382551739366012, "t_last_token_ns": 382557407256291, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "700bf9b5ddd3a040", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382552057083001, "t_first_token_ns": 382552337968074, "t_last_token_ns": 382557717340011, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "56c8618490819588", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382553072592691, "t_first_token_ns": 382553349038330, "t_last_token_ns": 382559586778068, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "897a93d4897b76cf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382553192957626, "t_first_token_ns": 382553605272662, "t_last_token_ns": 382559645432890, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "fa28b6fdceb04b0a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382553568184682, "t_first_token_ns": 382554075554532, "t_last_token_ns": 382559904878853, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "9d6d324a99e130fb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382553550527913, "t_first_token_ns": 382554075833888, "t_last_token_ns": 382559905146499, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "2b3fcd354a2135a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382554293518390, "t_first_token_ns": 382554576785937, "t_last_token_ns": 382561034799423, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "e29b1c49a4e6562a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382555644551640, "t_first_token_ns": 382555927355314, "t_last_token_ns": 382562123804150, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "729dbd22ce5934ca", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382555935793265, "t_first_token_ns": 382556207273448, "t_last_token_ns": 382562181599030, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "150d24a23cc28808", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382557066488596, "t_first_token_ns": 382557336207968, "t_last_token_ns": 382563798671707, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "1247310989ae8f13", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382557782459484, "t_first_token_ns": 382558048740890, "t_last_token_ns": 382564376890772, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "e40763fc59ed0799", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382557794629036, "t_first_token_ns": 382558288738142, "t_last_token_ns": 382564391218633, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "7db7d42bc0b6e636", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382558251041036, "t_first_token_ns": 382558757058302, "t_last_token_ns": 382564417783559, "prompt_tokens": 4028, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "a0ed26ee4cdfed05", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382558218648750, "t_first_token_ns": 382558757471640, "t_last_token_ns": 382564418328997, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "c81fd9769ec09a5f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382558502828703, "t_first_token_ns": 382559008057576, "t_last_token_ns": 382564441400816, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "8a5e9c377ef0f53c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382559578902820, "t_first_token_ns": 382559859996023, "t_last_token_ns": 382567308062492, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "3bb8df688946d953", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382559669094788, "t_first_token_ns": 382560114470598, "t_last_token_ns": 382567344245123, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "448dfb7bcf90f1f3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382560225661693, "t_first_token_ns": 382560501719488, "t_last_token_ns": 382567993931385, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "33ee8dfa5d511f36", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382560266757746, "t_first_token_ns": 382560737065548, "t_last_token_ns": 382568011710697, "prompt_tokens": 3947, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "fe73928ec06f2ffd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382560345407185, "t_first_token_ns": 382560975847633, "t_last_token_ns": 382568025666052, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "b3b98aab260b94f5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382562373129078, "t_first_token_ns": 382562648740542, "t_last_token_ns": 382569588593134, "prompt_tokens": 3962, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "ea50a73400df1150", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382562571701115, "t_first_token_ns": 382563117337441, "t_last_token_ns": 382569617372914, "prompt_tokens": 4063, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "0616a2c3128fe7ad", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382562441514880, "t_first_token_ns": 382563117511254, "t_last_token_ns": 382569617663398, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "4608dc5a783e53a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382564449347157, "t_first_token_ns": 382564725894402, "t_last_token_ns": 382570925571422, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "d2629c04025d1ad2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382564696510896, "t_first_token_ns": 382564973569712, "t_last_token_ns": 382570953214148, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "871a29b70510a5f2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382564860143833, "t_first_token_ns": 382565437437664, "t_last_token_ns": 382570980983956, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "78ec14117dc5c5e0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382564928741655, "t_first_token_ns": 382565437734558, "t_last_token_ns": 382570981215086, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "83b69b3a671f955d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382565390713102, "t_first_token_ns": 382565691330741, "t_last_token_ns": 382571034768598, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "7174a671812eb138", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382565431953109, "t_first_token_ns": 382565931902618, "t_last_token_ns": 382571249452883, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "51ff885b54684fc5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382565598437190, "t_first_token_ns": 382566171529197, "t_last_token_ns": 382571257834811, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "37f77dec01a5a091", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382565766199083, "t_first_token_ns": 382566411909532, "t_last_token_ns": 382571266591490, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "280e5ba3748db94d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382566678906660, "t_first_token_ns": 382566968474355, "t_last_token_ns": 382571486280077, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "e4a29ae517fc90cb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382566797223086, "t_first_token_ns": 382567225346580, "t_last_token_ns": 382571501103915, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "191f846cb9f66a8d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382567389123840, "t_first_token_ns": 382567676808515, "t_last_token_ns": 382571603445113, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "35f7d27c1cba45d2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382567574174075, "t_first_token_ns": 382567936852443, "t_last_token_ns": 382571615902759, "prompt_tokens": 4008, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "7c00e83d6e1dd0e6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382570966429048, "t_first_token_ns": 382571249315667, "t_last_token_ns": 382572627166715, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "3d4d2bb9bff4449a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382574080931525, "t_first_token_ns": 382574338697970, "t_last_token_ns": 382575560814792, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "7272725ee8fd0540", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382575799750264, "t_first_token_ns": 382576055222955, "t_last_token_ns": 382577278144728, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "a2fea85d23f1bda3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382577335846602, "t_first_token_ns": 382577590433097, "t_last_token_ns": 382579243658877, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "011158111ca19afe", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382577874140165, "t_first_token_ns": 382578130799988, "t_last_token_ns": 382579553343952, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "8da37110844555c6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382579656166374, "t_first_token_ns": 382579912533024, "t_last_token_ns": 382582250437266, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "d8b99c661f516313", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382580337338311, "t_first_token_ns": 382580597104657, "t_last_token_ns": 382583718961424, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "a02c08c718952117", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382580453669521, "t_first_token_ns": 382580835793972, "t_last_token_ns": 382583739085671, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "85285854a309da38", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382581332761994, "t_first_token_ns": 382581592624269, "t_last_token_ns": 382584661887497, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "7ddc31684732a450", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382582595348050, "t_first_token_ns": 382582861458895, "t_last_token_ns": 382587175204698, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "536791c815121f58", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382582605773245, "t_first_token_ns": 382583097875675, "t_last_token_ns": 382587185679221, "prompt_tokens": 4023, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "8576f22a0c427bbf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382582751115895, "t_first_token_ns": 382583331982632, "t_last_token_ns": 382587196964924, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "ca99a4a5056f40d2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382583774507189, "t_first_token_ns": 382584037681433, "t_last_token_ns": 382587698640192, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "5838d0fe27e1457f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382584749198686, "t_first_token_ns": 382585018681905, "t_last_token_ns": 382588960257822, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "816823128084eebd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382585298111091, "t_first_token_ns": 382585562600934, "t_last_token_ns": 382589553589211, "prompt_tokens": 3932, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "bb0921b5e49295ee", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382585385854383, "t_first_token_ns": 382585808354347, "t_last_token_ns": 382589572713505, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "346cf978d3834bc0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382585592478956, "t_first_token_ns": 382586271411803, "t_last_token_ns": 382589592712137, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "c886d982435860b7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382585772484656, "t_first_token_ns": 382586271515796, "t_last_token_ns": 382589592845374, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "fdbc4ac00b838337", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382588401071195, "t_first_token_ns": 382588673725478, "t_last_token_ns": 382592472661119, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "fa46f37dea905ba5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382588571935919, "t_first_token_ns": 382588919256100, "t_last_token_ns": 382592491429134, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "3e44df05b71a3966", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382588881307614, "t_first_token_ns": 382589173162091, "t_last_token_ns": 382592510106245, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "8ee00ef0ad273d5c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382589660512707, "t_first_token_ns": 382589919211203, "t_last_token_ns": 382592910991737, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "abfce48fa7fd10f9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382590386545936, "t_first_token_ns": 382590641286113, "t_last_token_ns": 382593640080068, "prompt_tokens": 3927, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "4f100560326929ab", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382590937795484, "t_first_token_ns": 382591204098999, "t_last_token_ns": 382593899241616, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "f83672c4afe7b491", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382591640783966, "t_first_token_ns": 382591907718982, "t_last_token_ns": 382594204090484, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "8683501cd7f36932", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382592976712053, "t_first_token_ns": 382593237180182, "t_last_token_ns": 382595488124510, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "3ce62a1f03d2a106", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382594612744689, "t_first_token_ns": 382594871240230, "t_last_token_ns": 382597428559484, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "34360b76a6f05a1c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382594710568633, "t_first_token_ns": 382595322953692, "t_last_token_ns": 382597442699065, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "0b7dbbfb7b402e80", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382594689869866, "t_first_token_ns": 382595323250425, "t_last_token_ns": 382597442838083, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "9ca5d6660027739d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382596343868776, "t_first_token_ns": 382596603679810, "t_last_token_ns": 382598898246971, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "1fe7cbb011f5427a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382597951767532, "t_first_token_ns": 382598209244942, "t_last_token_ns": 382603111407515, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "dffdc06a4b36683c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382598258107446, "t_first_token_ns": 382598518078461, "t_last_token_ns": 382603302383564, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "caced4db24115828", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382598598969736, "t_first_token_ns": 382598862406811, "t_last_token_ns": 382603483417617, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "9b16f710bda9505d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382599369464812, "t_first_token_ns": 382599630674292, "t_last_token_ns": 382604776662918, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "cb7c10754f5ce5fe", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382599734831951, "t_first_token_ns": 382599996914894, "t_last_token_ns": 382604986444551, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "88ffbcb6bc7f56b1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382599813931783, "t_first_token_ns": 382600241912454, "t_last_token_ns": 382605009405734, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "78e331e87c45a189", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382600007211604, "t_first_token_ns": 382600481649840, "t_last_token_ns": 382605020451011, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "456ef7ed125277b0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382600249709097, "t_first_token_ns": 382600728664289, "t_last_token_ns": 382605039352293, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "89f55bed6ffb548a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382600807260847, "t_first_token_ns": 382601082572196, "t_last_token_ns": 382605378318623, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "3d3ab1fae05f0d2a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382600934668608, "t_first_token_ns": 382601329509161, "t_last_token_ns": 382605398328110, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "335bb26db7a2003b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382601804697441, "t_first_token_ns": 382602081057022, "t_last_token_ns": 382605703794103, "prompt_tokens": 4024, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "dfbdddceda574c5e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382604138808006, "t_first_token_ns": 382604417109275, "t_last_token_ns": 382607780724734, "prompt_tokens": 4028, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "59fcfa5724d8e7b3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382604463399434, "t_first_token_ns": 382604737711650, "t_last_token_ns": 382607858135168, "prompt_tokens": 4019, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "cfe7772de6d17d6b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382605027417738, "t_first_token_ns": 382605296610285, "t_last_token_ns": 382608341548119, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "59bba69a8d667e3e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382605702810796, "t_first_token_ns": 382605962106250, "t_last_token_ns": 382608761880958, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "108739472aeef7cd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382606769506495, "t_first_token_ns": 382607031901233, "t_last_token_ns": 382610251468021, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "7912548face9904f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382607431429213, "t_first_token_ns": 382607701453591, "t_last_token_ns": 382610597334965, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "9ad61f308977eb83", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382607916873006, "t_first_token_ns": 382608179456333, "t_last_token_ns": 382611038121273, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "3491e86c0fd93ddb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382608767732849, "t_first_token_ns": 382609029865383, "t_last_token_ns": 382611584171990, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "48e9daddae7ddfde", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382608784823455, "t_first_token_ns": 382609259091611, "t_last_token_ns": 382611590752625, "prompt_tokens": 3921, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "30336ab3d8b85b24", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382610658711582, "t_first_token_ns": 382610920092059, "t_last_token_ns": 382612627197968, "prompt_tokens": 4024, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "9a3fd26f54c1be0a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382612185934681, "t_first_token_ns": 382612441911073, "t_last_token_ns": 382617752860247, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "6b4b03f7f400b478", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382612975456832, "t_first_token_ns": 382613236139936, "t_last_token_ns": 382619830767298, "prompt_tokens": 4028, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "79c09b32b572e89b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382613126145106, "t_first_token_ns": 382613474638939, "t_last_token_ns": 382620088647314, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "96d046514314461d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382613781568005, "t_first_token_ns": 382614046072206, "t_last_token_ns": 382622324692434, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "185f1dbdea6d23c0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382613801771400, "t_first_token_ns": 382614282813567, "t_last_token_ns": 382622546584608, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "1d800c9aabc32617", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382614140534238, "t_first_token_ns": 382614528629372, "t_last_token_ns": 382622586903252, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "6c8d620872c9187d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382614303396558, "t_first_token_ns": 382614766305061, "t_last_token_ns": 382622603304503, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "48598702f53ca2f3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382614853586256, "t_first_token_ns": 382615127444049, "t_last_token_ns": 382623035928644, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "f2db9e461e72ad78", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382615047039537, "t_first_token_ns": 382615375390439, "t_last_token_ns": 382623070540173, "prompt_tokens": 4022, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "f80c2c460ec65e32", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382615112254296, "t_first_token_ns": 382615612938994, "t_last_token_ns": 382623090782483, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "ab3000c06ac776bf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382616166292054, "t_first_token_ns": 382616446936306, "t_last_token_ns": 382624111749835, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "0bf5905c1e8121bd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382616359836484, "t_first_token_ns": 382616695469738, "t_last_token_ns": 382624138513963, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "1098d56d526989cc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382616907328585, "t_first_token_ns": 382617184597708, "t_last_token_ns": 382624646061823, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "137d70d1f46c03a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382616918886513, "t_first_token_ns": 382617424718378, "t_last_token_ns": 382624660854127, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "cebd470b51f7fe4e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382617471888185, "t_first_token_ns": 382617753286606, "t_last_token_ns": 382624766884824, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "a151e7b5b68ad9ae", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382618510166978, "t_first_token_ns": 382618798558045, "t_last_token_ns": 382625794212452, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "4eca794d265710c8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382619535415276, "t_first_token_ns": 382619817483411, "t_last_token_ns": 382626839078789, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "dff220b2b2e49d3a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382619801849885, "t_first_token_ns": 382620089116455, "t_last_token_ns": 382626884459900, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "82efcb8a8e11e6c8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382619893994646, "t_first_token_ns": 382620343025535, "t_last_token_ns": 382626913631946, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "2b2b27c2bbba748c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382620522125017, "t_first_token_ns": 382620809172432, "t_last_token_ns": 382627155310471, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "22232cfe893b9166", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382620766452730, "t_first_token_ns": 382621067966093, "t_last_token_ns": 382627184013886, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "139fa79bf7516bed", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382620992774617, "t_first_token_ns": 382621326474415, "t_last_token_ns": 382627206539923, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "d725af190f423c10", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382621151041603, "t_first_token_ns": 382621585834867, "t_last_token_ns": 382627229353932, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "1d3e84737b4620da", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382621484792832, "t_first_token_ns": 382621848389552, "t_last_token_ns": 382627253438175, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "2375944c4224b637", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382622250265387, "t_first_token_ns": 382622546427396, "t_last_token_ns": 382627472390105, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 22, "error": null}
|
||||||
|
{"req_id": "657154f7593762d1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382622593318594, "t_first_token_ns": 382622885049345, "t_last_token_ns": 382627533520468, "prompt_tokens": 4032, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "4e50ce56d0618fec", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382623287183845, "t_first_token_ns": 382623567840547, "t_last_token_ns": 382627784378581, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "cea59338a3b0a9b3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382624155490889, "t_first_token_ns": 382624441285825, "t_last_token_ns": 382628052105141, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "f4dd02509a77bddd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382625265836301, "t_first_token_ns": 382625552422283, "t_last_token_ns": 382628428544692, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "0e7beafea7584ff7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382625830855653, "t_first_token_ns": 382626107186652, "t_last_token_ns": 382628533811478, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "b97b84e328b625d3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382628923914909, "t_first_token_ns": 382629180169679, "t_last_token_ns": 382631589018567, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "651186d1dd6c15a6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382629389956720, "t_first_token_ns": 382629650650655, "t_last_token_ns": 382631952646134, "prompt_tokens": 4033, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "b6350e463220e4e6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382629904813735, "t_first_token_ns": 382630164883485, "t_last_token_ns": 382632244227486, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "358015cbb66d10de", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382630621381850, "t_first_token_ns": 382630887387303, "t_last_token_ns": 382632568227979, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "50705bd15064c6d4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382632990383086, "t_first_token_ns": 382633247972576, "t_last_token_ns": 382634850918943, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "ecb80a17721044a3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382633798340499, "t_first_token_ns": 382634057916373, "t_last_token_ns": 382635439704921, "prompt_tokens": 4027, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "41abb79477b47f91", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382635457515427, "t_first_token_ns": 382635713454340, "t_last_token_ns": 382638716266599, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "721ebf229a030a95", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382635574712257, "t_first_token_ns": 382635948997253, "t_last_token_ns": 382638732869989, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "3a340945700f2c58", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382635941067708, "t_first_token_ns": 382636201410606, "t_last_token_ns": 382638761450073, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "2170390234b5ae2d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382636752761436, "t_first_token_ns": 382637010645268, "t_last_token_ns": 382641373829182, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "076476fea8382967", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382637633177994, "t_first_token_ns": 382637898506646, "t_last_token_ns": 382643094635091, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "ff52490f95172d67", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382638815463934, "t_first_token_ns": 382639076874831, "t_last_token_ns": 382645669974580, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "e924d3bcedd387fb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382639023211601, "t_first_token_ns": 382639318160388, "t_last_token_ns": 382645700044826, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "b877892f91ecec34", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382639276272651, "t_first_token_ns": 382639560938630, "t_last_token_ns": 382645731114800, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "1caf880a2fbf12e5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382639631702127, "t_first_token_ns": 382639900222621, "t_last_token_ns": 382645908429837, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "9a1cf60f1b871ded", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382639964919019, "t_first_token_ns": 382640237664723, "t_last_token_ns": 382646078192685, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "a95165328c12522a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382640030166019, "t_first_token_ns": 382640485104982, "t_last_token_ns": 382646106175075, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "db3cb451a123fd7b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382640359872632, "t_first_token_ns": 382640732399370, "t_last_token_ns": 382646134528413, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "ec19d55d7bc56117", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382641045919245, "t_first_token_ns": 382641320145087, "t_last_token_ns": 382647212193436, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "45f17bcf193b6eb1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382641943386257, "t_first_token_ns": 382642214367593, "t_last_token_ns": 382649410290192, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "0a1426ca17d726a2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382642118594502, "t_first_token_ns": 382642465926278, "t_last_token_ns": 382649648227693, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "85f92136d0577f5c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382642765354378, "t_first_token_ns": 382643041491261, "t_last_token_ns": 382650521986283, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "c68c8ac2cefadeda", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382643694656559, "t_first_token_ns": 382643979069890, "t_last_token_ns": 382651818444024, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "39ec2bfef2cc660c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382643819274870, "t_first_token_ns": 382644495852057, "t_last_token_ns": 382651851998171, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "5c5ae6e010163893", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382643807866370, "t_first_token_ns": 382644495740634, "t_last_token_ns": 382651852134928, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "bc830bbd57c31dfd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382643936258268, "t_first_token_ns": 382644683876827, "t_last_token_ns": 382651866994160, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "f8dfa76d3272924c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382644694178075, "t_first_token_ns": 382644981867664, "t_last_token_ns": 382652162169643, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "6a0616efb0ee2261", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382646325160423, "t_first_token_ns": 382646598800794, "t_last_token_ns": 382653847139926, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "9be8cd07b75dc5e7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382646652025676, "t_first_token_ns": 382646934746011, "t_last_token_ns": 382653967232416, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "213c753a2902d374", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382646902773735, "t_first_token_ns": 382647185543008, "t_last_token_ns": 382654222739778, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "4e27c91846d4e17d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382647562720414, "t_first_token_ns": 382647847796746, "t_last_token_ns": 382654699488115, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "1b02d1736bed2783", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382647816876012, "t_first_token_ns": 382648099191597, "t_last_token_ns": 382654728705328, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "044505f55ee8eb29", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382647844155053, "t_first_token_ns": 382648339645763, "t_last_token_ns": 382654737811643, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "9b6468d2f524b689", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382648033644327, "t_first_token_ns": 382648581315896, "t_last_token_ns": 382654751281718, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "0576ac2b072d4f41", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382648646217023, "t_first_token_ns": 382648925414862, "t_last_token_ns": 382654866453891, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "1904f89e3ffa195e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382649055063227, "t_first_token_ns": 382649334800922, "t_last_token_ns": 382655022297548, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "278330e7e052f152", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382649339549243, "t_first_token_ns": 382649632097745, "t_last_token_ns": 382655518210936, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "783382169706baf2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382649671953950, "t_first_token_ns": 382649949800213, "t_last_token_ns": 382656044365014, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "fd3f7b073d883ae7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382649771409823, "t_first_token_ns": 382650207119581, "t_last_token_ns": 382656070426675, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "16236450faab7663", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382650744705993, "t_first_token_ns": 382651026020413, "t_last_token_ns": 382656506025336, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "f702da4e623d7579", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382651057413341, "t_first_token_ns": 382651348808768, "t_last_token_ns": 382657009927202, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "d9555469adb82bd6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382651839186630, "t_first_token_ns": 382652123180589, "t_last_token_ns": 382657392297966, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "47bdac1c5663cca9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382652400437898, "t_first_token_ns": 382652680543786, "t_last_token_ns": 382657642326492, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "af1d9e8359f64baa", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382653937346143, "t_first_token_ns": 382654223640193, "t_last_token_ns": 382658708662957, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "b27e960445b0e807", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382654999326504, "t_first_token_ns": 382655272338666, "t_last_token_ns": 382659741984406, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "3877cb03f6ea4711", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382655095076921, "t_first_token_ns": 382655518139966, "t_last_token_ns": 382659764392054, "prompt_tokens": 3955, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "cee027f56ae5f140", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382655496234489, "t_first_token_ns": 382655767133736, "t_last_token_ns": 382659784267972, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "336f49e040184b00", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382655504658967, "t_first_token_ns": 382656007212163, "t_last_token_ns": 382659827147960, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "9cc7dafc187ac5a5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382656443549536, "t_first_token_ns": 382656720871279, "t_last_token_ns": 382660679510823, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "8363103077032c09", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382656619133639, "t_first_token_ns": 382656971953988, "t_last_token_ns": 382660699301265, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "288e93bd46cfc9ea", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382659122633624, "t_first_token_ns": 382659388576647, "t_last_token_ns": 382662855694897, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "9e4a220b3cf1ed89", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382659294185950, "t_first_token_ns": 382659636713044, "t_last_token_ns": 382662876486229, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "d553f356441ca287", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382659768127486, "t_first_token_ns": 382660038759405, "t_last_token_ns": 382663006438202, "prompt_tokens": 4027, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "5c8b375f370a610c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382660219887225, "t_first_token_ns": 382660487772037, "t_last_token_ns": 382663174184970, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "c8b0fc76cafef753", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382661391829786, "t_first_token_ns": 382661654846923, "t_last_token_ns": 382665778942135, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "72e7e5c0e7f2b84f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382661995811822, "t_first_token_ns": 382662268455557, "t_last_token_ns": 382666241877475, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "9e7ed7f0ec538c3f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382663416482197, "t_first_token_ns": 382663678627359, "t_last_token_ns": 382669818556992, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "ff133c5d58ab656f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382663674182007, "t_first_token_ns": 382663940191582, "t_last_token_ns": 382669878267913, "prompt_tokens": 4023, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "a729629242e95560", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382663744469504, "t_first_token_ns": 382664404831691, "t_last_token_ns": 382669908251579, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "0b005fe06300469f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382663882620010, "t_first_token_ns": 382664405294993, "t_last_token_ns": 382669909157472, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "f9fef77febb3a101", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382663914995073, "t_first_token_ns": 382664630966929, "t_last_token_ns": 382669955072489, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "3a56a9b9fbaf16ef", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382664735323983, "t_first_token_ns": 382665014060740, "t_last_token_ns": 382670343961146, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "a4b8f1dd0a833ac0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382665508920092, "t_first_token_ns": 382665779666477, "t_last_token_ns": 382671239051711, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "bc1ce14acfb8f5b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382666446333308, "t_first_token_ns": 382666722183364, "t_last_token_ns": 382672080662324, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "550870cf62cf7dc8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382667459434773, "t_first_token_ns": 382667733545775, "t_last_token_ns": 382673216377615, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "2affef463ea83fb5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382667549515357, "t_first_token_ns": 382668193431746, "t_last_token_ns": 382673241015066, "prompt_tokens": 3963, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "3bed9c08ecc0735e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382667594478674, "t_first_token_ns": 382668193651132, "t_last_token_ns": 382673241228350, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "aebd728fac5eb7c9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382668312966747, "t_first_token_ns": 382668803340245, "t_last_token_ns": 382673388368613, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "80bee31a1295953e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382668314082437, "t_first_token_ns": 382668803737279, "t_last_token_ns": 382673388833168, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "fc6cfd2270ab2581", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382668590269702, "t_first_token_ns": 382669054973519, "t_last_token_ns": 382673408464474, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "f4a15b0e0da21586", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382669181937394, "t_first_token_ns": 382669465704116, "t_last_token_ns": 382673506939336, "prompt_tokens": 3916, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "1ff7c31e57e00bcb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382669887634415, "t_first_token_ns": 382670164098978, "t_last_token_ns": 382673743701891, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "c1fecd97b2361f3e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382670525721204, "t_first_token_ns": 382670805068704, "t_last_token_ns": 382673920346517, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "d167f8efb80215a9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382672515196376, "t_first_token_ns": 382672788378920, "t_last_token_ns": 382674557882485, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "c52bced31247e27d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382674966056409, "t_first_token_ns": 382675219557327, "t_last_token_ns": 382679820887381, "prompt_tokens": 3949, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "431a39693928917a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382675215727203, "t_first_token_ns": 382675477963765, "t_last_token_ns": 382679884781332, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "bf8d7bba7096d325", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382675468689698, "t_first_token_ns": 382675732134076, "t_last_token_ns": 382679931209303, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "abfb5bfd5d68dc8d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382675483549852, "t_first_token_ns": 382675967814585, "t_last_token_ns": 382679940307805, "prompt_tokens": 4028, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "5142ba1e759d7807", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382675570390467, "t_first_token_ns": 382676205112673, "t_last_token_ns": 382679949383699, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "911dfbd9a5da2f3a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382675955694748, "t_first_token_ns": 382676450059359, "t_last_token_ns": 382679966150192, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "d4ec1ed768d3128b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382676640431240, "t_first_token_ns": 382676906275613, "t_last_token_ns": 382680152652520, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "8ecca9914ec9afd1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382676821435761, "t_first_token_ns": 382677153704976, "t_last_token_ns": 382680164044629, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "94e98c79036fdaf0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382676893138484, "t_first_token_ns": 382677391558538, "t_last_token_ns": 382680168053920, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
@@ -0,0 +1,8 @@
|
|||||||
|
{
|
||||||
|
"rate": 1.5,
|
||||||
|
"input_tokens": 4096,
|
||||||
|
"output_tokens": 256,
|
||||||
|
"duration_target_s": 240.0,
|
||||||
|
"duration_actual_s": 243.1126050199964,
|
||||||
|
"n_requests": 392
|
||||||
|
}
|
||||||
File diff suppressed because it is too large
Load Diff
@@ -0,0 +1,624 @@
|
|||||||
|
# HELP python_gc_objects_collected_total Objects collected during gc
|
||||||
|
# TYPE python_gc_objects_collected_total counter
|
||||||
|
python_gc_objects_collected_total{generation="0"} 11970.0
|
||||||
|
python_gc_objects_collected_total{generation="1"} 1549.0
|
||||||
|
python_gc_objects_collected_total{generation="2"} 855.0
|
||||||
|
# HELP python_gc_objects_uncollectable_total Uncollectable objects found during GC
|
||||||
|
# TYPE python_gc_objects_uncollectable_total counter
|
||||||
|
python_gc_objects_uncollectable_total{generation="0"} 0.0
|
||||||
|
python_gc_objects_uncollectable_total{generation="1"} 0.0
|
||||||
|
python_gc_objects_uncollectable_total{generation="2"} 0.0
|
||||||
|
# HELP python_gc_collections_total Number of times this generation was collected
|
||||||
|
# TYPE python_gc_collections_total counter
|
||||||
|
python_gc_collections_total{generation="0"} 1349.0
|
||||||
|
python_gc_collections_total{generation="1"} 123.0
|
||||||
|
python_gc_collections_total{generation="2"} 9.0
|
||||||
|
# HELP python_info Python platform information
|
||||||
|
# TYPE python_info gauge
|
||||||
|
python_info{implementation="CPython",major="3",minor="12",patchlevel="3",version="3.12.3"} 1.0
|
||||||
|
# HELP process_virtual_memory_bytes Virtual memory size in bytes.
|
||||||
|
# TYPE process_virtual_memory_bytes gauge
|
||||||
|
process_virtual_memory_bytes 4.099221504e+010
|
||||||
|
# HELP process_resident_memory_bytes Resident memory size in bytes.
|
||||||
|
# TYPE process_resident_memory_bytes gauge
|
||||||
|
process_resident_memory_bytes 1.380847616e+09
|
||||||
|
# HELP process_start_time_seconds Start time of the process since unix epoch in seconds.
|
||||||
|
# TYPE process_start_time_seconds gauge
|
||||||
|
process_start_time_seconds 1.77981354873e+09
|
||||||
|
# HELP process_cpu_seconds_total Total user and system CPU time spent in seconds.
|
||||||
|
# TYPE process_cpu_seconds_total counter
|
||||||
|
process_cpu_seconds_total 41.89
|
||||||
|
# HELP process_open_fds Number of open file descriptors.
|
||||||
|
# TYPE process_open_fds gauge
|
||||||
|
process_open_fds 67.0
|
||||||
|
# HELP process_max_fds Maximum number of open file descriptors.
|
||||||
|
# TYPE process_max_fds gauge
|
||||||
|
process_max_fds 1.048575e+06
|
||||||
|
# HELP vllm:estimated_flops_per_gpu_total Estimated number of floating point operations per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_flops_per_gpu_total counter
|
||||||
|
vllm:estimated_flops_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:estimated_flops_per_gpu_created Estimated number of floating point operations per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_flops_per_gpu_created gauge
|
||||||
|
vllm:estimated_flops_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516264124e+09
|
||||||
|
# HELP vllm:estimated_read_bytes_per_gpu_total Estimated number of bytes read from memory per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_read_bytes_per_gpu_total counter
|
||||||
|
vllm:estimated_read_bytes_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:estimated_read_bytes_per_gpu_created Estimated number of bytes read from memory per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_read_bytes_per_gpu_created gauge
|
||||||
|
vllm:estimated_read_bytes_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516264498e+09
|
||||||
|
# HELP vllm:estimated_write_bytes_per_gpu_total Estimated number of bytes written to memory per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_write_bytes_per_gpu_total counter
|
||||||
|
vllm:estimated_write_bytes_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:estimated_write_bytes_per_gpu_created Estimated number of bytes written to memory per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_write_bytes_per_gpu_created gauge
|
||||||
|
vllm:estimated_write_bytes_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.77981365162647e+09
|
||||||
|
# HELP vllm:num_requests_running Number of requests in model execution batches.
|
||||||
|
# TYPE vllm:num_requests_running gauge
|
||||||
|
vllm:num_requests_running{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:num_requests_waiting Number of requests waiting to be processed.
|
||||||
|
# TYPE vllm:num_requests_waiting gauge
|
||||||
|
vllm:num_requests_waiting{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:engine_sleep_state Engine sleep state; awake = 0 means engine is sleeping; awake = 1 means engine is awake; weights_offloaded = 1 means sleep level 1; discard_all = 1 means sleep level 2.
|
||||||
|
# TYPE vllm:engine_sleep_state gauge
|
||||||
|
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="awake"} 1.0
|
||||||
|
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="weights_offloaded"} 0.0
|
||||||
|
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="discard_all"} 0.0
|
||||||
|
# HELP vllm:kv_cache_usage_perc KV-cache usage. 1 means 100 percent usage.
|
||||||
|
# TYPE vllm:kv_cache_usage_perc gauge
|
||||||
|
vllm:kv_cache_usage_perc{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:prefix_cache_queries_total Prefix cache queries, in terms of number of queried tokens.
|
||||||
|
# TYPE vllm:prefix_cache_queries_total counter
|
||||||
|
vllm:prefix_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.563446e+06
|
||||||
|
# HELP vllm:prefix_cache_queries_created Prefix cache queries, in terms of number of queried tokens.
|
||||||
|
# TYPE vllm:prefix_cache_queries_created gauge
|
||||||
|
vllm:prefix_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516266289e+09
|
||||||
|
# HELP vllm:prefix_cache_hits_total Prefix cache hits, in terms of number of cached tokens.
|
||||||
|
# TYPE vllm:prefix_cache_hits_total counter
|
||||||
|
vllm:prefix_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:prefix_cache_hits_created Prefix cache hits, in terms of number of cached tokens.
|
||||||
|
# TYPE vllm:prefix_cache_hits_created gauge
|
||||||
|
vllm:prefix_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516266432e+09
|
||||||
|
# HELP vllm:external_prefix_cache_queries_total External prefix cache queries from KV connector cross-instance cache sharing, in terms of number of queried tokens.
|
||||||
|
# TYPE vllm:external_prefix_cache_queries_total counter
|
||||||
|
vllm:external_prefix_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.563446e+06
|
||||||
|
# HELP vllm:external_prefix_cache_queries_created External prefix cache queries from KV connector cross-instance cache sharing, in terms of number of queried tokens.
|
||||||
|
# TYPE vllm:external_prefix_cache_queries_created gauge
|
||||||
|
vllm:external_prefix_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516266596e+09
|
||||||
|
# HELP vllm:external_prefix_cache_hits_total External prefix cache hits from KV connector cross-instance cache sharing, in terms of number of cached tokens.
|
||||||
|
# TYPE vllm:external_prefix_cache_hits_total counter
|
||||||
|
vllm:external_prefix_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:external_prefix_cache_hits_created External prefix cache hits from KV connector cross-instance cache sharing, in terms of number of cached tokens.
|
||||||
|
# TYPE vllm:external_prefix_cache_hits_created gauge
|
||||||
|
vllm:external_prefix_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516266723e+09
|
||||||
|
# HELP vllm:mm_cache_queries_total Multi-modal cache queries, in terms of number of queried items.
|
||||||
|
# TYPE vllm:mm_cache_queries_total counter
|
||||||
|
vllm:mm_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:mm_cache_queries_created Multi-modal cache queries, in terms of number of queried items.
|
||||||
|
# TYPE vllm:mm_cache_queries_created gauge
|
||||||
|
vllm:mm_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516266844e+09
|
||||||
|
# HELP vllm:mm_cache_hits_total Multi-modal cache hits, in terms of number of cached items.
|
||||||
|
# TYPE vllm:mm_cache_hits_total counter
|
||||||
|
vllm:mm_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:mm_cache_hits_created Multi-modal cache hits, in terms of number of cached items.
|
||||||
|
# TYPE vllm:mm_cache_hits_created gauge
|
||||||
|
vllm:mm_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516266956e+09
|
||||||
|
# HELP vllm:num_preemptions_total Cumulative number of preemption from the engine.
|
||||||
|
# TYPE vllm:num_preemptions_total counter
|
||||||
|
vllm:num_preemptions_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:num_preemptions_created Cumulative number of preemption from the engine.
|
||||||
|
# TYPE vllm:num_preemptions_created gauge
|
||||||
|
vllm:num_preemptions_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516267076e+09
|
||||||
|
# HELP vllm:prompt_tokens_total Number of prefill tokens processed.
|
||||||
|
# TYPE vllm:prompt_tokens_total counter
|
||||||
|
vllm:prompt_tokens_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.563446e+06
|
||||||
|
# HELP vllm:prompt_tokens_created Number of prefill tokens processed.
|
||||||
|
# TYPE vllm:prompt_tokens_created gauge
|
||||||
|
vllm:prompt_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779813651626718e+09
|
||||||
|
# HELP vllm:prompt_tokens_by_source_total Number of prompt tokens by source.
|
||||||
|
# TYPE vllm:prompt_tokens_by_source_total counter
|
||||||
|
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_compute"} 1.563446e+06
|
||||||
|
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_cache_hit"} 0.0
|
||||||
|
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="external_kv_transfer"} 0.0
|
||||||
|
# HELP vllm:prompt_tokens_by_source_created Number of prompt tokens by source.
|
||||||
|
# TYPE vllm:prompt_tokens_by_source_created gauge
|
||||||
|
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_compute"} 1.7798136516267326e+09
|
||||||
|
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_cache_hit"} 1.7798136516267376e+09
|
||||||
|
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="external_kv_transfer"} 1.7798136516267424e+09
|
||||||
|
# HELP vllm:prompt_tokens_cached_total Number of cached prompt tokens (local + external).
|
||||||
|
# TYPE vllm:prompt_tokens_cached_total counter
|
||||||
|
vllm:prompt_tokens_cached_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:prompt_tokens_cached_created Number of cached prompt tokens (local + external).
|
||||||
|
# TYPE vllm:prompt_tokens_cached_created gauge
|
||||||
|
vllm:prompt_tokens_cached_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779813651626754e+09
|
||||||
|
# HELP vllm:prompt_tokens_recomputed_total Number of cached tokens recomputed for forward pass.
|
||||||
|
# TYPE vllm:prompt_tokens_recomputed_total counter
|
||||||
|
vllm:prompt_tokens_recomputed_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:prompt_tokens_recomputed_created Number of cached tokens recomputed for forward pass.
|
||||||
|
# TYPE vllm:prompt_tokens_recomputed_created gauge
|
||||||
|
vllm:prompt_tokens_recomputed_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779813651626765e+09
|
||||||
|
# HELP vllm:generation_tokens_total Number of generation tokens processed.
|
||||||
|
# TYPE vllm:generation_tokens_total counter
|
||||||
|
vllm:generation_tokens_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 100352.0
|
||||||
|
# HELP vllm:generation_tokens_created Number of generation tokens processed.
|
||||||
|
# TYPE vllm:generation_tokens_created gauge
|
||||||
|
vllm:generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516267776e+09
|
||||||
|
# HELP vllm:request_success_total Count of successfully processed requests.
|
||||||
|
# TYPE vllm:request_success_total counter
|
||||||
|
vllm:request_success_total{engine="0",finished_reason="stop",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_success_total{engine="0",finished_reason="length",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_success_total{engine="0",finished_reason="abort",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_success_total{engine="0",finished_reason="error",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_success_total{engine="0",finished_reason="repetition",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:request_success_created Count of successfully processed requests.
|
||||||
|
# TYPE vllm:request_success_created gauge
|
||||||
|
vllm:request_success_created{engine="0",finished_reason="stop",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516268065e+09
|
||||||
|
vllm:request_success_created{engine="0",finished_reason="length",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516268144e+09
|
||||||
|
vllm:request_success_created{engine="0",finished_reason="abort",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516268213e+09
|
||||||
|
vllm:request_success_created{engine="0",finished_reason="error",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516268265e+09
|
||||||
|
vllm:request_success_created{engine="0",finished_reason="repetition",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516268313e+09
|
||||||
|
# HELP vllm:request_prompt_tokens Number of prefill tokens processed.
|
||||||
|
# TYPE vllm:request_prompt_tokens histogram
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.563446e+06
|
||||||
|
# HELP vllm:request_prompt_tokens_created Number of prefill tokens processed.
|
||||||
|
# TYPE vllm:request_prompt_tokens_created gauge
|
||||||
|
vllm:request_prompt_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779813651626876e+09
|
||||||
|
# HELP vllm:request_generation_tokens Number of generation tokens processed.
|
||||||
|
# TYPE vllm:request_generation_tokens histogram
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 100352.0
|
||||||
|
# HELP vllm:request_generation_tokens_created Number of generation tokens processed.
|
||||||
|
# TYPE vllm:request_generation_tokens_created gauge
|
||||||
|
vllm:request_generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516269345e+09
|
||||||
|
# HELP vllm:iteration_tokens_total Histogram of number of tokens per engine_step.
|
||||||
|
# TYPE vllm:iteration_tokens_total histogram
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 3027.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="8.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 10335.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="16.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 14359.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="32.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15043.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="64.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15043.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="128.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15043.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="256.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15043.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="512.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15043.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="1024.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15043.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="2048.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15043.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="4096.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15389.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="8192.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15412.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="16384.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15412.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15412.0
|
||||||
|
vllm:iteration_tokens_total_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15412.0
|
||||||
|
vllm:iteration_tokens_total_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.663798e+06
|
||||||
|
# HELP vllm:iteration_tokens_total_created Histogram of number of tokens per engine_step.
|
||||||
|
# TYPE vllm:iteration_tokens_total_created gauge
|
||||||
|
vllm:iteration_tokens_total_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516270025e+09
|
||||||
|
# HELP vllm:request_max_num_generation_tokens Histogram of maximum number of requested generation tokens.
|
||||||
|
# TYPE vllm:request_max_num_generation_tokens histogram
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 100352.0
|
||||||
|
# HELP vllm:request_max_num_generation_tokens_created Histogram of maximum number of requested generation tokens.
|
||||||
|
# TYPE vllm:request_max_num_generation_tokens_created gauge
|
||||||
|
vllm:request_max_num_generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516270401e+09
|
||||||
|
# HELP vllm:request_params_n Histogram of the n request parameter.
|
||||||
|
# TYPE vllm:request_params_n histogram
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
# HELP vllm:request_params_n_created Histogram of the n request parameter.
|
||||||
|
# TYPE vllm:request_params_n_created gauge
|
||||||
|
vllm:request_params_n_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779813651627073e+09
|
||||||
|
# HELP vllm:request_params_max_tokens Histogram of the max_tokens request parameter.
|
||||||
|
# TYPE vllm:request_params_max_tokens histogram
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 100352.0
|
||||||
|
# HELP vllm:request_params_max_tokens_created Histogram of the max_tokens request parameter.
|
||||||
|
# TYPE vllm:request_params_max_tokens_created gauge
|
||||||
|
vllm:request_params_max_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779813651627101e+09
|
||||||
|
# HELP vllm:time_to_first_token_seconds Histogram of time to first token in seconds.
|
||||||
|
# TYPE vllm:time_to_first_token_seconds histogram
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.001",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.005",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.02",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.04",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.06",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.08",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.25",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 13.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 335.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 385.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 391.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="160.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="640.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="2560.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 139.64915657043457
|
||||||
|
# HELP vllm:time_to_first_token_seconds_created Histogram of time to first token in seconds.
|
||||||
|
# TYPE vllm:time_to_first_token_seconds_created gauge
|
||||||
|
vllm:time_to_first_token_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516271358e+09
|
||||||
|
# HELP vllm:inter_token_latency_seconds Histogram of inter-token latency in seconds.
|
||||||
|
# TYPE vllm:inter_token_latency_seconds histogram
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 23661.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.025",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 93949.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.05",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 96702.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.075",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 96748.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 96748.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.15",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 96748.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.2",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 96904.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99760.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.4",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99760.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2037.7261010148213
|
||||||
|
# HELP vllm:inter_token_latency_seconds_created Histogram of inter-token latency in seconds.
|
||||||
|
# TYPE vllm:inter_token_latency_seconds_created gauge
|
||||||
|
vllm:inter_token_latency_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516271784e+09
|
||||||
|
# HELP vllm:request_time_per_output_token_seconds Histogram of time_per_output_token_seconds per request.
|
||||||
|
# TYPE vllm:request_time_per_output_token_seconds histogram
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 32.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.025",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 277.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.05",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.075",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.15",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.2",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.4",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 7.991082749077725
|
||||||
|
# HELP vllm:request_time_per_output_token_seconds_created Histogram of time_per_output_token_seconds per request.
|
||||||
|
# TYPE vllm:request_time_per_output_token_seconds_created gauge
|
||||||
|
vllm:request_time_per_output_token_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516272125e+09
|
||||||
|
# HELP vllm:e2e_request_latency_seconds Histogram of e2e request latency in seconds.
|
||||||
|
# TYPE vllm:e2e_request_latency_seconds histogram
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 11.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 17.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 165.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2177.253833055496
|
||||||
|
# HELP vllm:e2e_request_latency_seconds_created Histogram of e2e request latency in seconds.
|
||||||
|
# TYPE vllm:e2e_request_latency_seconds_created gauge
|
||||||
|
vllm:e2e_request_latency_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779813651627295e+09
|
||||||
|
# HELP vllm:request_queue_time_seconds Histogram of time spent in WAITING phase for request.
|
||||||
|
# TYPE vllm:request_queue_time_seconds histogram
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0029381027561612427
|
||||||
|
# HELP vllm:request_queue_time_seconds_created Histogram of time spent in WAITING phase for request.
|
||||||
|
# TYPE vllm:request_queue_time_seconds_created gauge
|
||||||
|
vllm:request_queue_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516273322e+09
|
||||||
|
# HELP vllm:request_inference_time_seconds Histogram of time spent in RUNNING phase for request.
|
||||||
|
# TYPE vllm:request_inference_time_seconds histogram
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 3.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 12.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 18.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 168.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2156.0547709063394
|
||||||
|
# HELP vllm:request_inference_time_seconds_created Histogram of time spent in RUNNING phase for request.
|
||||||
|
# TYPE vllm:request_inference_time_seconds_created gauge
|
||||||
|
vllm:request_inference_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516273665e+09
|
||||||
|
# HELP vllm:request_prefill_time_seconds Histogram of time spent in PREFILL phase for request.
|
||||||
|
# TYPE vllm:request_prefill_time_seconds histogram
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 305.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 379.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 118.32866989151808
|
||||||
|
# HELP vllm:request_prefill_time_seconds_created Histogram of time spent in PREFILL phase for request.
|
||||||
|
# TYPE vllm:request_prefill_time_seconds_created gauge
|
||||||
|
vllm:request_prefill_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516274056e+09
|
||||||
|
# HELP vllm:request_decode_time_seconds Histogram of time spent in DECODE phase for request.
|
||||||
|
# TYPE vllm:request_decode_time_seconds histogram
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 6.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 14.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 31.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 189.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2037.7261010148213
|
||||||
|
# HELP vllm:request_decode_time_seconds_created Histogram of time spent in DECODE phase for request.
|
||||||
|
# TYPE vllm:request_decode_time_seconds_created gauge
|
||||||
|
vllm:request_decode_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516274517e+09
|
||||||
|
# HELP vllm:request_prefill_kv_computed_tokens Histogram of new KV tokens computed during prefill (excluding cached tokens).
|
||||||
|
# TYPE vllm:request_prefill_kv_computed_tokens histogram
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.563446e+06
|
||||||
|
# HELP vllm:request_prefill_kv_computed_tokens_created Histogram of new KV tokens computed during prefill (excluding cached tokens).
|
||||||
|
# TYPE vllm:request_prefill_kv_computed_tokens_created gauge
|
||||||
|
vllm:request_prefill_kv_computed_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798136516274922e+09
|
||||||
|
# HELP vllm:cache_config_info Information of the LLMEngine CacheConfig
|
||||||
|
# TYPE vllm:cache_config_info gauge
|
||||||
|
vllm:cache_config_info{_block_size_resolved="True",block_size="16",cache_dtype="auto",calculate_kv_scales="False",cpu_kvcache_space_bytes="None",enable_prefix_caching="True",engine="0",gpu_memory_utilization="0.9",is_attention_free="False",kv_cache_memory_bytes="None",kv_offloading_backend="native",kv_offloading_size="None",kv_sharing_fast_prefill="False",mamba_block_size="None",mamba_cache_dtype="auto",mamba_cache_mode="none",mamba_page_size_padded="None",mamba_ssm_cache_dtype="auto",num_cpu_blocks="None",num_gpu_blocks="17590",num_gpu_blocks_override="None",prefix_caching_hash_algo="sha256",sliding_window="None",user_specified_block_size="False"} 1.0
|
||||||
|
# HELP http_requests_total Total number of requests by method, status and handler.
|
||||||
|
# TYPE http_requests_total counter
|
||||||
|
http_requests_total{handler="/v1/models",method="GET",status="2xx"} 1.0
|
||||||
|
http_requests_total{handler="/v1/chat/completions",method="POST",status="2xx"} 392.0
|
||||||
|
# HELP http_requests_created Total number of requests by method, status and handler.
|
||||||
|
# TYPE http_requests_created gauge
|
||||||
|
http_requests_created{handler="/v1/models",method="GET",status="2xx"} 1.7798136534283202e+09
|
||||||
|
http_requests_created{handler="/v1/chat/completions",method="POST",status="2xx"} 1.7798136621262043e+09
|
||||||
|
# HELP http_request_size_bytes Content length of incoming requests by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
|
||||||
|
# TYPE http_request_size_bytes summary
|
||||||
|
http_request_size_bytes_count{handler="/v1/models"} 1.0
|
||||||
|
http_request_size_bytes_sum{handler="/v1/models"} 0.0
|
||||||
|
http_request_size_bytes_count{handler="/v1/chat/completions"} 392.0
|
||||||
|
http_request_size_bytes_sum{handler="/v1/chat/completions"} 2.067408e+06
|
||||||
|
# HELP http_request_size_bytes_created Content length of incoming requests by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
|
||||||
|
# TYPE http_request_size_bytes_created gauge
|
||||||
|
http_request_size_bytes_created{handler="/v1/models"} 1.779813653428346e+09
|
||||||
|
http_request_size_bytes_created{handler="/v1/chat/completions"} 1.7798136621262243e+09
|
||||||
|
# HELP http_response_size_bytes Content length of outgoing responses by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
|
||||||
|
# TYPE http_response_size_bytes summary
|
||||||
|
http_response_size_bytes_count{handler="/v1/models"} 1.0
|
||||||
|
http_response_size_bytes_sum{handler="/v1/models"} 558.0
|
||||||
|
http_response_size_bytes_count{handler="/v1/chat/completions"} 392.0
|
||||||
|
http_response_size_bytes_sum{handler="/v1/chat/completions"} 0.0
|
||||||
|
# HELP http_response_size_bytes_created Content length of outgoing responses by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
|
||||||
|
# TYPE http_response_size_bytes_created gauge
|
||||||
|
http_response_size_bytes_created{handler="/v1/models"} 1.7798136534283705e+09
|
||||||
|
http_response_size_bytes_created{handler="/v1/chat/completions"} 1.7798136621262496e+09
|
||||||
|
# HELP http_request_duration_highr_seconds Latency with many buckets but no API specific labels. Made for more accurate percentile calculations.
|
||||||
|
# TYPE http_request_duration_highr_seconds histogram
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.01"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.025"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.05"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.075"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.1"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.25"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.5"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.75"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="1.0"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="1.5"} 3.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="2.0"} 12.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="2.5"} 18.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="3.0"} 38.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="3.5"} 59.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="4.0"} 92.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="4.5"} 134.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="5.0"} 166.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="7.5"} 317.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="10.0"} 393.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="30.0"} 393.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="60.0"} 393.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="+Inf"} 393.0
|
||||||
|
http_request_duration_highr_seconds_count 393.0
|
||||||
|
http_request_duration_highr_seconds_sum 2177.854818835389
|
||||||
|
# HELP http_request_duration_highr_seconds_created Latency with many buckets but no API specific labels. Made for more accurate percentile calculations.
|
||||||
|
# TYPE http_request_duration_highr_seconds_created gauge
|
||||||
|
http_request_duration_highr_seconds_created 1.779813652123545e+09
|
||||||
|
# HELP http_request_duration_seconds Latency with only few buckets by handler. Made to be only used if aggregation by handler is important.
|
||||||
|
# TYPE http_request_duration_seconds histogram
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/models",le="0.1",method="GET"} 1.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/models",le="0.5",method="GET"} 1.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/models",le="1.0",method="GET"} 1.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/models",le="+Inf",method="GET"} 1.0
|
||||||
|
http_request_duration_seconds_count{handler="/v1/models",method="GET"} 1.0
|
||||||
|
http_request_duration_seconds_sum{handler="/v1/models",method="GET"} 0.0021718639764003456
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="0.1",method="POST"} 0.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="0.5",method="POST"} 0.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="1.0",method="POST"} 0.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="+Inf",method="POST"} 392.0
|
||||||
|
http_request_duration_seconds_count{handler="/v1/chat/completions",method="POST"} 392.0
|
||||||
|
http_request_duration_seconds_sum{handler="/v1/chat/completions",method="POST"} 2177.8526469714125
|
||||||
|
# HELP http_request_duration_seconds_created Latency with only few buckets by handler. Made to be only used if aggregation by handler is important.
|
||||||
|
# TYPE http_request_duration_seconds_created gauge
|
||||||
|
http_request_duration_seconds_created{handler="/v1/models",method="GET"} 1.7798136534284024e+09
|
||||||
|
http_request_duration_seconds_created{handler="/v1/chat/completions",method="POST"} 1.7798136621262796e+09
|
||||||
@@ -0,0 +1,392 @@
|
|||||||
|
{"req_id": "f9f8f44fd4964565", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382789290926849, "t_first_token_ns": 382790558826455, "t_last_token_ns": 382797113514317, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "36bbb6c17a024be2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382789659901922, "t_first_token_ns": 382790562489439, "t_last_token_ns": 382797128313029, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "accbc730595b4e03", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382789652047816, "t_first_token_ns": 382790562628888, "t_last_token_ns": 382797128921692, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "95ac8f889eb69b61", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382790823166204, "t_first_token_ns": 382791088097622, "t_last_token_ns": 382797769087654, "prompt_tokens": 4026, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "70cae603829190e5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382791061474334, "t_first_token_ns": 382791331557838, "t_last_token_ns": 382797799038906, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "2769a82161526fa2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382791368904865, "t_first_token_ns": 382791639708760, "t_last_token_ns": 382797915973240, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "7a3ae284fd025e54", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382791513603042, "t_first_token_ns": 382791884362823, "t_last_token_ns": 382797945325656, "prompt_tokens": 3951, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "8c7ebece6fc8ef96", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382792070360683, "t_first_token_ns": 382792333281514, "t_last_token_ns": 382798220523639, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "8a6e9ed8388bab88", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382792188423536, "t_first_token_ns": 382792837925726, "t_last_token_ns": 382798245247200, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "291192528bfcd0d4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382792278152712, "t_first_token_ns": 382792838878037, "t_last_token_ns": 382798245386265, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "29aa5369f15ef8ed", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382792657735480, "t_first_token_ns": 382793036280094, "t_last_token_ns": 382798257351743, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "7d168f7f34914df8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382793209071934, "t_first_token_ns": 382793475213444, "t_last_token_ns": 382798419759324, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "5949b30b5660db44", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382793337546159, "t_first_token_ns": 382793727514866, "t_last_token_ns": 382798439961212, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "dee2814e53c1133c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382793875482551, "t_first_token_ns": 382794149403210, "t_last_token_ns": 382798557234662, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "d3d6d687b1270d0c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382794168792434, "t_first_token_ns": 382794449869425, "t_last_token_ns": 382798593462406, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "0114c7c017f510dc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382796283672313, "t_first_token_ns": 382796567992768, "t_last_token_ns": 382800456325397, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "6b23ad60e8365056", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382796348717041, "t_first_token_ns": 382796826715112, "t_last_token_ns": 382800476852194, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "c147b43b07e42731", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382798913190946, "t_first_token_ns": 382799167265892, "t_last_token_ns": 382803660590230, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "d6e4d018b0cbaea9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382799268533212, "t_first_token_ns": 382799527223812, "t_last_token_ns": 382804095615169, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "6a3c0ae558f764fa", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382799736444187, "t_first_token_ns": 382799996528165, "t_last_token_ns": 382804458206793, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "6f4afc5b3b7d8068", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382799843744360, "t_first_token_ns": 382800241058383, "t_last_token_ns": 382804483541758, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "e2a3bc5ca5dd8e06", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382800690287000, "t_first_token_ns": 382800949330830, "t_last_token_ns": 382805021322275, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "750b6e6f651a9619", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382800831527654, "t_first_token_ns": 382801196613852, "t_last_token_ns": 382805042270103, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "088884483a9971d8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382801111099661, "t_first_token_ns": 382801659580614, "t_last_token_ns": 382805058217742, "prompt_tokens": 4025, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "318a0f6dc8a5806d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382801128030199, "t_first_token_ns": 382801659764792, "t_last_token_ns": 382805058354556, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "60fa44dd296e868f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382801405238455, "t_first_token_ns": 382801906540087, "t_last_token_ns": 382805069893089, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "be5160126f3a14e8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382803689801416, "t_first_token_ns": 382803961955550, "t_last_token_ns": 382805850433970, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "31994709d9d43ecc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382806259759968, "t_first_token_ns": 382806513529094, "t_last_token_ns": 382808568722584, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "ebb3f3698c984adb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382807170612926, "t_first_token_ns": 382807425733023, "t_last_token_ns": 382809699885930, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "a0f7008353f8beb2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382807174114959, "t_first_token_ns": 382807660007935, "t_last_token_ns": 382809705901083, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "7eac0cde0fd34276", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382809053268435, "t_first_token_ns": 382809309839001, "t_last_token_ns": 382811034295669, "prompt_tokens": 4019, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "95601ae65eae9166", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382810418224589, "t_first_token_ns": 382810671017246, "t_last_token_ns": 382813683360459, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "189d08c7b9ae2624", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382811401411411, "t_first_token_ns": 382811657826254, "t_last_token_ns": 382815978585583, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "607beffbb7acca39", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382811533980256, "t_first_token_ns": 382812116764826, "t_last_token_ns": 382816003041668, "prompt_tokens": 3953, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "93918e85acdf8b5d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382811604869353, "t_first_token_ns": 382812116876489, "t_last_token_ns": 382816003440515, "prompt_tokens": 4052, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "de6c910bd8be862f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382811962671979, "t_first_token_ns": 382812360247384, "t_last_token_ns": 382816021579516, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "776a9724799617aa", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382813408348118, "t_first_token_ns": 382813675025705, "t_last_token_ns": 382817366613815, "prompt_tokens": 3955, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "b6357510a726d24b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382813984655364, "t_first_token_ns": 382814246694913, "t_last_token_ns": 382817935981686, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "7814350e435d2264", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382814874412902, "t_first_token_ns": 382815141842094, "t_last_token_ns": 382818624922719, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "c354ffa98aea604a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382815052261057, "t_first_token_ns": 382815392116621, "t_last_token_ns": 382818641708933, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "b95654a80317948f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382815547784624, "t_first_token_ns": 382815822622438, "t_last_token_ns": 382819270756340, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "d3983c9d6e42e3df", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382816371960659, "t_first_token_ns": 382816637192064, "t_last_token_ns": 382819734625647, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "f12e1b2630a2dfcc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382817535729606, "t_first_token_ns": 382817804214925, "t_last_token_ns": 382821032681629, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "26aa2698c0cddf5a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382818633148946, "t_first_token_ns": 382818894811465, "t_last_token_ns": 382822484649758, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "1dd5dcf6889436bf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382818810752574, "t_first_token_ns": 382819139973477, "t_last_token_ns": 382822501695039, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "55f20a37c1e89743", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382820188413880, "t_first_token_ns": 382820443229664, "t_last_token_ns": 382824384135386, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "e013af03a51488e2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382820351769226, "t_first_token_ns": 382820686629031, "t_last_token_ns": 382824402041904, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "4fefe92b23621583", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382821431689779, "t_first_token_ns": 382821690537145, "t_last_token_ns": 382825296434094, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "5af9d220c4dd9318", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382821972204625, "t_first_token_ns": 382822242915938, "t_last_token_ns": 382826325701865, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "a3ff67ad251967cf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382822111239665, "t_first_token_ns": 382822485582985, "t_last_token_ns": 382826345015943, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "c9258f74cc827511", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382822704557309, "t_first_token_ns": 382822968803272, "t_last_token_ns": 382826603860618, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "140eb88bb615ba37", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382823192398108, "t_first_token_ns": 382823466335994, "t_last_token_ns": 382826833236104, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "d4a164b2ff3b3509", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382825316782113, "t_first_token_ns": 382825585321098, "t_last_token_ns": 382828175467784, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "9f1266f29a9e8b28", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382825345530509, "t_first_token_ns": 382825822792095, "t_last_token_ns": 382828181950839, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "57c78bedb1e5433a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382825466019551, "t_first_token_ns": 382826057756975, "t_last_token_ns": 382828185947405, "prompt_tokens": 3957, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "08ff3938b9e3ae71", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382828194709571, "t_first_token_ns": 382828446723843, "t_last_token_ns": 382832162465529, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "fc976d1e97a41cca", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382829385684628, "t_first_token_ns": 382829646522416, "t_last_token_ns": 382834471686551, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "6c45ae3b99de1e62", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382829495127172, "t_first_token_ns": 382829927358868, "t_last_token_ns": 382834495333020, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "417d200da6fc7d82", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382829669860793, "t_first_token_ns": 382830127234785, "t_last_token_ns": 382834509541573, "prompt_tokens": 4042, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "2b45e140974d73b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382830186691755, "t_first_token_ns": 382830445762600, "t_last_token_ns": 382834622035412, "prompt_tokens": 3948, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "4809e923113cfa22", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382830302831191, "t_first_token_ns": 382830694144423, "t_last_token_ns": 382834642051937, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "1baef27c84916aa0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382830564505214, "t_first_token_ns": 382831154694683, "t_last_token_ns": 382834659162390, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "b80243b2ea980e07", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382830599299279, "t_first_token_ns": 382831155127199, "t_last_token_ns": 382834659294973, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "efd5183f824026cc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382831429833827, "t_first_token_ns": 382831745920808, "t_last_token_ns": 382834842531079, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "df3338f95a55a940", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382831484728095, "t_first_token_ns": 382831939656083, "t_last_token_ns": 382834846800173, "prompt_tokens": 3921, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "973d64287adcd8a7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382834967519234, "t_first_token_ns": 382835214621912, "t_last_token_ns": 382836480235337, "prompt_tokens": 3934, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "19674684ced3de91", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382836657139633, "t_first_token_ns": 382836908283705, "t_last_token_ns": 382838455104068, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "e6fa68dd3de56239", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382837694960949, "t_first_token_ns": 382837953336634, "t_last_token_ns": 382840239626082, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "ec06f5beb70436d9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382838682780619, "t_first_token_ns": 382838942724095, "t_last_token_ns": 382842521760568, "prompt_tokens": 4040, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "034f1151aa2d2bdd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382838989121879, "t_first_token_ns": 382839249141767, "t_last_token_ns": 382842673043655, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "d156973f3cae174d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382839746547888, "t_first_token_ns": 382840006204111, "t_last_token_ns": 382843960946040, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "c2a0107acab59bd0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382840701983439, "t_first_token_ns": 382840962872878, "t_last_token_ns": 382845132955301, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "25faaab088637291", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382841139347213, "t_first_token_ns": 382841400700080, "t_last_token_ns": 382845897226650, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "4a6ff2135636c100", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382841182667230, "t_first_token_ns": 382841638190389, "t_last_token_ns": 382845909872365, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "70f1b6f6bad6005a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382841838766456, "t_first_token_ns": 382842109544859, "t_last_token_ns": 382846163150469, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "485e723c2b84eb6b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382842102247520, "t_first_token_ns": 382842370068440, "t_last_token_ns": 382846190836304, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "6be120b83e225aa3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382842731351181, "t_first_token_ns": 382843002689497, "t_last_token_ns": 382846744717830, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "291a0df34911423c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382843095908125, "t_first_token_ns": 382843368685031, "t_last_token_ns": 382846834854612, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "9a771254d6418297", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382845128062915, "t_first_token_ns": 382845391475443, "t_last_token_ns": 382849185873329, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "3d63e1d817747a07", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382845278716614, "t_first_token_ns": 382845640933297, "t_last_token_ns": 382849204892880, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "a85143fddaf4a5d9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382846256383535, "t_first_token_ns": 382846514122606, "t_last_token_ns": 382849833534887, "prompt_tokens": 3953, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "ed998b60c04d0117", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382847056406245, "t_first_token_ns": 382847317163260, "t_last_token_ns": 382852798088504, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "10c050e2cad99a17", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382847598877988, "t_first_token_ns": 382847855290527, "t_last_token_ns": 382853622632361, "prompt_tokens": 3949, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "03bc3ccba18fc900", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382847655097289, "t_first_token_ns": 382848319845905, "t_last_token_ns": 382853650665548, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "6ca800f53f978a58", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382847776227657, "t_first_token_ns": 382848320038841, "t_last_token_ns": 382853651094626, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "0d6ebbadc6b6f44a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382850056584059, "t_first_token_ns": 382850323034071, "t_last_token_ns": 382857353686173, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "2225316b65c25005", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382850232294933, "t_first_token_ns": 382850571523701, "t_last_token_ns": 382857383430636, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "48f171c96419c86f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382850351711103, "t_first_token_ns": 382850817436479, "t_last_token_ns": 382857412103847, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "53e2a3f9153c46d1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382850579065344, "t_first_token_ns": 382851053333340, "t_last_token_ns": 382857426155741, "prompt_tokens": 3949, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "8f000c5b2b8ac58e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382851094382993, "t_first_token_ns": 382851369785577, "t_last_token_ns": 382857537754515, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "601066832f30b428", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382851584319567, "t_first_token_ns": 382852126037728, "t_last_token_ns": 382857864315096, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "d417d380a4f40330", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382851587673332, "t_first_token_ns": 382852126146415, "t_last_token_ns": 382857864450989, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "bf6ab969a046d65b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382851590294191, "t_first_token_ns": 382852315781915, "t_last_token_ns": 382857875415449, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "d23fe5d3ae909682", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382852048384761, "t_first_token_ns": 382852556150704, "t_last_token_ns": 382857887345708, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "08d7416f17740d2e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382852186903555, "t_first_token_ns": 382852797844212, "t_last_token_ns": 382857898692126, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "f63c4d00aca7e8d4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382852810963301, "t_first_token_ns": 382853090070347, "t_last_token_ns": 382858179268374, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "72e028e252e47e71", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382853879518176, "t_first_token_ns": 382854153977313, "t_last_token_ns": 382859251336870, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "90e18fcccb2043f7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382853947436690, "t_first_token_ns": 382854404877233, "t_last_token_ns": 382859274451727, "prompt_tokens": 3956, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "7e9fdd59228f0876", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382854417145965, "t_first_token_ns": 382854689344913, "t_last_token_ns": 382859311898867, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "245bf7e5865d03d2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382854765184032, "t_first_token_ns": 382855038787458, "t_last_token_ns": 382859389368721, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "2033b1bb6c32c79a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382855809036268, "t_first_token_ns": 382856084747979, "t_last_token_ns": 382859808046805, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "0f0a89285f54e2c1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382857873857979, "t_first_token_ns": 382858146228833, "t_last_token_ns": 382864462319266, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "da6f35b293578e17", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382858233532283, "t_first_token_ns": 382858500337935, "t_last_token_ns": 382864641646276, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "6c160ab03b66d81b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382858420458286, "t_first_token_ns": 382858748307913, "t_last_token_ns": 382864670257950, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "acad4eb8b9a0e9f6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382859826767372, "t_first_token_ns": 382860085724367, "t_last_token_ns": 382868681541925, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "502a12aa8ba51234", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382859846364510, "t_first_token_ns": 382860321203574, "t_last_token_ns": 382868703310069, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "c0e0a7d3c8cf4b95", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382860114160506, "t_first_token_ns": 382860565320975, "t_last_token_ns": 382868745107387, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "541738310001dd1c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382860450336588, "t_first_token_ns": 382860809247681, "t_last_token_ns": 382868779420476, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "525463c05dc42958", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382860582042794, "t_first_token_ns": 382861044115067, "t_last_token_ns": 382868797192942, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "3926bdb966d6e395", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382861341888226, "t_first_token_ns": 382861603737627, "t_last_token_ns": 382869576571637, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "586a0d3540151470", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382861830424897, "t_first_token_ns": 382862101157718, "t_last_token_ns": 382869981021959, "prompt_tokens": 3929, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "37c5cdb56477a163", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382862022501733, "t_first_token_ns": 382862351891878, "t_last_token_ns": 382870009359321, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "5947be4f56ddbab5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382862378986939, "t_first_token_ns": 382862658213482, "t_last_token_ns": 382870329157278, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "2fe62cb82ee987a4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382863164667762, "t_first_token_ns": 382863444443567, "t_last_token_ns": 382871912625813, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "8c278419850c8b5e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382863427535361, "t_first_token_ns": 382863699218992, "t_last_token_ns": 382871946695232, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "021ff28eaacd14c5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382863475128757, "t_first_token_ns": 382863951960108, "t_last_token_ns": 382871981995583, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "faaf800e58c8f85f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382863887390103, "t_first_token_ns": 382864207729489, "t_last_token_ns": 382872042538759, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "744f358ad7f63e97", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382864032910246, "t_first_token_ns": 382864462787591, "t_last_token_ns": 382872267709139, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "927918347bcc40c8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382864681727723, "t_first_token_ns": 382864958793520, "t_last_token_ns": 382872558081539, "prompt_tokens": 4032, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "2fcef28ebf707351", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382864833150572, "t_first_token_ns": 382865425357745, "t_last_token_ns": 382872589580641, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "fe233f2c57e680fd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382864930557673, "t_first_token_ns": 382865425535074, "t_last_token_ns": 382872589739215, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "b95acf9eb3705d28", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382866427825604, "t_first_token_ns": 382866709270487, "t_last_token_ns": 382874684576669, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "96b58f314a838835", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382866696755497, "t_first_token_ns": 382866986395403, "t_last_token_ns": 382874726808228, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "08456882f8d362d1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382866928601737, "t_first_token_ns": 382867245428560, "t_last_token_ns": 382874757221468, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "4c8bd15a715f7720", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382867444993419, "t_first_token_ns": 382867733050775, "t_last_token_ns": 382875177832988, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "6949ec56d00ce8b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382868234126538, "t_first_token_ns": 382868517342679, "t_last_token_ns": 382875833519236, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 22, "error": null}
|
||||||
|
{"req_id": "0fce2da54ec51dfd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382868976012250, "t_first_token_ns": 382869250331927, "t_last_token_ns": 382876950728798, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "4238f6957ae36e57", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382870044017810, "t_first_token_ns": 382870328568643, "t_last_token_ns": 382878891622425, "prompt_tokens": 4027, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "943ff34258d40d28", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382870245262995, "t_first_token_ns": 382870582273142, "t_last_token_ns": 382878932090192, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "dfca62202759aa8a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382870618200019, "t_first_token_ns": 382870893342036, "t_last_token_ns": 382879030975977, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "4397f621098474f5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382870920664483, "t_first_token_ns": 382871209958417, "t_last_token_ns": 382879111894670, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "beebd08c7cc682ce", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382871407256036, "t_first_token_ns": 382871692238206, "t_last_token_ns": 382879382380843, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "7c168a9454ca5620", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382871976699659, "t_first_token_ns": 382872254268322, "t_last_token_ns": 382879676817984, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "affa6c57b9777216", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382872638803711, "t_first_token_ns": 382872919300104, "t_last_token_ns": 382880330872448, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "e5ffcb3c7f664dc7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382872686892734, "t_first_token_ns": 382873160637446, "t_last_token_ns": 382880344519928, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "a024c6b1441ebbd3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382872996907515, "t_first_token_ns": 382873414089019, "t_last_token_ns": 382880373013069, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "17af21e67df524a9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382873819669867, "t_first_token_ns": 382874099082834, "t_last_token_ns": 382880831950343, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "80ff547317c08b98", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382873844863211, "t_first_token_ns": 382874340211893, "t_last_token_ns": 382881051582722, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "70a5b87cfadecff6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382874759726277, "t_first_token_ns": 382875042630169, "t_last_token_ns": 382881481649108, "prompt_tokens": 3945, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "843c0165f72929a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382875532937845, "t_first_token_ns": 382875819586615, "t_last_token_ns": 382882184894285, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "73b4128afce8e243", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382875876473994, "t_first_token_ns": 382876155872298, "t_last_token_ns": 382882285464114, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "205e89d222fdda00", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382876274302126, "t_first_token_ns": 382876549749715, "t_last_token_ns": 382882445452734, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "7117082172557ca5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382876479204118, "t_first_token_ns": 382876805513213, "t_last_token_ns": 382882470129563, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "7cc19a30a453f5f4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382877371598529, "t_first_token_ns": 382877647353373, "t_last_token_ns": 382882945802699, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "8ba8086b8eec10d6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382877808717969, "t_first_token_ns": 382878099388879, "t_last_token_ns": 382883089311819, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "6b0f100f2f9945ac", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382878014780380, "t_first_token_ns": 382878570885120, "t_last_token_ns": 382883109319860, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "ec1a4c4a2ea7ed12", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382878036339967, "t_first_token_ns": 382878570459954, "t_last_token_ns": 382883109598159, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "768828307182d0ad", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382878319282993, "t_first_token_ns": 382878832080274, "t_last_token_ns": 382883126279116, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "034e749c721e456f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382880011988799, "t_first_token_ns": 382880291437318, "t_last_token_ns": 382885530837368, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "73a8827fa8847e13", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382880764408758, "t_first_token_ns": 382881051926191, "t_last_token_ns": 382885967259409, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "bbcdf2009f998102", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382881419700884, "t_first_token_ns": 382881700250856, "t_last_token_ns": 382886291410058, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "9fa8ae1957c0f1d2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382883231057982, "t_first_token_ns": 382883485881853, "t_last_token_ns": 382888255739473, "prompt_tokens": 3937, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "b11891ac95fc3b64", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382883252874937, "t_first_token_ns": 382883719648085, "t_last_token_ns": 382888267939834, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "59a9a00a0d3a1563", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382883868184332, "t_first_token_ns": 382884139144747, "t_last_token_ns": 382889186590779, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "d8bfc0767d724848", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382884007352515, "t_first_token_ns": 382884379224633, "t_last_token_ns": 382889207548184, "prompt_tokens": 3937, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "345d39ae3777d415", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382884345576010, "t_first_token_ns": 382884627512514, "t_last_token_ns": 382889227382063, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "9866cbc828a4fca8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382884480501887, "t_first_token_ns": 382884876230583, "t_last_token_ns": 382889249997835, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "af397ebf951c7719", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382884767739861, "t_first_token_ns": 382885124472338, "t_last_token_ns": 382889268536602, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "65c0b4265eb5dd19", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382886313573221, "t_first_token_ns": 382886588229127, "t_last_token_ns": 382891715820424, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "89dffa88c862c5b0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382887176437803, "t_first_token_ns": 382887448513348, "t_last_token_ns": 382894077900115, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "dc476408b33fbf5a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382888425178525, "t_first_token_ns": 382888699968102, "t_last_token_ns": 382897116774074, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "319a82c9b4681316", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382888478107074, "t_first_token_ns": 382888948423442, "t_last_token_ns": 382897157363767, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "71d7c157bf1ea669", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382888717688914, "t_first_token_ns": 382889186312932, "t_last_token_ns": 382897178943570, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "2cc13ac9d11806a5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382889740622932, "t_first_token_ns": 382890009976557, "t_last_token_ns": 382898796692701, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "129148915a429e01", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382889768950727, "t_first_token_ns": 382890245592340, "t_last_token_ns": 382898817573117, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "df7587e4a3d50532", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382889910351533, "t_first_token_ns": 382890693590475, "t_last_token_ns": 382898834124575, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "8b0e7810ec937f82", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382889967753775, "t_first_token_ns": 382890694173331, "t_last_token_ns": 382898835455142, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "fb520fc29874b954", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382891102870746, "t_first_token_ns": 382891370429992, "t_last_token_ns": 382900485553370, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "dd803dc1b0f0a741", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382891279933439, "t_first_token_ns": 382891622900966, "t_last_token_ns": 382900527320519, "prompt_tokens": 4024, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "93327e2e45a9ea52", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382891648864921, "t_first_token_ns": 382892142430809, "t_last_token_ns": 382900630552894, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "f37f16b87565dc53", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382891660188365, "t_first_token_ns": 382892142544191, "t_last_token_ns": 382900630704880, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "3bcfa41060f2dd80", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382891741142874, "t_first_token_ns": 382892392763773, "t_last_token_ns": 382900662051752, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "5badec87742c1dc2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382892222558392, "t_first_token_ns": 382892645617021, "t_last_token_ns": 382900691779223, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "d1e7b052e89f38f0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382893267995348, "t_first_token_ns": 382893537558856, "t_last_token_ns": 382902110448308, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "36a41522c4af841a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382893547106929, "t_first_token_ns": 382893825258694, "t_last_token_ns": 382902183264392, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "56afd7a006446ce0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382893804824704, "t_first_token_ns": 382894077626259, "t_last_token_ns": 382902210704832, "prompt_tokens": 3949, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "1ec26802903d0510", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382893833029795, "t_first_token_ns": 382894315324961, "t_last_token_ns": 382902223880583, "prompt_tokens": 3947, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "16ab6647bde110e0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382894011275587, "t_first_token_ns": 382894769095241, "t_last_token_ns": 382902239042630, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "a642c8cc74bcd308", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382893903766030, "t_first_token_ns": 382894769530699, "t_last_token_ns": 382902239612941, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "ab5eaa452796e93b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382894166873960, "t_first_token_ns": 382895011250802, "t_last_token_ns": 382902253518626, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "57388896a658fda9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382894731523174, "t_first_token_ns": 382895254177278, "t_last_token_ns": 382902267079113, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "74e18421d9a9571b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382895028786147, "t_first_token_ns": 382895517301218, "t_last_token_ns": 382902293613778, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 22, "error": null}
|
||||||
|
{"req_id": "51627206c78da939", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382897566234197, "t_first_token_ns": 382897845937130, "t_last_token_ns": 382904615902188, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "ce0e4f081e4c436a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382897789698193, "t_first_token_ns": 382898109862459, "t_last_token_ns": 382904638830708, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "9826eaf60529d07a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382899083621647, "t_first_token_ns": 382899362237981, "t_last_token_ns": 382906518716106, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "f06c8516cb43f569", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382899148196495, "t_first_token_ns": 382899618298353, "t_last_token_ns": 382906547147628, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "a2fcaaf1ad76b0b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382899648898358, "t_first_token_ns": 382899929480205, "t_last_token_ns": 382906624841086, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "0edc9b51f130a612", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382900181437516, "t_first_token_ns": 382900467327062, "t_last_token_ns": 382906842242453, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "ab79c0a9dc372db9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382900896447338, "t_first_token_ns": 382901179686925, "t_last_token_ns": 382907276990728, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "284610eaafd0e42a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382901249884695, "t_first_token_ns": 382901524108593, "t_last_token_ns": 382907394146001, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "89cb2a334a9f6238", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382901744929129, "t_first_token_ns": 382902028948698, "t_last_token_ns": 382907835343717, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "7ea2d46c75d8fe9a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382902748306721, "t_first_token_ns": 382903020739499, "t_last_token_ns": 382908923484811, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "024f2a25db9d388f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382902859154774, "t_first_token_ns": 382903271922131, "t_last_token_ns": 382908951483118, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "4bff469af089d526", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382903366339263, "t_first_token_ns": 382903647927291, "t_last_token_ns": 382909325335834, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "700bf9b5ddd3a040", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382903958595364, "t_first_token_ns": 382904231427622, "t_last_token_ns": 382909620458796, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "56c8618490819588", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382904974089091, "t_first_token_ns": 382905249433034, "t_last_token_ns": 382911499962745, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "897a93d4897b76cf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382905094226481, "t_first_token_ns": 382905504673637, "t_last_token_ns": 382911755923446, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "fa28b6fdceb04b0a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382905469868594, "t_first_token_ns": 382905971649674, "t_last_token_ns": 382912009019036, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "9d6d324a99e130fb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382905452504359, "t_first_token_ns": 382905971887208, "t_last_token_ns": 382912009178825, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "2b3fcd354a2135a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382906194698977, "t_first_token_ns": 382906477072813, "t_last_token_ns": 382912947657660, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "e29b1c49a4e6562a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382907545906785, "t_first_token_ns": 382907820955644, "t_last_token_ns": 382914026466704, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "729dbd22ce5934ca", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382907837036083, "t_first_token_ns": 382908116674853, "t_last_token_ns": 382914097885017, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "150d24a23cc28808", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382908967063936, "t_first_token_ns": 382909238669145, "t_last_token_ns": 382915701561511, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "1247310989ae8f13", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382909682801396, "t_first_token_ns": 382909951690012, "t_last_token_ns": 382916280964212, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "e40763fc59ed0799", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382909694527096, "t_first_token_ns": 382910191468305, "t_last_token_ns": 382916294845999, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "7db7d42bc0b6e636", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382910151148111, "t_first_token_ns": 382910657880040, "t_last_token_ns": 382916321401849, "prompt_tokens": 4028, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "a0ed26ee4cdfed05", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382910119282882, "t_first_token_ns": 382910658550738, "t_last_token_ns": 382916322077729, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "c81fd9769ec09a5f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382910403029863, "t_first_token_ns": 382910909076097, "t_last_token_ns": 382916344295038, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "8a5e9c377ef0f53c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382911478442216, "t_first_token_ns": 382911755585640, "t_last_token_ns": 382919215026487, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "3bb8df688946d953", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382911568804259, "t_first_token_ns": 382912008790824, "t_last_token_ns": 382919250705350, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "448dfb7bcf90f1f3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382912124789573, "t_first_token_ns": 382912401294524, "t_last_token_ns": 382919907040123, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "33ee8dfa5d511f36", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382912164228258, "t_first_token_ns": 382912636831634, "t_last_token_ns": 382919925491073, "prompt_tokens": 3947, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "fe73928ec06f2ffd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382912242688487, "t_first_token_ns": 382912876922081, "t_last_token_ns": 382919939997218, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "b3b98aab260b94f5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382914270206701, "t_first_token_ns": 382914550808586, "t_last_token_ns": 382921483877194, "prompt_tokens": 3962, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "0616a2c3128fe7ad", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382914338908210, "t_first_token_ns": 382915022293033, "t_last_token_ns": 382921513419471, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "ea50a73400df1150", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382914468450089, "t_first_token_ns": 382915022466955, "t_last_token_ns": 382921513936829, "prompt_tokens": 4063, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "4608dc5a783e53a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382916345077232, "t_first_token_ns": 382916616746154, "t_last_token_ns": 382922794737760, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "d2629c04025d1ad2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382916592047127, "t_first_token_ns": 382916864680409, "t_last_token_ns": 382922821976994, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "871a29b70510a5f2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382916755438932, "t_first_token_ns": 382917328602586, "t_last_token_ns": 382922848341325, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "78ec14117dc5c5e0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382916823513977, "t_first_token_ns": 382917328486984, "t_last_token_ns": 382922851100152, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "83b69b3a671f955d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382917285817908, "t_first_token_ns": 382917582322995, "t_last_token_ns": 382922869151641, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "7174a671812eb138", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382917326878953, "t_first_token_ns": 382917822985310, "t_last_token_ns": 382922881121014, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "51ff885b54684fc5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382917493106734, "t_first_token_ns": 382918063504101, "t_last_token_ns": 382922922360327, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "37f77dec01a5a091", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382917661010779, "t_first_token_ns": 382918304544283, "t_last_token_ns": 382923134568047, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "280e5ba3748db94d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382918573377556, "t_first_token_ns": 382918856446952, "t_last_token_ns": 382923351992551, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "e4a29ae517fc90cb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382918692723590, "t_first_token_ns": 382919115739195, "t_last_token_ns": 382923367141481, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "191f846cb9f66a8d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382919284856966, "t_first_token_ns": 382919568613062, "t_last_token_ns": 382923468811444, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "35f7d27c1cba45d2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382919469966584, "t_first_token_ns": 382919830779037, "t_last_token_ns": 382923480301484, "prompt_tokens": 4008, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "7c00e83d6e1dd0e6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382922861791438, "t_first_token_ns": 382923135112680, "t_last_token_ns": 382924503091456, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "3d4d2bb9bff4449a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382925977287833, "t_first_token_ns": 382926229416220, "t_last_token_ns": 382927443967980, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "7272725ee8fd0540", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382927694895232, "t_first_token_ns": 382927946535112, "t_last_token_ns": 382929162206031, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "a2fea85d23f1bda3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382929230966978, "t_first_token_ns": 382929482152853, "t_last_token_ns": 382931140875021, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "011158111ca19afe", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382929769173364, "t_first_token_ns": 382930026850438, "t_last_token_ns": 382931455191149, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "8da37110844555c6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382931550982739, "t_first_token_ns": 382931801970406, "t_last_token_ns": 382934132119325, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "d8b99c661f516313", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382932232434189, "t_first_token_ns": 382932489066674, "t_last_token_ns": 382935585922128, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "a02c08c718952117", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382932349367111, "t_first_token_ns": 382932727628078, "t_last_token_ns": 382935604839933, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "85285854a309da38", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382933228632964, "t_first_token_ns": 382933490787602, "t_last_token_ns": 382936531459672, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "7ddc31684732a450", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382934490449945, "t_first_token_ns": 382934753109500, "t_last_token_ns": 382939062928021, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "536791c815121f58", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382934500858259, "t_first_token_ns": 382934989960495, "t_last_token_ns": 382939074298946, "prompt_tokens": 4023, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "8576f22a0c427bbf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382934646693281, "t_first_token_ns": 382935226352486, "t_last_token_ns": 382939086472061, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "ca99a4a5056f40d2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382935670031514, "t_first_token_ns": 382935935402524, "t_last_token_ns": 382939594939297, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "5838d0fe27e1457f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382936644778672, "t_first_token_ns": 382936909549530, "t_last_token_ns": 382941059923067, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "816823128084eebd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382937193484770, "t_first_token_ns": 382937455521041, "t_last_token_ns": 382941454793009, "prompt_tokens": 3932, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "bb0921b5e49295ee", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382937282173431, "t_first_token_ns": 382937699735468, "t_last_token_ns": 382941473320522, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "346cf978d3834bc0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382937488037072, "t_first_token_ns": 382938162347825, "t_last_token_ns": 382941493290404, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "c886d982435860b7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382937668517005, "t_first_token_ns": 382938161971656, "t_last_token_ns": 382941493409534, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "fdbc4ac00b838337", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382940296695249, "t_first_token_ns": 382940563089249, "t_last_token_ns": 382944361465993, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "fa46f37dea905ba5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382940467467112, "t_first_token_ns": 382940810371498, "t_last_token_ns": 382944381392626, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "3e44df05b71a3966", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382940777907007, "t_first_token_ns": 382941059757008, "t_last_token_ns": 382944400476315, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "8ee00ef0ad273d5c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382941557356645, "t_first_token_ns": 382941818708637, "t_last_token_ns": 382944806309436, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "abfce48fa7fd10f9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382942283276208, "t_first_token_ns": 382942540378736, "t_last_token_ns": 382945532923940, "prompt_tokens": 3927, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "4f100560326929ab", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382942835097381, "t_first_token_ns": 382943102441575, "t_last_token_ns": 382945784097900, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "f83672c4afe7b491", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382943536448721, "t_first_token_ns": 382943802381831, "t_last_token_ns": 382946091237617, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "8683501cd7f36932", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382944871994870, "t_first_token_ns": 382945131767468, "t_last_token_ns": 382947371298281, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "3ce62a1f03d2a106", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382946507732925, "t_first_token_ns": 382946763114679, "t_last_token_ns": 382949315979358, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "0b7dbbfb7b402e80", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382946585180971, "t_first_token_ns": 382947214557439, "t_last_token_ns": 382949331098873, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "34360b76a6f05a1c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382946605794034, "t_first_token_ns": 382947214884676, "t_last_token_ns": 382949331303486, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "9ca5d6660027739d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382948238795534, "t_first_token_ns": 382948501134769, "t_last_token_ns": 382950783595795, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "1fe7cbb011f5427a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382949847086241, "t_first_token_ns": 382950101614251, "t_last_token_ns": 382954977935175, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "dffdc06a4b36683c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382950153271451, "t_first_token_ns": 382950410171573, "t_last_token_ns": 382955176679267, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "caced4db24115828", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382950494401040, "t_first_token_ns": 382950754222077, "t_last_token_ns": 382955368458696, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "9b16f710bda9505d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382951265486654, "t_first_token_ns": 382951525029268, "t_last_token_ns": 382956726759990, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "cb7c10754f5ce5fe", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382951630844552, "t_first_token_ns": 382951905781927, "t_last_token_ns": 382957192870722, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "88ffbcb6bc7f56b1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382951711172765, "t_first_token_ns": 382952154698248, "t_last_token_ns": 382957212311988, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "78e331e87c45a189", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382951903480165, "t_first_token_ns": 382952391676452, "t_last_token_ns": 382957223112182, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "456ef7ed125277b0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382952145673760, "t_first_token_ns": 382952638086504, "t_last_token_ns": 382957247136845, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "89f55bed6ffb548a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382952702824754, "t_first_token_ns": 382952972212278, "t_last_token_ns": 382957343461682, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "3d3ab1fae05f0d2a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382952829129450, "t_first_token_ns": 382953218114548, "t_last_token_ns": 382957362654919, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "335bb26db7a2003b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382953699622150, "t_first_token_ns": 382953979913404, "t_last_token_ns": 382957912450915, "prompt_tokens": 4024, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "dfbdddceda574c5e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382956033443839, "t_first_token_ns": 382956301270498, "t_last_token_ns": 382959751761247, "prompt_tokens": 4028, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "59fcfa5724d8e7b3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382956359374462, "t_first_token_ns": 382956627140565, "t_last_token_ns": 382959834054727, "prompt_tokens": 4019, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "cfe7772de6d17d6b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382956923401831, "t_first_token_ns": 382957192541857, "t_last_token_ns": 382960328155901, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "59bba69a8d667e3e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382957597982614, "t_first_token_ns": 382957857550912, "t_last_token_ns": 382961183047739, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "108739472aeef7cd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382958664670752, "t_first_token_ns": 382958921507089, "t_last_token_ns": 382962218143327, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "7912548face9904f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382959327521659, "t_first_token_ns": 382959594099935, "t_last_token_ns": 382962818584039, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "9ad61f308977eb83", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382959813253530, "t_first_token_ns": 382960078042536, "t_last_token_ns": 382963024336825, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "3491e86c0fd93ddb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382960664110723, "t_first_token_ns": 382960925346876, "t_last_token_ns": 382963557439280, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "48e9daddae7ddfde", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382960681318934, "t_first_token_ns": 382961155976039, "t_last_token_ns": 382963563540337, "prompt_tokens": 3921, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "30336ab3d8b85b24", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382962555824470, "t_first_token_ns": 382962819202906, "t_last_token_ns": 382964586083884, "prompt_tokens": 4024, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "9a3fd26f54c1be0a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382964082411460, "t_first_token_ns": 382964337877173, "t_last_token_ns": 382969707970982, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "6b4b03f7f400b478", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382964871974999, "t_first_token_ns": 382965130747141, "t_last_token_ns": 382972253410118, "prompt_tokens": 4028, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "79c09b32b572e89b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382965023250275, "t_first_token_ns": 382965368790616, "t_last_token_ns": 382972291095009, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "96d046514314461d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382965678768077, "t_first_token_ns": 382965935574188, "t_last_token_ns": 382974521550074, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "185f1dbdea6d23c0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382965698768537, "t_first_token_ns": 382966173182946, "t_last_token_ns": 382974565651702, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "1d800c9aabc32617", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382966037273044, "t_first_token_ns": 382966419747219, "t_last_token_ns": 382974807676197, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "6c8d620872c9187d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382966200511162, "t_first_token_ns": 382966665211292, "t_last_token_ns": 382974847518918, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "48598702f53ca2f3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382966750193640, "t_first_token_ns": 382967014976664, "t_last_token_ns": 382975032643076, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "f2db9e461e72ad78", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382966942869618, "t_first_token_ns": 382967264954501, "t_last_token_ns": 382975067203317, "prompt_tokens": 4022, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "f80c2c460ec65e32", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382967008273755, "t_first_token_ns": 382967501210814, "t_last_token_ns": 382975086214056, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "ab3000c06ac776bf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382968061829510, "t_first_token_ns": 382968328879133, "t_last_token_ns": 382976345897697, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "0bf5905c1e8121bd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382968255828785, "t_first_token_ns": 382968579400257, "t_last_token_ns": 382976381189354, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "1098d56d526989cc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382968803557889, "t_first_token_ns": 382969081531405, "t_last_token_ns": 382976663356019, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "137d70d1f46c03a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382968815464531, "t_first_token_ns": 382969321217983, "t_last_token_ns": 382976677186139, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "cebd470b51f7fe4e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382969368811945, "t_first_token_ns": 382969649467716, "t_last_token_ns": 382976780104681, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "a151e7b5b68ad9ae", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382970408712279, "t_first_token_ns": 382970685708239, "t_last_token_ns": 382977789107764, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "4eca794d265710c8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382971434060756, "t_first_token_ns": 382971706870064, "t_last_token_ns": 382978779282915, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "dff220b2b2e49d3a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382971701668076, "t_first_token_ns": 382971976563040, "t_last_token_ns": 382978821705535, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "82efcb8a8e11e6c8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382971794787189, "t_first_token_ns": 382972236783995, "t_last_token_ns": 382978850142874, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "2b2b27c2bbba748c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382972422695171, "t_first_token_ns": 382972698778991, "t_last_token_ns": 382979059872449, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "22232cfe893b9166", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382972667902866, "t_first_token_ns": 382972958069954, "t_last_token_ns": 382979086547479, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "139fa79bf7516bed", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382972893271136, "t_first_token_ns": 382973216336359, "t_last_token_ns": 382979108832071, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "d725af190f423c10", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382973051402584, "t_first_token_ns": 382973475218977, "t_last_token_ns": 382979131748405, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "1d3e84737b4620da", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382973385519328, "t_first_token_ns": 382973738667735, "t_last_token_ns": 382979155563268, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "2375944c4224b637", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382974150825083, "t_first_token_ns": 382974442895888, "t_last_token_ns": 382979391080369, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 22, "error": null}
|
||||||
|
{"req_id": "657154f7593762d1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382974493886898, "t_first_token_ns": 382974788861859, "t_last_token_ns": 382979452712066, "prompt_tokens": 4032, "completion_tokens": 256, "inflight_at_send": 23, "error": null}
|
||||||
|
{"req_id": "4e50ce56d0618fec", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382975188421027, "t_first_token_ns": 382975461931858, "t_last_token_ns": 382979687741052, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "cea59338a3b0a9b3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382976056272032, "t_first_token_ns": 382976346263881, "t_last_token_ns": 382979936373009, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "f4dd02509a77bddd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382977165528339, "t_first_token_ns": 382977439885179, "t_last_token_ns": 382980290529320, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "0e7beafea7584ff7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382977731207395, "t_first_token_ns": 382978006850893, "t_last_token_ns": 382980399529403, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "b97b84e328b625d3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382980823689281, "t_first_token_ns": 382981075688086, "t_last_token_ns": 382983476634454, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "651186d1dd6c15a6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382981289258632, "t_first_token_ns": 382981548939419, "t_last_token_ns": 382983853578046, "prompt_tokens": 4033, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "b6350e463220e4e6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382981803870265, "t_first_token_ns": 382982060697967, "t_last_token_ns": 382984141231232, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "358015cbb66d10de", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382982520271690, "t_first_token_ns": 382982782950860, "t_last_token_ns": 382984464938783, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "50705bd15064c6d4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382984889117196, "t_first_token_ns": 382985142742040, "t_last_token_ns": 382986739578814, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "ecb80a17721044a3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382985696775295, "t_first_token_ns": 382985955904751, "t_last_token_ns": 382987321670092, "prompt_tokens": 4027, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "41abb79477b47f91", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382987355349469, "t_first_token_ns": 382987605708598, "t_last_token_ns": 382990574895939, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "721ebf229a030a95", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382987472519494, "t_first_token_ns": 382987840997804, "t_last_token_ns": 382990590088544, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "3a340945700f2c58", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382987839061395, "t_first_token_ns": 382988099169498, "t_last_token_ns": 382990627092689, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "2170390234b5ae2d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382988649987556, "t_first_token_ns": 382988909167022, "t_last_token_ns": 382993001662646, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "076476fea8382967", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382989530470958, "t_first_token_ns": 382989792217274, "t_last_token_ns": 382994679266784, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "ff52490f95172d67", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382990712791716, "t_first_token_ns": 382990971200319, "t_last_token_ns": 382997553642879, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "e924d3bcedd387fb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382990920043591, "t_first_token_ns": 382991212182096, "t_last_token_ns": 382997584314503, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "b877892f91ecec34", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382991172767836, "t_first_token_ns": 382991454716177, "t_last_token_ns": 382997615249931, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "1caf880a2fbf12e5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382991527941321, "t_first_token_ns": 382991792900555, "t_last_token_ns": 382997791893345, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "9a1cf60f1b871ded", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382991861045452, "t_first_token_ns": 382992130204783, "t_last_token_ns": 382997961286607, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "a95165328c12522a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382991926192021, "t_first_token_ns": 382992377868985, "t_last_token_ns": 382997989217438, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "db3cb451a123fd7b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382992256767425, "t_first_token_ns": 382992625017685, "t_last_token_ns": 382998016828719, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "ec19d55d7bc56117", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382992942558055, "t_first_token_ns": 382993215701630, "t_last_token_ns": 382999096964079, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "45f17bcf193b6eb1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382993840954274, "t_first_token_ns": 382994119861588, "t_last_token_ns": 383001306366832, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "0a1426ca17d726a2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382994015777751, "t_first_token_ns": 382994370980435, "t_last_token_ns": 383001544419881, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "85f92136d0577f5c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382994662842950, "t_first_token_ns": 382994932478193, "t_last_token_ns": 383002402402567, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "c68c8ac2cefadeda", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382995590949195, "t_first_token_ns": 382995872537866, "t_last_token_ns": 383003709060859, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "5c5ae6e010163893", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382995704859857, "t_first_token_ns": 382996348969634, "t_last_token_ns": 383003743273870, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "39ec2bfef2cc660c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382995716668906, "t_first_token_ns": 382996348634157, "t_last_token_ns": 383003744205858, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "bc830bbd57c31dfd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382995834115999, "t_first_token_ns": 382996576548299, "t_last_token_ns": 383003757992635, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "f8dfa76d3272924c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382996592526432, "t_first_token_ns": 382996875178131, "t_last_token_ns": 383004057626431, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "6a0616efb0ee2261", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382998223428771, "t_first_token_ns": 382998493950952, "t_last_token_ns": 383005754297147, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "9be8cd07b75dc5e7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382998551019335, "t_first_token_ns": 382998831807421, "t_last_token_ns": 383005901218597, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "213c753a2902d374", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382998801335860, "t_first_token_ns": 382999083678594, "t_last_token_ns": 383006136157518, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "4e27c91846d4e17d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382999459596869, "t_first_token_ns": 382999732143017, "t_last_token_ns": 383006598191889, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "1b02d1736bed2783", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382999713878673, "t_first_token_ns": 382999984232406, "t_last_token_ns": 383006629308877, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "044505f55ee8eb29", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382999740776549, "t_first_token_ns": 383000224556617, "t_last_token_ns": 383006643506063, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "9b6468d2f524b689", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382999929759433, "t_first_token_ns": 383000465788287, "t_last_token_ns": 383006657148086, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "0576ac2b072d4f41", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383000541997745, "t_first_token_ns": 383000823717795, "t_last_token_ns": 383006783955862, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "1904f89e3ffa195e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383000950707836, "t_first_token_ns": 383001230172094, "t_last_token_ns": 383007165326410, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "278330e7e052f152", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383001235892397, "t_first_token_ns": 383001528424082, "t_last_token_ns": 383007663661606, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "783382169706baf2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383001569684788, "t_first_token_ns": 383001844968737, "t_last_token_ns": 383007973140501, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "fd3f7b073d883ae7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383001668536397, "t_first_token_ns": 383002102721237, "t_last_token_ns": 383008001539889, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "16236450faab7663", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383002641194606, "t_first_token_ns": 383002920644684, "t_last_token_ns": 383008878449759, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "f702da4e623d7579", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383002953745602, "t_first_token_ns": 383003230034676, "t_last_token_ns": 383008945478826, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "d9555469adb82bd6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383003736685686, "t_first_token_ns": 383004014553614, "t_last_token_ns": 383009317951479, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "47bdac1c5663cca9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383004296174963, "t_first_token_ns": 383004580018304, "t_last_token_ns": 383009574570549, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "af1d9e8359f64baa", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383005832742661, "t_first_token_ns": 383006121843023, "t_last_token_ns": 383010633961300, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "b27e960445b0e807", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383006894781834, "t_first_token_ns": 383007166179558, "t_last_token_ns": 383011665380526, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "3877cb03f6ea4711", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383006990199450, "t_first_token_ns": 383007410572047, "t_last_token_ns": 383011688780131, "prompt_tokens": 3955, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "cee027f56ae5f140", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383007391297659, "t_first_token_ns": 383007663836597, "t_last_token_ns": 383011938063080, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "336f49e040184b00", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383007399829640, "t_first_token_ns": 383007908118781, "t_last_token_ns": 383011945522376, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "9cc7dafc187ac5a5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383008338941323, "t_first_token_ns": 383008614078777, "t_last_token_ns": 383012554380367, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "8363103077032c09", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383008515161934, "t_first_token_ns": 383008866787486, "t_last_token_ns": 383012573414985, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "288e93bd46cfc9ea", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383011018721810, "t_first_token_ns": 383011285846608, "t_last_token_ns": 383014720127104, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "9e4a220b3cf1ed89", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383011189805105, "t_first_token_ns": 383011534697539, "t_last_token_ns": 383014740652197, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "d553f356441ca287", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383011664224303, "t_first_token_ns": 383011937969249, "t_last_token_ns": 383014870442913, "prompt_tokens": 4027, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "5c8b375f370a610c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383012116055141, "t_first_token_ns": 383012385155666, "t_last_token_ns": 383015038599250, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "c8b0fc76cafef753", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383013287964256, "t_first_token_ns": 383013553953406, "t_last_token_ns": 383017392254557, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "72e7e5c0e7f2b84f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383013891713358, "t_first_token_ns": 383014160649590, "t_last_token_ns": 383018064955282, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "9e7ed7f0ec538c3f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383015312019843, "t_first_token_ns": 383015567966387, "t_last_token_ns": 383021670625639, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "ff133c5d58ab656f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383015569465518, "t_first_token_ns": 383015834678890, "t_last_token_ns": 383021745660658, "prompt_tokens": 4023, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "a729629242e95560", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383015638483717, "t_first_token_ns": 383016300705127, "t_last_token_ns": 383021771689746, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "0b005fe06300469f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383015776199335, "t_first_token_ns": 383016301094614, "t_last_token_ns": 383021772295924, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "f9fef77febb3a101", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383015808407003, "t_first_token_ns": 383016526380049, "t_last_token_ns": 383021785516174, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "3a56a9b9fbaf16ef", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383016627925772, "t_first_token_ns": 383016895163818, "t_last_token_ns": 383022187453466, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "a4b8f1dd0a833ac0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383017400989473, "t_first_token_ns": 383017678828983, "t_last_token_ns": 383023098524275, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "bc1ce14acfb8f5b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383018338656614, "t_first_token_ns": 383018611454998, "t_last_token_ns": 383023927274975, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "550870cf62cf7dc8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383019351419913, "t_first_token_ns": 383019619186256, "t_last_token_ns": 383025053466820, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "3bed9c08ecc0735e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383019485675318, "t_first_token_ns": 383020079935171, "t_last_token_ns": 383025077063089, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "2affef463ea83fb5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383019440881820, "t_first_token_ns": 383020080727063, "t_last_token_ns": 383025077844451, "prompt_tokens": 3963, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "80bee31a1295953e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383020204829783, "t_first_token_ns": 383020694573197, "t_last_token_ns": 383025229719769, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "aebd728fac5eb7c9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383020203647292, "t_first_token_ns": 383020693948715, "t_last_token_ns": 383025230376503, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "fc6cfd2270ab2581", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383020480649855, "t_first_token_ns": 383020946529158, "t_last_token_ns": 383025249446040, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "f4a15b0e0da21586", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383021072114639, "t_first_token_ns": 383021342159212, "t_last_token_ns": 383025342329830, "prompt_tokens": 3916, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "1ff7c31e57e00bcb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383021777330011, "t_first_token_ns": 383022051410261, "t_last_token_ns": 383025595705672, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "c1fecd97b2361f3e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383022415020732, "t_first_token_ns": 383022690188916, "t_last_token_ns": 383025769684775, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "d167f8efb80215a9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383024405106159, "t_first_token_ns": 383024677982876, "t_last_token_ns": 383026413568615, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "c52bced31247e27d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383026855738000, "t_first_token_ns": 383027104245357, "t_last_token_ns": 383031698453402, "prompt_tokens": 3949, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "431a39693928917a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383027104473723, "t_first_token_ns": 383027362663482, "t_last_token_ns": 383031762943886, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "bf8d7bba7096d325", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383027357693904, "t_first_token_ns": 383027615410216, "t_last_token_ns": 383031809521168, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "abfb5bfd5d68dc8d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383027373043855, "t_first_token_ns": 383027851510799, "t_last_token_ns": 383031819724401, "prompt_tokens": 4028, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "5142ba1e759d7807", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383027459898148, "t_first_token_ns": 383028089105308, "t_last_token_ns": 383031829219652, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "911dfbd9a5da2f3a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383027845178171, "t_first_token_ns": 383028335148320, "t_last_token_ns": 383031845592361, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "d4ec1ed768d3128b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383028529926037, "t_first_token_ns": 383028799403610, "t_last_token_ns": 383032037654779, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "8ecca9914ec9afd1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383028710045870, "t_first_token_ns": 383029259886354, "t_last_token_ns": 383032048417024, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "94e98c79036fdaf0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 383028782077174, "t_first_token_ns": 383029259643712, "t_last_token_ns": 383032048584805, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
@@ -0,0 +1,8 @@
|
|||||||
|
{
|
||||||
|
"rate": 1.5,
|
||||||
|
"input_tokens": 4096,
|
||||||
|
"output_tokens": 256,
|
||||||
|
"duration_target_s": 240.0,
|
||||||
|
"duration_actual_s": 243.08491666900227,
|
||||||
|
"n_requests": 392
|
||||||
|
}
|
||||||
File diff suppressed because it is too large
Load Diff
@@ -0,0 +1,286 @@
|
|||||||
|
{
|
||||||
|
"mooncake_both": {
|
||||||
|
"config": "mooncake_both",
|
||||||
|
"n_steps_total": 15522,
|
||||||
|
"n_steps_after_warmup": 15022,
|
||||||
|
"n_steps_decode_only": 14645,
|
||||||
|
"decode_share": 0.974903474903475,
|
||||||
|
"rows_used_for_fit": "decode_only",
|
||||||
|
"cache_size_max": 17533,
|
||||||
|
"per_bin": [
|
||||||
|
{
|
||||||
|
"bin_id": 3,
|
||||||
|
"cache_size_mid": 6136.55,
|
||||||
|
"n": 183,
|
||||||
|
"cache_size_p50": 6794,
|
||||||
|
"step_duration_us_p50": 753,
|
||||||
|
"step_duration_us_p90": 1105,
|
||||||
|
"build_meta_us_p50": 639,
|
||||||
|
"build_meta_us_p90": 968
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 4,
|
||||||
|
"cache_size_mid": 7889.849999999999,
|
||||||
|
"n": 921,
|
||||||
|
"cache_size_p50": 7916,
|
||||||
|
"step_duration_us_p50": 763,
|
||||||
|
"step_duration_us_p90": 1102,
|
||||||
|
"build_meta_us_p50": 694,
|
||||||
|
"build_meta_us_p90": 983
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 5,
|
||||||
|
"cache_size_mid": 9643.15,
|
||||||
|
"n": 223,
|
||||||
|
"cache_size_p50": 9500,
|
||||||
|
"step_duration_us_p50": 933,
|
||||||
|
"step_duration_us_p90": 1336,
|
||||||
|
"build_meta_us_p50": 834,
|
||||||
|
"build_meta_us_p90": 1223
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 6,
|
||||||
|
"cache_size_mid": 11396.449999999999,
|
||||||
|
"n": 404,
|
||||||
|
"cache_size_p50": 11355,
|
||||||
|
"step_duration_us_p50": 1039,
|
||||||
|
"step_duration_us_p90": 1488,
|
||||||
|
"build_meta_us_p50": 954,
|
||||||
|
"build_meta_us_p90": 1363
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 7,
|
||||||
|
"cache_size_mid": 13149.75,
|
||||||
|
"n": 367,
|
||||||
|
"cache_size_p50": 13443,
|
||||||
|
"step_duration_us_p50": 1174,
|
||||||
|
"step_duration_us_p90": 1659,
|
||||||
|
"build_meta_us_p50": 1078,
|
||||||
|
"build_meta_us_p90": 1540
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 8,
|
||||||
|
"cache_size_mid": 14903.05,
|
||||||
|
"n": 468,
|
||||||
|
"cache_size_p50": 14554,
|
||||||
|
"step_duration_us_p50": 1211,
|
||||||
|
"step_duration_us_p90": 1685,
|
||||||
|
"build_meta_us_p50": 1136,
|
||||||
|
"build_meta_us_p90": 1593
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 9,
|
||||||
|
"cache_size_mid": 16656.35,
|
||||||
|
"n": 12079,
|
||||||
|
"cache_size_p50": 17528,
|
||||||
|
"step_duration_us_p50": 1555,
|
||||||
|
"step_duration_us_p90": 2204,
|
||||||
|
"build_meta_us_p50": 1450,
|
||||||
|
"build_meta_us_p90": 2051
|
||||||
|
}
|
||||||
|
],
|
||||||
|
"fit_step_duration": {
|
||||||
|
"slope_us_per_block": 0.0944873921263782,
|
||||||
|
"intercept_us": 85.69888970284087
|
||||||
|
},
|
||||||
|
"fit_build_meta": {
|
||||||
|
"slope_us_per_block": 0.09173081765176755,
|
||||||
|
"intercept_us": 24.232639769930234
|
||||||
|
},
|
||||||
|
"worker_summary": {
|
||||||
|
"n": 15522,
|
||||||
|
"get_finished_us_p50": 180,
|
||||||
|
"get_finished_us_p90": 255,
|
||||||
|
"get_finished_us_p99": 349,
|
||||||
|
"start_load_kv_us_p50": 2,
|
||||||
|
"start_load_kv_us_p90": 5
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"mooncake_both_drfix": {
|
||||||
|
"config": "mooncake_both_drfix",
|
||||||
|
"n_steps_total": 15438,
|
||||||
|
"n_steps_after_warmup": 14938,
|
||||||
|
"n_steps_decode_only": 14562,
|
||||||
|
"decode_share": 0.9748292944169232,
|
||||||
|
"rows_used_for_fit": "decode_only",
|
||||||
|
"cache_size_max": 17533,
|
||||||
|
"per_bin": [
|
||||||
|
{
|
||||||
|
"bin_id": 3,
|
||||||
|
"cache_size_mid": 6136.55,
|
||||||
|
"n": 180,
|
||||||
|
"cache_size_p50": 6793,
|
||||||
|
"step_duration_us_p50": 125,
|
||||||
|
"step_duration_us_p90": 144,
|
||||||
|
"build_meta_us_p50": 6,
|
||||||
|
"build_meta_us_p90": 7
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 4,
|
||||||
|
"cache_size_mid": 7889.849999999999,
|
||||||
|
"n": 921,
|
||||||
|
"cache_size_p50": 7916,
|
||||||
|
"step_duration_us_p50": 66,
|
||||||
|
"step_duration_us_p90": 99,
|
||||||
|
"build_meta_us_p50": 6,
|
||||||
|
"build_meta_us_p90": 7
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 5,
|
||||||
|
"cache_size_mid": 9643.15,
|
||||||
|
"n": 217,
|
||||||
|
"cache_size_p50": 9499,
|
||||||
|
"step_duration_us_p50": 102,
|
||||||
|
"step_duration_us_p90": 120,
|
||||||
|
"build_meta_us_p50": 6,
|
||||||
|
"build_meta_us_p90": 7
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 6,
|
||||||
|
"cache_size_mid": 11396.449999999999,
|
||||||
|
"n": 384,
|
||||||
|
"cache_size_p50": 11353,
|
||||||
|
"step_duration_us_p50": 91,
|
||||||
|
"step_duration_us_p90": 119,
|
||||||
|
"build_meta_us_p50": 6,
|
||||||
|
"build_meta_us_p90": 7
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 7,
|
||||||
|
"cache_size_mid": 13149.75,
|
||||||
|
"n": 344,
|
||||||
|
"cache_size_p50": 13690,
|
||||||
|
"step_duration_us_p50": 98,
|
||||||
|
"step_duration_us_p90": 124,
|
||||||
|
"build_meta_us_p50": 6,
|
||||||
|
"build_meta_us_p90": 7
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 8,
|
||||||
|
"cache_size_mid": 14903.05,
|
||||||
|
"n": 456,
|
||||||
|
"cache_size_p50": 14554,
|
||||||
|
"step_duration_us_p50": 73,
|
||||||
|
"step_duration_us_p90": 100,
|
||||||
|
"build_meta_us_p50": 6,
|
||||||
|
"build_meta_us_p90": 7
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 9,
|
||||||
|
"cache_size_mid": 16656.35,
|
||||||
|
"n": 12060,
|
||||||
|
"cache_size_p50": 17528,
|
||||||
|
"step_duration_us_p50": 97,
|
||||||
|
"step_duration_us_p90": 169,
|
||||||
|
"build_meta_us_p50": 6,
|
||||||
|
"build_meta_us_p90": 7
|
||||||
|
}
|
||||||
|
],
|
||||||
|
"fit_step_duration": {
|
||||||
|
"slope_us_per_block": 0.0023383689406900362,
|
||||||
|
"intercept_us": 66.43277951652531
|
||||||
|
},
|
||||||
|
"fit_build_meta": {
|
||||||
|
"slope_us_per_block": -2.126549423120513e-05,
|
||||||
|
"intercept_us": 6.582759051897582
|
||||||
|
},
|
||||||
|
"worker_summary": {
|
||||||
|
"n": 15438,
|
||||||
|
"get_finished_us_p50": 180,
|
||||||
|
"get_finished_us_p90": 256,
|
||||||
|
"get_finished_us_p99": 345,
|
||||||
|
"start_load_kv_us_p50": 2,
|
||||||
|
"start_load_kv_us_p90": 2
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"plain": {
|
||||||
|
"config": "plain",
|
||||||
|
"n_steps_total": 15626,
|
||||||
|
"n_steps_after_warmup": 15126,
|
||||||
|
"n_steps_decode_only": 14751,
|
||||||
|
"decode_share": 0.9752082506941689,
|
||||||
|
"rows_used_for_fit": "decode_only",
|
||||||
|
"cache_size_max": 17533,
|
||||||
|
"per_bin": [
|
||||||
|
{
|
||||||
|
"bin_id": 3,
|
||||||
|
"cache_size_mid": 6136.55,
|
||||||
|
"n": 215,
|
||||||
|
"cache_size_p50": 6791,
|
||||||
|
"step_duration_us_p50": 105,
|
||||||
|
"step_duration_us_p90": 135,
|
||||||
|
"build_meta_us_p50": 0,
|
||||||
|
"build_meta_us_p90": 0
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 4,
|
||||||
|
"cache_size_mid": 7889.849999999999,
|
||||||
|
"n": 979,
|
||||||
|
"cache_size_p50": 7920,
|
||||||
|
"step_duration_us_p50": 54,
|
||||||
|
"step_duration_us_p90": 81,
|
||||||
|
"build_meta_us_p50": 0,
|
||||||
|
"build_meta_us_p90": 0
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 5,
|
||||||
|
"cache_size_mid": 9643.15,
|
||||||
|
"n": 230,
|
||||||
|
"cache_size_p50": 9500,
|
||||||
|
"step_duration_us_p50": 83,
|
||||||
|
"step_duration_us_p90": 104,
|
||||||
|
"build_meta_us_p50": 0,
|
||||||
|
"build_meta_us_p90": 0
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 6,
|
||||||
|
"cache_size_mid": 11396.449999999999,
|
||||||
|
"n": 402,
|
||||||
|
"cache_size_p50": 11355,
|
||||||
|
"step_duration_us_p50": 78,
|
||||||
|
"step_duration_us_p90": 101,
|
||||||
|
"build_meta_us_p50": 0,
|
||||||
|
"build_meta_us_p90": 0
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 7,
|
||||||
|
"cache_size_mid": 13149.75,
|
||||||
|
"n": 371,
|
||||||
|
"cache_size_p50": 13693,
|
||||||
|
"step_duration_us_p50": 81,
|
||||||
|
"step_duration_us_p90": 103,
|
||||||
|
"build_meta_us_p50": 0,
|
||||||
|
"build_meta_us_p90": 0
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 8,
|
||||||
|
"cache_size_mid": 14903.05,
|
||||||
|
"n": 468,
|
||||||
|
"cache_size_p50": 14554,
|
||||||
|
"step_duration_us_p50": 61,
|
||||||
|
"step_duration_us_p90": 83,
|
||||||
|
"build_meta_us_p50": 0,
|
||||||
|
"build_meta_us_p90": 0
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"bin_id": 9,
|
||||||
|
"cache_size_mid": 16656.35,
|
||||||
|
"n": 12086,
|
||||||
|
"cache_size_p50": 17528,
|
||||||
|
"step_duration_us_p50": 87,
|
||||||
|
"step_duration_us_p90": 157,
|
||||||
|
"build_meta_us_p50": 0,
|
||||||
|
"build_meta_us_p90": 0
|
||||||
|
}
|
||||||
|
],
|
||||||
|
"fit_step_duration": {
|
||||||
|
"slope_us_per_block": 0.0026079754841429087,
|
||||||
|
"intercept_us": 51.12171169280091
|
||||||
|
},
|
||||||
|
"fit_build_meta": {
|
||||||
|
"slope_us_per_block": 0.0,
|
||||||
|
"intercept_us": 0.0
|
||||||
|
},
|
||||||
|
"worker_summary": null
|
||||||
|
}
|
||||||
|
}
|
||||||
@@ -0,0 +1,624 @@
|
|||||||
|
# HELP python_gc_objects_collected_total Objects collected during gc
|
||||||
|
# TYPE python_gc_objects_collected_total counter
|
||||||
|
python_gc_objects_collected_total{generation="0"} 11967.0
|
||||||
|
python_gc_objects_collected_total{generation="1"} 1552.0
|
||||||
|
python_gc_objects_collected_total{generation="2"} 855.0
|
||||||
|
# HELP python_gc_objects_uncollectable_total Uncollectable objects found during GC
|
||||||
|
# TYPE python_gc_objects_uncollectable_total counter
|
||||||
|
python_gc_objects_uncollectable_total{generation="0"} 0.0
|
||||||
|
python_gc_objects_uncollectable_total{generation="1"} 0.0
|
||||||
|
python_gc_objects_uncollectable_total{generation="2"} 0.0
|
||||||
|
# HELP python_gc_collections_total Number of times this generation was collected
|
||||||
|
# TYPE python_gc_collections_total counter
|
||||||
|
python_gc_collections_total{generation="0"} 1340.0
|
||||||
|
python_gc_collections_total{generation="1"} 121.0
|
||||||
|
python_gc_collections_total{generation="2"} 9.0
|
||||||
|
# HELP python_info Python platform information
|
||||||
|
# TYPE python_info gauge
|
||||||
|
python_info{implementation="CPython",major="3",minor="12",patchlevel="3",version="3.12.3"} 1.0
|
||||||
|
# HELP process_virtual_memory_bytes Virtual memory size in bytes.
|
||||||
|
# TYPE process_virtual_memory_bytes gauge
|
||||||
|
process_virtual_memory_bytes 3.8945964032e+010
|
||||||
|
# HELP process_resident_memory_bytes Resident memory size in bytes.
|
||||||
|
# TYPE process_resident_memory_bytes gauge
|
||||||
|
process_resident_memory_bytes 1.350537216e+09
|
||||||
|
# HELP process_start_time_seconds Start time of the process since unix epoch in seconds.
|
||||||
|
# TYPE process_start_time_seconds gauge
|
||||||
|
process_start_time_seconds 1.77981284373e+09
|
||||||
|
# HELP process_cpu_seconds_total Total user and system CPU time spent in seconds.
|
||||||
|
# TYPE process_cpu_seconds_total counter
|
||||||
|
process_cpu_seconds_total 41.480000000000004
|
||||||
|
# HELP process_open_fds Number of open file descriptors.
|
||||||
|
# TYPE process_open_fds gauge
|
||||||
|
process_open_fds 64.0
|
||||||
|
# HELP process_max_fds Maximum number of open file descriptors.
|
||||||
|
# TYPE process_max_fds gauge
|
||||||
|
process_max_fds 65535.0
|
||||||
|
# HELP vllm:estimated_flops_per_gpu_total Estimated number of floating point operations per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_flops_per_gpu_total counter
|
||||||
|
vllm:estimated_flops_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:estimated_flops_per_gpu_created Estimated number of floating point operations per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_flops_per_gpu_created gauge
|
||||||
|
vllm:estimated_flops_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468888192e+09
|
||||||
|
# HELP vllm:estimated_read_bytes_per_gpu_total Estimated number of bytes read from memory per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_read_bytes_per_gpu_total counter
|
||||||
|
vllm:estimated_read_bytes_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:estimated_read_bytes_per_gpu_created Estimated number of bytes read from memory per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_read_bytes_per_gpu_created gauge
|
||||||
|
vllm:estimated_read_bytes_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779812946888848e+09
|
||||||
|
# HELP vllm:estimated_write_bytes_per_gpu_total Estimated number of bytes written to memory per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_write_bytes_per_gpu_total counter
|
||||||
|
vllm:estimated_write_bytes_per_gpu_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:estimated_write_bytes_per_gpu_created Estimated number of bytes written to memory per GPU (for Model Flops Utilization calculations).
|
||||||
|
# TYPE vllm:estimated_write_bytes_per_gpu_created gauge
|
||||||
|
vllm:estimated_write_bytes_per_gpu_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468888671e+09
|
||||||
|
# HELP vllm:num_requests_running Number of requests in model execution batches.
|
||||||
|
# TYPE vllm:num_requests_running gauge
|
||||||
|
vllm:num_requests_running{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:num_requests_waiting Number of requests waiting to be processed.
|
||||||
|
# TYPE vllm:num_requests_waiting gauge
|
||||||
|
vllm:num_requests_waiting{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:engine_sleep_state Engine sleep state; awake = 0 means engine is sleeping; awake = 1 means engine is awake; weights_offloaded = 1 means sleep level 1; discard_all = 1 means sleep level 2.
|
||||||
|
# TYPE vllm:engine_sleep_state gauge
|
||||||
|
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="awake"} 1.0
|
||||||
|
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="weights_offloaded"} 0.0
|
||||||
|
vllm:engine_sleep_state{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",sleep_state="discard_all"} 0.0
|
||||||
|
# HELP vllm:kv_cache_usage_perc KV-cache usage. 1 means 100 percent usage.
|
||||||
|
# TYPE vllm:kv_cache_usage_perc gauge
|
||||||
|
vllm:kv_cache_usage_perc{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:prefix_cache_queries_total Prefix cache queries, in terms of number of queried tokens.
|
||||||
|
# TYPE vllm:prefix_cache_queries_total counter
|
||||||
|
vllm:prefix_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.563446e+06
|
||||||
|
# HELP vllm:prefix_cache_queries_created Prefix cache queries, in terms of number of queried tokens.
|
||||||
|
# TYPE vllm:prefix_cache_queries_created gauge
|
||||||
|
vllm:prefix_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468890276e+09
|
||||||
|
# HELP vllm:prefix_cache_hits_total Prefix cache hits, in terms of number of cached tokens.
|
||||||
|
# TYPE vllm:prefix_cache_hits_total counter
|
||||||
|
vllm:prefix_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:prefix_cache_hits_created Prefix cache hits, in terms of number of cached tokens.
|
||||||
|
# TYPE vllm:prefix_cache_hits_created gauge
|
||||||
|
vllm:prefix_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468890412e+09
|
||||||
|
# HELP vllm:external_prefix_cache_queries_total External prefix cache queries from KV connector cross-instance cache sharing, in terms of number of queried tokens.
|
||||||
|
# TYPE vllm:external_prefix_cache_queries_total counter
|
||||||
|
vllm:external_prefix_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:external_prefix_cache_queries_created External prefix cache queries from KV connector cross-instance cache sharing, in terms of number of queried tokens.
|
||||||
|
# TYPE vllm:external_prefix_cache_queries_created gauge
|
||||||
|
vllm:external_prefix_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468890548e+09
|
||||||
|
# HELP vllm:external_prefix_cache_hits_total External prefix cache hits from KV connector cross-instance cache sharing, in terms of number of cached tokens.
|
||||||
|
# TYPE vllm:external_prefix_cache_hits_total counter
|
||||||
|
vllm:external_prefix_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:external_prefix_cache_hits_created External prefix cache hits from KV connector cross-instance cache sharing, in terms of number of cached tokens.
|
||||||
|
# TYPE vllm:external_prefix_cache_hits_created gauge
|
||||||
|
vllm:external_prefix_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468890667e+09
|
||||||
|
# HELP vllm:mm_cache_queries_total Multi-modal cache queries, in terms of number of queried items.
|
||||||
|
# TYPE vllm:mm_cache_queries_total counter
|
||||||
|
vllm:mm_cache_queries_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:mm_cache_queries_created Multi-modal cache queries, in terms of number of queried items.
|
||||||
|
# TYPE vllm:mm_cache_queries_created gauge
|
||||||
|
vllm:mm_cache_queries_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468890786e+09
|
||||||
|
# HELP vllm:mm_cache_hits_total Multi-modal cache hits, in terms of number of cached items.
|
||||||
|
# TYPE vllm:mm_cache_hits_total counter
|
||||||
|
vllm:mm_cache_hits_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:mm_cache_hits_created Multi-modal cache hits, in terms of number of cached items.
|
||||||
|
# TYPE vllm:mm_cache_hits_created gauge
|
||||||
|
vllm:mm_cache_hits_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468890915e+09
|
||||||
|
# HELP vllm:num_preemptions_total Cumulative number of preemption from the engine.
|
||||||
|
# TYPE vllm:num_preemptions_total counter
|
||||||
|
vllm:num_preemptions_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:num_preemptions_created Cumulative number of preemption from the engine.
|
||||||
|
# TYPE vllm:num_preemptions_created gauge
|
||||||
|
vllm:num_preemptions_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468891037e+09
|
||||||
|
# HELP vllm:prompt_tokens_total Number of prefill tokens processed.
|
||||||
|
# TYPE vllm:prompt_tokens_total counter
|
||||||
|
vllm:prompt_tokens_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.563446e+06
|
||||||
|
# HELP vllm:prompt_tokens_created Number of prefill tokens processed.
|
||||||
|
# TYPE vllm:prompt_tokens_created gauge
|
||||||
|
vllm:prompt_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468891146e+09
|
||||||
|
# HELP vllm:prompt_tokens_by_source_total Number of prompt tokens by source.
|
||||||
|
# TYPE vllm:prompt_tokens_by_source_total counter
|
||||||
|
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_compute"} 1.563446e+06
|
||||||
|
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_cache_hit"} 0.0
|
||||||
|
vllm:prompt_tokens_by_source_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="external_kv_transfer"} 0.0
|
||||||
|
# HELP vllm:prompt_tokens_by_source_created Number of prompt tokens by source.
|
||||||
|
# TYPE vllm:prompt_tokens_by_source_created gauge
|
||||||
|
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_compute"} 1.7798129468891292e+09
|
||||||
|
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="local_cache_hit"} 1.7798129468891344e+09
|
||||||
|
vllm:prompt_tokens_by_source_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct",source="external_kv_transfer"} 1.779812946889139e+09
|
||||||
|
# HELP vllm:prompt_tokens_cached_total Number of cached prompt tokens (local + external).
|
||||||
|
# TYPE vllm:prompt_tokens_cached_total counter
|
||||||
|
vllm:prompt_tokens_cached_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:prompt_tokens_cached_created Number of cached prompt tokens (local + external).
|
||||||
|
# TYPE vllm:prompt_tokens_cached_created gauge
|
||||||
|
vllm:prompt_tokens_cached_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468891513e+09
|
||||||
|
# HELP vllm:prompt_tokens_recomputed_total Number of cached tokens recomputed for forward pass.
|
||||||
|
# TYPE vllm:prompt_tokens_recomputed_total counter
|
||||||
|
vllm:prompt_tokens_recomputed_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:prompt_tokens_recomputed_created Number of cached tokens recomputed for forward pass.
|
||||||
|
# TYPE vllm:prompt_tokens_recomputed_created gauge
|
||||||
|
vllm:prompt_tokens_recomputed_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468891625e+09
|
||||||
|
# HELP vllm:generation_tokens_total Number of generation tokens processed.
|
||||||
|
# TYPE vllm:generation_tokens_total counter
|
||||||
|
vllm:generation_tokens_total{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 100352.0
|
||||||
|
# HELP vllm:generation_tokens_created Number of generation tokens processed.
|
||||||
|
# TYPE vllm:generation_tokens_created gauge
|
||||||
|
vllm:generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468891773e+09
|
||||||
|
# HELP vllm:request_success_total Count of successfully processed requests.
|
||||||
|
# TYPE vllm:request_success_total counter
|
||||||
|
vllm:request_success_total{engine="0",finished_reason="stop",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_success_total{engine="0",finished_reason="length",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_success_total{engine="0",finished_reason="abort",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_success_total{engine="0",finished_reason="error",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_success_total{engine="0",finished_reason="repetition",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
# HELP vllm:request_success_created Count of successfully processed requests.
|
||||||
|
# TYPE vllm:request_success_created gauge
|
||||||
|
vllm:request_success_created{engine="0",finished_reason="stop",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779812946889204e+09
|
||||||
|
vllm:request_success_created{engine="0",finished_reason="length",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468892212e+09
|
||||||
|
vllm:request_success_created{engine="0",finished_reason="abort",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468892279e+09
|
||||||
|
vllm:request_success_created{engine="0",finished_reason="error",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468892329e+09
|
||||||
|
vllm:request_success_created{engine="0",finished_reason="repetition",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468892384e+09
|
||||||
|
# HELP vllm:request_prompt_tokens Number of prefill tokens processed.
|
||||||
|
# TYPE vllm:request_prompt_tokens histogram
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prompt_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.563446e+06
|
||||||
|
# HELP vllm:request_prompt_tokens_created Number of prefill tokens processed.
|
||||||
|
# TYPE vllm:request_prompt_tokens_created gauge
|
||||||
|
vllm:request_prompt_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468892868e+09
|
||||||
|
# HELP vllm:request_generation_tokens Number of generation tokens processed.
|
||||||
|
# TYPE vllm:request_generation_tokens histogram
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_generation_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 100352.0
|
||||||
|
# HELP vllm:request_generation_tokens_created Number of generation tokens processed.
|
||||||
|
# TYPE vllm:request_generation_tokens_created gauge
|
||||||
|
vllm:request_generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468893485e+09
|
||||||
|
# HELP vllm:iteration_tokens_total Histogram of number of tokens per engine_step.
|
||||||
|
# TYPE vllm:iteration_tokens_total histogram
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 3152.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="8.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 10567.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="16.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 14535.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="32.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15232.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="64.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15232.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="128.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15232.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="256.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15232.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="512.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15232.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="1024.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15232.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="2048.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15232.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="4096.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15576.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="8192.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15600.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="16384.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15600.0
|
||||||
|
vllm:iteration_tokens_total_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15600.0
|
||||||
|
vllm:iteration_tokens_total_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 15600.0
|
||||||
|
vllm:iteration_tokens_total_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.663798e+06
|
||||||
|
# HELP vllm:iteration_tokens_total_created Histogram of number of tokens per engine_step.
|
||||||
|
# TYPE vllm:iteration_tokens_total_created gauge
|
||||||
|
vllm:iteration_tokens_total_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468893907e+09
|
||||||
|
# HELP vllm:request_max_num_generation_tokens Histogram of maximum number of requested generation tokens.
|
||||||
|
# TYPE vllm:request_max_num_generation_tokens histogram
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_max_num_generation_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 100352.0
|
||||||
|
# HELP vllm:request_max_num_generation_tokens_created Histogram of maximum number of requested generation tokens.
|
||||||
|
# TYPE vllm:request_max_num_generation_tokens_created gauge
|
||||||
|
vllm:request_max_num_generation_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779812946889425e+09
|
||||||
|
# HELP vllm:request_params_n Histogram of the n request parameter.
|
||||||
|
# TYPE vllm:request_params_n histogram
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_n_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
# HELP vllm:request_params_n_created Histogram of the n request parameter.
|
||||||
|
# TYPE vllm:request_params_n_created gauge
|
||||||
|
vllm:request_params_n_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468894763e+09
|
||||||
|
# HELP vllm:request_params_max_tokens Histogram of the max_tokens request parameter.
|
||||||
|
# TYPE vllm:request_params_max_tokens histogram
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_params_max_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 100352.0
|
||||||
|
# HELP vllm:request_params_max_tokens_created Histogram of the max_tokens request parameter.
|
||||||
|
# TYPE vllm:request_params_max_tokens_created gauge
|
||||||
|
vllm:request_params_max_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468895009e+09
|
||||||
|
# HELP vllm:time_to_first_token_seconds Histogram of time to first token in seconds.
|
||||||
|
# TYPE vllm:time_to_first_token_seconds histogram
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.001",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.005",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.02",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.04",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.06",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.08",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.25",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 14.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 342.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 385.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 391.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="160.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="640.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="2560.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:time_to_first_token_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 139.74462819099426
|
||||||
|
# HELP vllm:time_to_first_token_seconds_created Histogram of time to first token in seconds.
|
||||||
|
# TYPE vllm:time_to_first_token_seconds_created gauge
|
||||||
|
vllm:time_to_first_token_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779812946889535e+09
|
||||||
|
# HELP vllm:inter_token_latency_seconds Histogram of inter-token latency in seconds.
|
||||||
|
# TYPE vllm:inter_token_latency_seconds histogram
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 24975.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.025",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 93964.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.05",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 96708.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.075",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 96773.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 96773.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.15",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 96780.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.2",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 96903.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99743.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.4",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99743.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 99960.0
|
||||||
|
vllm:inter_token_latency_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2025.8043659705436
|
||||||
|
# HELP vllm:inter_token_latency_seconds_created Histogram of inter-token latency in seconds.
|
||||||
|
# TYPE vllm:inter_token_latency_seconds_created gauge
|
||||||
|
vllm:inter_token_latency_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468896053e+09
|
||||||
|
# HELP vllm:request_time_per_output_token_seconds Histogram of time_per_output_token_seconds per request.
|
||||||
|
# TYPE vllm:request_time_per_output_token_seconds histogram
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.01",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 33.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.025",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 276.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.05",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.075",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.1",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.15",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.2",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.4",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="0.75",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="7.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="80.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_time_per_output_token_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 7.94433084694331
|
||||||
|
# HELP vllm:request_time_per_output_token_seconds_created Histogram of time_per_output_token_seconds per request.
|
||||||
|
# TYPE vllm:request_time_per_output_token_seconds_created gauge
|
||||||
|
vllm:request_time_per_output_token_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468896456e+09
|
||||||
|
# HELP vllm:e2e_request_latency_seconds Histogram of e2e request latency in seconds.
|
||||||
|
# TYPE vllm:e2e_request_latency_seconds histogram
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 3.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 12.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 19.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 167.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:e2e_request_latency_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2165.43790102005
|
||||||
|
# HELP vllm:e2e_request_latency_seconds_created Histogram of e2e request latency in seconds.
|
||||||
|
# TYPE vllm:e2e_request_latency_seconds_created gauge
|
||||||
|
vllm:e2e_request_latency_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468908637e+09
|
||||||
|
# HELP vllm:request_queue_time_seconds Histogram of time spent in WAITING phase for request.
|
||||||
|
# TYPE vllm:request_queue_time_seconds histogram
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_queue_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0030762000242248178
|
||||||
|
# HELP vllm:request_queue_time_seconds_created Histogram of time spent in WAITING phase for request.
|
||||||
|
# TYPE vllm:request_queue_time_seconds_created gauge
|
||||||
|
vllm:request_queue_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468909323e+09
|
||||||
|
# HELP vllm:request_inference_time_seconds Histogram of time spent in RUNNING phase for request.
|
||||||
|
# TYPE vllm:request_inference_time_seconds histogram
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 3.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 13.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 20.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 173.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_inference_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2143.7371602561325
|
||||||
|
# HELP vllm:request_inference_time_seconds_created Histogram of time spent in RUNNING phase for request.
|
||||||
|
# TYPE vllm:request_inference_time_seconds_created gauge
|
||||||
|
vllm:request_inference_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468909864e+09
|
||||||
|
# HELP vllm:request_prefill_time_seconds Histogram of time spent in PREFILL phase for request.
|
||||||
|
# TYPE vllm:request_prefill_time_seconds histogram
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 308.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 376.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 117.9327942855889
|
||||||
|
# HELP vllm:request_prefill_time_seconds_created Histogram of time spent in PREFILL phase for request.
|
||||||
|
# TYPE vllm:request_prefill_time_seconds_created gauge
|
||||||
|
vllm:request_prefill_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468910336e+09
|
||||||
|
# HELP vllm:request_decode_time_seconds Histogram of time spent in DECODE phase for request.
|
||||||
|
# TYPE vllm:request_decode_time_seconds histogram
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="0.3",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="0.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="0.8",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="1.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 6.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 17.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="2.5",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 32.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 189.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="15.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="30.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="40.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="60.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="120.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="240.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="480.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="960.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="1920.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="7680.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_decode_time_seconds_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 2025.8043659705436
|
||||||
|
# HELP vllm:request_decode_time_seconds_created Histogram of time spent in DECODE phase for request.
|
||||||
|
# TYPE vllm:request_decode_time_seconds_created gauge
|
||||||
|
vllm:request_decode_time_seconds_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.779812946891072e+09
|
||||||
|
# HELP vllm:request_prefill_kv_computed_tokens Histogram of new KV tokens computed during prefill (excluding cached tokens).
|
||||||
|
# TYPE vllm:request_prefill_kv_computed_tokens histogram
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="1.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="2.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="5.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="10.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="20.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="50.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="100.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="200.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="500.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="1000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="2000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 0.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="5000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="10000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="20000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="50000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="100000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="200000.0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_bucket{engine="0",le="+Inf",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_count{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 392.0
|
||||||
|
vllm:request_prefill_kv_computed_tokens_sum{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.563446e+06
|
||||||
|
# HELP vllm:request_prefill_kv_computed_tokens_created Histogram of new KV tokens computed during prefill (excluding cached tokens).
|
||||||
|
# TYPE vllm:request_prefill_kv_computed_tokens_created gauge
|
||||||
|
vllm:request_prefill_kv_computed_tokens_created{engine="0",model_name="/home/admin/cpfs/wjh/models/Qwen/Qwen3-Coder-30B-A3B-Instruct"} 1.7798129468911197e+09
|
||||||
|
# HELP vllm:cache_config_info Information of the LLMEngine CacheConfig
|
||||||
|
# TYPE vllm:cache_config_info gauge
|
||||||
|
vllm:cache_config_info{_block_size_resolved="True",block_size="16",cache_dtype="auto",calculate_kv_scales="False",cpu_kvcache_space_bytes="None",enable_prefix_caching="True",engine="0",gpu_memory_utilization="0.9",is_attention_free="False",kv_cache_memory_bytes="None",kv_offloading_backend="native",kv_offloading_size="None",kv_sharing_fast_prefill="False",mamba_block_size="None",mamba_cache_dtype="auto",mamba_cache_mode="none",mamba_page_size_padded="None",mamba_ssm_cache_dtype="auto",num_cpu_blocks="None",num_gpu_blocks="17590",num_gpu_blocks_override="None",prefix_caching_hash_algo="sha256",sliding_window="None",user_specified_block_size="False"} 1.0
|
||||||
|
# HELP http_requests_total Total number of requests by method, status and handler.
|
||||||
|
# TYPE http_requests_total counter
|
||||||
|
http_requests_total{handler="/v1/models",method="GET",status="2xx"} 1.0
|
||||||
|
http_requests_total{handler="/v1/chat/completions",method="POST",status="2xx"} 392.0
|
||||||
|
# HELP http_requests_created Total number of requests by method, status and handler.
|
||||||
|
# TYPE http_requests_created gauge
|
||||||
|
http_requests_created{handler="/v1/models",method="GET",status="2xx"} 1.7798129484643826e+09
|
||||||
|
http_requests_created{handler="/v1/chat/completions",method="POST",status="2xx"} 1.779812957247958e+09
|
||||||
|
# HELP http_request_size_bytes Content length of incoming requests by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
|
||||||
|
# TYPE http_request_size_bytes summary
|
||||||
|
http_request_size_bytes_count{handler="/v1/models"} 1.0
|
||||||
|
http_request_size_bytes_sum{handler="/v1/models"} 0.0
|
||||||
|
http_request_size_bytes_count{handler="/v1/chat/completions"} 392.0
|
||||||
|
http_request_size_bytes_sum{handler="/v1/chat/completions"} 2.067408e+06
|
||||||
|
# HELP http_request_size_bytes_created Content length of incoming requests by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
|
||||||
|
# TYPE http_request_size_bytes_created gauge
|
||||||
|
http_request_size_bytes_created{handler="/v1/models"} 1.779812948464408e+09
|
||||||
|
http_request_size_bytes_created{handler="/v1/chat/completions"} 1.7798129572479794e+09
|
||||||
|
# HELP http_response_size_bytes Content length of outgoing responses by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
|
||||||
|
# TYPE http_response_size_bytes summary
|
||||||
|
http_response_size_bytes_count{handler="/v1/models"} 1.0
|
||||||
|
http_response_size_bytes_sum{handler="/v1/models"} 558.0
|
||||||
|
http_response_size_bytes_count{handler="/v1/chat/completions"} 392.0
|
||||||
|
http_response_size_bytes_sum{handler="/v1/chat/completions"} 0.0
|
||||||
|
# HELP http_response_size_bytes_created Content length of outgoing responses by handler. Only value of header is respected. Otherwise ignored. No percentile calculated.
|
||||||
|
# TYPE http_response_size_bytes_created gauge
|
||||||
|
http_response_size_bytes_created{handler="/v1/models"} 1.7798129484644337e+09
|
||||||
|
http_response_size_bytes_created{handler="/v1/chat/completions"} 1.7798129572480056e+09
|
||||||
|
# HELP http_request_duration_highr_seconds Latency with many buckets but no API specific labels. Made for more accurate percentile calculations.
|
||||||
|
# TYPE http_request_duration_highr_seconds histogram
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.01"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.025"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.05"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.075"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.1"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.25"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.5"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="0.75"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="1.0"} 1.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="1.5"} 4.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="2.0"} 13.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="2.5"} 20.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="3.0"} 39.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="3.5"} 64.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="4.0"} 98.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="4.5"} 138.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="5.0"} 168.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="7.5"} 316.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="10.0"} 393.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="30.0"} 393.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="60.0"} 393.0
|
||||||
|
http_request_duration_highr_seconds_bucket{le="+Inf"} 393.0
|
||||||
|
http_request_duration_highr_seconds_count 393.0
|
||||||
|
http_request_duration_highr_seconds_sum 2166.043502608838
|
||||||
|
# HELP http_request_duration_highr_seconds_created Latency with many buckets but no API specific labels. Made for more accurate percentile calculations.
|
||||||
|
# TYPE http_request_duration_highr_seconds_created gauge
|
||||||
|
http_request_duration_highr_seconds_created 1.7798129474325876e+09
|
||||||
|
# HELP http_request_duration_seconds Latency with only few buckets by handler. Made to be only used if aggregation by handler is important.
|
||||||
|
# TYPE http_request_duration_seconds histogram
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/models",le="0.1",method="GET"} 1.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/models",le="0.5",method="GET"} 1.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/models",le="1.0",method="GET"} 1.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/models",le="+Inf",method="GET"} 1.0
|
||||||
|
http_request_duration_seconds_count{handler="/v1/models",method="GET"} 1.0
|
||||||
|
http_request_duration_seconds_sum{handler="/v1/models",method="GET"} 0.002513296960387379
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="0.1",method="POST"} 0.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="0.5",method="POST"} 0.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="1.0",method="POST"} 0.0
|
||||||
|
http_request_duration_seconds_bucket{handler="/v1/chat/completions",le="+Inf",method="POST"} 392.0
|
||||||
|
http_request_duration_seconds_count{handler="/v1/chat/completions",method="POST"} 392.0
|
||||||
|
http_request_duration_seconds_sum{handler="/v1/chat/completions",method="POST"} 2166.0409893118776
|
||||||
|
# HELP http_request_duration_seconds_created Latency with only few buckets by handler. Made to be only used if aggregation by handler is important.
|
||||||
|
# TYPE http_request_duration_seconds_created gauge
|
||||||
|
http_request_duration_seconds_created{handler="/v1/models",method="GET"} 1.779812948464476e+09
|
||||||
|
http_request_duration_seconds_created{handler="/v1/chat/completions",method="POST"} 1.7798129572480347e+09
|
||||||
@@ -0,0 +1,392 @@
|
|||||||
|
{"req_id": "f9f8f44fd4964565", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382084485315880, "t_first_token_ns": 382085751985740, "t_last_token_ns": 382092234447772, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "36bbb6c17a024be2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382084854634864, "t_first_token_ns": 382085756582157, "t_last_token_ns": 382092249407424, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "accbc730595b4e03", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382084846909405, "t_first_token_ns": 382085756468395, "t_last_token_ns": 382092249666342, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "95ac8f889eb69b61", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382086017811149, "t_first_token_ns": 382086278898625, "t_last_token_ns": 382092888697351, "prompt_tokens": 4026, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "70cae603829190e5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382086255135893, "t_first_token_ns": 382086524187375, "t_last_token_ns": 382092918848009, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "2769a82161526fa2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382086561686502, "t_first_token_ns": 382086831843459, "t_last_token_ns": 382093038356375, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "7a3ae284fd025e54", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382086706452936, "t_first_token_ns": 382087076121991, "t_last_token_ns": 382093067217516, "prompt_tokens": 3951, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "8c7ebece6fc8ef96", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382087263620291, "t_first_token_ns": 382087535250568, "t_last_token_ns": 382093356817425, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "291192528bfcd0d4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382087471206537, "t_first_token_ns": 382088039840023, "t_last_token_ns": 382093380530301, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "8a6e9ed8388bab88", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382087381525191, "t_first_token_ns": 382088040653266, "t_last_token_ns": 382093381150039, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "29aa5369f15ef8ed", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382087850098477, "t_first_token_ns": 382088237260581, "t_last_token_ns": 382093391828376, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "7d168f7f34914df8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382088401317115, "t_first_token_ns": 382088675156747, "t_last_token_ns": 382093553597941, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "5949b30b5660db44", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382088530166550, "t_first_token_ns": 382088927487832, "t_last_token_ns": 382093570191025, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "dee2814e53c1133c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382089067684210, "t_first_token_ns": 382089347288505, "t_last_token_ns": 382093680939389, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "d3d6d687b1270d0c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382089360858174, "t_first_token_ns": 382089632276646, "t_last_token_ns": 382093709089784, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "0114c7c017f510dc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382091475713376, "t_first_token_ns": 382091750427712, "t_last_token_ns": 382095470175502, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "6b23ad60e8365056", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382091540474798, "t_first_token_ns": 382092011849654, "t_last_token_ns": 382095487909354, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "c147b43b07e42731", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382094105745222, "t_first_token_ns": 382094360416437, "t_last_token_ns": 382098653227539, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "d6e4d018b0cbaea9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382094460514148, "t_first_token_ns": 382094722942114, "t_last_token_ns": 382098876304081, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "6a3c0ae558f764fa", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382094928261563, "t_first_token_ns": 382095189710499, "t_last_token_ns": 382099498627857, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "6f4afc5b3b7d8068", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382095035839055, "t_first_token_ns": 382095433479107, "t_last_token_ns": 382099519308502, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "e2a3bc5ca5dd8e06", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382095882568896, "t_first_token_ns": 382096149618438, "t_last_token_ns": 382100119691456, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "750b6e6f651a9619", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382096023573828, "t_first_token_ns": 382096389932985, "t_last_token_ns": 382100140145528, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "088884483a9971d8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382096303429046, "t_first_token_ns": 382096850723713, "t_last_token_ns": 382100156576034, "prompt_tokens": 4025, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "318a0f6dc8a5806d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382096320117360, "t_first_token_ns": 382096850558433, "t_last_token_ns": 382100156738283, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "60fa44dd296e868f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382096597293605, "t_first_token_ns": 382097098413040, "t_last_token_ns": 382100167935492, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "be5160126f3a14e8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382098881665428, "t_first_token_ns": 382099182823989, "t_last_token_ns": 382100932027749, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "31994709d9d43ecc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382101452253340, "t_first_token_ns": 382101705031877, "t_last_token_ns": 382103653661298, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "ebb3f3698c984adb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382102362695450, "t_first_token_ns": 382102617993097, "t_last_token_ns": 382104765107223, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "a0f7008353f8beb2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382102366075418, "t_first_token_ns": 382102852445112, "t_last_token_ns": 382104771161161, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "7eac0cde0fd34276", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382104245078173, "t_first_token_ns": 382104503092731, "t_last_token_ns": 382106080452930, "prompt_tokens": 4019, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "95601ae65eae9166", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382105609930451, "t_first_token_ns": 382105865820742, "t_last_token_ns": 382108479137977, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "189d08c7b9ae2624", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382106592854963, "t_first_token_ns": 382106846927268, "t_last_token_ns": 382111011314112, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "607beffbb7acca39", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382106725242270, "t_first_token_ns": 382107302770531, "t_last_token_ns": 382111031111692, "prompt_tokens": 3953, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "93918e85acdf8b5d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382106795960784, "t_first_token_ns": 382107303167749, "t_last_token_ns": 382111031731486, "prompt_tokens": 4052, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "de6c910bd8be862f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382107155362079, "t_first_token_ns": 382107545253073, "t_last_token_ns": 382111049499791, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "776a9724799617aa", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382108601027573, "t_first_token_ns": 382108863203989, "t_last_token_ns": 382112459442313, "prompt_tokens": 3955, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "b6357510a726d24b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382109178368627, "t_first_token_ns": 382109445074511, "t_last_token_ns": 382113049635333, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "7814350e435d2264", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382110068747851, "t_first_token_ns": 382110335361810, "t_last_token_ns": 382113760511997, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "c354ffa98aea604a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382110246872779, "t_first_token_ns": 382110582645427, "t_last_token_ns": 382113776093718, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "b95654a80317948f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382110741663414, "t_first_token_ns": 382111010987856, "t_last_token_ns": 382114375439940, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "d3983c9d6e42e3df", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382111566724910, "t_first_token_ns": 382111832273757, "t_last_token_ns": 382114862593646, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "f12e1b2630a2dfcc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382112728721768, "t_first_token_ns": 382112997261704, "t_last_token_ns": 382116098452170, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "26aa2698c0cddf5a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382113826064659, "t_first_token_ns": 382114085296890, "t_last_token_ns": 382117051979504, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "1dd5dcf6889436bf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382114003523637, "t_first_token_ns": 382114329516196, "t_last_token_ns": 382117067193867, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "55f20a37c1e89743", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382115381130284, "t_first_token_ns": 382115639648893, "t_last_token_ns": 382119382645818, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "e013af03a51488e2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382115544317102, "t_first_token_ns": 382115881855274, "t_last_token_ns": 382119401842605, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "4fefe92b23621583", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382116624370456, "t_first_token_ns": 382116887620494, "t_last_token_ns": 382120316201161, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "5af9d220c4dd9318", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382117164782628, "t_first_token_ns": 382117420646491, "t_last_token_ns": 382121312403234, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "a3ff67ad251967cf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382117302283549, "t_first_token_ns": 382117663368691, "t_last_token_ns": 382121329907934, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "c9258f74cc827511", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382117895621904, "t_first_token_ns": 382118163339413, "t_last_token_ns": 382121605409252, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "140eb88bb615ba37", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382118382848752, "t_first_token_ns": 382118646417347, "t_last_token_ns": 382121814874557, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "d4a164b2ff3b3509", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382120507346979, "t_first_token_ns": 382120765805217, "t_last_token_ns": 382123161869622, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "9f1266f29a9e8b28", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382120536346207, "t_first_token_ns": 382121002154595, "t_last_token_ns": 382123167727494, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "57c78bedb1e5433a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382120656527505, "t_first_token_ns": 382121237067439, "t_last_token_ns": 382123171579404, "prompt_tokens": 3957, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "08ff3938b9e3ae71", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382123385556601, "t_first_token_ns": 382123637959122, "t_last_token_ns": 382127170763520, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "fc976d1e97a41cca", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382124575913113, "t_first_token_ns": 382124831128749, "t_last_token_ns": 382129509596960, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "6c45ae3b99de1e62", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382124685304696, "t_first_token_ns": 382125071733865, "t_last_token_ns": 382129532983141, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "417d200da6fc7d82", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382124859602337, "t_first_token_ns": 382125318069397, "t_last_token_ns": 382129555520868, "prompt_tokens": 4042, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "2b45e140974d73b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382125376483769, "t_first_token_ns": 382125633771501, "t_last_token_ns": 382129669787344, "prompt_tokens": 3948, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "4809e923113cfa22", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382125490789770, "t_first_token_ns": 382125879482814, "t_last_token_ns": 382129689298424, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "b80243b2ea980e07", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382125786666385, "t_first_token_ns": 382126338815361, "t_last_token_ns": 382129706104856, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "1baef27c84916aa0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382125751949133, "t_first_token_ns": 382126339351486, "t_last_token_ns": 382129706277861, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "efd5183f824026cc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382126616017958, "t_first_token_ns": 382126885654669, "t_last_token_ns": 382129879944142, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "df3338f95a55a940", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382126671637574, "t_first_token_ns": 382127130961013, "t_last_token_ns": 382129888643028, "prompt_tokens": 3921, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "973d64287adcd8a7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382130154531040, "t_first_token_ns": 382130400867231, "t_last_token_ns": 382131621211341, "prompt_tokens": 3934, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "19674684ced3de91", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382131844141889, "t_first_token_ns": 382132095400336, "t_last_token_ns": 382133641527953, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "e6fa68dd3de56239", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382132881184723, "t_first_token_ns": 382133139468625, "t_last_token_ns": 382135430175003, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "ec06f5beb70436d9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382133868534695, "t_first_token_ns": 382134124890189, "t_last_token_ns": 382137703558917, "prompt_tokens": 4040, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "034f1151aa2d2bdd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382134174927085, "t_first_token_ns": 382134431228329, "t_last_token_ns": 382137856822810, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "d156973f3cae174d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382134932527817, "t_first_token_ns": 382135193457977, "t_last_token_ns": 382139144533899, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "c2a0107acab59bd0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382135888587433, "t_first_token_ns": 382136144211615, "t_last_token_ns": 382140297080035, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "25faaab088637291", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382136326035620, "t_first_token_ns": 382136587916047, "t_last_token_ns": 382141063089526, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "4a6ff2135636c100", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382136368689957, "t_first_token_ns": 382136833523524, "t_last_token_ns": 382141087519173, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "70f1b6f6bad6005a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382137026103851, "t_first_token_ns": 382137294678583, "t_last_token_ns": 382141333527505, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "485e723c2b84eb6b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382137289910024, "t_first_token_ns": 382137554990236, "t_last_token_ns": 382141362746117, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "6be120b83e225aa3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382137918445148, "t_first_token_ns": 382138190076505, "t_last_token_ns": 382141924574841, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "291a0df34911423c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382138282808798, "t_first_token_ns": 382138556098766, "t_last_token_ns": 382142015329705, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "9a771254d6418297", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382140315586420, "t_first_token_ns": 382140584545283, "t_last_token_ns": 382144385358364, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "3d63e1d817747a07", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382140466303138, "t_first_token_ns": 382140835388158, "t_last_token_ns": 382144405051264, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "a85143fddaf4a5d9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382141443044504, "t_first_token_ns": 382141698075190, "t_last_token_ns": 382145026911126, "prompt_tokens": 3953, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "ed998b60c04d0117", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382142243200480, "t_first_token_ns": 382142506026607, "t_last_token_ns": 382147980017354, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "10c050e2cad99a17", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382142786760294, "t_first_token_ns": 382143046239974, "t_last_token_ns": 382148808485648, "prompt_tokens": 3949, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "03bc3ccba18fc900", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382142843582082, "t_first_token_ns": 382143510409309, "t_last_token_ns": 382148835792674, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "6ca800f53f978a58", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382142965009435, "t_first_token_ns": 382143510700208, "t_last_token_ns": 382148836070146, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "0d6ebbadc6b6f44a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382145245720490, "t_first_token_ns": 382145510468100, "t_last_token_ns": 382152509489934, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "2225316b65c25005", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382145422737803, "t_first_token_ns": 382145758585647, "t_last_token_ns": 382152539934260, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "48f171c96419c86f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382145543044038, "t_first_token_ns": 382146005415056, "t_last_token_ns": 382152571890884, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "53e2a3f9153c46d1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382145770431456, "t_first_token_ns": 382146240838887, "t_last_token_ns": 382152587726945, "prompt_tokens": 3949, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "8f000c5b2b8ac58e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382146284550460, "t_first_token_ns": 382146558571855, "t_last_token_ns": 382152703280461, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "601066832f30b428", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382146774467284, "t_first_token_ns": 382147047268204, "t_last_token_ns": 382153033798108, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "bf6ab969a046d65b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382146780852586, "t_first_token_ns": 382147499518858, "t_last_token_ns": 382153047446499, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "d417d380a4f40330", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382146777761158, "t_first_token_ns": 382147499719427, "t_last_token_ns": 382153047956971, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "d23fe5d3ae909682", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382147238846239, "t_first_token_ns": 382147968347685, "t_last_token_ns": 382153070669400, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "08d7416f17740d2e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382147375689078, "t_first_token_ns": 382147968700737, "t_last_token_ns": 382153070806834, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "f63c4d00aca7e8d4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382148000174672, "t_first_token_ns": 382148273597339, "t_last_token_ns": 382153360298420, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "72e028e252e47e71", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382149068456835, "t_first_token_ns": 382149348554144, "t_last_token_ns": 382154443030601, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "90e18fcccb2043f7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382149136513746, "t_first_token_ns": 382149600864577, "t_last_token_ns": 382154466234174, "prompt_tokens": 3956, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "7e9fdd59228f0876", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382149606281138, "t_first_token_ns": 382149886196466, "t_last_token_ns": 382154505159382, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "245bf7e5865d03d2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382149954192620, "t_first_token_ns": 382150233950827, "t_last_token_ns": 382154585202689, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "2033b1bb6c32c79a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382150996777505, "t_first_token_ns": 382151279633180, "t_last_token_ns": 382155011561233, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "0f0a89285f54e2c1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382153062097748, "t_first_token_ns": 382153328237737, "t_last_token_ns": 382159693572019, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "da6f35b293578e17", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382153422066955, "t_first_token_ns": 382153692956220, "t_last_token_ns": 382159892850728, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "6c160ab03b66d81b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382153608576562, "t_first_token_ns": 382153942415850, "t_last_token_ns": 382160157467343, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "acad4eb8b9a0e9f6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382155014185856, "t_first_token_ns": 382155276645095, "t_last_token_ns": 382163987065268, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "502a12aa8ba51234", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382155033067481, "t_first_token_ns": 382155512278400, "t_last_token_ns": 382164008874723, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "c0e0a7d3c8cf4b95", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382155300992731, "t_first_token_ns": 382155758956878, "t_last_token_ns": 382164052573925, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "541738310001dd1c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382155636772096, "t_first_token_ns": 382156002850078, "t_last_token_ns": 382164086851692, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "525463c05dc42958", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382155769330470, "t_first_token_ns": 382156238617491, "t_last_token_ns": 382164103208348, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "3926bdb966d6e395", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382156529924106, "t_first_token_ns": 382156795118967, "t_last_token_ns": 382164859875757, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "586a0d3540151470", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382157017897104, "t_first_token_ns": 382157281724702, "t_last_token_ns": 382165249594661, "prompt_tokens": 3929, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "37c5cdb56477a163", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382157209281472, "t_first_token_ns": 382157533807262, "t_last_token_ns": 382165509376109, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "5947be4f56ddbab5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382157565789420, "t_first_token_ns": 382157841926331, "t_last_token_ns": 382165859469454, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "2fe62cb82ee987a4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382158351514622, "t_first_token_ns": 382158623425098, "t_last_token_ns": 382167223583208, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "8c278419850c8b5e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382158614234491, "t_first_token_ns": 382158892966968, "t_last_token_ns": 382167480594409, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "021ff28eaacd14c5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382158662162337, "t_first_token_ns": 382159133444765, "t_last_token_ns": 382167497314962, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "faaf800e58c8f85f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382159074263310, "t_first_token_ns": 382159388624067, "t_last_token_ns": 382167535060392, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "744f358ad7f63e97", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382159219646231, "t_first_token_ns": 382159642125878, "t_last_token_ns": 382167563663637, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "927918347bcc40c8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382159867767929, "t_first_token_ns": 382160155490922, "t_last_token_ns": 382167882727509, "prompt_tokens": 4032, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "2fcef28ebf707351", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382160018644036, "t_first_token_ns": 382160622703153, "t_last_token_ns": 382168139630832, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "fe233f2c57e680fd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382160117624464, "t_first_token_ns": 382160623811460, "t_last_token_ns": 382168140619284, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "b95acf9eb3705d28", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382161614049847, "t_first_token_ns": 382161911634388, "t_last_token_ns": 382170238228998, "prompt_tokens": 4011, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "96b58f314a838835", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382161882357875, "t_first_token_ns": 382162173383565, "t_last_token_ns": 382170273284320, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "08456882f8d362d1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382162114090871, "t_first_token_ns": 382162432411293, "t_last_token_ns": 382170301380722, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "4c8bd15a715f7720", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382162631449889, "t_first_token_ns": 382162928978333, "t_last_token_ns": 382170490281703, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "6949ec56d00ce8b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382163420315977, "t_first_token_ns": 382163714525969, "t_last_token_ns": 382171352639512, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 22, "error": null}
|
||||||
|
{"req_id": "0fce2da54ec51dfd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382164162116760, "t_first_token_ns": 382164449957180, "t_last_token_ns": 382172239925040, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "4238f6957ae36e57", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382165230181175, "t_first_token_ns": 382165510156058, "t_last_token_ns": 382174169547343, "prompt_tokens": 4027, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "943ff34258d40d28", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382165432201314, "t_first_token_ns": 382165764227907, "t_last_token_ns": 382174210119649, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "dfca62202759aa8a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382165803829344, "t_first_token_ns": 382166083320567, "t_last_token_ns": 382174292923771, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "4397f621098474f5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382166106375621, "t_first_token_ns": 382166400274028, "t_last_token_ns": 382174375244987, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "beebd08c7cc682ce", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382166591831129, "t_first_token_ns": 382166882526737, "t_last_token_ns": 382174647993158, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "7c168a9454ca5620", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382167161160421, "t_first_token_ns": 382167447364672, "t_last_token_ns": 382174942751314, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "affa6c57b9777216", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382167822972670, "t_first_token_ns": 382168102677870, "t_last_token_ns": 382175570182132, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "e5ffcb3c7f664dc7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382167870353217, "t_first_token_ns": 382168359531073, "t_last_token_ns": 382175599481749, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "a024c6b1441ebbd3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382168180794345, "t_first_token_ns": 382168615046947, "t_last_token_ns": 382175631200427, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "17af21e67df524a9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382169003040547, "t_first_token_ns": 382169290554765, "t_last_token_ns": 382176280489960, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "80ff547317c08b98", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382169028258376, "t_first_token_ns": 382169531758895, "t_last_token_ns": 382176294013489, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "70a5b87cfadecff6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382169943431963, "t_first_token_ns": 382170222748100, "t_last_token_ns": 382176877992402, "prompt_tokens": 3945, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "843c0165f72929a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382170717602609, "t_first_token_ns": 382170991417567, "t_last_token_ns": 382177388384436, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "73b4128afce8e243", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382171060540918, "t_first_token_ns": 382171339560239, "t_last_token_ns": 382177506575857, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "205e89d222fdda00", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382171458987421, "t_first_token_ns": 382171728840309, "t_last_token_ns": 382177666134634, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "7117082172557ca5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382171664441326, "t_first_token_ns": 382171984200978, "t_last_token_ns": 382177690516908, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "7cc19a30a453f5f4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382172556410858, "t_first_token_ns": 382172843986587, "t_last_token_ns": 382178155208995, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "8ba8086b8eec10d6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382172994104335, "t_first_token_ns": 382173271591898, "t_last_token_ns": 382178289834991, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "ec1a4c4a2ea7ed12", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382173221073726, "t_first_token_ns": 382173744591784, "t_last_token_ns": 382178309642689, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "6b0f100f2f9945ac", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382173199924494, "t_first_token_ns": 382173744158256, "t_last_token_ns": 382178309921433, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "768828307182d0ad", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382173504248810, "t_first_token_ns": 382174004761140, "t_last_token_ns": 382178326065050, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "034e749c721e456f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382175197041765, "t_first_token_ns": 382175471363162, "t_last_token_ns": 382180696818257, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "73a8827fa8847e13", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382175949308654, "t_first_token_ns": 382176225537422, "t_last_token_ns": 382181123225967, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "bbcdf2009f998102", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382176605208379, "t_first_token_ns": 382176877581484, "t_last_token_ns": 382181469659040, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "9fa8ae1957c0f1d2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382178416433437, "t_first_token_ns": 382178669969435, "t_last_token_ns": 382183430963715, "prompt_tokens": 3937, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "b11891ac95fc3b64", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382178438073962, "t_first_token_ns": 382178904163248, "t_last_token_ns": 382183442473126, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "59a9a00a0d3a1563", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382179053970222, "t_first_token_ns": 382179321413429, "t_last_token_ns": 382184133136282, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "d8bfc0767d724848", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382179193547844, "t_first_token_ns": 382179562446906, "t_last_token_ns": 382184385847973, "prompt_tokens": 3937, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "345d39ae3777d415", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382179531917382, "t_first_token_ns": 382179810488747, "t_last_token_ns": 382184406996473, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "9866cbc828a4fca8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382179667191362, "t_first_token_ns": 382180058895485, "t_last_token_ns": 382184428709692, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "af397ebf951c7719", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382179954066160, "t_first_token_ns": 382180308317944, "t_last_token_ns": 382184446591780, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "65c0b4265eb5dd19", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382181499593078, "t_first_token_ns": 382181767061473, "t_last_token_ns": 382186889749065, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "89dffa88c862c5b0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382182361462319, "t_first_token_ns": 382182625455239, "t_last_token_ns": 382189019116200, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "dc476408b33fbf5a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382183610794544, "t_first_token_ns": 382183883955404, "t_last_token_ns": 382192272274136, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "319a82c9b4681316", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382183664074066, "t_first_token_ns": 382184133404454, "t_last_token_ns": 382192312585207, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "71d7c157bf1ea669", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382183903860832, "t_first_token_ns": 382184376240888, "t_last_token_ns": 382192334761008, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "2cc13ac9d11806a5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382184926866005, "t_first_token_ns": 382185201824699, "t_last_token_ns": 382193981317819, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "129148915a429e01", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382184955411982, "t_first_token_ns": 382185432923928, "t_last_token_ns": 382194002762124, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "df7587e4a3d50532", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382185096858110, "t_first_token_ns": 382185881088756, "t_last_token_ns": 382194019141847, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "8b0e7810ec937f82", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382185154085449, "t_first_token_ns": 382185881181943, "t_last_token_ns": 382194019333715, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "fb520fc29874b954", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382186288931232, "t_first_token_ns": 382186557093785, "t_last_token_ns": 382195672501852, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "dd803dc1b0f0a741", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382186465630886, "t_first_token_ns": 382186809713643, "t_last_token_ns": 382195713983940, "prompt_tokens": 4024, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "93327e2e45a9ea52", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382186834740379, "t_first_token_ns": 382187108739879, "t_last_token_ns": 382195799397624, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "f37f16b87565dc53", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382186845106124, "t_first_token_ns": 382187347272150, "t_last_token_ns": 382195815554352, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "3bcfa41060f2dd80", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382186926327941, "t_first_token_ns": 382187586073595, "t_last_token_ns": 382195831555122, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "5badec87742c1dc2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382187408177011, "t_first_token_ns": 382187840621135, "t_last_token_ns": 382195861307988, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "d1e7b052e89f38f0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382188453345563, "t_first_token_ns": 382188738586177, "t_last_token_ns": 382197289823823, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "36a41522c4af841a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382188733332937, "t_first_token_ns": 382189006922767, "t_last_token_ns": 382197344116266, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "56afd7a006446ce0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382188990746952, "t_first_token_ns": 382189273460023, "t_last_token_ns": 382197387052714, "prompt_tokens": 3949, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "1ec26802903d0510", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382189020131877, "t_first_token_ns": 382189511203860, "t_last_token_ns": 382197400723372, "prompt_tokens": 3947, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "16ab6647bde110e0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382189199081819, "t_first_token_ns": 382189965760901, "t_last_token_ns": 382197415670374, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "a642c8cc74bcd308", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382189090754451, "t_first_token_ns": 382189964656055, "t_last_token_ns": 382197415797903, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "ab5eaa452796e93b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382189354201545, "t_first_token_ns": 382190205438099, "t_last_token_ns": 382197428708232, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "57388896a658fda9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382189917526905, "t_first_token_ns": 382190450077996, "t_last_token_ns": 382197441930889, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "74e18421d9a9571b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382190215201021, "t_first_token_ns": 382190711418045, "t_last_token_ns": 382197469232700, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 22, "error": null}
|
||||||
|
{"req_id": "51627206c78da939", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382192752654125, "t_first_token_ns": 382193045592207, "t_last_token_ns": 382199829188993, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "ce0e4f081e4c436a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382192976393814, "t_first_token_ns": 382193310053865, "t_last_token_ns": 382199857315482, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "9826eaf60529d07a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382194270660231, "t_first_token_ns": 382194547882750, "t_last_token_ns": 382201701909232, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "f06c8516cb43f569", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382194335191820, "t_first_token_ns": 382194811176438, "t_last_token_ns": 382201730094284, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "a2fcaaf1ad76b0b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382194836282261, "t_first_token_ns": 382195116206158, "t_last_token_ns": 382201803203035, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "0edc9b51f130a612", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382195368740645, "t_first_token_ns": 382195653479024, "t_last_token_ns": 382202018525566, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "ab79c0a9dc372db9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382196083110703, "t_first_token_ns": 382196359652940, "t_last_token_ns": 382202449697183, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "284610eaafd0e42a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382196435574411, "t_first_token_ns": 382196722381601, "t_last_token_ns": 382202582104081, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "89cb2a334a9f6238", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382196931100841, "t_first_token_ns": 382197207709370, "t_last_token_ns": 382203010566186, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "7ea2d46c75d8fe9a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382197934695072, "t_first_token_ns": 382198207993298, "t_last_token_ns": 382204087960501, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "024f2a25db9d388f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382198044809734, "t_first_token_ns": 382198460255925, "t_last_token_ns": 382204115940149, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "4bff469af089d526", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382198551418465, "t_first_token_ns": 382198828331830, "t_last_token_ns": 382204470998169, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "700bf9b5ddd3a040", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382199143308149, "t_first_token_ns": 382199425761836, "t_last_token_ns": 382204776586852, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "56c8618490819588", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382200158437145, "t_first_token_ns": 382200429219426, "t_last_token_ns": 382206610744431, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "897a93d4897b76cf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382200278100137, "t_first_token_ns": 382200684269775, "t_last_token_ns": 382206638146108, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "9d6d324a99e130fb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382200636238899, "t_first_token_ns": 382201150640347, "t_last_token_ns": 382206666403827, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "fa28b6fdceb04b0a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382200653602709, "t_first_token_ns": 382201151055854, "t_last_token_ns": 382206666630685, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "2b3fcd354a2135a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382201378628999, "t_first_token_ns": 382201661105558, "t_last_token_ns": 382208056226905, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "e29b1c49a4e6562a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382202729634059, "t_first_token_ns": 382203010114269, "t_last_token_ns": 382209145628336, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "729dbd22ce5934ca", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382203021873189, "t_first_token_ns": 382203290339273, "t_last_token_ns": 382209201681417, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "150d24a23cc28808", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382204152434994, "t_first_token_ns": 382204428915616, "t_last_token_ns": 382210818961899, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "1247310989ae8f13", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382204868864561, "t_first_token_ns": 382205136220226, "t_last_token_ns": 382211428871526, "prompt_tokens": 3967, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "e40763fc59ed0799", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382204881170636, "t_first_token_ns": 382205374451179, "t_last_token_ns": 382211442251967, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "a0ed26ee4cdfed05", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382205305886748, "t_first_token_ns": 382205841941111, "t_last_token_ns": 382211470050233, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "7db7d42bc0b6e636", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382205337379117, "t_first_token_ns": 382205843033175, "t_last_token_ns": 382211470970408, "prompt_tokens": 4028, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "c81fd9769ec09a5f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382205589339335, "t_first_token_ns": 382206093690937, "t_last_token_ns": 382211492058195, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "8a5e9c377ef0f53c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382206665758791, "t_first_token_ns": 382206941354350, "t_last_token_ns": 382214383505479, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "3bb8df688946d953", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382206755281174, "t_first_token_ns": 382207192472676, "t_last_token_ns": 382214420082876, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "448dfb7bcf90f1f3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382207311175673, "t_first_token_ns": 382207579731863, "t_last_token_ns": 382215073238782, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "33ee8dfa5d511f36", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382207350457148, "t_first_token_ns": 382207815485763, "t_last_token_ns": 382215091566276, "prompt_tokens": 3947, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "fe73928ec06f2ffd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382207428695980, "t_first_token_ns": 382208056417117, "t_last_token_ns": 382215106284349, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "b3b98aab260b94f5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382209456038643, "t_first_token_ns": 382209735842187, "t_last_token_ns": 382216670868033, "prompt_tokens": 3962, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "0616a2c3128fe7ad", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382209524353675, "t_first_token_ns": 382210204453851, "t_last_token_ns": 382216700436726, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "ea50a73400df1150", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382209653218229, "t_first_token_ns": 382210204820040, "t_last_token_ns": 382216700686594, "prompt_tokens": 4063, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "4608dc5a783e53a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382211530198960, "t_first_token_ns": 382211798959182, "t_last_token_ns": 382217995483399, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "d2629c04025d1ad2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382211777756263, "t_first_token_ns": 382212047899603, "t_last_token_ns": 382218022826439, "prompt_tokens": 3959, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "871a29b70510a5f2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382211942268855, "t_first_token_ns": 382212512638118, "t_last_token_ns": 382218050169237, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "78ec14117dc5c5e0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382212010734151, "t_first_token_ns": 382212513442145, "t_last_token_ns": 382218050665050, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "83b69b3a671f955d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382212473143542, "t_first_token_ns": 382212766009828, "t_last_token_ns": 382218072846481, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "7174a671812eb138", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382212513915268, "t_first_token_ns": 382213007341635, "t_last_token_ns": 382218116461094, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "51ff885b54684fc5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382212680827271, "t_first_token_ns": 382213246437014, "t_last_token_ns": 382218333747392, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "37f77dec01a5a091", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382212849598239, "t_first_token_ns": 382213486182778, "t_last_token_ns": 382218337218911, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "280e5ba3748db94d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382213761471408, "t_first_token_ns": 382214039964989, "t_last_token_ns": 382218557306081, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "e4a29ae517fc90cb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382213881609836, "t_first_token_ns": 382214300688822, "t_last_token_ns": 382218572178840, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "191f846cb9f66a8d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382214474097639, "t_first_token_ns": 382214753987221, "t_last_token_ns": 382218673528080, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "35f7d27c1cba45d2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382214659257602, "t_first_token_ns": 382215015590573, "t_last_token_ns": 382218685133468, "prompt_tokens": 4008, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "7c00e83d6e1dd0e6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382218051462410, "t_first_token_ns": 382218333647381, "t_last_token_ns": 382219702300725, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "3d4d2bb9bff4449a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382221166680340, "t_first_token_ns": 382221418232551, "t_last_token_ns": 382222635150182, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "7272725ee8fd0540", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382222884018197, "t_first_token_ns": 382223136178690, "t_last_token_ns": 382224353706433, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "a2fea85d23f1bda3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382224420641376, "t_first_token_ns": 382224672337337, "t_last_token_ns": 382226325895727, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "011158111ca19afe", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382224959521340, "t_first_token_ns": 382225213382835, "t_last_token_ns": 382226635465143, "prompt_tokens": 3987, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "8da37110844555c6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382226741248835, "t_first_token_ns": 382226992920493, "t_last_token_ns": 382229323314112, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "d8b99c661f516313", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382227423181356, "t_first_token_ns": 382227679988954, "t_last_token_ns": 382230776778108, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "a02c08c718952117", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382227539642691, "t_first_token_ns": 382227918674657, "t_last_token_ns": 382230795745437, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "85285854a309da38", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382228419136250, "t_first_token_ns": 382228682408216, "t_last_token_ns": 382231728912764, "prompt_tokens": 4001, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "7ddc31684732a450", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382229681891401, "t_first_token_ns": 382229944481851, "t_last_token_ns": 382234275268214, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "536791c815121f58", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382229692195410, "t_first_token_ns": 382230180918053, "t_last_token_ns": 382234293959653, "prompt_tokens": 4023, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "8576f22a0c427bbf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382229837600895, "t_first_token_ns": 382230416855636, "t_last_token_ns": 382234298990114, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "ca99a4a5056f40d2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382230860534607, "t_first_token_ns": 382231118860753, "t_last_token_ns": 382234807481914, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "5838d0fe27e1457f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382231835313652, "t_first_token_ns": 382232094063147, "t_last_token_ns": 382236260213053, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "816823128084eebd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382232384537271, "t_first_token_ns": 382232642912715, "t_last_token_ns": 382236632788270, "prompt_tokens": 3932, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "bb0921b5e49295ee", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382232472077972, "t_first_token_ns": 382232888080797, "t_last_token_ns": 382236651960314, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "346cf978d3834bc0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382232678229522, "t_first_token_ns": 382233348913731, "t_last_token_ns": 382236671555061, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "c886d982435860b7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382232858052076, "t_first_token_ns": 382233349466266, "t_last_token_ns": 382236671992368, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "fdbc4ac00b838337", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382235486795830, "t_first_token_ns": 382235755469859, "t_last_token_ns": 382239539887610, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "fa46f37dea905ba5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382235657639879, "t_first_token_ns": 382236001757374, "t_last_token_ns": 382239557882032, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "3e44df05b71a3966", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382235968214739, "t_first_token_ns": 382236251599184, "t_last_token_ns": 382239577233326, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "8ee00ef0ad273d5c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382236747690994, "t_first_token_ns": 382237010170939, "t_last_token_ns": 382239991166370, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "abfce48fa7fd10f9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382237473573203, "t_first_token_ns": 382237731677118, "t_last_token_ns": 382240710895573, "prompt_tokens": 3927, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "4f100560326929ab", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382238025183856, "t_first_token_ns": 382238287784436, "t_last_token_ns": 382240956308495, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "f83672c4afe7b491", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382238727193197, "t_first_token_ns": 382238989046663, "t_last_token_ns": 382241255073454, "prompt_tokens": 3975, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "8683501cd7f36932", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382240062713183, "t_first_token_ns": 382240322846834, "t_last_token_ns": 382242542706069, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "3ce62a1f03d2a106", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382241698518116, "t_first_token_ns": 382241955883205, "t_last_token_ns": 382244507877645, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "34360b76a6f05a1c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382241796530824, "t_first_token_ns": 382242408126523, "t_last_token_ns": 382244524309960, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "0b7dbbfb7b402e80", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382241775725566, "t_first_token_ns": 382242408235306, "t_last_token_ns": 382244524593196, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "9ca5d6660027739d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382243429474904, "t_first_token_ns": 382243689797564, "t_last_token_ns": 382245973300327, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "1fe7cbb011f5427a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382245037514759, "t_first_token_ns": 382245294339964, "t_last_token_ns": 382250194843484, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "dffdc06a4b36683c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382245343924635, "t_first_token_ns": 382245602114914, "t_last_token_ns": 382250387049265, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "caced4db24115828", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382245684488912, "t_first_token_ns": 382245959670002, "t_last_token_ns": 382250595073071, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "9b16f710bda9505d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382246455155122, "t_first_token_ns": 382246713116464, "t_last_token_ns": 382251855458673, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "cb7c10754f5ce5fe", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382246820467361, "t_first_token_ns": 382247079292015, "t_last_token_ns": 382252064431004, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "88ffbcb6bc7f56b1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382246899219593, "t_first_token_ns": 382247324912941, "t_last_token_ns": 382252087727441, "prompt_tokens": 4007, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "78e331e87c45a189", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382247090656834, "t_first_token_ns": 382247563741168, "t_last_token_ns": 382252100025111, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "456ef7ed125277b0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382247332250810, "t_first_token_ns": 382247810341060, "t_last_token_ns": 382252119050670, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "89f55bed6ffb548a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382247890167221, "t_first_token_ns": 382248154181193, "t_last_token_ns": 382252447222493, "prompt_tokens": 3969, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "3d3ab1fae05f0d2a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382248016358736, "t_first_token_ns": 382248404905181, "t_last_token_ns": 382252466210954, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "335bb26db7a2003b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382248886461930, "t_first_token_ns": 382249158934231, "t_last_token_ns": 382252778461417, "prompt_tokens": 4024, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "dfbdddceda574c5e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382251220254559, "t_first_token_ns": 382251492349707, "t_last_token_ns": 382254858193212, "prompt_tokens": 4028, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "59fcfa5724d8e7b3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382251545160317, "t_first_token_ns": 382251817740199, "t_last_token_ns": 382254943373623, "prompt_tokens": 4019, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "cfe7772de6d17d6b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382252108590582, "t_first_token_ns": 382252376840772, "t_last_token_ns": 382255427792897, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "59bba69a8d667e3e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382252783358215, "t_first_token_ns": 382253044197227, "t_last_token_ns": 382255861156576, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "108739472aeef7cd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382253850525340, "t_first_token_ns": 382254113938495, "t_last_token_ns": 382257348412417, "prompt_tokens": 3964, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "7912548face9904f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382254512630187, "t_first_token_ns": 382254777050148, "t_last_token_ns": 382257680083230, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "9ad61f308977eb83", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382254998337939, "t_first_token_ns": 382255257111090, "t_last_token_ns": 382258124056760, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "3491e86c0fd93ddb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382255849278374, "t_first_token_ns": 382256113740914, "t_last_token_ns": 382258672159473, "prompt_tokens": 4013, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "48e9daddae7ddfde", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382255866071677, "t_first_token_ns": 382256343072316, "t_last_token_ns": 382258678062715, "prompt_tokens": 3921, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "30336ab3d8b85b24", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382257740405991, "t_first_token_ns": 382258003695552, "t_last_token_ns": 382259717870390, "prompt_tokens": 4024, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "9a3fd26f54c1be0a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382259267040309, "t_first_token_ns": 382259520113618, "t_last_token_ns": 382264610527785, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "6b4b03f7f400b478", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382260056629156, "t_first_token_ns": 382260314996180, "t_last_token_ns": 382266945327221, "prompt_tokens": 4028, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "79c09b32b572e89b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382260207966744, "t_first_token_ns": 382260552535667, "t_last_token_ns": 382267201658830, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "96d046514314461d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382260863772430, "t_first_token_ns": 382261121517694, "t_last_token_ns": 382269616456818, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "185f1dbdea6d23c0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382260883864322, "t_first_token_ns": 382261358981122, "t_last_token_ns": 382269635133280, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "1d800c9aabc32617", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382261223021937, "t_first_token_ns": 382261605036781, "t_last_token_ns": 382269675252671, "prompt_tokens": 4020, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "6c8d620872c9187d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382261386618982, "t_first_token_ns": 382261850355307, "t_last_token_ns": 382269733330878, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "48598702f53ca2f3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382261936113708, "t_first_token_ns": 382262207866977, "t_last_token_ns": 382270137410340, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "f2db9e461e72ad78", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382262129279460, "t_first_token_ns": 382262457281167, "t_last_token_ns": 382270170093876, "prompt_tokens": 4022, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "f80c2c460ec65e32", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382262194739319, "t_first_token_ns": 382262694545867, "t_last_token_ns": 382270188383183, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "ab3000c06ac776bf", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382263247959500, "t_first_token_ns": 382263527369075, "t_last_token_ns": 382271199385444, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "0bf5905c1e8121bd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382263441769801, "t_first_token_ns": 382263778485898, "t_last_token_ns": 382271229307297, "prompt_tokens": 3966, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "137d70d1f46c03a1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382264000930903, "t_first_token_ns": 382264483440102, "t_last_token_ns": 382271736094277, "prompt_tokens": 3988, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "1098d56d526989cc", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382263989145268, "t_first_token_ns": 382264483691940, "t_last_token_ns": 382271736277227, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "cebd470b51f7fe4e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382264554664854, "t_first_token_ns": 382264829425692, "t_last_token_ns": 382271857184862, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "a151e7b5b68ad9ae", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382265593367647, "t_first_token_ns": 382265878694985, "t_last_token_ns": 382272887094000, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "4eca794d265710c8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382266619269328, "t_first_token_ns": 382266891337738, "t_last_token_ns": 382273890793576, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "dff220b2b2e49d3a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382266885962771, "t_first_token_ns": 382267165156399, "t_last_token_ns": 382273933294801, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "82efcb8a8e11e6c8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382266978474898, "t_first_token_ns": 382267421971357, "t_last_token_ns": 382273961573828, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "2b2b27c2bbba748c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382267606770870, "t_first_token_ns": 382267891744564, "t_last_token_ns": 382274201139814, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "22232cfe893b9166", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382267851804875, "t_first_token_ns": 382268151719835, "t_last_token_ns": 382274222764040, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "139fa79bf7516bed", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382268077307186, "t_first_token_ns": 382268410764409, "t_last_token_ns": 382274244527518, "prompt_tokens": 3980, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "d725af190f423c10", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382268235288591, "t_first_token_ns": 382268669529210, "t_last_token_ns": 382274266811916, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "1d3e84737b4620da", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382268568836316, "t_first_token_ns": 382268931323446, "t_last_token_ns": 382274290482621, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 21, "error": null}
|
||||||
|
{"req_id": "2375944c4224b637", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382269333588639, "t_first_token_ns": 382269616315075, "t_last_token_ns": 382274517324964, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 22, "error": null}
|
||||||
|
{"req_id": "657154f7593762d1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382269676971204, "t_first_token_ns": 382269955772343, "t_last_token_ns": 382274579582384, "prompt_tokens": 4032, "completion_tokens": 256, "inflight_at_send": 20, "error": null}
|
||||||
|
{"req_id": "4e50ce56d0618fec", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382270370930435, "t_first_token_ns": 382270646155427, "t_last_token_ns": 382274840321258, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "cea59338a3b0a9b3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382271240179357, "t_first_token_ns": 382271517760638, "t_last_token_ns": 382275102943935, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "f4dd02509a77bddd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382272348998420, "t_first_token_ns": 382272630061624, "t_last_token_ns": 382275456225311, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "0e7beafea7584ff7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382272913324164, "t_first_token_ns": 382273185025477, "t_last_token_ns": 382275560411765, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "b97b84e328b625d3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382276006639328, "t_first_token_ns": 382276256530418, "t_last_token_ns": 382278666436229, "prompt_tokens": 3974, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "651186d1dd6c15a6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382276471774276, "t_first_token_ns": 382276726335909, "t_last_token_ns": 382279030224506, "prompt_tokens": 4033, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "b6350e463220e4e6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382276985792776, "t_first_token_ns": 382277240696393, "t_last_token_ns": 382279322364134, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "358015cbb66d10de", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382277702966911, "t_first_token_ns": 382277962916964, "t_last_token_ns": 382279646475958, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "50705bd15064c6d4", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382280071652148, "t_first_token_ns": 382280324700297, "t_last_token_ns": 382281925876641, "prompt_tokens": 4006, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "ecb80a17721044a3", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382280878938876, "t_first_token_ns": 382281138710755, "t_last_token_ns": 382282509067958, "prompt_tokens": 4027, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "41abb79477b47f91", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382282537804991, "t_first_token_ns": 382282788944025, "t_last_token_ns": 382285759281017, "prompt_tokens": 3983, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "721ebf229a030a95", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382282654081516, "t_first_token_ns": 382283024887566, "t_last_token_ns": 382285774665841, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "3a340945700f2c58", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382283020915717, "t_first_token_ns": 382283281575175, "t_last_token_ns": 382285814957053, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "2170390234b5ae2d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382283831925259, "t_first_token_ns": 382284092819349, "t_last_token_ns": 382288185747467, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "076476fea8382967", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382284712516382, "t_first_token_ns": 382284976692902, "t_last_token_ns": 382289902776001, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "ff52490f95172d67", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382285894483115, "t_first_token_ns": 382286155432594, "t_last_token_ns": 382292769390123, "prompt_tokens": 3996, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "e924d3bcedd387fb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382286101690378, "t_first_token_ns": 382286397210325, "t_last_token_ns": 382292801303990, "prompt_tokens": 3970, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "b877892f91ecec34", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382286354904928, "t_first_token_ns": 382286639601881, "t_last_token_ns": 382292831600255, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "1caf880a2fbf12e5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382286710224367, "t_first_token_ns": 382286977767075, "t_last_token_ns": 382293006976919, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "9a1cf60f1b871ded", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382287042791479, "t_first_token_ns": 382287306643213, "t_last_token_ns": 382293163095604, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "a95165328c12522a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382287108257219, "t_first_token_ns": 382287553709196, "t_last_token_ns": 382293190316188, "prompt_tokens": 3986, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "db3cb451a123fd7b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382287438582330, "t_first_token_ns": 382287801748655, "t_last_token_ns": 382293217578256, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "ec19d55d7bc56117", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382288124599871, "t_first_token_ns": 382288403964586, "t_last_token_ns": 382294315926875, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "45f17bcf193b6eb1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382289022270243, "t_first_token_ns": 382289289950672, "t_last_token_ns": 382296722833350, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "0a1426ca17d726a2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382289197880272, "t_first_token_ns": 382289541891230, "t_last_token_ns": 382296758146324, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "85f92136d0577f5c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382289844640819, "t_first_token_ns": 382290119573263, "t_last_token_ns": 382297620747299, "prompt_tokens": 3982, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "c68c8ac2cefadeda", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382290772189791, "t_first_token_ns": 382291044190610, "t_last_token_ns": 382298884126891, "prompt_tokens": 3976, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "39ec2bfef2cc660c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382290896269825, "t_first_token_ns": 382291557937052, "t_last_token_ns": 382298918583030, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "5c5ae6e010163893", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382290885824379, "t_first_token_ns": 382291558128261, "t_last_token_ns": 382298919800304, "prompt_tokens": 3960, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "bc830bbd57c31dfd", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382291013313524, "t_first_token_ns": 382291747448050, "t_last_token_ns": 382298932740540, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "f8dfa76d3272924c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382291771801775, "t_first_token_ns": 382292046115735, "t_last_token_ns": 382299229923989, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "6a0616efb0ee2261", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382293403776801, "t_first_token_ns": 382293668924944, "t_last_token_ns": 382300919064688, "prompt_tokens": 3977, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "9be8cd07b75dc5e7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382293731199801, "t_first_token_ns": 382294006155407, "t_last_token_ns": 382301042218103, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "213c753a2902d374", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382293981769734, "t_first_token_ns": 382294258539915, "t_last_token_ns": 382301299021090, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "4e27c91846d4e17d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382294641425973, "t_first_token_ns": 382294925554912, "t_last_token_ns": 382301779950306, "prompt_tokens": 3995, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "1b02d1736bed2783", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382294895722126, "t_first_token_ns": 382295177886367, "t_last_token_ns": 382301807617569, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "044505f55ee8eb29", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382294923363657, "t_first_token_ns": 382295417962976, "t_last_token_ns": 382301822151120, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "9b6468d2f524b689", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382295112754799, "t_first_token_ns": 382295660622279, "t_last_token_ns": 382301837146717, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "0576ac2b072d4f41", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382295725139653, "t_first_token_ns": 382296004582188, "t_last_token_ns": 382301950439627, "prompt_tokens": 3999, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "1904f89e3ffa195e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382296134260191, "t_first_token_ns": 382296413472717, "t_last_token_ns": 382302134773902, "prompt_tokens": 4002, "completion_tokens": 256, "inflight_at_send": 17, "error": null}
|
||||||
|
{"req_id": "278330e7e052f152", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382296419310614, "t_first_token_ns": 382296708528618, "t_last_token_ns": 382302640835174, "prompt_tokens": 3973, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "783382169706baf2", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382296752445880, "t_first_token_ns": 382297031024489, "t_last_token_ns": 382303135339820, "prompt_tokens": 4017, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "fd3f7b073d883ae7", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382296851397497, "t_first_token_ns": 382297289250588, "t_last_token_ns": 382303160499824, "prompt_tokens": 3981, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "16236450faab7663", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382297824640610, "t_first_token_ns": 382298107311304, "t_last_token_ns": 382304045809829, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 18, "error": null}
|
||||||
|
{"req_id": "f702da4e623d7579", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382298137321560, "t_first_token_ns": 382298416539465, "t_last_token_ns": 382304112896204, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "d9555469adb82bd6", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382298919125293, "t_first_token_ns": 382299203410033, "t_last_token_ns": 382304513657278, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 19, "error": null}
|
||||||
|
{"req_id": "47bdac1c5663cca9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382299479247967, "t_first_token_ns": 382299752385750, "t_last_token_ns": 382304758513211, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "af1d9e8359f64baa", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382301015398911, "t_first_token_ns": 382301299446540, "t_last_token_ns": 382305833963679, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 16, "error": null}
|
||||||
|
{"req_id": "b27e960445b0e807", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382302077132232, "t_first_token_ns": 382302353779344, "t_last_token_ns": 382306867355390, "prompt_tokens": 3993, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "3877cb03f6ea4711", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382302172810824, "t_first_token_ns": 382302601941942, "t_last_token_ns": 382307117973132, "prompt_tokens": 3955, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "cee027f56ae5f140", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382302573962987, "t_first_token_ns": 382303068443742, "t_last_token_ns": 382307138965145, "prompt_tokens": 4009, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "336f49e040184b00", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382302582340992, "t_first_token_ns": 382303068546107, "t_last_token_ns": 382307139098054, "prompt_tokens": 4005, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "9cc7dafc187ac5a5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382303521299090, "t_first_token_ns": 382303790669946, "t_last_token_ns": 382307775877186, "prompt_tokens": 3994, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "8363103077032c09", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382303698193977, "t_first_token_ns": 382304047062605, "t_last_token_ns": 382307795962936, "prompt_tokens": 4018, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "288e93bd46cfc9ea", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382306201788219, "t_first_token_ns": 382306465215344, "t_last_token_ns": 382309936320353, "prompt_tokens": 3984, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "9e4a220b3cf1ed89", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382306372622843, "t_first_token_ns": 382306713100234, "t_last_token_ns": 382309956502330, "prompt_tokens": 4000, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "d553f356441ca287", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382306846192108, "t_first_token_ns": 382307117674979, "t_last_token_ns": 382310085891182, "prompt_tokens": 4027, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "5c8b375f370a610c", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382307297425735, "t_first_token_ns": 382307565490133, "t_last_token_ns": 382310246006686, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "c8b0fc76cafef753", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382308469469585, "t_first_token_ns": 382308734862861, "t_last_token_ns": 382312592045093, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "72e7e5c0e7f2b84f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382309072995230, "t_first_token_ns": 382309336914139, "t_last_token_ns": 382313268295474, "prompt_tokens": 3998, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "9e7ed7f0ec538c3f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382310492993009, "t_first_token_ns": 382310754760768, "t_last_token_ns": 382316883485729, "prompt_tokens": 3992, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "ff133c5d58ab656f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382310750583486, "t_first_token_ns": 382311013283677, "t_last_token_ns": 382316943272594, "prompt_tokens": 4023, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "0b005fe06300469f", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382310957832201, "t_first_token_ns": 382311478864113, "t_last_token_ns": 382316971147033, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "a729629242e95560", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382310819773538, "t_first_token_ns": 382311479055787, "t_last_token_ns": 382316971417710, "prompt_tokens": 3985, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "f9fef77febb3a101", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382310989684571, "t_first_token_ns": 382311707712423, "t_last_token_ns": 382316984995843, "prompt_tokens": 3991, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "3a56a9b9fbaf16ef", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382311809639853, "t_first_token_ns": 382312083743110, "t_last_token_ns": 382317398693541, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "a4b8f1dd0a833ac0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382312582579356, "t_first_token_ns": 382312855284846, "t_last_token_ns": 382318293869707, "prompt_tokens": 4015, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "bc1ce14acfb8f5b9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382313520095902, "t_first_token_ns": 382313785296209, "t_last_token_ns": 382319122033699, "prompt_tokens": 3971, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "550870cf62cf7dc8", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382314533243277, "t_first_token_ns": 382314803619472, "t_last_token_ns": 382320237138835, "prompt_tokens": 3989, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
|
{"req_id": "3bed9c08ecc0735e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382314667881087, "t_first_token_ns": 382315264386947, "t_last_token_ns": 382320261349546, "prompt_tokens": 4012, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "2affef463ea83fb5", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382314623370475, "t_first_token_ns": 382315264474829, "t_last_token_ns": 382320261630141, "prompt_tokens": 3963, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "aebd728fac5eb7c9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382315387126536, "t_first_token_ns": 382315882868801, "t_last_token_ns": 382320413305294, "prompt_tokens": 3978, "completion_tokens": 256, "inflight_at_send": 12, "error": null}
|
||||||
|
{"req_id": "80bee31a1295953e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382315388543059, "t_first_token_ns": 382315883264438, "t_last_token_ns": 382320413544809, "prompt_tokens": 3965, "completion_tokens": 256, "inflight_at_send": 13, "error": null}
|
||||||
|
{"req_id": "fc6cfd2270ab2581", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382315664880544, "t_first_token_ns": 382316134809886, "t_last_token_ns": 382320433041568, "prompt_tokens": 3972, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "f4a15b0e0da21586", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382316255760696, "t_first_token_ns": 382316529243729, "t_last_token_ns": 382320526288526, "prompt_tokens": 3916, "completion_tokens": 256, "inflight_at_send": 15, "error": null}
|
||||||
|
{"req_id": "1ff7c31e57e00bcb", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382316961175087, "t_first_token_ns": 382317236358633, "t_last_token_ns": 382320780111665, "prompt_tokens": 3961, "completion_tokens": 256, "inflight_at_send": 14, "error": null}
|
||||||
|
{"req_id": "c1fecd97b2361f3e", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382317599187050, "t_first_token_ns": 382317873144906, "t_last_token_ns": 382320954096140, "prompt_tokens": 4003, "completion_tokens": 256, "inflight_at_send": 11, "error": null}
|
||||||
|
{"req_id": "d167f8efb80215a9", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382319589439442, "t_first_token_ns": 382319862829834, "t_last_token_ns": 382321599275810, "prompt_tokens": 3979, "completion_tokens": 256, "inflight_at_send": 10, "error": null}
|
||||||
|
{"req_id": "c52bced31247e27d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382322039662170, "t_first_token_ns": 382322288052303, "t_last_token_ns": 382326895246382, "prompt_tokens": 3949, "completion_tokens": 256, "inflight_at_send": 1, "error": null}
|
||||||
|
{"req_id": "431a39693928917a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382322288301548, "t_first_token_ns": 382322546859249, "t_last_token_ns": 382326958466910, "prompt_tokens": 4004, "completion_tokens": 256, "inflight_at_send": 2, "error": null}
|
||||||
|
{"req_id": "bf8d7bba7096d325", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382322541678429, "t_first_token_ns": 382322799972050, "t_last_token_ns": 382327005049850, "prompt_tokens": 4014, "completion_tokens": 256, "inflight_at_send": 3, "error": null}
|
||||||
|
{"req_id": "abfb5bfd5d68dc8d", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382322557040682, "t_first_token_ns": 382323036117646, "t_last_token_ns": 382327014252080, "prompt_tokens": 4028, "completion_tokens": 256, "inflight_at_send": 4, "error": null}
|
||||||
|
{"req_id": "5142ba1e759d7807", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382322644006561, "t_first_token_ns": 382323272791904, "t_last_token_ns": 382327023929267, "prompt_tokens": 4016, "completion_tokens": 256, "inflight_at_send": 5, "error": null}
|
||||||
|
{"req_id": "911dfbd9a5da2f3a", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382323029542517, "t_first_token_ns": 382323518698810, "t_last_token_ns": 382327040019218, "prompt_tokens": 4010, "completion_tokens": 256, "inflight_at_send": 6, "error": null}
|
||||||
|
{"req_id": "d4ec1ed768d3128b", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382323714343260, "t_first_token_ns": 382323975638146, "t_last_token_ns": 382327226864239, "prompt_tokens": 3968, "completion_tokens": 256, "inflight_at_send": 7, "error": null}
|
||||||
|
{"req_id": "8ecca9914ec9afd1", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382323894640673, "t_first_token_ns": 382324222039019, "t_last_token_ns": 382327238167110, "prompt_tokens": 3997, "completion_tokens": 256, "inflight_at_send": 8, "error": null}
|
||||||
|
{"req_id": "94e98c79036fdaf0", "rate_target": 1.5, "input_tokens_target": 4096, "output_tokens_target": 256, "t_send_ns": 382323965866554, "t_first_token_ns": 382324463256352, "t_last_token_ns": 382327242314889, "prompt_tokens": 3990, "completion_tokens": 256, "inflight_at_send": 9, "error": null}
|
||||||
@@ -0,0 +1,8 @@
|
|||||||
|
{
|
||||||
|
"rate": 1.5,
|
||||||
|
"input_tokens": 4096,
|
||||||
|
"output_tokens": 256,
|
||||||
|
"duration_target_s": 240.0,
|
||||||
|
"duration_actual_s": 243.239731536014,
|
||||||
|
"n_requests": 392
|
||||||
|
}
|
||||||
@@ -0,0 +1,53 @@
|
|||||||
|
{
|
||||||
|
"unified": {
|
||||||
|
"n_total": 1214,
|
||||||
|
"n_ok": 1214,
|
||||||
|
"n_err": 0,
|
||||||
|
"ttft_mean_ms": 4018.2049553304346,
|
||||||
|
"ttft_p50_ms": 499.80880302609876,
|
||||||
|
"ttft_p90_ms": 11971.00561100524,
|
||||||
|
"ttft_p99_ms": 46695.4637899762,
|
||||||
|
"tpot_mean_ms": 15.317457336748381,
|
||||||
|
"tpot_p50_ms": 8.4233505983844,
|
||||||
|
"tpot_p90_ms": 19.624556955096267,
|
||||||
|
"tpot_p99_ms": 151.59626177765062,
|
||||||
|
"e2e_mean_ms": 8179.940037778339,
|
||||||
|
"e2e_p50_ms": 1942.4257799983025,
|
||||||
|
"e2e_p90_ms": 23474.75191502599,
|
||||||
|
"e2e_p99_ms": 73709.20522802044
|
||||||
|
},
|
||||||
|
"unified_kv_both": {
|
||||||
|
"n_total": 1214,
|
||||||
|
"n_ok": 1214,
|
||||||
|
"n_err": 0,
|
||||||
|
"ttft_mean_ms": 3551.512749613402,
|
||||||
|
"ttft_p50_ms": 500.6749929743819,
|
||||||
|
"ttft_p90_ms": 9743.529893981759,
|
||||||
|
"ttft_p99_ms": 42432.15363100171,
|
||||||
|
"tpot_mean_ms": 14.382922003727078,
|
||||||
|
"tpot_p50_ms": 8.348213152272539,
|
||||||
|
"tpot_p90_ms": 21.58885370458434,
|
||||||
|
"tpot_p99_ms": 127.73682388598733,
|
||||||
|
"e2e_mean_ms": 7966.7458915099105,
|
||||||
|
"e2e_p50_ms": 1995.0270210392773,
|
||||||
|
"e2e_p90_ms": 21254.210047016386,
|
||||||
|
"e2e_p99_ms": 76629.94067498948
|
||||||
|
},
|
||||||
|
"unified_kv_both_drfix": {
|
||||||
|
"n_total": 1214,
|
||||||
|
"n_ok": 1214,
|
||||||
|
"n_err": 0,
|
||||||
|
"ttft_mean_ms": 3103.38564077023,
|
||||||
|
"ttft_p50_ms": 484.8736240528524,
|
||||||
|
"ttft_p90_ms": 7583.885677042417,
|
||||||
|
"ttft_p99_ms": 41882.84209399717,
|
||||||
|
"tpot_mean_ms": 14.182905529343138,
|
||||||
|
"tpot_p50_ms": 8.092588612325601,
|
||||||
|
"tpot_p90_ms": 17.727813587879535,
|
||||||
|
"tpot_p99_ms": 112.25943991111738,
|
||||||
|
"e2e_mean_ms": 7183.769712352909,
|
||||||
|
"e2e_p50_ms": 1805.8691520127468,
|
||||||
|
"e2e_p90_ms": 17930.795097956434,
|
||||||
|
"e2e_p99_ms": 71957.86093402421
|
||||||
|
}
|
||||||
|
}
|
||||||
File diff suppressed because one or more lines are too long
File diff suppressed because it is too large
Load Diff
@@ -0,0 +1,50 @@
|
|||||||
|
{
|
||||||
|
"actual_output_tokens_stats": {
|
||||||
|
"count": 1214.0,
|
||||||
|
"mean": 309.1507413509061,
|
||||||
|
"p50": 92.0,
|
||||||
|
"p90": 511.4000000000001,
|
||||||
|
"p99": 4181.989999999992
|
||||||
|
},
|
||||||
|
"cache_hit_request_count": 882,
|
||||||
|
"cached_tokens_stats": {
|
||||||
|
"count": 1214.0,
|
||||||
|
"mean": 34370.767710049426,
|
||||||
|
"p50": 23264.0,
|
||||||
|
"p90": 92649.6,
|
||||||
|
"p99": 130699.5199999999
|
||||||
|
},
|
||||||
|
"error_count": 0,
|
||||||
|
"external_cache_hit_ratio": 0.0,
|
||||||
|
"external_cache_hits_tokens": 0,
|
||||||
|
"external_cache_queries_tokens": 0,
|
||||||
|
"latency_stats_s": {
|
||||||
|
"count": 1214.0,
|
||||||
|
"mean": 8.179940037778339,
|
||||||
|
"p50": 1.9469290699926205,
|
||||||
|
"p90": 23.85709141241971,
|
||||||
|
"p99": 76.447050594399
|
||||||
|
},
|
||||||
|
"prefix_cache_hit_ratio": 0.0,
|
||||||
|
"prefix_cache_hits_tokens": 0,
|
||||||
|
"prefix_cache_queries_tokens": 0,
|
||||||
|
"request_count": 1214,
|
||||||
|
"success_count": 1214,
|
||||||
|
"total_cached_tokens": 41726112,
|
||||||
|
"total_input_tokens": 53335690,
|
||||||
|
"tpot_stats_s": {
|
||||||
|
"count": 1214.0,
|
||||||
|
"mean": 0.01530483999133096,
|
||||||
|
"p50": 0.008421713519231216,
|
||||||
|
"p90": 0.019828541993577534,
|
||||||
|
"p99": 0.15287945013125698
|
||||||
|
},
|
||||||
|
"ttft_stats_s": {
|
||||||
|
"count": 1214.0,
|
||||||
|
"mean": 4.018204955330434,
|
||||||
|
"p50": 0.5004960215010215,
|
||||||
|
"p90": 12.025405208714078,
|
||||||
|
"p99": 48.026077826277884
|
||||||
|
},
|
||||||
|
"wall_clock_s": 2915.8219948689803
|
||||||
|
}
|
||||||
@@ -0,0 +1,7 @@
|
|||||||
|
{
|
||||||
|
"policy": "unified",
|
||||||
|
"trace": "/home/admin/cpfs/wjh/agentic-kv/traces/w600_r0.0015_st30.jsonl",
|
||||||
|
"t_start_unix": 1779814508.4891112,
|
||||||
|
"t_end_unix": 1779817425.4834316,
|
||||||
|
"isolated": true
|
||||||
|
}
|
||||||
@@ -0,0 +1 @@
|
|||||||
|
[{"url":"http://127.0.0.1:8000","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":2696},{"url":"http://127.0.0.1:8001","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":1797},{"url":"http://127.0.0.1:8002","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":2645},{"url":"http://127.0.0.1:8003","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":1651},{"url":"http://127.0.0.1:8004","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":2587},{"url":"http://127.0.0.1:8005","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":1822},{"url":"http://127.0.0.1:8006","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":2801},{"url":"http://127.0.0.1:8007","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":1966}]
|
||||||
File diff suppressed because one or more lines are too long
File diff suppressed because it is too large
Load Diff
@@ -0,0 +1,50 @@
|
|||||||
|
{
|
||||||
|
"actual_output_tokens_stats": {
|
||||||
|
"count": 1214.0,
|
||||||
|
"mean": 309.1507413509061,
|
||||||
|
"p50": 92.0,
|
||||||
|
"p90": 511.4000000000001,
|
||||||
|
"p99": 4181.989999999992
|
||||||
|
},
|
||||||
|
"cache_hit_request_count": 895,
|
||||||
|
"cached_tokens_stats": {
|
||||||
|
"count": 1214.0,
|
||||||
|
"mean": 34921.15980230642,
|
||||||
|
"p50": 24000.0,
|
||||||
|
"p90": 92948.80000000002,
|
||||||
|
"p99": 130516.47999999975
|
||||||
|
},
|
||||||
|
"error_count": 0,
|
||||||
|
"external_cache_hit_ratio": 0.0,
|
||||||
|
"external_cache_hits_tokens": 0,
|
||||||
|
"external_cache_queries_tokens": 0,
|
||||||
|
"latency_stats_s": {
|
||||||
|
"count": 1214.0,
|
||||||
|
"mean": 7.966745891509911,
|
||||||
|
"p50": 1.9950706280069426,
|
||||||
|
"p90": 21.26458439627313,
|
||||||
|
"p99": 77.17173248772794
|
||||||
|
},
|
||||||
|
"prefix_cache_hit_ratio": 0.0,
|
||||||
|
"prefix_cache_hits_tokens": 0,
|
||||||
|
"prefix_cache_queries_tokens": 0,
|
||||||
|
"request_count": 1214,
|
||||||
|
"success_count": 1214,
|
||||||
|
"total_cached_tokens": 42394288,
|
||||||
|
"total_input_tokens": 53335690,
|
||||||
|
"tpot_stats_s": {
|
||||||
|
"count": 1214.0,
|
||||||
|
"mean": 0.014371074456771783,
|
||||||
|
"p50": 0.00834782877529764,
|
||||||
|
"p90": 0.02166754243794215,
|
||||||
|
"p99": 0.1342057830565418
|
||||||
|
},
|
||||||
|
"ttft_stats_s": {
|
||||||
|
"count": 1214.0,
|
||||||
|
"mean": 3.551512749613402,
|
||||||
|
"p50": 0.502115077979397,
|
||||||
|
"p90": 9.801292790105803,
|
||||||
|
"p99": 42.64609946611393
|
||||||
|
},
|
||||||
|
"wall_clock_s": 2918.4564750069985
|
||||||
|
}
|
||||||
@@ -0,0 +1,7 @@
|
|||||||
|
{
|
||||||
|
"policy": "unified_kv_both",
|
||||||
|
"trace": "/home/admin/cpfs/wjh/agentic-kv/traces/w600_r0.0015_st30.jsonl",
|
||||||
|
"t_start_unix": 1779817593.1108272,
|
||||||
|
"t_end_unix": 1779820512.6874907,
|
||||||
|
"isolated": true
|
||||||
|
}
|
||||||
@@ -0,0 +1 @@
|
|||||||
|
[{"url":"http://127.0.0.1:8000","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":1639},{"url":"http://127.0.0.1:8001","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":1874},{"url":"http://127.0.0.1:8002","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":2694},{"url":"http://127.0.0.1:8003","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":2366},{"url":"http://127.0.0.1:8004","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":2043},{"url":"http://127.0.0.1:8005","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":1822},{"url":"http://127.0.0.1:8006","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":1779},{"url":"http://127.0.0.1:8007","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":2557}]
|
||||||
File diff suppressed because one or more lines are too long
File diff suppressed because it is too large
Load Diff
@@ -0,0 +1,50 @@
|
|||||||
|
{
|
||||||
|
"actual_output_tokens_stats": {
|
||||||
|
"count": 1214.0,
|
||||||
|
"mean": 309.1507413509061,
|
||||||
|
"p50": 92.0,
|
||||||
|
"p90": 511.4000000000001,
|
||||||
|
"p99": 4181.989999999992
|
||||||
|
},
|
||||||
|
"cache_hit_request_count": 884,
|
||||||
|
"cached_tokens_stats": {
|
||||||
|
"count": 1214.0,
|
||||||
|
"mean": 34984.83031301483,
|
||||||
|
"p50": 24352.0,
|
||||||
|
"p90": 93219.20000000001,
|
||||||
|
"p99": 130699.5199999999
|
||||||
|
},
|
||||||
|
"error_count": 0,
|
||||||
|
"external_cache_hit_ratio": 0.0,
|
||||||
|
"external_cache_hits_tokens": 0,
|
||||||
|
"external_cache_queries_tokens": 0,
|
||||||
|
"latency_stats_s": {
|
||||||
|
"count": 1214.0,
|
||||||
|
"mean": 7.183769712352909,
|
||||||
|
"p50": 1.8075949079939164,
|
||||||
|
"p90": 18.172221782768624,
|
||||||
|
"p99": 72.3876089354086
|
||||||
|
},
|
||||||
|
"prefix_cache_hit_ratio": 0.0,
|
||||||
|
"prefix_cache_hits_tokens": 0,
|
||||||
|
"prefix_cache_queries_tokens": 0,
|
||||||
|
"request_count": 1214,
|
||||||
|
"success_count": 1214,
|
||||||
|
"total_cached_tokens": 42471584,
|
||||||
|
"total_input_tokens": 53335690,
|
||||||
|
"tpot_stats_s": {
|
||||||
|
"count": 1214.0,
|
||||||
|
"mean": 0.014171222740603976,
|
||||||
|
"p50": 0.008089419168215166,
|
||||||
|
"p90": 0.017776012932020836,
|
||||||
|
"p99": 0.12457718443722908
|
||||||
|
},
|
||||||
|
"ttft_stats_s": {
|
||||||
|
"count": 1214.0,
|
||||||
|
"mean": 3.1033856407702305,
|
||||||
|
"p50": 0.4855456190125551,
|
||||||
|
"p90": 7.6192877857247385,
|
||||||
|
"p99": 41.97445172026403
|
||||||
|
},
|
||||||
|
"wall_clock_s": 2915.82019165199
|
||||||
|
}
|
||||||
@@ -0,0 +1,7 @@
|
|||||||
|
{
|
||||||
|
"policy": "unified_kv_both",
|
||||||
|
"trace": "/home/admin/cpfs/wjh/agentic-kv/traces/w600_r0.0015_st30.jsonl",
|
||||||
|
"t_start_unix": 1779820686.3177474,
|
||||||
|
"t_end_unix": 1779823603.2265427,
|
||||||
|
"isolated": true
|
||||||
|
}
|
||||||
@@ -0,0 +1 @@
|
|||||||
|
[{"url":"http://127.0.0.1:8000","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":2292},{"url":"http://127.0.0.1:8001","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":1741},{"url":"http://127.0.0.1:8002","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":2520},{"url":"http://127.0.0.1:8003","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":2325},{"url":"http://127.0.0.1:8004","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":1773},{"url":"http://127.0.0.1:8005","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":1828},{"url":"http://127.0.0.1:8006","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":1862},{"url":"http://127.0.0.1:8007","role":"combined","ongoing_tokens":0,"pending_prefill_tokens":0,"ongoing_decode_tokens":0,"num_requests":0,"active_p_offloads":0,"cached_blocks":2091}]
|
||||||
@@ -56,21 +56,25 @@ class ReqMetric:
|
|||||||
error: str | None = None
|
error: str | None = None
|
||||||
|
|
||||||
|
|
||||||
def make_random_prompt(target_tokens: int) -> str:
|
def make_random_prompt(target_tokens: int, rng: random.Random) -> str:
|
||||||
"""Same calibration as the bench_loop.py used elsewhere:
|
"""Same calibration as the bench_loop.py used elsewhere:
|
||||||
'Block N: <32-hex>' tokenizes to ~35 tokens on Qwen3-Coder."""
|
'Block N: <32-hex>' tokenizes to ~35 tokens on Qwen3-Coder.
|
||||||
|
|
||||||
|
Deterministic given `rng` — same RNG state produces the same prompt.
|
||||||
|
Used so two runs with the same --seed get bit-identical request streams.
|
||||||
|
"""
|
||||||
n_parts = max(1, target_tokens // 35)
|
n_parts = max(1, target_tokens // 35)
|
||||||
seed = uuid.uuid4().hex
|
# 32 random hex chars from rng (uuid.uuid4 would use os.urandom, unseedable)
|
||||||
|
seed = "".join(f"{rng.randrange(16):x}" for _ in range(32))
|
||||||
parts = []
|
parts = []
|
||||||
for i in range(n_parts):
|
for i in range(n_parts):
|
||||||
h = hashlib.md5(f"{seed}_{i}_{time.time_ns()}".encode()).hexdigest()
|
h = hashlib.md5(f"{seed}_{i}".encode()).hexdigest()
|
||||||
parts.append(f"Block {i}: {h}")
|
parts.append(f"Block {i}: {h}")
|
||||||
return " ".join(parts)
|
return " ".join(parts)
|
||||||
|
|
||||||
|
|
||||||
async def send_one(client, url, model, inp_tokens, out_tokens,
|
async def send_one(client, url, model, prompt, inp_tokens, out_tokens,
|
||||||
rate, inflight, inflight_cap, fh):
|
rate, inflight, inflight_cap, fh, rid):
|
||||||
rid = uuid.uuid4().hex[:16]
|
|
||||||
if inflight[0] >= inflight_cap:
|
if inflight[0] >= inflight_cap:
|
||||||
m = ReqMetric(req_id=rid, rate_target=rate,
|
m = ReqMetric(req_id=rid, rate_target=rate,
|
||||||
input_tokens_target=inp_tokens,
|
input_tokens_target=inp_tokens,
|
||||||
@@ -88,7 +92,6 @@ async def send_one(client, url, model, inp_tokens, out_tokens,
|
|||||||
t_send_ns=time.perf_counter_ns(),
|
t_send_ns=time.perf_counter_ns(),
|
||||||
inflight_at_send=inflight[0])
|
inflight_at_send=inflight[0])
|
||||||
try:
|
try:
|
||||||
prompt = make_random_prompt(inp_tokens)
|
|
||||||
payload = {
|
payload = {
|
||||||
"model": model,
|
"model": model,
|
||||||
"messages": [{"role": "user", "content": prompt}],
|
"messages": [{"role": "user", "content": prompt}],
|
||||||
@@ -141,25 +144,39 @@ async def main_async(args):
|
|||||||
inflight = [0]
|
inflight = [0]
|
||||||
pending: list[asyncio.Task] = []
|
pending: list[asyncio.Task] = []
|
||||||
interval_mean = 1.0 / args.rate
|
interval_mean = 1.0 / args.rate
|
||||||
rng = random.Random(int(time.time_ns()) & 0xFFFFFFFF)
|
|
||||||
|
|
||||||
print(f"[bench] rate={args.rate} shape=({args.input_tokens},{args.output_tokens}) "
|
# Shared seed across configs gives bit-identical arrival times AND prompt
|
||||||
f"duration={args.duration}s output={out_dir}")
|
# content. arrival_rng feeds expovariate, content_rng feeds make_random_prompt,
|
||||||
|
# rid_rng feeds the per-request id. All three derive from --seed so two runs
|
||||||
|
# with the same seed are bit-identical from the producer side; only server
|
||||||
|
# response timing differs (which is what we want to measure).
|
||||||
|
if args.seed is not None:
|
||||||
|
seed = args.seed
|
||||||
|
else:
|
||||||
|
seed = int(time.time_ns()) & 0xFFFFFFFF
|
||||||
|
print(f"[bench] seed={seed} rate={args.rate} shape=({args.input_tokens},"
|
||||||
|
f"{args.output_tokens}) duration={args.duration}s output={out_dir}")
|
||||||
|
arrival_rng = random.Random(seed)
|
||||||
|
content_rng = random.Random(seed ^ 0xC0FFEE)
|
||||||
|
rid_rng = random.Random(seed ^ 0xDEADBEEF)
|
||||||
|
|
||||||
fh = open(req_path, "a", buffering=1)
|
fh = open(req_path, "a", buffering=1)
|
||||||
t0 = time.perf_counter()
|
t0 = time.perf_counter()
|
||||||
last_print = t0
|
last_print = t0
|
||||||
|
|
||||||
async with httpx.AsyncClient(timeout=httpx.Timeout(600.0)) as client:
|
async with httpx.AsyncClient(timeout=httpx.Timeout(600.0)) as client:
|
||||||
# producer
|
# producer — prompts generated here in order so async scheduling can't
|
||||||
|
# reorder the content RNG draw across requests.
|
||||||
async def producer():
|
async def producer():
|
||||||
while time.perf_counter() - t0 < args.duration:
|
while time.perf_counter() - t0 < args.duration:
|
||||||
|
prompt = make_random_prompt(args.input_tokens, content_rng)
|
||||||
|
rid = f"{rid_rng.randrange(1 << 64):016x}"
|
||||||
pending.append(asyncio.create_task(
|
pending.append(asyncio.create_task(
|
||||||
send_one(client, args.url, args.model,
|
send_one(client, args.url, args.model, prompt,
|
||||||
args.input_tokens, args.output_tokens,
|
args.input_tokens, args.output_tokens,
|
||||||
args.rate, inflight, args.inflight_cap, fh)
|
args.rate, inflight, args.inflight_cap, fh, rid)
|
||||||
))
|
))
|
||||||
await asyncio.sleep(rng.expovariate(1.0 / interval_mean))
|
await asyncio.sleep(arrival_rng.expovariate(1.0 / interval_mean))
|
||||||
|
|
||||||
# heartbeat
|
# heartbeat
|
||||||
async def heartbeat():
|
async def heartbeat():
|
||||||
@@ -210,6 +227,10 @@ def main():
|
|||||||
ap.add_argument("--output-tokens", type=int, default=256)
|
ap.add_argument("--output-tokens", type=int, default=256)
|
||||||
ap.add_argument("--duration", type=float, default=480.0,
|
ap.add_argument("--duration", type=float, default=480.0,
|
||||||
help="Total run duration in seconds (default 8 min)")
|
help="Total run duration in seconds (default 8 min)")
|
||||||
|
ap.add_argument("--seed", type=int, default=None,
|
||||||
|
help="Master seed; same value across configs gives "
|
||||||
|
"bit-identical Poisson arrivals and prompt content. "
|
||||||
|
"Default: time-based (different each run).")
|
||||||
ap.add_argument("--inflight-cap", type=int, default=256)
|
ap.add_argument("--inflight-cap", type=int, default=256)
|
||||||
ap.add_argument("--output-dir", required=True)
|
ap.add_argument("--output-dir", required=True)
|
||||||
args = ap.parse_args()
|
args = ap.parse_args()
|
||||||
|
|||||||
@@ -25,6 +25,7 @@ PORT="${PORT:-8000}"
|
|||||||
GPU_ID="${GPU_ID:-0}"
|
GPU_ID="${GPU_ID:-0}"
|
||||||
MODEL_PATH="${MODEL_PATH:-$HOME/models/Qwen/Qwen3-Coder-30B-A3B-Instruct}"
|
MODEL_PATH="${MODEL_PATH:-$HOME/models/Qwen/Qwen3-Coder-30B-A3B-Instruct}"
|
||||||
CONFIGS="${CONFIGS:-plain mooncake_both mooncake_both_drfix}"
|
CONFIGS="${CONFIGS:-plain mooncake_both mooncake_both_drfix}"
|
||||||
|
SEED="${SEED:-12345}" # shared seed across configs → identical Poisson + content
|
||||||
|
|
||||||
DATE="$(date +%Y%m%d_%H%M)"
|
DATE="$(date +%Y%m%d_%H%M)"
|
||||||
RUN_ROOT="$HERE/results/${DATE}_drfix"
|
RUN_ROOT="$HERE/results/${DATE}_drfix"
|
||||||
@@ -33,7 +34,7 @@ mkdir -p "$RUN_ROOT"
|
|||||||
echo "=== Cache-size sweep + DR-fix A/B ==="
|
echo "=== Cache-size sweep + DR-fix A/B ==="
|
||||||
echo "Run dir : $RUN_ROOT"
|
echo "Run dir : $RUN_ROOT"
|
||||||
echo "Configs : $CONFIGS"
|
echo "Configs : $CONFIGS"
|
||||||
echo "Rate : $RATE Duration: ${DURATION}s"
|
echo "Rate : $RATE Duration: ${DURATION}s Seed: $SEED"
|
||||||
echo ""
|
echo ""
|
||||||
|
|
||||||
kill_all_vllm() {
|
kill_all_vllm() {
|
||||||
@@ -114,6 +115,7 @@ for cfg in $CONFIGS; do
|
|||||||
--url "http://127.0.0.1:$PORT/v1/chat/completions" \
|
--url "http://127.0.0.1:$PORT/v1/chat/completions" \
|
||||||
--model "$MODEL_PATH" \
|
--model "$MODEL_PATH" \
|
||||||
--rate "$RATE" --duration "$DURATION" \
|
--rate "$RATE" --duration "$DURATION" \
|
||||||
|
--seed "$SEED" \
|
||||||
--output-dir "$cfg_dir" 2>&1 | tail -8
|
--output-dir "$cfg_dir" 2>&1 | tail -8
|
||||||
|
|
||||||
curl -s "http://127.0.0.1:$PORT/metrics" > "$cfg_dir/metrics_final.txt" 2>&1 || true
|
curl -s "http://127.0.0.1:$PORT/metrics" > "$cfg_dir/metrics_final.txt" 2>&1 || true
|
||||||
|
|||||||
86
microbench/connector_tax/cache_sweep/run_trace_replay_drfix.sh
Executable file
86
microbench/connector_tax/cache_sweep/run_trace_replay_drfix.sh
Executable file
@@ -0,0 +1,86 @@
|
|||||||
|
#!/usr/bin/env bash
|
||||||
|
# Re-run the elastic_migration_v2 trace replay (8x TP1 + unified policy)
|
||||||
|
# with the DR-fix applied, to test whether the +45% TTFT p90 gap closes.
|
||||||
|
#
|
||||||
|
# Three policies in sequence on the SAME trace and SAME machine:
|
||||||
|
# unified — plain (no connector) control
|
||||||
|
# unified_kv_both — Mooncake kv_both, DR sync ON baseline
|
||||||
|
# unified_kv_both_drfix — Mooncake kv_both, DR sync OFF fix under test
|
||||||
|
#
|
||||||
|
# The DR-fix patch is applied at the start and reverted at the end.
|
||||||
|
# When env VLLM_MOONCAKE_DISABLE_DIRECT_READ_SYNC=1 is set, the patched
|
||||||
|
# build_connector_meta skips the O(|cache|) hash sync.
|
||||||
|
|
||||||
|
set -uo pipefail
|
||||||
|
|
||||||
|
PROJ_DIR="${PROJ_DIR:-/home/admin/cpfs/wjh/agentic-kv}"
|
||||||
|
TRACE="${TRACE:-$PROJ_DIR/traces/w600_r0.0015_st30.jsonl}"
|
||||||
|
DATE="$(date +%Y%m%d_%H%M)"
|
||||||
|
OUTROOT="${OUTROOT:-$PROJ_DIR/outputs/trace_replay_drfix_${DATE}}"
|
||||||
|
PYTHON="$PROJ_DIR/.venv/bin/python"
|
||||||
|
DR_FIX_SCRIPT="$PROJ_DIR/microbench/connector_tax/cache_sweep/apply_direct_read_fix.py"
|
||||||
|
VLLM_ROOT="${VLLM_ROOT:-$PROJ_DIR/.venv/lib/python3.12/site-packages/vllm}"
|
||||||
|
|
||||||
|
mkdir -p "$OUTROOT"
|
||||||
|
echo "=== Trace-replay DR-fix A/B/C ==="
|
||||||
|
echo "Trace : $TRACE"
|
||||||
|
echo "Out : $OUTROOT"
|
||||||
|
echo "Policies in order: unified → unified_kv_both → unified_kv_both_drfix"
|
||||||
|
echo ""
|
||||||
|
|
||||||
|
cleanup_all() {
|
||||||
|
pkill -9 -f cache_aware_proxy 2>/dev/null || true
|
||||||
|
pkill -9 -f "vllm serve" 2>/dev/null || true
|
||||||
|
pkill -9 -f "EngineCore" 2>/dev/null || true
|
||||||
|
sleep 5
|
||||||
|
"$PYTHON" "$DR_FIX_SCRIPT" --revert --vllm-root "$VLLM_ROOT" 2>/dev/null || true
|
||||||
|
}
|
||||||
|
trap cleanup_all EXIT
|
||||||
|
cleanup_all
|
||||||
|
|
||||||
|
echo "[stage 0] applying CT_DR_FIX patch (skip flag still respects env var)"
|
||||||
|
"$PYTHON" "$DR_FIX_SCRIPT" --apply --vllm-root "$VLLM_ROOT"
|
||||||
|
|
||||||
|
run_policy() {
|
||||||
|
local policy="$1"
|
||||||
|
local rundir="$2"
|
||||||
|
local skip_dr="$3"
|
||||||
|
echo ""
|
||||||
|
echo "====== $policy → $(basename $rundir) ; VLLM_MOONCAKE_DISABLE_DIRECT_READ_SYNC=$skip_dr ======"
|
||||||
|
mkdir -p "$rundir"
|
||||||
|
# Pass-through to the existing isolated runner; it spawns 8 vLLMs and
|
||||||
|
# the replayer. The env var is inherited by all vLLM child processes.
|
||||||
|
if [ "$skip_dr" = "1" ]; then
|
||||||
|
export VLLM_MOONCAKE_DISABLE_DIRECT_READ_SYNC=1
|
||||||
|
else
|
||||||
|
unset VLLM_MOONCAKE_DISABLE_DIRECT_READ_SYNC
|
||||||
|
fi
|
||||||
|
bash "$PROJ_DIR/scripts/b3_isolated_policy.sh" "$policy" "$TRACE" "$rundir" \
|
||||||
|
2>&1 | tee "$rundir/orchestrator.log" | tail -30
|
||||||
|
rc="${PIPESTATUS[0]}"
|
||||||
|
if [ "$rc" != "0" ]; then
|
||||||
|
echo "[FAIL] policy $policy rc=$rc"
|
||||||
|
fi
|
||||||
|
# Hard cleanup between policies (b3_isolated_policy.sh's trap should
|
||||||
|
# have already done it but belt-and-braces)
|
||||||
|
pkill -9 -f cache_aware_proxy 2>/dev/null || true
|
||||||
|
pkill -9 -f "vllm serve" 2>/dev/null || true
|
||||||
|
pkill -9 -f "EngineCore" 2>/dev/null || true
|
||||||
|
sleep 10
|
||||||
|
return 0
|
||||||
|
}
|
||||||
|
|
||||||
|
run_policy "unified" "$OUTROOT/unified" "0"
|
||||||
|
run_policy "unified_kv_both" "$OUTROOT/unified_kv_both" "0"
|
||||||
|
# Same policy + launcher; the env-gated skip is what makes this differ.
|
||||||
|
run_policy "unified_kv_both" "$OUTROOT/unified_kv_both_drfix" "1"
|
||||||
|
|
||||||
|
echo ""
|
||||||
|
echo "[stage Z] reverting CT_DR_FIX patch"
|
||||||
|
"$PYTHON" "$DR_FIX_SCRIPT" --revert --vllm-root "$VLLM_ROOT"
|
||||||
|
|
||||||
|
echo ""
|
||||||
|
echo "Done. Artifacts: $OUTROOT"
|
||||||
|
echo " unified : $OUTROOT/unified/metrics.jsonl"
|
||||||
|
echo " unified_kv_both : $OUTROOT/unified_kv_both/metrics.jsonl"
|
||||||
|
echo " unified_kv_both_drfix: $OUTROOT/unified_kv_both_drfix/metrics.jsonl"
|
||||||
Reference in New Issue
Block a user