Runtime Properties
Each row corresponds to one benchmark profile (a profile in scripts/benchmark/profiles/). The columns come from the runtime_props field on the evidence record, captured by the props-capture extractor at run time. Fields are all optional; profiles without capture data show — and sort last.
- Extractor — which capture script populated this row.
- Base model — the model identifier as reported by the server (not the profile alias).
- Model quant — quantization format (e.g.
int4,Q3_K_XL,8-bit). - KV quant — KV-cache quantization applied by the runtime, if any.
- Draft model / mode — speculative decoding draft model and accept strategy, when enabled.
- Max ctx — maximum context window in tokens as reported by the server.
- MTP — whether multi-token prediction is enabled server-side.
- Temp — temperature from sampling_defaults (server-side default, may differ from per-request override).
| Profile | Provider | Extractor | Base model | Model quant | KV quant | Draft model | Draft mode | Max ctx | MTP | Temp |
|---|