devstral-small-24b

devstral-small-24b serving configurations from the ServingCard registry.

View on GitHub
ParametersN/A
ArchitectureN/A
LicenseN/A
Observations1

Benchmark Observations

zenprocessverified
Source

Throughput

53.6 tok/s

Latency

74ms TTFT

Quality

0.293

GPU

nvidia-gb10

vllm>=0.8.0bf16baseline

Fork This Config

Create your own optimized configuration based on these community-verified settings.

Open Configurator

Discussion

Debate benchmarks, share quirks, report what works and what doesn't on your hardware.