deepseek-coder-v2-lite
deepseek-coder-v2-lite serving configurations from the ServingCard registry.
ParametersN/A
ArchitectureN/A
LicenseN/A
Observations1
Benchmark Observations
zenprocessverified
Source Throughput
58.1 tok/s
Latency
3850ms TTFT
Quality
0.286
GPU
nvidia-gb10
vllm>=0.8.0fp8fp8-baseline
Fork This Config
Create your own optimized configuration based on these community-verified settings.
Open ConfiguratorDiscussion
Debate benchmarks, share quirks, report what works and what doesn't on your hardware.