qwen3-coder
qwen3-coder serving configurations from the ServingCard registry.
Parameters80B (MoE, 12/48 full attention layers)
ArchitectureQwen3NextForCausalLM
LicenseApache-2.0
Configs2 variants
Observations2
Benchmark Observations
Fork This Config
Create your own optimized configuration based on these community-verified settings.
Open ConfiguratorDiscussion
Debate benchmarks, share quirks, report what works and what doesn't on your hardware.
View on HuggingFace →