qwen3.5-27b-awq
qwen3.5-27b-awq serving configurations from the ServingCard registry.
Note: This is a dense general-purpose model. PawBench scores measure coding agent performance — this model may excel at different tasks.
Configs1 variant
Observations1
Benchmark Observations
mitkoxclaim
Source Throughput
27.9 tok/s
Latency
10267ms TTFT
Quality
0.693
GPU
nvidia-gb10
vllm>=0.18.0rc1awqturboquant-3.5-triton-native
Fork This Config
Create your own optimized configuration based on these community-verified settings.
Open ConfiguratorDiscussion
Debate benchmarks, share quirks, report what works and what doesn't on your hardware.