ServingCard
Registry
Compare
Learn
Configure
Hardware Configuration
GPU Model
NVIDIA GB10 (128GB)
NVIDIA RTX 4090 (24GB)
NVIDIA A100 (80GB)
NVIDIA H100 (80GB)
NVIDIA RTX 3090 (24GB)
Quantization
FP8
NVFP4
fp16 (Half Precision)
AWQ (4-bit)
GPTQ (4-bit)
INT8 (8-bit)
Batch Size
32
64
128
256
512
View YAML