Llama 3 2 1b Instruct Q8 0 Gguf
No PawBench serving benchmark yet
This model hasn't been benchmarked with PawBench on any hardware. Be the first to contribute!
How to contribute
- 1.
pip install servingcard - 2.
servingcard benchmark --model llama-3-2-1b-instruct-q8-0-gguf --hardware your-gpu - 3. Submit a PR to the registry
Discussion
Know something about serving this model? Share your experience.