Find the best Gemma configuration for your hardware. Search by GPU, CPU, or RAM to see what works, how fast, and what quality to expect.
| Backend | Best For | GPU Support | Notes |
|---|---|---|---|
| Ollama | Most users, GPU setups | CUDA, Metal, ROCm | Easiest setup, automatic model management |
| llama.cpp | Flexible quantization | CUDA, Metal, Vulkan | More quant options, manual model files |
| gemma.cpp | CPU-first setups | CPU only (for now) | Google-native, Gemma 2/3 only currently |