Local LLM VRAM Calculator

Name: Quartalis
Address: GB
Price range: ££

Check if your GPU can run a local LLM. Select model, quantisation, and GPU — get instant VRAM estimates.

Model

Quantisation

VRAM ≈ (params × bits_per_param / 8)
      + KV_cache(ctx_len, params)
      + overhead(~0.5-1.0 GB)

🧠

Our Local AI Stack includes Ollama, FastAPI wrapper, model management, and production deployment configs.