Hardware Guide

What can you run on your Mac? Here's what fits at each memory tier.

🖥️

Our Test Machine

All benchmarks run on this hardware

Entry Level

MacBook Air M3, MacBook Pro base configs. Good for smaller models.

Sweet Spot

MacBook Pro upgraded, Mac Mini M4 Pro. The productivity sweet spot.

Pro Tier

Mac Studio M2 Max, MacBook Pro maxed. Run most models comfortably.

Ultra Tier

Mac Studio M2 Ultra / M4 Ultra. Run anything, including massive MoE models.

Don't use 100% of your RAM for the model. Leave 10-20% for system and context. If you have 64GB, target models under 50GB loaded.

Always use -ngl 99 or equivalent to offload all layers to GPU. Apple Silicon's unified memory makes this seamless.

Q4 saves memory but loses quality. Q6/Q8 is closer to full precision. For coding, higher quants often matter more than for chat.

With enough RAM, keep multiple models loaded and route between them. No cold start penalty — instant switching.

If you're buying a Mac for local LLMs, memory is the most important spec. You can't upgrade it later.

The difference between a frustrating experience and a great one is often just RAM.