Q&A

Can I run AI offline, without internet?

Short answer

Yes. Tools like Ollama, llama.cpp and LM Studio run open-weight models (Llama 4, Mistral, Gemma 3, DeepSeek-V4) on your laptop or local server with zero internet required. Performance on a modern MacBook is genuinely good for chat-style work. You give up the best model quality (Claude Opus and GPT-5 don't run offline) and a lot of convenience. Worth it for strict data-residency needs, otherwise unnecessary in 2026.

Short answer

Yes. Tools like Ollama, llama.cpp and LM Studio run open-weight models locally with zero internet. On a modern MacBook (M3 + 16GB RAM) you get useful chat-quality AI for free, forever. You give up access to the best models (Claude Opus, GPT-5) and a lot of convenience. Worth it for strict data-residency needs. For most Australian SMBs, cloud Claude/ChatGPT is still the better trade-off.

The 60-second offline AI setup

If you want to try this in the next ten minutes:

  1. Go to ollama.com and download for your OS (Mac, Windows, Linux).
  2. Install (one-click).
  3. Open your terminal.
  4. Run: ollama run llama4:8b
  5. Wait ~5 minutes for the model to download (about 5GB).
  6. Type a message. Hit enter. You’re chatting with AI offline.

Total cost: $0. Total time: 10 minutes including the download.

What’s actually useful offline in 2026

The open-weight model landscape:

ModelSizeHardware neededRoughly comparable to
Llama 4 8B5GBAny 16GB-RAM laptopGPT-4 (2024 era)
Llama 4 70B40GBM-series Pro / 24GB+ VRAMGPT-4o
Mistral Large 280GBHigh-end workstationGPT-4o
DeepSeek-V440-200GB dependingHigh-end workstationGPT-5 mini
Gemma 3 8B5GBAny 16GB-RAM laptopGPT-4 (2024 era)

Free Claude.ai and free ChatGPT both run circles around these for general use. The point of offline isn’t “best quality”; it’s “good enough quality, with no data leaving your machine”.

When offline genuinely makes sense

Strong fit:

  • Defence + intelligence contractors where data classification rules forbid cloud
  • Healthcare practices doing transcription where audio is too sensitive
  • Legal firms with confidential matter detail
  • Anyone in remote Australia with poor internet
  • Cost-sensitive heavy users (a self-hosted setup pays for itself in 6-12 months at heavy use)

Weak fit:

  • Most Australian SMBs (cloud Claude/ChatGPT is faster, better, simpler)
  • Solo operators (the setup overhead isn’t worth it)
  • Anyone who can use anonymisation as their privacy layer
  • Anyone who values the latest model improvements (cloud gets them weekly; you’d be re-downloading models monthly)

The hidden costs

Offline AI isn’t free; it’s “no marginal cost per query”. The hidden costs:

  • Setup time: 2-10 hours depending on hardware + how technical you are
  • Hardware: free if you use an existing laptop; $4-15k AUD for a dedicated workstation
  • Model maintenance: monthly model updates as the open-weight space moves
  • Quality gap: you’ll feel it on hard tasks
  • No image generation: Stable Diffusion / Flux work locally but require separate setup

The hybrid pattern that works

Most regulated-industry clients we work with land on:

  • Cloud Claude Pro ($30 AUD/month) for general work where data isn’t sensitive
  • Local Ollama for the 10-20% of work where data must stay on-machine
  • Azure OpenAI Service / AWS Bedrock when data must stay in Australian region but doesn’t need to be on-laptop

Three tiers, applied per task. Different from “we use one tool for everything”.

Self-hosting AI for a whole team

If you’re running offline AI for 5+ team members, you’re past the laptop tier. Options:

  • Dedicated workstation running Ollama as a server (others connect via HTTP API). $5-15k AUD hardware.
  • On-premise Hetzner / OVH dedicated server with a GPU. €100-500/month depending on spec.
  • AWS Bedrock with provisioned throughput (not technically self-hosted, but private). $$$/month.

For most teams, this is over-investment. Cloud AI with the no-training privacy guarantee is enough.

See also

Want this built for your business?

Book a free 30-minute AI audit. We'll map your business and show you exactly which systems we'd build first. No pitch deck, no scoping fee.

Book my free AI audit