AI Automation & Self-Hosting Specialist
ChatGPT Plus costs $20/month. Claude Pro is $20/month. Perplexity Pro is $20/month. If you are using more than one, the bills stack fast — and you still hit rate limits, share your data with third parties, and cannot customise how the AI behaves. Self-hosting changes all of that. For roughly the same cost as one subscription, you can run your own AI models 24/7 on a private server, with no rate limits, no data leaving your control, and full freedom to customise. Here is exactly how to do it in 2026.
What You Need
The requirements are simpler than most people expect:
- A VPS with 8GB+ RAM: This is the minimum for running a useful 7B language model. Contabo’s VPS plans offer 8GB RAM from around $7/month — the best RAM-per-dollar on the market. For running multiple models or a full agent stack, the Cloud VPS 60 is the recommended tier.
- Ubuntu 22.04 LTS: The most compatible OS for AI tooling. Select it when provisioning.
- NVMe SSD storage: Model weights are large (2–8GB per model). NVMe makes loading fast; spinning disk is painful.
- Node.js 18+: Required for OpenClaw.
That is it. No GPU required for CPU inference on 7B models. No special hardware. Just a standard VPS.
Step 1 — Provision Your VPS
- Go to Contabo VPS plans and pick an 8GB or 16GB RAM plan.
- Select Ubuntu 22.04 as your OS during setup.
- Choose your data centre region (pick one geographically close to you for lower latency).
- Complete checkout — your server is usually provisioned within minutes.
- You will receive SSH credentials by email. Connect:
ssh root@YOUR_SERVER_IP - First thing:
apt update && apt upgrade -y
Step 2 — Install Ollama
Ollama is the easiest way to run local LLMs. It handles model downloads, quantisation, and serving via a local REST API.
# Install Ollama
curl -fsSL https://ollama.com/install.sh | sh
# Ollama runs as a systemd service automatically
# Pull a model (Llama 3.2 3B is a good starting point)
ollama pull llama3.2
# Run it interactively
ollama run llama3.2
# Or use the REST API (runs on port 11434)
curl http://localhost:11434/api/generate -d '{
"model": "llama3.2",
"prompt": "Explain neural networks in one paragraph"
}'
Ollama keeps the model loaded in memory between requests, so subsequent queries are fast. The systemd service means it restarts automatically if the server reboots.
To pull a larger model for better quality responses:
# Mistral 7B (needs 8GB RAM)
ollama pull mistral
# Llama 3 8B (needs 8GB RAM, higher quality)
ollama pull llama3
Step 3 — Install OpenClaw and Connect to Ollama
OpenClaw is a self-hosted AI agent platform that connects to your Ollama instance (or any OpenAI-compatible API) and gives you a full agent with memory, skills, cron jobs, and multi-channel messaging.
# Install Node.js 22 (if not already installed)
curl -fsSL https://deb.nodesource.com/setup_22.x | bash -
apt install -y nodejs
# Install OpenClaw globally
npm install -g openclaw
# Run setup wizard
openclaw setup
During setup, when asked for your AI provider, select “OpenAI-compatible” and enter http://localhost:11434/v1 as the base URL. This points OpenClaw directly at your Ollama instance. Enter any string as the API key (Ollama does not require one locally).
To connect a Telegram bot (optional but highly recommended):
- Message @BotFather on Telegram and create a new bot with
/newbot - Copy the bot token
- In OpenClaw config, add the Telegram plugin with your token
- Start OpenClaw:
openclaw start
You now have a private AI assistant on Telegram backed by your own local model. No data leaves your server.
Step 4 — Keep It Running 24/7
Both Ollama and OpenClaw need to survive server reboots and crashes.
Ollama is already managed by systemd (the install script sets this up). Verify: systemctl status ollama
For OpenClaw, use PM2:
# Install PM2
npm install -g pm2
# Start OpenClaw with PM2
pm2 start "openclaw start" --name openclaw
# Save PM2 process list and enable on boot
pm2 save
pm2 startup
Security hardening (do this before anything else):
# Install fail2ban (protects SSH from brute force)
apt install fail2ban -y
# Disable password auth, use SSH keys only
# Edit /etc/ssh/sshd_config:
# PasswordAuthentication no
# PermitRootLogin prohibit-password
systemctl restart sshd
What Models Run Well on a Budget VPS
| Model | RAM Required | Runs on $7/mo VPS? | Quality |
|---|---|---|---|
| Llama 3.2 3B | 4GB | ✓ Yes | Good for simple tasks |
| Llama 3.2 8B | 8GB | ✓ Yes | Very good general use |
| Mistral 7B Q4 | 8GB | ✓ Yes | Very good, fast |
| Llama 3 8B full | 16GB | Needs upgrade | Excellent |
| Llama 3 70B Q4 | 40GB+ | ✗ No | Near GPT-4 quality |
| Gemma 2 9B | 10GB | Borderline | Very good |
For the Cloud VPS 60, you have enough RAM to run Llama 3 8B at full precision alongside a vector database and OpenClaw simultaneously — a complete self-hosted AI stack.
Is It Worth It?
Yes, if you are: a developer who wants a private AI API, someone building automation pipelines, privacy-conscious and uncomfortable with data going to third parties, or running multiple AI-powered services where cloud API costs would add up.
Probably not, if you are: a casual user who just wants quick answers and does not mind using ChatGPT, someone who needs GPT-4-class quality (you need cloud or GPU for that), or a non-technical user who does not want to manage a Linux server.
The setup takes about 30 minutes. After that, it runs itself. The OpenClaw guides on this site cover everything from initial setup to advanced automation workflows — including how to use cron jobs, heartbeat monitoring, and skill extensions to build a genuinely powerful self-hosted AI assistant.
Get started with Contabo Cloud VPS 60 — the sweet spot for AI self-hosting in 2026.
What to Read Next
- Best Open Source AI Voice Cloning Tools in 2026: Top Picks Compared
- Mastering OpenClaw: A Practical Guide to AI-Powered Automation
- How to Summarize Meetings and Long Documents with AI in 2026: A Step-by-Step Guide
- Claude Code Quality Update 2026: Latest HackerNews Reports, Benchmarks, and Performance Fixes
- Browse all AI Stack Digest articles
Bookmark aistackdigest.com for daily AI tools, reviews, and workflow guides.
This article was produced with the assistance of AI tools and reviewed by the AIStackDigest editorial team.