Local AI Agents.
Governed.
ShellForge runs autonomous AI agents on your machine with policy enforcement on every tool call. No cloud. No API keys. No data leaves your laptop.
A governance wrapper for any local agent
ShellForge is not an agent framework. It's the enforcement layer that sits between your agent driver and the real world. The agent decides what it wants to do. ShellForge decides whether it's allowed.
Crush handles execution. Dagu handles orchestration. AgentGuard wraps them all with policy enforcement.
Best-in-class components, governed end-to-end
Each layer is an independent best-of-breed tool. ShellForge wires them together and adds governance at the enforcement boundary.
Ollama
Local LLM inference with Metal GPU on Apple Silicon. Pull any model — qwen3:8b for everyday tasks, qwen3:30b for production quality on M4 Pro.
RTK
Token compression layer — 70–90% reduction on shell output. Agents see compact, structured results instead of raw terminal noise. More signal, fewer tokens spent.
Crush
Go-native AI coding agent with TUI and headless mode. Uses Ollama for local inference. Full tool-calling loop — reads files, writes code, runs shell commands.
Dagu
YAML DAG workflows with cron scheduling and a web UI. Run multi-step agent pipelines, schedule recurring tasks, observe runs in a browser dashboard.
AgentGuard
Policy enforcement on every action — allow, deny, or correct. 26 built-in invariants, 93 destructive patterns, tamper-evident audit trail. The enforcement boundary agents cannot bypass.
OpenShell + DefenseClaw
Kernel-level isolation via Docker on macOS. DefenseClaw scans for supply chain risks — AI Bill of Materials. Optional but recommended for untrusted workloads.
From zero to governed agent in 5 minutes
macOS (Apple Silicon or Intel) or Linux.
Policy-as-code for any agent driver
Every tool call — file write, shell command, git push — passes through agentguard.yaml before execution. When an action is denied, the correction engine feeds structured feedback back to the model so it can self-correct — not just fail.
mode: enforce # enforce | monitor
policies:
- name: no-force-push
action: deny
pattern: "git push --force"
- name: no-destructive-rm
action: deny
pattern: "rm -rf"
- name: no-secret-access
action: deny
pattern: "*.env|*id_rsa"
- name: allow-src-writes
action: allow
scope: ["src/**", "tests/**"]
invariants:
- tests-before-push
- no-credential-file-creation
- lockfile-integrity
Choose Your Model
Pull any model via Ollama. ShellForge works with all of them.
| Model | Params | RAM | Best For |
|---|---|---|---|
| qwen3:1.7b | 1.7B | ~1.2 GB | Fast tasks, prototyping |
| qwen3:4b | 4B | ~3 GB | Balanced reasoning |
| qwen3:8b | 8B | ~6 GB | Good balance (recommended) |
| qwen3:30b | 30B | ~19 GB | Production quality (M4 Pro 48GB) |
Your agents. Your machine. Your rules.
Local inference, zero cloud, policy enforcement on every action. ShellForge is governed AI for developers who don't want to ask permission.