AgentOS runs MLX, llama.cpp, LM Studio, and EXO clusters — your code, your models, your machine. No subscriptions. No cloud round-trips. Just an agent loop wired to the models you already trust.
Point AgentOS at a folder. It picks up your code,
MEMORY.md, and DECISIONS.md so
every conversation starts grounded — not from scratch.
Battle-tested recipes ship with the app — from "scaffold a
SwiftUI macOS app with a working .xcodeproj"
to "diagnose a Core Data threading violation." Attach them
per conversation, and the library is refreshed every release
so it tracks frontier model capability — never goes stale.
~/LibraryAuto-attach by intent (skills surfaced automatically when the conversation needs them) ships in a future release — paired with a toggle so users on M1 / 16 GB Macs can opt out, since expanding context every turn slows load times on older hardware.
Most local models look great in benchmarks and fall over the first time you ask them to call a tool. We curate the ones that work — Qwen 3.6, Gemma 4, gpt-oss, MiniMax — and ship them as one-click downloads sized for your Mac's RAM.
llama.cpp ships bundled and auto-starts. LM Studio plugs in with one click. EXO lets you fan inference across multiple Macs on your network. The Local API Server exposes everything as OpenAI-compatible so Xcode 16 Intelligence and any other client just work.
AgentOS is a $99 founders-price license while the first 200 spots last — then it goes to $199. No tiers, no subscriptions, no usage caps.
We'll email you when the founders-price gates open. No newsletter, no marketing blasts — just the launch ping.