Agents
12 Architecture Patterns from the Claude Code Leak -- Ranked by Payoff for Local AI
Claude Code's leaked source reveals 12 engineering patterns that power a $2.5B product. Ranked by how much each one improves your local AI agent setup.
epsiclaw: OpenClaw Stripped to 515 Lines of Python (The Karpathy Treatment)
epsiclaw is a minimal, readable reimplementation of OpenClaw in 515 lines of Python with 6 files and one dependency. Inspired by Karpathy's approach to autoresearch. Here's what it does and why it matters.
OpenClaw Model Combinations: What to Pair for Each Task
Stop running one model for everything in OpenClaw. Pair Qwen 2.5 Coder 32B for autocomplete, Qwen 3.5 27B for planning, and Qwen3-Coder-Next for agentic coding. Combos by VRAM tier.
Prompt Debt: When Your System Prompt Becomes Unmaintainable Spaghetti
Your system prompt started at 200 words. Six months later it's 3,000 words of contradictory instructions and panic patches. Here's how prompt debt accumulates, what it costs, and how to pay it down.
Model Routing for Local AI — Stop Using One Model for Everything
You're running one model for every task. That wastes VRAM, burns electricity, and gives worse results. Model routing sends each task to the right model at the right cost. Here's how to set it up.
Intent Engineering for Local AI Agents: A Practical Guide
Stop telling your agent to 'be helpful.' Start encoding specific goals, decision boundaries, and value hierarchies it can actually act on. Starter template included.
Agent Trust Decay: Why Long-Running AI Agents Get Worse Over Time
AI agents degrade after days of autonomous operation. Context pollution, memory bloat, and intent drift compound silently. A trust budget framework for knowing when to intervene.
Building AI Agents with Local LLMs: A Practical Guide
Build AI agents with local LLMs using Ollama and Python. Model requirements, VRAM budgets, framework comparison, working code example, and security warnings.
Best Local Alternatives to Claude Code in 2026
Aider, Continue.dev, Cline, OpenCode, Void, and Tabby compared. Which open-source coding tools work best with local models on your own GPU?
The Web Is Forking: What the Agentic Web Means for Local AI Builders
Coinbase, Stripe, Cloudflare, Google, OpenAI, and Visa are building a parallel web for AI agents. Money, search, content, execution — all redesigned for software clients. What local AI builders should do now.
LocalAgent: A Local-First Agent Runtime That Actually Cares About Safety
Rust CLI for AI agents with deny-by-default permissions, approval workflows, and deterministic replay. Works with LM Studio, Ollama, and llama.cpp.
Function Calling with Local LLMs: Tools, Agents, and Structured Output
Function calling with local LLMs using Ollama and llama.cpp. Qwen 2.5 7B matches GPT-4 accuracy for tool selection. Working code and agentic loop patterns.