Posts

All the articles I've posted.

LHC v0.2: A Benchmark for Long-Horizon Agent Coherence (and the Methodology That Got It Honest)
Published:May 10, 2026 at 08:00 PM
I just published LHC v0.2, an open benchmark for long-horizon coherence in 8B-class agent models, plus a deterministic parser baseline that puts a useful floor on what fine-tuning is worth for structured-state tasks. This post explains what they're for, how to use them, and the methodology arc that produced them across five rounds of external review.
We're Mistaking the Bootstrap Phase for the Future of AI Agents
Published:Apr 1, 2026 at 08:00 AM
The self-hosted AI agent movement is real and important. But we are confusing a bootstrap phase with a destination architecture. The long-term future of agents will be defined by platforms that make them reliable, governable, and operationally boring.
Execution Is Cheap. Judgment Isn't: AI Agents and the Collapse of the CTO/CPO Divide
Published:Feb 21, 2026 at 10:00 AM
When execution becomes abundant through AI agents, judgment becomes the bottleneck. The traditional separation between technical and product leadership breaks down, creating space for the CPTO role.
The OWASP Top 10 for AI Agents - Security in the Age of Autonomy
Published:Feb 9, 2026 at 08:00 AM
OWASP just published their first Top 10 for Agentic Applications. Here's what every agent builder needs to know about the new attack surfaces, why traditional security fails, and how the orchestration layer becomes the new security boundary.
From OpenClaw's Chaos to OpenAI's Frontier: The Agent Infrastructure Reckoning
Published:Feb 5, 2026 at 10:00 AM
OpenClaw went from viral sensation to security nightmare. Today, OpenAI launched Frontier. Two sides of the same story: autonomous agents are going mainstream, and the infrastructure isn't ready.

Posts

LHC v0.2: A Benchmark for Long-Horizon Agent Coherence (and the Methodology That Got It Honest)

We're Mistaking the Bootstrap Phase for the Future of AI Agents

Execution Is Cheap. Judgment Isn't: AI Agents and the Collapse of the CTO/CPO Divide

The OWASP Top 10 for AI Agents - Security in the Age of Autonomy

From OpenClaw's Chaos to OpenAI's Frontier: The Agent Infrastructure Reckoning

Explore by Tags