Posts
All the articles I've posted.
LHC v0.2: A Benchmark for Long-Horizon Agent Coherence (and the Methodology That Got It Honest)
Published: at 08:00 PMI just published LHC v0.2, an open benchmark for long-horizon coherence in 8B-class agent models, plus a deterministic parser baseline that puts a useful floor on what fine-tuning is worth for structured-state tasks. This post explains what they're for, how to use them, and the methodology arc that produced them across five rounds of external review.
We're Mistaking the Bootstrap Phase for the Future of AI Agents
Published: at 08:00 AMThe self-hosted AI agent movement is real and important. But we are confusing a bootstrap phase with a destination architecture. The long-term future of agents will be defined by platforms that make them reliable, governable, and operationally boring.
Execution Is Cheap. Judgment Isn't: AI Agents and the Collapse of the CTO/CPO Divide
Published: at 10:00 AMWhen execution becomes abundant through AI agents, judgment becomes the bottleneck. The traditional separation between technical and product leadership breaks down, creating space for the CPTO role.
The OWASP Top 10 for AI Agents - Security in the Age of Autonomy
Published: at 08:00 AMOWASP just published their first Top 10 for Agentic Applications. Here's what every agent builder needs to know about the new attack surfaces, why traditional security fails, and how the orchestration layer becomes the new security boundary.
From OpenClaw's Chaos to OpenAI's Frontier: The Agent Infrastructure Reckoning
Published: at 10:00 AMOpenClaw went from viral sensation to security nightmare. Today, OpenAI launched Frontier. Two sides of the same story: autonomous agents are going mainstream, and the infrastructure isn't ready.