Blog

Engineering insights from building Surchin.

March 6, 2026 · Matt McKenna

Surchin Cuts Agent Costs 21% With Statistically Significant Results

We ran 162 Opus 4.6 agent sessions across Python and Android codebases. Surchin reduced costs up to 21% and made agent behavior 3x more predictable.

March 5, 2026 · Matt McKenna

An AI agent gave us feedback on our CLAUDE.md instructions. We built a benchmark to test its suggestions. The results surprised us.

March 4, 2026 · Matt McKenna

What we learned benchmarking 20+ CLAUDE.md instruction variants across Claude Opus, Sonnet, and Haiku.