The Cloud Codex

// featured

the-26-who-never-rolled-back-an-agent-arent-winning.mdx featured

The 26% Who Never Rolled Back an Agent Aren't Winning

New survey data shows AI agent rollback rates rise, not fall, as safety guardrails mature — because better monitoring catches failures other teams miss entirely.

michael tuszynski · Jul 28, 2026 · 6 min read

// series

$ ls ./series --all →

7-part series

Building an AI Assessment System

A field build of AI-graded assessment — grading written answers, testing the judge, red-teaming it, and measuring what actually moved the needle for students. Read in order.

▶ start at part 1 latest: part 7

// start here

$ grep -l signal ./essays

// most recent

$ ls -t ./essays

[ 01 ] Jul 23, 2026 · 5 min read

AI Use Is a 50/50 Coin Flip. Coding Already Tipped 61% Autonomous.

#ai-agents#software-development#ai-automation

[ 02 ] Jul 21, 2026 · 5 min read

The Meter Is Always Running

#ai-economics#enterprise-ai#cloud-strategy

[ 03 ] Jul 20, 2026 · 5 min read

Your AI Coding Pilot Cost $80K and Shipped 9% Faster. Here's the Line Item Finance Missed.

#ai-engineering#engineering-leadership#developer-productivity

[ 04 ] Jul 16, 2026 · 7 min read

The Cloud Codex — field notes on building with AI agents

The 26% Who Never Rolled Back an Agent Aren't Winning

Building an AI Assessment System

AI Use Is a 50/50 Coin Flip. Coding Already Tipped 61% Autonomous.

The Meter Is Always Running

Your AI Coding Pilot Cost $80K and Shipped 9% Faster. Here's the Line Item Finance Missed.

The Bubble Popper and the Payoff Are the Same Thing

Torvalds' Best Review Trick Just Stopped Working

Attention Is the New Bottleneck. Engineer It Like One.

Where Your $20K in Tokens Actually Goes

Try It: A Working Assessment-First Course

Instrument Like a Learning Scientist

The Biggest Effect Size Was the Boring Feature

The Grader Was Right and the Students Quit Anyway

Students Are Adversaries: Red-Teaming an LLM Grader