Michael Rothrock
Thinking out loud about AI agent reliability, verification design, and what the data actually says.
Why AI failures propagate — and why the fix is checkpoints, not better models