Discussion about this post

User's avatar
Devesh's avatar

The 80% problem maps directly to what we've seen in production. The gap isn't model capability — it's the compound cost of verification.

That last 20% isn't linear. Each incremental % requires exponentially more human oversight, edge case handling, and rollback infrastructure. The economics flip somewhere around 75-85% depending on domain complexity.

Most teams underestimate this until they've built it twice.

Esborogardius Antoniopolus's avatar

We should not take the opinion of Andrej too seriously. He is an AI researcher, and most AI researchers are generally at most passable software engineers, if not outright bad ones.

It is the same old age problem of scientists code.

27 more comments...

No posts

Ready for more?