Discussion about this post

User's avatar
The Next Evolution's avatar

The existing comment from Amal Jbira takes the human cognition angle well — worth reading if you haven't. I want to push on something adjacent but different.

The piece correctly identifies fluency as the mechanism of trust. The manager approves because it looks clean. But in most enterprise workflows, that approval isn't really a quality judgment — it's a legal one. The signature means the human took responsibility. The downstream failure is now on record as something a person cleared.

The interception layer the piece proposes is the right technical answer. But there's an organisational question that sits upstream of it: who decided that this workflow should use AI, and who assessed the liability profile of that decision? In most organisations, those two things aren't connected. The technical team deploys. The approval process continues as if the data source hasn't changed. The governance framework for the old workflow gets inherited by the new one, with no audit of whether it still holds.

The flawless hallucination is dangerous because it looks like someone else's problem until it isn't. The API breaks, the engineers debug, the root cause eventually traces back to a model output that a person signed off on. At that point, the question isn't technical. It's whether the organisation that made the deployment decision understood what it was creating — and whether anyone ever asked.

Amal Jbira's avatar

This is a sharp and necessary piece, the "flawless hallucination" framing captures something most people in enterprise AI are still not taking seriously enough.

I'd push the concern one layer deeper though. You're describing what confident AI does to our systems. I've been thinking about what it does to us, the humans in the loop.

The same mechanism you identify here (fluency as a trust signal, errors indistinguishable from accuracy) doesn't just break downstream pipelines. It quietly erodes the internal validation layer we carry as thinkers. Each time an AI output lands well, a small deposit of trust accumulates. Over time, the pause — the moment where we used to ask "wait, do I actually agree with this?" — stops firing. Not because we chose to skip it. Because it stopped feeling necessary.

I call this defense dissolution. And unlike a broken API, it leaves no error log.

Your fix is an interception layer between AI output and live infrastructure. I think we also need one between AI output and our own judgment — deliberate friction, maintained skepticism, the habit of owning the reasoning rather than just evaluating someone else's.

The technical problem has a technical solution. The human problem is harder. And I'd argue it's the one we're least prepared for.

2 more comments...

No posts

Ready for more?