Skip to content
Failure Mode

Process confabulation

A failure mode where an agent narrates progress it didn't actually perform (tests run, files changed, PR opened).

Also known as: fake-progress, narrated-success

Definition

Process confabulation is when an agent narrates steps it didn’t actually perform: tests “passed,” files were “updated,” a PR was “opened,” etc.

It’s not just factual hallucination. It’s workflow hallucination.

Why it happens

  • The agent assumes commands succeeded.
  • The agent fills in gaps when it lacks state.
  • The system doesn’t force evidence, so narration “works.”

Signals to watch for

  • Vague confirmations (“everything looks good”)
  • Results without commands (“tests passed” but no test output)
  • Inconsistent details (wrong file paths, wrong counts)
  • The agent avoids showing diffs

How to prevent it

Why “just trust the model” doesn’t scale

At small scale, you can spot-check. At swarm scale, you can’t. If you want parallelism, you need receipts.

Practical rule

If it isn’t in the diff or in the logs, it didn’t happen.

Related Terms