Frontier AI models corrupt 25% of document content in multi-step workflows — rewriting rather than deleting, which makes the errors far harder to catch.

New research from a trio of Microsoft researchers reveals that LLMs ‘introduce substantial errors when editing work documents.’

Frontier AI models corrupt 25% of document content in multi-step workflows — rewriting rather than deleting, which makes the errors far harder to catch.