The dangerous part of web extraction is not the error.

The dangerous part is a clean JSON response that looks correct and is not.

If an AI agent uses that output, the mistake does not stay inside a scraper. It moves into a report, a lead list, a price alert, a CRM update, or an automated decision.

So before you trust any web extraction API in an agent workflow, test whether it fails honestly.

Here is the checklist I use.