Status: Current best available technique as of 2026. Treat as standard practice, not a workaround.

The Problem

Three naive approaches fail on modern sites:

view-source / static fetch — returns server HTML before JavaScript runs. JS-rendered tables show only empty <tbody> tags.

Screenshot + OCR — slow, pixel-dependent, brittle, compounds errors on numeric data.