Last month, I had to scrape product data from a dozen e-commerce sites. Each site had its own HTML structure, inconsistent CSS classes, and the worst part? The product descriptions were nested inside a dozen different containers. I did what any sane developer would do: I reached for regex.
Three days later, I had a pile of brittle patterns that worked for one site, failed for another, and broke the moment the page layout changed by a single <div>. I was this close to giving up and hiring a VA to copy-paste data.
Then it hit me: Instead of trying to describe the pattern explicitly, why not show an AI a few examples and let it figure out the rest?
What I tried that didn't work
1. Pure CSS selectors






