Why My CSS Selectors Kept Breaking (and How LLMs Fixed It)

Every developer who has scraped the web knows the pain of brittle parsers.

I was building a small side project to aggregate job listings from a handful of startup pages. Nothing fancy — just grab title, company, location, and description. The sites were all different, but they had one thing in common: they changed their markup every few weeks, and my carefully crafted CSS selectors would snap.

At first I thought I could outsmart them. Use more generic selectors? XPath? Regex? No. Each change meant hours of debugging. I needed a different approach.

The Breaking Point

Last month, one of the target sites rolled out a redesign. My scraper returned zero listings. The HTML was completely reorganized. I spent an afternoon updating selectors, only to realize the next site in my list was also due for a refresh. I was fighting entropy.

Why My CSS Selectors Kept Breaking (and How LLMs Fixed It)

Related reading

I Gave Up on CSS Selectors: Using LLMs for Web Scraping

I stopped fighting broken parsers — here's how I use LLMs to extract web data…

I Built a Free API That Scrapes Any Website Using Plain English - No CSS…

Why I Gave Up on Perfect Selectors and Asked GPT to Extract My Data

I Tried AI-Powered Web Scraping So My Selectors Could Finally Rest

I spent 3 days scraping a site until I tried LLMs for data extraction