How I use LLMs for structured classification without getting garbage output

I've been using LLMs to classify GitHub pull requests into changelog categories. The goal:...

lunedì 15 giugno 2026 New tab

495 words~2 min read

I've been using LLMs to classify GitHub pull requests into changelog categories. The goal: automatically decide if a PR is a feature, bugfix, breaking change, or internal noise.

It took several iterations to get consistent output. Here's what actually worked.

The problem with direct classification

The naive approach:

Classify this PR: feature / bugfix / breaking / internal.

Other newsrooms on this story

· 2 sources

Full timeline →

machinelearningmastery.com·Jun 11, 2026 · 13 g fa
Multi-Label Text Classification with Scikit-LLM - MachineLearningMastery.com
shopify.engineering·Jun 15, 2026 · 10 g fa
Teaching Sidekick to say no: automated data curation with LLM judge consensus (2026) - Shopify

How I use LLMs for structured classification without getting garbage output

Other newsrooms on this story

How I use LLMs for structured classification without getting garbage output

Other newsrooms on this story

Related reading

I don't trust the LLM to classify my email. So I don't let it.

Why I stopped using LLMs to generate code (and what I use instead)

Scikit-LLM vs. Traditional Text Classifiers: When Should You Use an LLM? -…

The Three Phases of Post-Training: How LLMs Learn to Provide Sensible Responses

How to use LLMs effectively in your daily work — a practical tutorial

Ways Devs Are Plugging LLMs Into Anomaly Detection

Related reading

I don't trust the LLM to classify my email. So I don't let it.

Why I stopped using LLMs to generate code (and what I use instead)

Scikit-LLM vs. Traditional Text Classifiers: When Should You Use an LLM? -…

The Three Phases of Post-Training: How LLMs Learn to Provide Sensible Responses

How to use LLMs effectively in your daily work — a practical tutorial

Ways Devs Are Plugging LLMs Into Anomaly Detection