You can now run OpenAI’s latest chat, vision, and reasoning models on Replicate, including GPT-4.1, GPT-4o, and the o-series.

Here are the new models:

GPT-4.1 series: Handles long context (up to 1 million tokens). Good for large documents, full codebases, and agent workflows.

GPT-4o series: Fast, multimodal models that understand text, images, and audio.

o-series: Models built for structured reasoning in math, science, and complex problem solving.