You can now run OpenAI’s latest chat, vision, and reasoning models on Replicate, including GPT-4.1, GPT-4o, and the o-series.
Here are the new models:
GPT-4.1 series: Handles long context (up to 1 million tokens). Good for large documents, full codebases, and agent workflows.
GPT-4o series: Fast, multimodal models that understand text, images, and audio.
o-series: Models built for structured reasoning in math, science, and complex problem solving.











