Microsoft's new superintelligence team has unveiled its first product: MAI-Image-2, an image generator rolling out across Microsoft's own products and eventually available via API.

Microsoft's superintelligence team, led by Mustafa Suleyman, has released MAI-Image-2, an AI model that turns text prompts into images. The model currently ranks third on the Arena.ai leaderboard for text-to-image generators, trailing OpenAI's GPT-Image-1.5 and Google's Nano Banana 2 by a significant margin.

According to Microsoft, MAI-Image-2 produces especially realistic photos with natural lighting and accurate skin tones, while also handling detailed and surreal scenes. The company says it built the model alongside photographers, designers, and visual artists.

Microsoft says MAI-Image-2 generates photorealistic images with natural lighting and fine detail, including a portrait with shadow play, a macro shot of an iris, and a glacier cave scene. | Image: Microsoft

The model also does well with more practical tasks, like reliably rendering text in images for posters, infographics, or diagrams.