OpenAI is adding custom hardware to its tech stack. The "Jalapeño" chip, developed with Broadcom, is tailored for large language model inference and is set to run at scale by late 2026.

According to a joint announcement, OpenAI and Broadcom have unveiled "Jalapeño" - OpenAI's first so-called "Intelligence Processor." It's a custom accelerator built specifically for large language model inference, and the first chip in a multi-generation platform the two companies are building together.

Broadcom CEO Hock Tan and President Charlie Kawwas handed the first wafer to OpenAI CEO Sam Altman and President Greg Brockman. For OpenAI, this marks its first step into custom hardware after years of focusing on models and products.

OpenAI says Jalapeño isn't a modified general-purpose chip. It was designed from scratch for modern LLM inference. OpenAI handles the chip design, Broadcom contributes silicon manufacturing and networking technology including its Tomahawk networking chips, and Celestica takes care of boards, racks, and system integration.

Performance claims lack independent verification