OpenAI Jalapeño: Its First Custom AI Chip Built for LLM Inference In Partnership with Broadcom

Mirror Review

June 25, 2026

OpenAI has officially unveiled OpenAI Jalapeño, its first custom AI chip designed from scratch to run future large language models. Developed in partnership with Broadcom, this new intelligence processor marks the company’s expansion from software development into physical hardware infrastructure. The application-specific integrated circuit (ASIC) focuses entirely on AI inference, which is the process of serving live AI responses to users on platforms like ChatGPT and Codex. OpenAI plans to begin initial deployments of the chip within global data centers by the end of 2026.

By controlling both the AI models and the physical silicon underneath them, OpenAI aims to make advanced artificial intelligence faster, more reliable, and significantly more affordable for everyday users.

What is the OpenAI Jalapeño Chip?

The OpenAI Jalapeño chip is a highly specialized hardware accelerator built specifically for large language model (LLM) inference.

Unlike general-purpose graphics processors that handle a wide variety of computing tasks, this custom silicon focuses solely on running trained models efficiently.

When a user submits a prompt to ChatGPT or instructs an AI agent to execute code, the inference engine processes that specific request in real time.

Key Specifications and Technical Design of the Jalapeño Chip

Chip Type: Application-Specific Integrated Circuit (ASIC) tailored for LLM workloads.
Primary Partner: Broadcom handled the silicon implementation, while Celestica assisted with board and rack system integration.
Networking Base: Utilizes Broadcom’s Tomahawk networking silicon to support massive multi-chip arrays.
Target Load: Lab engineering samples are already running frontier workloads, including experimental testing on models like GPT-5.3-Codex-Spark.
Efficiency Focus: The internal architecture minimizes data movement between memory and computing cores to maximize actual hardware utilization.

Instead of adapting an older architecture designed for graphics or general AI training, engineers designed this platform as a completely blank slate. This design approach targets the specific memory bottlenecks and communication patterns that occur when serving massive modern language networks.

Why OpenAI is Building its First Homegrown AI Chip

The decisions shaping OpenAI’s first homegrown AI chip stem directly from a global shortage of advanced computing power.

Since kick-starting the generative AI boom, the company has operated as one of the largest buyers of Nvidia’s expensive graphics processing units. However, the explosive demand for automated intelligence has made relying on a single hardware vendor a major operational risk.

By developing OpenAI’s first custom AI chip, the organization establishes a self-sustaining cycle of efficiency.

Lowering the electrical and financial cost of running data centers means the company can offer faster response times without charging premium prices.

This control over the hardware infrastructure protects the company from market supply shocks and allows it to tailor servers to its exact algorithmic needs.

The Nine-Month Development Cycle

The creation of OpenAI’s first AI Processor set a remarkably fast timeline for the semiconductor industry. The team progressed from an initial blank-slate concept to a completed manufacturing tape-out in just nine months. Industry experts typically expect custom high-performance silicon development to take multiple years of design and verification.

The Nine-Month Development Timeline Includes:

Phase 1: Architecture Definition (Based on LLM kernel requirements)

Phase 2: AI-Assisted Chip Design (OpenAI models automated optimization)

Phase 3: Silicon Implementation (Broadcom manufacturing preparation)

Phase 4: Hardware Tape-out & Lab Delivery (Physical engineering samples)

A major factor in this rapid schedule was the use of artificial intelligence itself. Engineers utilized existing OpenAI models to automate, check, and improve parts of the physical design process. This creates an interesting feedback loop where current models help design the exact physical infrastructure required to run the next generation of software.

Performance Expectations and Industry Impact

While OpenAI plans to release an exhaustive technical report in the coming months, early lab benchmarks show significant promise.

The AI Chip Jalapeño manages energy consumption efficiently, yielding a performance-per-watt metric that stands substantially above current industry benchmarks.

“Jalapeño was designed from the ground up for LLM inference using detailed insights from our close collaboration with OpenAI researchers,” said Richard Ho, leader of OpenAI’s hardware program. “We optimized the architecture around the kernels, memory movement, networking, and serving patterns that matter most for frontier AI models.”

Broadcom CEO Hock Tan noted that the new chip matches the general capabilities of top-tier hardware like Nvidia’s Blackwell series and Google’s Tensor Processing Units (TPUs) during specialized inference tasks.

OpenAI joins other major technology companies, such as Microsoft, Amazon, and Meta, which have also deployed custom silicon inside their private server farms to manage soaring operational expenses.

Deployment Timeline and Future Roadmap

The arrival of the physical silicon sample marks only the first phase of a broader multi-generation compute initiative. The organizations are preparing a massive physical footprint for these chips over the next several years.

OpenAI Jalapeño Rollout Schedule (2026–2028)

Late 2026: Small-scale prototype deployment begins within selected data centers managed by infrastructure partners like Microsoft.
2027: Production lines will ramp up significantly to distribute racks of custom processors globally.
First Half of 2028: Full-tilt manufacturing and deployment will expand to achieve a gigawatt-scale infrastructure footprint.

This multi-year roadmap is designed to support the shift from simple conversational chatbots to persistent, autonomous AI agents that run continuously in the background.

These advanced agentic systems require massive amounts of continuous energy and processing power, making specialized hardware efficiency vital.

End Note

The introduction of OpenAI Jalapeño is a major transition for the generative AI market. By controlling everything from the user interface down to the physical silicon transistors, the company aims to make intelligence as accessible and affordable as electricity.

While it will take a few years for the AI Chip Jalapeño to populate global data centers at full scale, this partnership with Broadcom proves that the future of artificial intelligence relies as much on hardware innovation as it does on smart software algorithms

Maria Isabel Rodrigues