Crypto

OpenAI Reduces Nvidia Dependency While NVDA Stock Climbs: What's Behind the Paradox?

OpenAI Reduces Nvidia Dependency While NVDA Stock Climbs: What's Behind the Paradox?

Nvidia finds itself in an unusual position: two of the most significant threats to its dominance emerged in quick succession, yet the company's stock continues to rise. First, China demonstrated it could train a powerful AI model without a single Nvidia chip. Now, OpenAI — one of Nvidia's most important customers — has dramatically reduced how many of those chips it actually needs. So why aren't investors panicking?

OpenAI has been quietly chipping away at its Nvidia dependency on two distinct fronts. The first is software optimization. According to reporting by The Information, OpenAI engineers developed new inference optimization techniques that slashed inference costs by more than half. While the company hasn't disclosed the technical specifics, the implications are clear: fewer Nvidia GPUs are required to handle the same volume of ChatGPT traffic. This opens the door for OpenAI to either cut user-facing prices or significantly expand usage limits without adding hardware.

The second front is custom silicon. On June 24, OpenAI and semiconductor giant Broadcom jointly announced a chip called Jalapeño — OpenAI's first purpose-built AI processor. Designed from scratch in just nine months, the chip is optimized specifically for large language model workloads, including ChatGPT, Codex, API services, and future agentic applications. Early benchmarks reportedly show substantially better performance per watt compared to current leading chips. Deployment is planned at gigawatt scale by the end of 2026, with Microsoft serving as the primary partner.

OpenAI is far from alone in this shift. Google has operated its own tensor processing units since 2016, and Amazon has since developed its own custom AI accelerators. Research firm TrendForce projects that ASIC-based systems will account for 27.8% of AI server shipments in 2026 — the highest share since 2023 — with custom chip growth set to outpace Nvidia GPU adoption for the first time. Broadcom and Marvell have emerged as key players in this custom silicon buildout.

China's trajectory adds another layer of complexity. Meituan recently trained its 1.6 trillion parameter LongCat-2.0 model entirely on domestic Chinese chips, bypassing Nvidia hardware entirely — a milestone accelerated by U.S. export restrictions.

Despite all of this, Nvidia stock rose nearly 2% on June 30, pushing the company's valuation close to $4.8 trillion. The reason lies in the numbers. Nvidia's most recent quarterly results revealed data-center revenue surging 75% year-over-year to a record $62.3 billion in a single quarter. Demand continues to outpace supply, and the company is still selling every chip it can manufacture.

Crucially, the current competitive pressure is concentrated at the inference layer, not training. Nvidia maintains an iron grip on model training, largely thanks to its CUDA software ecosystem, which developers have built on since 2006. Custom chips, while improving, rarely offer the same flexibility or developer familiarity.

Nvidia is also actively defending its inference turf. At its GTC conference, the company revealed that its upcoming Rubin platform is expected to reduce inference cost per token by up to ten times compared to its current Blackwell architecture. Historically, cheaper inference tends to increase overall usage and total compute demand — a dynamic that could offset volume losses from efficiency gains.

Some investors have rotated into rival chip stocks, betting that the inference shift will compound over time. But Nvidia issued its latest guidance without factoring in any China revenue and still projects record demand. The central question isn't whether competition is growing — it clearly is. The real test is whether Nvidia's biggest customers can scale their alternatives faster than the overall AI market continues to expand.

Read Also