Monday, June 22, 2026

Google is spending billions to show its TPU chips into an actual challenger to Nvidia

TL;DR: Google is pushing its in-house AI chips rather more aggressively, turning years of tensor processing unit growth right into a direct problem to Nvidia’s maintain on the AI {hardware} market. For years, the corporate constructed its chips largely to deal with its inside workloads. These tensor processing items, or TPUs, sat behind merchandise like search and speech recognition, dealing with among the firm’s heavier AI workloads. Now, Google is making an attempt to show that in-house benefit right into a enterprise that may stand as much as Nvidia.

One clear instance of that shift is in western New York at an AI data-center cluster known as Lake Mariner, on Lake Ontario’s southern shore close to Niagara Falls. Alphabet’s Google has offered a $3.2 billion monetary assure for the undertaking, whose builders plan to lease computing energy from 1000’s of Google’s chips to Anthropic, in accordance with individuals acquainted with the matter who spoke to The Wall Avenue Journal.

The essential playbook is much like Nvidia’s: assist data-center financing after which profit when these websites purchase your chips.

That form of financing has turn out to be extra essential as the marketplace for AI compute has tightened. Over the previous yr, the AI race has turn out to be much less about fashions and extra about sheer entry to computing energy. “You’ve gotten all these very well-capitalized corporations who’re huge believers that this market round compute goes to have great worth,” stated Nazar Khan, co-founder and chief know-how officer of TeraWulf, which is growing Lake Mariner with FluidStack, a Google-backed cloud supplier. “They need to be within the recreation, they do not need to be left behind,” Khan informed The WSJ.

The story behind Google’s push goes again to 2013. Jeff Dean, now chief scientist at Google’s DeepMind lab, recalled engaged on speech recognition techniques constructed on the neural-network strategies that later developed into in the present day’s giant language fashions. “I stated, ‘OK, if we need to have this speech mannequin that we roll out to 100 million customers, they usually use it a couple of minutes a day, that will require doubling the variety of computer systems Google had,'” he stated. “We have to construct specialised {hardware}.” That conclusion helped spur Google’s TPU program, which has since produced a number of generations of the chips.

Google saved these chips to itself at first, then began providing them by Google Cloud as demand for AI computing exploded. That step helped drive progress within the cloud enterprise and set the stage for extra direct competitors with Nvidia. Analysis agency SemiAnalysis requested in a November notice whether or not the discharge of Google’s seventh-generation TPU – which Anthropic makes use of to coach its fashions – marked “the top of Nvidia’s dominance.”

The corporate’s newest strikes recommend it’s keen to check that query. Google not too long ago struck a $5 billion cope with Blackstone to create a brand new cloud-services enterprise designed to compete with Nvidia-aligned suppliers akin to CoreWeave and Nebius. It has additionally determined to promote chips on to prospects relatively than solely by its cloud and has rolled out its first TPU designed particularly for inference.

Mark Lohmeyer, vice chairman of AI and computing infrastructure for Google Cloud, stated the brand new inference chip and enhancements in how TPUs work throughout completely different techniques have generated new curiosity in utilizing them. “We’re seeing a set of consumers which may not have thought of it prior to now,” he stated.

Citadel Securities, a longtime Google Cloud consumer, not too long ago started utilizing TPUs for a few of its analysis software program. Josh Woods, the agency’s chief know-how officer, stated the corporate can run key workloads at 30% decrease price and as much as 4 occasions sooner with TPUs.

Nvidia, for its half, isn’t treating TPUs as an existential menace. The corporate nonetheless controls an estimated greater than 90% of the AI chip market, helped by its CUDA software program stack and a {hardware} ecosystem that many AI labs already depend on.

In an April look on podcaster Dwarkesh Patel’s present, CEO Jensen Huang stated Nvidia has a a lot wider attain than any customized chip or ASIC. “I’d love to listen to them exhibit the associated fee benefit of TPUs,” he stated. “It is mindless in my thoughts.”

Some cloud suppliers fear they’re locked into Nvidia’s full stack, involved that shifting spend elsewhere may price them entry to its most coveted chips. Adam Fisher, a companion at Bessemer Enterprise Companions, stated some so-called neo-clouds worry ending up in what insiders half-jokingly name “Jensen jail.” “Not all of the Nvidia neo-clouds would say it this manner – some would say Nvidia offers them what they want – however there are others which can be dying for one thing else, however they can not get it from one other provider,” he stated.

Google is making an attempt to counter that inertia with cash and focus. The corporate has stated it plans to lift $85 billion in fairness, largely to assist AI infrastructure. It’s backing one other Anthropic-related undertaking known as River Bend, a $7 billion growth close to Baton Rouge, La., and is offering $1.4 billion in ensures for an AI computing lease in Colorado Metropolis, Texas.

Inside Google, the TPU enterprise has taken on the next profile underneath Amin Vahdat, who in December grew to become chief technologist in command of the corporate’s AI infrastructure build-out. His portfolio now spans chip design, provide and deployment, and he experiences each to Google Cloud chief Thomas Kurian and Alphabet CEO Sundar Pichai. Individuals who have labored with him describe Vahdat as demanding however quietly aggressive, and say he’s pushing for regular efficiency features and clearer business objectives for Google’s silicon efforts.

Vahdat says he isn’t getting down to knock Nvidia off its pedestal. Google nonetheless runs Nvidia GPUs in its information facilities, and he describes the connection as each cooperative and aggressive. “For me and for us, it isn’t zero-sum,” he stated. “There’s a lot demand on the market.”

With AI workloads rising sooner than anyone provider can deal with, that could be the opening Google wants. If Google retains enhancing its chips, strains up long-term prospects akin to Anthropic and Citadel, and makes use of its steadiness sheet to assist construct data-center capability, TPUs may turn out to be greater than an inside device. They could possibly be a real second choice in a market that has largely belonged to Nvidia.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles