AI & Machine Learning
Business Insider3 days ago
0

Why AI token prices are about to plummet

AI

AI token prices are expected to plummet due to new, more efficient hardware like Nvidia's Blackwell GPUs, which drastically reduce costs per token.

Why AI token prices are about to plummet

Intelligence Insights

Context + impact, normalized for TechCulture.

The Big Picture
A wave of new technology, particularly Nvidia's Blackwell GPUs, is set to make AI tokens much cheaper and more abundant. Blackwell systems generate 65 times more tokens per second than previous Hopper systems and are 35 times cheaper per million tokens, dropping costs from $4.20 to $0.12. This efficiency gain is already leading to price cuts by AI model providers, with OpenAI's Sam Altman acknowledging the issue. A token spending index from Silicon Data shows a decline from 2.06 to 1.75, indicating falling prices. As Blackwell systems scale up in the second half of 2026, token prices are likely to plummet further, potentially reducing concerns about token costs or spurring even greater usage.
Why It Matters
The impending drop in AI token prices, driven by Nvidia's Blackwell systems, could democratize AI usage by making it drastically cheaper. This shift may spur a surge in AI applications and user adoption, but also risks overconsumption of tokens, potentially straining infrastructure and energy resources.

Deepen your understanding

Use our AI to break down complex signals.

Select an AI action to generate more depth.

Nvidia CEO Jensen Huang shows off the company's Blackwell system on stage.
Nvidia CEO Jensen Huang shows off the company
Nvidia CEO Jensen Huang shows off the company's Blackwell system on stage.

Ann Wang/REUTERS

  • There's been a lot of hand-wringing lately about AI token costs.
  • There could be relief coming later this year.
  • A wave of new technology may cause token prices to plummet. It may already be starting to happen.

I had lunch with the CEO of an AI infrastructure company recently. I can't tell you their name, but they said something that really caught my attention: There will be a crop of new AI models later this year that will be a lot better and more efficient.

This will likely make AI tokens more abundant and radically cheaper. (Tokens are the basic units models use to process information, and the standard way AI use is measured and priced).

Hand-wringing about tokenmaxxing could die down. Or, users could go on another bender and burn even more tokens with abandon.

Either way, the price of tokens is probably about to plummet. This is why we already see some AI model providers slashing prices, and other players talking about doing so.

OpenAI CEO Sam Altman recently said AI costs had become a huge issue, adding that the startup will have "a lot of ways we can help people get more value for less spend."

This trend may already be showing up in the data. A closely watched token spending index run by Silicon Data peaked at around 2.06 in late May and fell to 1.75 as of June 10.

Carmen Li, the CEO of Silicon Data, told me this could mean token prices are dropping across many AI models.

Blackwell finally emerges

The main force driving token prices lower is a new wave of technology that's sweeping through AI data centers.

Nvidia's Blackwell GPUs are being installed in huge volumes right now. By the second half of this year, these systems, which are really supercomputers rather than chips, will be operating at scale, helping AI labs train new models and run them more efficiently.

These systems took a while to install properly, partly because they needed to be water-cooled and required other gnarly new data center setups. But the payoff could be huge.

50 x more, 35 x cheaper

SemiAnalysis, a respected AI research firm, compared Nvidia's top Blackwell system, the GB 300 NVL72, to Nvidia's previous system, called the Hopper HGX 200.

With the older system, each GPU generated 90 tokens per second, while the new Blackwell system generated 6,000. That's 65 times more.

These systems consume massive amounts of electricity, and the newer Blackwell offerings use even more. So SemiAnalysis also looked at how many tokens each system generated per megawatt. On this measure, Hopper churned out 54,000 tokens per second, while Blackwell generated 2.8 million. 50 times more.

Electricity prices are rising, due to all these energy-sipping AI data centers. So these days, GPU systems are assessed based on how much it costs to generate one million tokens.

SemiAnalysis tested this, too, and found that the older Hopper system cost $4.20 for every million tokens. The Blackwell system cost 12 cents. That's 35 times cheaper.

Again, new AI models will be increasingly trained and run on these new Blackwell systems as 2026 progresses. This is very likely to produce a massive increase in the number of cheaply-generated tokens.

This is why AI model providers will probably slash token prices: Because they can.

Sign up for BI's Tech Memo newsletter here. Reach out to me via email at abarr@businessinsider.com.

Read the original article on Business Insider
Hardware Big Tech AI Policy

Intelligence Exchange

0

Log in to participate in the exchange.

Sign In

Syncing Discussions...

Finding Related Intelligence...
Why AI token prices are about to plummet | TechCulture