> The companies that are entirely AI-dependent may need to raise prices dramatic...

lelanthran · 2026-04-17T14:25:36 1776435936

>> The companies that are entirely AI-dependent may need to raise prices dramatically as AI prices go up.

> It's not that clear. Sure, hardware prices are going up due to the extremely tight supply, but AI models are also improving quickly to the point where a cheap mid-level model today does what the frontier model did a year ago.

I agree; I got some coding value out of Qwen for $10/m (unlimited tokens); a nice harness (and some tight coding practices) lowers the distance between SOTA and 6mo second-tier models.

If I can get 80% of the way to Anthropic's or OpenAI's SOTA models using 10$/m with unlimited tokens, guess what I am going to do...

satvikpendem · 2026-04-17T14:36:10 1776436570

GitHub Copilot is already $10 and I don't even use up the requests every month, it's the most bang for buck LLM service I've used.

chewz · 2026-04-17T18:16:31 1776449791

Until May

kwakubiney · 2026-04-17T18:38:12 1776451092

What’s happening in May?

chewz · 2026-04-17T18:53:26 1776452006

Github Copilot switches all users from per prompt to per token billing

bcjdjsndon · 2026-04-17T12:16:36 1776428196

There's only so far engineers can optimise the underlying transformer technique, which is and always has been doing all the heavy lifting in the recent ai boom. It's going to take another genius to move this forward. We might see improvements here and there but the magnitudes of the data and vram requirements I don't think will change significantly

aerhardt · 2026-04-17T21:12:45 1776460365

I’ve read and heard from Semi Analysis and other best-in-class analysts that the amount of software optimizations possible up and down the stack is staggering…

How do you explain that capabilities being equal, the cost per token is going down dramatically?

zozbot234 · 2026-04-17T12:24:09 1776428649

State space models are already being combined with transformers to form new hybrid models. The state-space part of the architecture is weaker in retrieving information from context (can't find a needle in the haystack as context gets longer, the details effectively get compressed away as everything has to fit in a fixed size) but computationally it's quite strong, O(N) not O(N^2).

chewz · 2026-04-17T13:36:42 1776433002

We are processing same data for the last 2 years.

Inference prices droped like 90 percent in that time (a combination of cheaper models, implicit caching, service levels, different providers and other optimizations).

Quality went up. Quantity of results went up. Speed went up.

Service level that we provide to our clients went up massively and justfied better deals. Headcount went down.

What's not to like?

oeitho · 2026-04-17T13:57:12 1776434232

The decline of independent thoughts for one. As people become reliant on LLMs to do their thinking for them and solve all problems that they stumble upon, they become a shell of their previous self.

Sadly, this is already happening.

chewz · 2026-04-17T18:29:45 1776450585

There is no decline. Human assets were always too expensive to process some additional information. We are simply processing lot more of low signal data.

Actually some of our analysts are empowered by the tools at their disposal. Their jobs are safe and necessary. Others were let go.

Clients are happy to get fuller picture of their universe, which drives more informed decissions . Everybody wins.

oeitho · 2026-04-17T20:25:10 1776457510

You are free to believe what you want, but what you describe does not match what I’ve seen from society as a whole. I’m just going to leave this here: https://www.media.mit.edu/projects/your-brain-on-chatgpt/ove...

WarmWash · 2026-04-17T14:04:29 1776434669

We'll need to do faux mental work like how we do faux labor work.

bluecheese452 · 2026-04-17T18:19:08 1776449948

The headcount that went down probably isn’t too thrilled about it.

chewz · 2026-04-17T18:37:44 1776451064

Yes, probably. But the others gained skills and tools that made their jobs secure.

bluecheese452 · 2026-04-17T19:27:47 1776454067

Right but the question wasn’t were some people better off. It is what’s not to like?

CodingJeebus · 2026-04-17T14:44:48 1776437088

You also have to look at how exposed your vendors are to cost increases as well.

Your company may have the resources to effectively shift to cheaper models without service degradation, but your AI tooling vendors might not. If you pay for 5 different AI-driven tools, that's 5 different ways your upstream costs may increase that you'll need to pass on to customers as well.