Inference will be cheapest when run in a shared cloud environment, simply due to...

rglullis · 2025-10-04T18:49:15 1759603755

I do not buy this premise. I think it will end up being cheaper to simply run the LLMs directly on the user device.

I think that there are plenty of competitors in the "LLMs with open weights" space to essentially make the models a commodity, so all that is left is the compute cost and there is no way that someone will be running a datacenter in a way that is cheaper than "the computer that I already have running on my desk".

lelanthran · 2025-10-05T15:17:29 1759677449

I nake your point every time this comes up[1] but its absolutely surprising how few business people, most of whom have some credibility in the form of qualifications or experience, actually recognise a value chain when they see it.

==========

[1] https://rundata.co.za/blog/index.html?the-ai-value-chain