More

thatwasunusual · 2026-02-04T03:17:07 1770175027

> There's a lot that hosted services with extra features can give you.

I totally agree with that, but in my experience 99% of "application developers" don't need all these features. Of those you listed, I only see "backups" as a requirement. Everything else is just - what I said - features for when your application is successful and you want something streamlined.

thatwasunusual · 2026-01-31T14:42:42 1769870562

> Give me virtual machines, reliable block storage, file storage and object storage, networking, dns, managed kubernetes, [...]

But managers wants to _buy_ these services, not be directly responsible for them. That's where the problem lies, as I see it.

ben_w · 2026-01-31T14:55:08 1769871308

If those managers currently sold on The Cloud, can instead be sold on how much money they'd save not being on The Cloud, then corporate can do what it does best and change policy hard enough to give the staff whiplash.

I don't know what managers have been reading/hearing, but for the last decade or so as a developer what I've mostly been hearing is that the only people who actually benefit from Big Data architectures are FAANG, that it's much cheaper to run on a single small self-hosted system that's done right, that the complexity of managing the cloud is even higher than a local solution.

This matches my own experience of what people needed to serve millions of users 20 years ago. If you can't handle a chat system or a simple sales system with 100k-1M customers on a server made out of one single modern mobile phone, you're either just not trying hard enough or have too many layers of abstraction between business logic and bare metal. Even for something a bit more challenging than that, you should still be thinking thousands of users on a phone and 10k-100k on a single device that's actually meant to work as a server.

Zigurd · 2026-01-31T16:04:37 1769875477

> If those managers currently sold on The Cloud, can instead be sold on how much money they'd save not being on The Cloud...

This is more than a theory, it's a trend that is already underway. The cloud remains supremely capital efficient for startups, but pricing has crept up and some customers are falling off the other side of the table.

MagicMoonlight · 2026-02-01T15:05:07 1769958307

You might save 100k in server fees, but now you have to hire three full time people to manage your own servers. And you won’t get the redundancy or the security of having the experts do it across three data centres for you.

lelanthran · 2026-01-31T15:15:39 1769872539

> But managers wants to _buy_ these services, not be directly responsible for them. That's where the problem lies, as I see it.

Why won't they be able to buy them from EU providers?

ExoticPearTree · 2026-01-31T14:48:48 1769870928

They don't want to necessarily buy it, but they want to hedge their options from "my $guy can do everything" to "on which cloud platform can I find a competent operator tomorrow".

Sayrus · 2026-01-31T14:48:09 1769870889

Marketplace offers can go a long way to fill these void in official managed services.

thatwasunusual · 2026-01-17T02:21:52 1768616512

> the pricing seems phenomenal

I'm in Norway, and I wonder if I see different prices than people from elsewhere in the world? Here it says $1.7K, and I can get the LG UltraFine 6K 32" for $2K, with the benefit of being bought from a Norwegian retailer (think guarantees and shopping security).

To be clear; I have never tried either of these monitors, so I can't tell if either is any good. :D

moepstar · 2026-01-17T08:54:26 1768640066

Germany, also seeing $1699 on there...

thatwasunusual · 2025-12-29T03:17:14 1766978234

> I am writing this comment from a 2019 i9.

Same here, but...

> I have to charge it from the right hand ports. I think that is dumb, but it did solve the issue.

I _had to_ do this for a while (around 2023, I think, not that it matters), but I no longer have to. I don't know what has changed, unfortunately; I haven't reinstalled anything, and I can't say I have uninstalled anything either. It's really weird...

thatwasunusual · 2025-12-28T05:31:03 1766899863

Nice. It would be nice to have an option to create a per month print as well.

thatwasunusual · 2025-12-25T08:27:10 1766651230

>> FYI, HTTP headers are case insensitive

> Since when are they case sensitive?

[...]

thomascountz · 2025-12-25T09:04:51 1766653491

Perhaps the OG comment was misread or confusion was caused by a typo and/or edit.

When I originally read it hours ago, I also read it as "...HTTP headers are case sensitive," (emphasis mine).

That said, there is one caveat regarding case sensitivity for headers encoded for HTTP/2.

thatwasunusual · 2025-12-13T01:32:32 1765589552

Can someone do an ELI5, and why this is important?

wmf · 2025-12-13T01:59:16 1765591156

It's faster and lower latency than standard Thunderbolt networking. Low latency makes AI clusters faster.

thatwasunusual · 2025-12-11T16:28:12 1765470492

No (human) developer would _add_ tests. ^/s

thatwasunusual · 2025-12-09T23:51:41 1765324301

> We are getting to the point that its not unreasonable to think that "Generate an SVG of a pelican riding a bicycle" could be included in some training data.

I may be stupid, but _why_ is this prompt used as a benchmark? I mean, pelicans _can't_ ride a bicycle, so why is it important for "AI" to show that they can (at least visually)?

The "wine glass problem"[0] - and probably others - seems to me to be a lot more relevant...?

[0] https://medium.com/@joe.richardson.iii/the-curious-case-of-t...

simonw · 2025-12-10T00:15:45 1765325745

The fact that pelicans can't ride bicycles is pretty much the point of the benchmark! Asking an LLM to draw something that's physically impossible means it can't just "get it right" - seeing how different models (especially at different sizes) handle the problem is surprisingly interesting.

Honestly though, the benchmark was originally meant to be a stupid joke.

I only started taking it slightly more seriously about six months ago, when I noticed that the quality of the pelican drawings really did correspond quite closely to how generally good the underlying models were.

If a model draws a really good picture of a pelican riding a bicycle there's a solid chance it will be great at all sorts of other things. I wish I could explain why that was!

If you start here and scroll through and look at the progression of pelican on bicycle images it's honestly spooky how well they match the vibes of the models they represent: https://simonwillison.net/2025/Jun/6/six-months-in-llms/#ai-...

So ever since then I've continue to get models to draw pelicans. I certainly wouldn't suggest anyone take serious decisions on model usage based on my stupid benchmark, but it's a fun first-day initial impression thing and it appears to be a useful signal for which models are worth diving into in more detail.

thatwasunusual · 2025-12-10T02:19:32 1765333172

> If a model draws a really good picture of a pelican riding a bicycle there's a solid chance it will be great at all sorts of other things.

Why?

If I hired a worker that was really good at drawing pelicans riding a bike, it wouldn't tell me anything about his/her other qualities?!

suspended_state · 2025-12-10T07:22:03 1765351323

Your comment is funny, but please note: it's not drawing a pelican riding a bike, it's describing in SVG a pelican riding a bike. Your candidate would at least displays some knowledge of the SVG specs.

simonw · 2025-12-10T03:10:16 1765336216

I wish I knew why. I didn't think it would be a useful indicator of model skills at all when I started doing it, but over time the pattern has held that performance on pelican riding a bicycle is a good indicator of performance on other tasks.

vikramkr · 2025-12-10T04:43:30 1765341810

The difference is that the worker you hire would be a human being and not a large matrix multiplication that had parameters optimized by a a gradient descent process and embeds concepts in a higher dimensional vector space that results in all sorts of weird things like subliminal learning (https://alignment.anthropic.com/2025/subliminal-learning/).

It's not a human intelligence - it's a totally different thing, so why would the same test that you use to evaluate human abilities apply here?

Also more directly the "all sorts of other things" we want llms to be good at often involve writing code/spatial reasoning/world understanding which creating an svg of a pelican riding a bicycle very very directly evaluates so it's not even that surprising?

falcor84 · 2025-12-10T10:22:21 1765362141

For better or worse, a lot of job interviews actually do use contrived questions like this, such as the infamous "how many golf balls can you fit in a 747?"

theshrike79 · 2025-12-10T11:48:57 1765367337

What if the employee can draw a bike and a pelican, but not a pelican on a bike?

jtbaker · 2025-12-10T02:53:30 1765335210

a posteriori knowledge. the pelican isn't the point, it's just amusing. the point is that Simon has seen a correlation between this skill and and the model's general capabilities.

theshrike79 · 2025-12-10T11:48:06 1765367286

It's just a variant of the wine glass - something that doesn't exist in the source material as-is. I have a few of my own I don't share publicly.

Basically in my niche I _know_ there are no original pictures of specific situations and my prompts test whether the LLM is "creative" enough to combine multiple sources into one that matches my prompt.

I think of if like this: there are three things I want in the picture (more actually, but for the example assume 3). All three are really far from each other in relevance, in the very corner of an equilateral triangle (in the vector space of the LLM's "brain"). What I'm asking it to do is in the middle of all three things.

Every model so far tends to veer towards one or two of the points more than others because it can't figure out how to combine them all into one properly.

wisty · 2025-12-10T00:18:55 1765325935

It's not nessessarily the best benchmark, it's a popular one, probably because it's funny.

Yes it's like the wine glass thing.

Also it's kind of got depth. Does it draw the pelican and the bicycle? Can the penguin reach the peddles? How?

I can imagine a really good AI finding a funny or creative or realistic way for the penguin to reach the peddles.

An slightly worse AI will do an OK job, maybe just making the bike small or the legs too long.

An OK AI will draw a penguin on top of a bicycle and just call it a day.

It's not as binary as the wine glass example.

thatwasunusual · 2025-12-10T02:16:36 1765332996

> It's not nessessarily the best benchmark, it's a popular one, probably because it's funny.

> Yes it's like the wine glass thing.

No, it's not!

That's part of my point; the wine glass scenario is a _realistic_ scenario. The pelican riding a bike is not. It's a _huge_ difference. Why should we measure intelligence (...) in regards to something that is realistic and something that is unrealistic?

I just don't get it.

Fnoord · 2025-12-10T05:50:48 1765345848

> the wine glass scenario is a _realistic_ scenario

It is unrealistic because if you go to a restaurant, you don't get served a glass like that. It is frowned upon (alcohol is a drug, after all) and impractical (wine stains are annoying) to fill a glass of wine as such.

A pelican riding a bike, on the other hand, is realistic in a scenario because of TV for children. Example from 1950's animation/comic involving a pelican [1].

[1] https://en.wikipedia.org/wiki/The_Adventures_of_Paddy_the_Pe...

mzl · 2025-12-10T12:45:44 1765370744

A better reason why wine glasses are not filled like that is that wine glasses are designed to capture the aroma of the wine.

Since people look at a glass of wine and judge how much "value" they got based partly on how much wine it looks like, many bars and restaurants choose bad wine-glasses (for the purpose of enjoying wine) that are smalle and thus can be fulled more.

vikramkr · 2025-12-10T04:44:56 1765341896

If the thing we're measuring is a the ability to write code, visually reason, and handle extrapolating to out of sample prompts, then why shouldn't we evaluate it by asking it to write code to generate a strange image that it wouldn't have seen in its training data?

thatwasunusual · 2025-12-05T08:16:02 1764922562

> after 2 days both the cable internet and cell towers went down, so even 5G would not have helped.

I discovered the same thing the hard way myself recently (in Norway); turns out that cell towers only has enough battery for ~24-36 hours (if you're lucky).

However, someone messing with the fibre to my house is a bigger possibility than power outage, so I'll probably end up with this 5G product. :)