Interesting what he reports, that newer models are worse at geolocation. Sorry if I'm getting paranoid, but I wonder if that's a deliberately nerfed capability.
That was the biggest surprise to me -- they are really, appreciably worse! Why?
One thing that comes to mind is that AI labs are increasingly specializing models for coding and, to a lesser degree, white-collar work in general (writing summaries, reports, etc.), and maybe that comes at the cost of other, unrelated capabilities.