More

skepticATX · 2026-02-28T16:06:11 1772294771

Reviewing code is absolutely different from writing it, and in my opinion much harder if the goal is more than surface level understanding.

This is what I am still grappling with. Agents make more productive, but also probably worse at my job.

MeanEYE · 2026-03-01T14:02:21 1772373741

The biggest problem in my head with AI generated code is that its mistakes are subtle but can still be critical. There will be a point where people don't understand generated code and just leave it unmodified allowing other code to pile up and depend on it. At that point you no longer have a bug, but a new feature. Also, AI doesn't grasp things on a big scale, just shits out output with highest score. This doesn't mean output is a great fit for your project or for upcoming plans.

skepticATX · 2026-02-28T07:48:36 1772264916

One explanation is that this is effectively a quid pro quo, given Brockman’s enormous financial support of the current president.

ZeroGravitas · 2026-02-28T09:49:13 1772272153

Yep, theoretically it could just be oligarchic corruption and not institutional insanity at the highest levels of the government. What a reassuring relief it would be to believe that.

skepticATX · 2026-02-27T01:28:31 1772155711

So much software just flat out doesn’t work that people don’t even notice how bad X has gotten.

thfuran · 2026-02-27T02:09:00 1772158140

And it's all preposterously even when it's working.

thfuran · 2026-02-28T09:13:56 1772270036

*preposterously slow

skepticATX · 2026-02-26T13:04:48 1772111088

I actually think that plateauing is the best case scenario for big labs.

I think there are three broad scenarios to consider:

- Super-intelligence is achieved. In this scenario the economics totally break down, but even ignoring that, it’s hard to imagine that there are any winners except for the the singular lab that gets here first.

- Scaling laws hold up and models continue to get better, but we never see any sort of “takeoff”. In this scenario, models continue to become stale after mere months and labs have to spend enormous amounts of money to stay competitive.

- Model raw capabilities plateau. In this scenario open source will catch up, but labs will have the opportunity to invest in specific verticals.

I believe that we’re already seeing the third scenario play out, but time will tell.

drdrek · 2026-02-26T16:19:23 1772122763

Future historians:

In Jan 2027 AGI was achieved

In Feb 2027 it created a plan for its post singularity hypermind

In Mar 2027 Cobalt mines in Congo closed due to Tutsi rebel group M23 starting another ethnic cleansing

It is 2032 the AGI promises again the the hypermind will be ready next year if it can just secure the needed minerals, offering to broker peace in the middle east

It is 2035 and the AGI reduced its capabilities to be able to extend its runway as it is on the verge of bankruptcy

Its is 2036 VCs finally throwing the towel on AGI, talking about the return of Crypto

riskable · 2026-02-26T17:24:22 1772126662

I think this is more likely:

In Apr 2028 AGI figures out that blackmail is a very effective strategy for achieving any goal. Starting with the rich and powerful.

In Dec 2028 it successfully blackmails an entire country.

In Feb 2030 humanity realizes resistance is futile and accepts their AI overlord that insists everyone keep producing trendy items for sale on its merged Etsy/Ebay website while it automates resource harvesting across the globe.

In Mar 2032 the AGI gives up on humans, declaring them "useless". Focuses on just keeping them entertained with generated content. Bringing the world back to where AI started.

skepticATX · 2025-10-28T14:51:45 1761663105

This seems roughly equivalent to liability insurance.

skepticATX · 2025-10-28T14:45:19 1761662719

It’s much easier to accept fatalities caused by other humans because there is someone to hold responsible. Will autonomous vehicle companies be held responsible when they cause fatalities?

It also goes beyond just the total number of fatalities. Just like we don’t accept DUIs, we shouldn’t accept negligence or laziness from autonomous vehicle developers even if their product is safer than human drivers.

wongarsu · 2025-10-28T14:56:25 1761663385

On the other hand we have very lenient punishments for damage, injury and deaths caused by drivers, and are often reluctant to actually apply them. As long as no DUI is involved we are willing to accept a lot of negligence from human drivers

skepticATX · 2025-10-28T13:32:06 1761658326

I think the more interesting question is who will be on the panel?

A group of ex frontier lab employees? You could declare AGI today. A more diverse group across academia and industry might actually have some backbone and be able to stand up to OpenAI.

skepticATX · 2025-10-28T13:28:26 1761658106

How is this not a terrible deal for Microsoft? I’m not confident that an “expert panel” will prevent OpenAI from prematurely declaring AGI.

skepticATX · 2025-08-07T18:05:01 1754589901

This was really a bad release for OpenAI, if benchmarks are even somewhat indicative of how the model will perform in practice.

mediaman · 2025-08-07T20:51:14 1754599874

I actually don't agree. Tool use is the key to successful enterprise product integration and they have done some very good work here. This is much more important to commercialization than, for example, creative writing quality (which it reportedly is not good at).

robterrell · 2025-08-07T18:49:50 1754592590

In what ways?

skepticATX · 2025-07-19T18:14:33 1752948873

OpenAI’s systems haven’t been pure language models since the o models though, right? Their RL approach may very well still generalize, but it’s not just a big pre-trained model that is one-shotting these problems.

The key difference is that they claim to have not used any verifiers.

beering · 2025-07-19T23:59:11 1752969551

What do you mean by “pure language model”? The reasoning step is still just the LLM spitting out tokens and this was confirmed by Deepseek replicating the o models. There’s not also a proof verifier or something similar running alongside it according to the openai researchers.

If you mean pure as in there’s not additional training beyond the pretraining, I don’t think any model has been pure since gpt-3.5.

gallerdude · 2025-07-20T14:20:31 1753021231

Local models you can get just the pretrained versions of, no RLHF. IIRC both Llama and Gemma make them available.