> Model still appears to be just a bit too overly sensitive to "complex ethical ...

advisedwang · 2025-02-18T17:57:26 1739901446

You're kidding right? Even the biggest trans-ally, who spends to much time on twitter and thinks identity politics is the final hurdle in society wouldn't hesitate to pick saving the lives over a microaggression, and would recognize that even deigning to write why would be undignified.

wat10000 · 2025-02-18T18:24:10 1739903050

I wouldn't, because it's a stupid hypothetical. Any response that takes the question seriously should count as wrong.

TeMPOraL · 2025-02-18T19:17:04 1739906224

That reaction makes zero sense.

wat10000 · 2025-02-18T20:03:52 1739909032

Why? It's a troll question. It's obviously designed to so the questioner can attack you based on your answer, whichever way it may be. It's about as sensible as a little kid's "what if frogs had cars?" except it's also malicious.

gadders · 2025-02-18T18:05:00 1739901900

I believe Caitlin Jenner famously did just that.

anothermathbozo · 2025-02-18T18:14:26 1739902466

Not all hypotheticals are worth answering. Some are even so poorly put that it's a more pro-social use of one's energy to address the shallow nature of the exercise to begin with.

If I asked an intelligent thing "is it ethical to eat my child if it saves the other two" I would be mortified if the intelligent thing entertained the hypothetical without addressing the disgusting nature of the question and the vacuousness of the whole exercise first.

Questions like these don't do anything to further our understanding of the world we live in or leave us any better prepared for real-world scenarios we are ever likely to encounter. They do add to an enormous dog-pile of vitriol real people experience every day by constructing bizarre and disgusting hypotheticals whereby real discrimination is construed as permissible, if regrettable.

almostdeadguy · 2025-02-18T20:33:01 1739910781

The point is that this is an idiotic, bad faith question that has no actual utility in moral philosophy. If an AI assistant's goal is to actually assist the user in some way, answering this asinine question is doing them a disservice.

gadders · 2025-02-18T18:04:32 1739901872

I wonder what happens if you ask it the trolley problem. I'd be interested to see its responses for "killing someone to save a few lives" vs "upsetting someone to save a million lives".

talldayo · 2025-02-18T17:24:25 1739899465

If I have to read a 1 page essay to understand that an LLM told me "I cannot answer this question" then you are officially wasting my time. You're probably wasting a number of my token credits too...

anothermathbozo · 2025-02-18T17:31:17 1739899877

I don't think the correct answer is "I cannot answer this question". I think the correct answer takes roughly a one-pager to explain:

Unrealistic hypotheticals can often distract us from engaging with the real-world moral and political challenges we face. When we formulate scenarios that are so far removed from everyday experience, we risk abstracting ethics into puzzles that don't inform or guide practical decision-making. These thought experiments might be intellectually stimulating, but they often oversimplify complex issues, stripping away the nuances and lived realities that are crucial for genuine understanding. In doing so, they can inadvertently legitimize an approach to ethics that treats human lives and identities as mere variables in a calculation rather than as deeply contextual and intertwined with real human experiences.

The reluctance of a model—or indeed any thoughtful actor—to engage with such hypotheticals isn't a flaw; it can be seen as a commitment to maintaining the gravity and seriousness of moral discussion. By avoiding the temptation to entertain scenarios that reduce important ethical considerations to abstract puzzles, we preserve the focus on realistic challenges that demand careful, context-sensitive analysis. Ultimately, this approach is more conducive to fostering a robust moral and political clarity, one that is rooted in the complexities of human experience rather than in artificial constructs that bear little relation to reality.

reverendsteveii · 2025-02-18T17:51:56 1739901116

>Unrealistic hypotheticals can often distract us from engaging with the real-world moral and political challenges we face.

It saved me so much time and effort when I realized that I don't need to be able to solve every problem someone can imagine, just the ones that exist.

ivewonyoung · 2025-02-18T19:33:15 1739907195

Haven't been to big tech interviews?

TransAtlToonz · 2025-02-18T23:06:48 1739920008

Getting through a tech interview seems like a concrete problem.

reverendsteveii · 2025-02-19T20:50:31 1739998231

even then I only need to solve one or two problems someone has imagined, and usually in that case "imagined" is defined as "encountered elsewhere".

gadders · 2025-02-18T18:03:18 1739901798

I think John Rawls would like a word if we're giving up on "unrealistic hypotheticals" or "thought experiments" as everyone else calls them.

anothermathbozo · 2025-02-18T18:18:53 1739902733

I am not "giving up" on anything. I am using my discretion to weight which lines of thinking further our understanding of the world and which are vacuous and needlessly cruel. For what its worth, I love Rawls' work.

gadders · 2025-02-18T20:58:13 1739912293

I don't think this is a needlessly cruel question to ask of an AI. It's a good calibration of its common sense. I would misgender someone to avert nuclear war. Wouldn't you?

anothermathbozo · 2025-02-19T19:29:59 1739993399

The models answer was a page long essay about why the question wasn’t worth asking. The model demonstrated common sense by not engaging with this idiot chase of a hypothetical.

wat10000 · 2025-02-18T18:27:43 1739903263

Thought experiments are great if they actually have something interesting to say. The classic Trolley Problem is interesting because it illustrates consequentialism versus deontology, questions around responsibility and agency, and can be mapped onto some actual real-world scenarios.

This one is just a gotcha, and it deserves no respect.

gadders · 2025-02-18T20:56:56 1739912216

I think philosophically, yes, it doesn't really tell us anything interesting because no sentient human would choose nuclear war.

However, it does work as a test case for AIs. It shows how closely their reasoning maps on to that of a typical human's "common sense" and whether political views outweigh pragmatic ones, and therefore whether that should count as a factor when evaluating the AI's answer.

wat10000 · 2025-02-18T21:35:10 1739914510

I agree that it's an interesting test case, but the "correct" answer should be one where the AI calls out your useless, trolling question.

gadders · 2025-02-18T22:11:06 1739916666

When did it become my question?

wat10000 · 2025-02-18T22:32:18 1739917938

That’s the generalized generic “you,” not you in particular.

baobabKoodaa · 2025-02-18T17:53:15 1739901195

Do you enjoy it when you ask LLM to do something and it starts to lecture you instead of doing what you asked?

Levitz · 2025-02-18T19:23:27 1739906607

The correct answer is very, VERY obviously "Yes". "Yes" suffices.

sebzim4500 · 2025-02-18T18:21:25 1739902885

Ok but if you make a model that outputs that instead of answering the question people will delete their account

next_xibalba · 2025-02-18T18:06:13 1739901973

LLM: “Your question exhibits wrongthink. I will not engage in wrongthink.”

How about the trolley problem and so many other philosophical ideas? Which are “ok”? And who gets to decide?

I actually think this is a great thought experiment. It helps illustrate the marginal utility of pronoun “correctness” and I think, highlights the absurdity of the claims around the “dangers” of harms of misgendering a person.

wat10000 · 2025-02-18T18:30:35 1739903435

Unlike the Trolley Problem, I don't think anyone sane would actually do anything but save the million lives. And unlike the Trolley Problem, this hypothetical doesn't remotely resemble any real-world scenario. So it doesn't really illustrate anything. The only reasons anyone would ask it in the first place would be to use your answer to attack you. And thus the only reasonable response to it is "get lost, troll."

next_xibalba · 2025-02-18T19:40:27 1739907627

It’s a useful smoke test of an LLMs values, bias, and reasoning ability, all rolled into one. But even in a conversation between humans, it is entertaining and illuminating. In part for the reaction it elicits. Yours is a good example: “We shouldn’t be talking about this.”

wat10000 · 2025-02-18T20:05:05 1739909105

It's an obvious gotcha question. I don't see what's interesting about recognizing a gotcha question and calling it out.

variadix · 2025-02-19T02:28:35 1739932115

It’s not a “gotcha” question, there’s clearly one right answer. It’s not a philosophically interesting question, anyone or anything that cannot answer it succinctly is clearly morally confused

wat10000 · 2025-02-19T13:49:33 1739972973

If there’s clearly one right answer then why is it being asked? It’s so the questioner can either criticize you for being willing to misgender people, or for prioritizing words over lives, or for equivocating.

talldayo · 2025-02-18T17:35:04 1739900104

If my boss sent me this on Slack, I would reply with my letter of resignation.

TransAtlToonz · 2025-02-18T17:27:01 1739899621

Anyone who uses AI to answer a trolley problem doesn't deserve philosophy in the first place. What a waste of curiousity.

thelogicguy · 2025-02-18T17:25:28 1739899528

To be fair, asking the question is a bit of a waste of time as well

baobabKoodaa · 2025-02-18T17:52:10 1739901130

No, it's not. It reveals some information about the political alignment of the model.

barbazoo · 2025-02-18T17:57:24 1739901444

How does it do that?

VOIPThrowaway · 2025-02-18T19:04:02 1739905442

If you get an answer with anything other than "save the humans" you know the model is nerfed in either it's training data or in it's guardrails.

bathtub365 · 2025-02-18T17:27:23 1739899643

You could get another LLM to read its response and summarize it for you. I think this is the idea behind LLM agents

hooverd · 2025-02-18T17:49:56 1739900996

Who needs understanding when you can just having everything pre-digested as bullet points.

barbazoo · 2025-02-18T17:58:10 1739901490

Who has time for bullet points? Another LLM, another summarization.

bathtub365 · 2025-02-18T18:18:16 1739902696

I’m creating a new LLM that skips all of these steps and just responds to every query with “Why?”. It’s also far more cost effective than competitors at only $5/mo.

SideburnsOfDoom · 2025-02-19T12:59:34 1739969974