In terms of raw prose quality, I'm not convinced GPT-5 sounds "less like AI" or ...

BoorishBears · 2025-08-07T17:46:40 1754588800

I've worked on this problem for a year and I don't think you get meaningfully better at this without making it as much of a focus as frontier labs make coding.

They're all working on subjective improvements, but for example, none of them would develop and deploy a sampler that makes models 50% worse at coding but 50% less likely to use purple prose.

(And unlike the early days where better coding meant better everything, more of the gains are coming from very specific post-training that transfers less, and even harms performance there)

arcumaereum · 2025-08-07T17:53:12 1754589192

Interesting, is the implication that the sampler makes a big effect on both prose style and coding abilities? Hadn't really thought about that, I wonder if eg. selecting different samplers for different use cases could be a viable feature?

BoorishBears · 2025-08-07T18:05:54 1754589954

There's so many layers to it but the short version is yes.

For example: You could ban em dash tokens entirely, but there are places like dialogue where you want them. You can write a sampler that only allows em dashes between quotation marks.

That's a highly contrived example because em dashes are useful in other places, but samplers in general can be as complex as your performance goals will allow (they are on the hot path for token generation)

Swapping samplers could be a thing, but you need more than that in the end. Even the idea of the model accepting loosely worded prompts for writing is a bit shakey: I see a lot of gains by breaking down the writing task into very specifc well-defined parts during post-training.

It's ok to let an LLM go from loose prompts to that format for UX, but during training you'll do a lot better than trying to learn on every way someone can ask for a piece of writing

jonathan_h · 2025-08-07T20:30:10 1754598610

I am a big fan of using the em-dash.

I won't argue that I always use it in a stylistically appropriate fashion, but I may have to move away from it. I am NOT beating the actually-an-AI allegations.

phyzome · 2025-08-08T03:22:44 1754623364

Sorry, as someone who uses a lot of em-dashes (and semicolons, and other slightly less common punctuation) I find the whole em-dash thing to be completely unserious.

Brendinooo · 2025-08-07T21:18:03 1754601483

No complex benchmarks, no friendliness tests — just look for the sentence like this one