More

Loeffelmann · 2026-04-12T07:30:29 1775979029

> locally coherent but structurally incoherent

Perfectly summarizes what I hate about AI code. The diff looks fine but if you take a step back its an absolute mess. I mean have you looked at the Claude Code or Openclaw codebases? that is the result of full on vibecoded. A bloated unattainable mess that no one understands.

Loeffelmann · 2026-01-28T07:35:02 1769585702

If you ever work with LLMs you know that they quite frequently give up.

Sometimes it's a

    // TODO: implement logic

or a

"this feature would require extensive logic and changes to the existing codebase".

Sometimes they just declare their work done. Ignoring failing tests and builds.

You can nudge them to keep going but I often feel like, when they behave like this, they are at their limit of what they can achieve.

wongarsu · 2026-01-28T09:33:22 1769592802

If I tell it to implement something it will sometimes declare their work done before it's done. But if I give Claude Code a verifiable goal like making the unit tests pass it will work tirelessly until that goal is achieved. I don't always like the solution, but the tenacity everyone is talking about is there

koiueo · 2026-01-28T11:20:53 1769599253

> but the tenacity everyone is talking about is there

I always double-check if it doesn't simply exclude the failing test.

The last time I had this, I discovered it later in the process. When I pointed this out to the LLM, it responded, that it acknowledged thefact of ignoring the test in CLAUDE.md, and this is justified because [...]. In other words, "known issue, fuck off"

theshrike79 · 2026-01-28T23:01:22 1769641282

Tools in a loop people, tools in a loop.

If you don't give the agent the tools to deterministically test what it did, you're just vibe coding in its worst form.

jpnc · 2026-01-28T12:39:37 1769603977

tenacity == while loop

jedberg · 2026-01-28T08:29:50 1769588990

> If you ever work with LLMs you know that they quite frequently give up.

If you try to single shot something perhaps. But with multiple shots, or an agent swarm where one agent tells another to try again, it'll keep going until it has a working solution.

alansaber · 2026-01-28T12:02:05 1769601725

Yeah exactly this is a scope problem, actual input/output size is always limited> I am 100% sure CC etc are using multiple LLM calls for each response, even though from the response streaming it looks like just one.

mlrtime · 2026-01-28T11:31:10 1769599870

Nope, not for me, unless I tell it to.

Context matters, for an LLM just like a person. When I wrote code I'd add TODOs because we cannot context switch to another problem we see every time.

But you can keep the agent fixated on the task AND have it create these TODOs, but ultimately it is your responsibility to find them and fix them (with another agent).

energy123 · 2026-01-28T08:00:43 1769587243

Using LLMs to clean those up is part of the workflow that you're responsible for (... for now). If you're hoping to get ideal results in a single inference, forget it.

Loeffelmann · 2026-01-23T14:25:27 1769178327

An AI version of ls and fzf bringing your file system to the AI age

Loeffelmann · 2026-01-10T12:34:16 1768048456

> you can always do C-x C-e in bash/zsh (M-v in Fish).

Thanks I didn't know!

Loeffelmann · 2026-01-08T09:07:12 1767863232

Some one apparently figured it out. The first system message has to include

"You are Claude Code, Anthropic's official CLI for Claude."

https://github.com/link-assistant/agent/pull/63

Loeffelmann · 2026-01-07T20:55:08 1767819308

Lol a formatting error in a change log breaking the entire thing

Loeffelmann · 2026-01-07T20:53:02 1767819182

You can use subscriptions.

I like it but I am not too deep into the whole agentic coding business.

Loeffelmann · 2026-01-06T19:23:52 1767727432

Why do all these AI generated readmes have a directory structure sections it's so redundant because you know I could just run tree

sonnig · 2026-01-06T22:20:32 1767738032

It makes me so exhausted trying to read them... my brain can tell immediately when there's so much redundant information that it just starts shutting itself off.

bakies · 2026-01-06T19:40:26 1767728426

comments? also reading into an agent so the agent doesnt have to tool-call/bash out

Loeffelmann · 2026-01-03T08:42:57 1767429777

It's mainly about showing how low the odds actually are. I think everyone understands they are low but it's ridiculous how low exactly.

Loeffelmann · 2025-10-30T09:11:16 1761815476

this looks like one of those things where it completely breaks apart if you want to do anything custom or out of line to what is intended by the framework. Causing way more headaches down the line then if you just did it yourself from the start.

boxed · 2025-10-30T09:20:41 1761816041

I understand that gut feeling. I've worked with many such systems. iommi is not like that though, because we HATE systems that have a nice two line demo but then immediately falls apart.

We consider any failure to scale up customization a high priority bug.

How do we handle this in practice? Nice defaults, easy to do deep customization with zero boilerplate, AND escape hatches of various forms. So if you need to just render your own template for an entire table row of form input field or whatever, you can do that. Always.

j_lubcke · 2025-10-30T09:23:52 1761816232

Actually, how it behaves with special cases was one of the initial requirements when it was built. A design goal has always been that there should be escape hatches. For example almost all settings can be a call-back if the value is not known up front.

pantulis · 2025-10-30T10:15:49 1761819349

From my experience with similar things built around Rails (ActiveAdmin and others) being based in a dynamic language helps and allows to accomodate a lot of customizations.

boxed · 2025-10-30T10:28:42 1761820122

It can. But it doesn't necessarily mean that. Or maybe it means you CAN work around it, but it's cumbersome/bad to do so. Imo the Django Admin is like that: lots of ad-hoc and random customization options and lots of missing options, and it's a pain to override etc.

pantulis · 2025-10-31T08:15:55 1761898555

Agreed. When you use a web framework you are somehow trusting the ability of the creators to ship something reusable and extensible. When you use something like this now you have two groups of creators to trust that they are not only good and coding but also good at making opinionated decisions. Or, better said, you switch your trust from the framework --after all it's battle tested-- to a new group.

If one is a seasoned Django developer I guess they can take a look at the source code and judge if it suits your needs. But if it does, there is a lot of development speed to be gained.

boxed · 2025-10-31T11:20:42 1761909642

As one of the developers of iommi I can confidently say that seasoned Django developers will be very confused by the iommi code base heh. We do things quite differently, for good and ill. I wrote about this here: https://kodare.net/2024/09/30/why-iommi-is-weird.html