More

sameers · 2026-04-15T20:00:39 1776283239

Looking at other people's response made me think of Lord Of War and War Dogs, both about (real life) accounts of people in the business of selling arms.

sameers · 2026-04-15T19:58:46 1776283126

That was very illuminating! Do you think you'll try experimenting with some sort of adversarial "agent" setup, where the code isn't released until it passes security review by itself, for each model you are comparing?

sameers · 2026-04-15T17:41:00 1776274860

Why would you not have (vibe coded) tests that prevent this from happening?

siddhant-jain · 2026-04-15T17:56:50 1776275810

Honestly? I did not even know tests existed, because i am of 17 and ai never mentioned so how i supposed to tell the ai that write the tests, that's why it broke.

but when i started building a dedicated backend then i know about tests, coverage, webhooks, idempotency and all that, so i am trying to find out that is there anyone who faced this or am i the only one?

sameers · 2026-04-15T20:05:13 1776283513

Okay, I get it - to be clear, I wasn't being dismissive, I was genuinely curious. And fascinated at the different ways people use AI, especially down the generations :)

I think what you are facing is a sense of how much the tools are just that - tools, and no substitute at all for what an experienced developer would try to plan for. So yes, I am sure everyone who's using AI tools for the first time has made this "newbie" mistake.

BUT I have also heard that the tools come with "modes" (or agents, or skills, or whatever you want to call them) where you can have it act _like_ an architect, that points out the things you ought to ALSO do. Like write tests, ensure idempotency etc.

I am curious what your experience would be if you went through some cycles of having the AI simply review its work and suggest improvements. Clearly you've already learned some things that you ought to add - I think documenting the progress you make as you discover these things would be immensely helpful to others like you who are new to this!

Good luck :)

siddhant-jain · 2026-04-15T21:13:21 1776287601

Thanks! Yeah, I also made that newbie mistake and about modes you heard correctly but even when you say to review the code it still hallucinates which clearly seen in websites and even after multiple correct prompts like "Act like a senior engineer experience of..." something like that, they just write and erase codes but never fixes the issues, they miss the critical flaws, where things starting complexing.

Like I know when I was building an automated workflow, it breaks on the same step every time, even using copilot pro it was unable to fix that, so these are the little things which AI often misses even the PERFECT PROMPT.

What works is that I have to debug it to another AI and then tell the original one that "this is the issue at line X with this error Y".

And appreciate the advice. Thank you!

sameers · 2026-04-15T17:37:36 1776274656

Depends on what you mean by "business," I suppose, but There Will Be Blood is my favorite, about the "business" of exploiting oil in southern CA though it's much more fictionalized than your other choices :)

sameers · 2026-04-15T17:25:42 1776273942

It feels a bit alarmist as an article - the obvious next step here is for the FOSS community to start adding AI security reviews to their development cycles. But there'll be a bit of a Y2K-style gold rush before that, as everyone panics and white-hat companies spring up touting their gen-AI credentials.

sameers · 2026-04-15T17:21:24 1776273684

Cool - I liked the idea of being a celebrity, just because you are walking around with twins!

sameers · 2026-04-14T03:47:17 1776138437

I wonder how much of that difference is because Qwen is being downloaded a lot more in China.

yorwba · 2026-04-14T08:53:54 1776156834

The report doesn't even count downloads from ModelScope.cn, the Chinese HuggingFace competitor.

andsoitis · 2026-04-14T04:06:36 1776139596

Qwen, according to the article, also fast surpasses DeepSeek.

sameers · 2026-04-08T04:04:21 1775621061

Do you know the source for this quote? I would love to read more, if there is more.

cryzinger · 2026-04-08T04:16:10 1775621770

It's from this! https://youtu.be/X2wLP0izeJE?si=jwmFevXkDPbkfejV

sameers · 2026-04-14T03:48:22 1776138502

Thanks!

sameers · 2025-11-07T00:50:23 1762476623

That's not the title of the article. Which is, "Baby Shoggoth Is Listening." I suppose that title would have made sense to the majority of this crowd but I thought I'd make it something more accessible.

sameers · on Jan 11, 2025

If their money making methods are ethical then isn't this a better strategy, to leave the decisions to others rather than impose your values on them while alive? Also presumably Buffett and his cohorts are better than others at growing their money, so in the vein of the EA argument, it would be best to leave the money untouched while it is being actively managed by the donor, then hand out the windfall after they are dead.

For this argument to work, you have to stop at the donor themselves - you can't keep extending it ad infinitum to their descendants or inheritors. But in the case of the pre-committed amounts, like Gates and Buffett, that isn't the case.