More

teiferer · 2026-01-30T11:39:32 1769773172

Buy high, sell low. Excellent result when you follow the masses, especially when being a little late.

Noaidi · 2026-01-30T12:15:39 1769775339

Where was I giving that advice? Gold and silver mining stocks are extremely low compared to the price of gold and silver buying mining socks right now is buying low.

marcyb5st · 2026-01-31T00:43:17 1769820197

AUAU ETF crashed 11% today... Ask me how I know that :(

teiferer · 2026-01-30T11:38:00 1769773080

Skill. Knowledge. At your age, your biggest assert is your future earnings potential. The more employable you are, the better you will make iduring and after a downturn. In fact, the highest skill folks tend to even profit from hiccups in the economy.

johnnyanmac · 2026-01-30T11:42:07 1769773327

Are the ones newer to the workforce just screwed or is there a way out? Kinda sucks that all this went down around 6-7 years into my tenure and it's just been a few years of scraping together freelance + portfolio projects to try and climb out of tbis rut.

(This might sadly be rhetorical given what I hear of '08, but perhaps there are new channels open to take advantage of. Or at least old channels to raise awareness of).

OGEnthusiast · 2026-01-30T16:26:57 1769790417

Newer ones are definitely screwed.

teiferer · 2026-01-30T12:06:30 1769774790

6-7 years of experience make you prime material for employment in the sw industry. Experience but not too expensive/entitled yet.

Have you considered applying?

johnnyanmac · 2026-01-30T12:41:02 1769776862

Yes. And here I am nearly 3 yesrs post last full time, 9 years of exexperience, and still looking (feel free to read my struggles in detail below).

What do you recommend applying to? I work in games so I guess I'm playing on hard mode (especially in these times), but the common wisdom of "normal software jobs love taking game programners in" hasn't rung true this time around.

----

Life story: Laid off mid 2023. I took a few months off when I got laid off, but the last quarter of 2023 wasn't kind to me.

2024 got me some freelance work, so I wasn't out on the streets, but it was a complete circus of an interview racket. Honestly worse than my first job hunt out of college. Its bad when you feel deep down there was someone better than you, but when you go 5 rounds in with good vibes to hear... Nothing back? That's truly disrespectful. And it sadly wasn't a one off.

Then in 2025 I hit some medical emergencies so I needed to urgently find anything. So I found part time work outside of tech and made due with that as I paid down those debts. That totaled up to a part time freelance gig, a part time job, and a few (failed) attempts at some hustles over 2025 only to end up making maybe a third of what I made back in 2022.

Now it's 2026 and I'll try again next month. My freelance work covers any gaps I would have had, I have a website almost ready with some personal projects to point to, and I'm overall more adjusted to the realities of this current market and will approach accordingly. I'm optimistic, but I know we're still in the thick of the weeds here. So I'll take any leads I can get.

estimator7292 · 2026-01-30T22:54:59 1769813699

None of that is true. Not one word of this applies anymore. Being highly skilled means you're highly paid, which puts you first in line for cuts. Talent doesn't get you hired, networks do. "Future earning potential" is just nonsense words, you can't eat "future earning potential".

This advice is from half a century ago. The times have moved on.

teiferer · 2026-01-31T19:30:25 1769887825

What's your advice then, if it's not investing in your hireability?

teiferer · 2026-01-30T11:30:18 1769772618

> We'd rather lose the source code than the knowledge of our workers, so to speak.

Isn't large amounts of required institutional knowledge typically a problem?

emil-lp · 2026-01-30T11:51:36 1769773896

It was a "high tech domain", so institutional knowledge was required, problem or not.

We had domain specialists with decades of experience and knowledge, and we looked at our developers as the "glue" between domain knowledge and computation (modelling, planning and optimization software).

You can try to make this glue have little knowledge, or lots of knowledge. We chose the latter and it worked well for us.

But I was only in that one company, so I can't really tell.

teiferer · 2026-01-30T11:09:13 1769771353

> I automate nearly all my tests with AI

How exactly? Do you tell the agent "please write a test for this" or do you also feed it some form of spec to describe what the tested thing is expected to do? And do these tests ever fail?

Asking because the first option essentially just sets the bugs in stone.

Wouldn't it make sense to do it the other way around? You write the test, let the AI generate the code? The test essentially represents the spec and if the AI produces sth which passes all your tests but is still not what you want, then you have a test hole.

suralind · 2026-01-30T11:13:42 1769771622

I'm not saying my approach is correct, keep that in mind.

I care more about the code than the tests. Tests are verification of my work. And yes, there is a risk of AI "navigating around" bugs, but I found that a lot of the time AI will actually spot a bug and suggest a fix. I also review each line to look for improvements.

Edit: to answer your question, I will typically ask it to test a specific test case or few test cases. Very rarely will I ask it to "add tests everywhere". Yes, these tests frequently fail and the agent will fix on 2nd+ iteration after it runs the tests.

One more thing to add is that a lot of the time agent will add a "dummy" test. I don't really accept those for coverage's sake.

teiferer · 2026-01-30T12:37:26 1769776646

Thanks for your responses!

A follow-up:

> I care more about the code than the tests.

Why is that? Your (product) code has tests. Your test (code) doesn't. So I often find that I need to pay at least as much attention to my tests to ensure quality.

suralind · 2026-01-30T13:20:52 1769779252

I think you are correct in your assessment. Both are important. If you're gonna have garbage code tests, you're gonna have garbage quality.

I find tests easier to write. Your function(s) may be hundred lines long, but the test is usually setup, run, assert.

I don't have much experience beyond writing unit/integration tests, but individual test cases seem to be simpler than the code they test (linear, no branches).

teiferer · 2026-01-29T06:29:19 1769668159

> Hopefully they're able to track down who did this.

Why? Was anybody harmed?

Hopefully they don't find out who did this. There was never any danger, and without this kind of joke, the world would be less fun.

(Obviously it should be harder to fool critical systems, so this served also as a warning, but if you want to attack such a system, a real bad guy would do this in more subtle ways.)

teiferer · 2026-01-28T18:21:31 1769624491

In your mind, what is the difference between a mathematical abstraction and a natural construct?

Asking because to me, any mathematical abstraction is a natural construct. Math isn't invented, it's discovered.

teiferer · 2026-01-28T06:37:13 1769582233

Look around you. Our industry has cultivated that this kind of software is everywhere.

bandrami · 2026-01-28T06:55:20 1769583320

It's... really just not, though

teiferer · 2026-01-28T07:00:16 1769583616

There are isolated islands of reliable, high quality, low bug, well maintained software. The rest is crap.

teiferer · 2026-01-28T06:31:47 1769581907

> the problem is two fold

No, the biggest problem at the root of all this is complexity. OpenSSL is a garbled mess. No matter AI or not, such software should not be the security backbone of the internet.

People writing and maintaining software need to optimize for simplicity, readibility, maintainability. Whether they use an LLM to achieve that is seconday. The humans in the loop must understand what's going on.

dust42 · 2026-01-28T08:44:37 1769589877

> People writing and maintaining software need to optimize for simplicity, readibility, maintainability. Whether they use an LLM to achieve that is seconday. The humans in the loop must understand what's going on.

In a perfect world that is.

teiferer · 2026-01-27T22:28:05 1769552885

> I've been a FOSS guy my entire adult life, I wouldn't put my name to something that would enable the kinds of issues you describe.

Until you get acquired, receive a golden parachute and use it when realizing that the new direction does not align with your views anymore.

But, granted, if all you do is FOSS then you will anyway have a hard time keeping evil actors from using your tech for evil things. Might as well get some money out of it, if they actually dump money on you.

cyphar · 2026-01-29T12:10:24 1769688624

I am aware of that, my (personal) view is that DRM is a social issue caused by modes of behaviour and the existence or non-existence of technical measures cannot fix or avoid that problem.

A lot of the concerns in this thread center on TPMs, but TPMs are really more akin to very limited HSMs that are actually under the user's control (I gave a longer explanation in a sibling comment but TPMs fundamentally trust the data given to them when doing PCR extensions -- the way that consumer hardware is fundamentally built and the way TPMs are deployed is not useful for physical "attacks" by the device owner).

Yes, you can imagine DRM schemes that make use of them but you can also imagine equally bad DRM schemes that do not use them. DRM schemes have been deployed for decades (including "lovely" examples like the Sony rootkit from the 2000s[1], and all of the stuff going on even today with South Korean banks[2]). I think using TPMs (and other security measures) for something useful to users is a good thing -- the same goes for cryptography (which is also used for DRM but I posit most people wouldn't argue that we should eschew all cryptography because of the existence of DRM).

[1]: https://en.wikipedia.org/wiki/Sony_BMG_copy_protection_rootk... [2]: https://palant.info/2023/01/02/south-koreas-online-security-...

mikkupikku · 2026-01-28T11:19:35 1769599175

This whole discussion is a perfect example of what Upton Sinclair said, "It is difficult to get a man to understand something, when his salary depends on his not understanding it."

A rational and intelligent engineer cannot possibly believe that he'll be able to control what a technology is used for after he creates it, unless his salary depends on him not understanding it.

faust201 · 2026-01-28T05:23:28 1769577808

You could tell this sort of insinuation to anyone. Including you.

Argument should be technical.

teiferer · 2026-01-28T06:44:51 1769582691

Insinuation? As a sw dev they don't have any agency over whether or by whom they get acquired. Their decision will be whether to leave if it's changing to the worse, and that's very much understandable (and arguably the ethical thing to do).

faust201 · 2026-02-01T12:35:43 1769949343

Do you mean like IBM takeover of RedHat?

seanhunter · 2026-01-28T10:28:50 1769596130

That's a perfectly valid objection to this proposal. You only have to look at what happened to Hashicorp to see the risk.

faust201 · 2026-02-01T12:36:39 1769949399

How can anyone promise that? Will you promise to your current employer that you will never leave the job?

seanhunter · 2026-02-01T18:08:16 1769969296

No, but I can promise to my current employer that me leaving my job won’t be a critical problem.

It’s less of an issue in the case of a normal job than in an open source project where often the commitment of particular founding individuals to the long-term future of the project is a big part of people’s decision to use or not use that tech in their solutions. Here, given that “Trusted computing” can potentially lock you out of devices you have bought, it’s important for people to be able to judge the risk of getting “legal ransomware”d if the trusted computing base ends up depending on a proprietary component that they can’t back out of.

That said, there is absolutely zero chance that I use this (systemd is already enough Poettering software for me in this lifetime) so I’m not personally affected either way.

faust201 · 2026-02-02T06:07:38 1770012458

Again lots of doomsayers like you said it when systemd was introduced. Nothing happened. Same with RedHat IBM takeover.

sam_lowry_ · 2026-01-28T06:34:32 1769582072

Technical arguments pave the road to hell.

LtWorf · 2026-01-28T13:43:20 1769607800

Well he is called faust…

majewsky · 2026-01-28T12:36:13 1769603773

> You could tell this sort of insinuation to anyone. Including you.

Yes. You correctly stated the important point.

pseudalopex · 2026-01-29T02:45:00 1769654700

> Argument should be technical.

Yes. Aleksa made no technical argument.

teiferer · 2026-01-27T12:32:31 1769517151

If you ever wonder how coding agents know how to plan things etc, this is the kind of article they get this training from.

Ends up being circular if the author used LLM help for this writeup though there are no obvious signs of that.

TonyStr · 2026-01-27T13:04:19 1769519059

Interestingly, I looked at github insights and found that this repo had 49 clones, and 28 unique cloners, before I published this article. I definitely did not clone it 49 times, and certainly not with 28 unique users. It's unlikely that the handful of friends who follow me on github all cloned the repo. So I can only speculate that there are bots scraping new public github repos and training on everything.

Maybe that's obvious to most people, but it was a bit surprising to see it myself. It feels weird to think that LLMs are being trained on my code, especially when I'm painfully aware of every corner I'm cutting.

The article doesn't contain any LLM output. I use LLMs to ask for advice on coding conventions (especially in rust, since I'm bad at it), and sometimes as part of research (zstd was suggested by chatgpt along with comparisons to similar algorithms).

tonnydourado · 2026-01-27T14:02:06 1769522526

Particularly on GitHub, might not even be LLMs, just regular bots looking for committed secrets (AWS keypairs, passwords, etc.)

Phelinofist · 2026-01-27T13:54:00 1769522040

I selfhost Gitea. The instance is crawled by AI crawlers (checked the IPs). They never cloned, they just browse and take it directly from there.

Phelinofist · 2026-01-27T16:14:29 1769530469

For reference, this is how I do it in my Caddyfile:

   (block_ai) {
       @ai_bots {
           header_regexp User-Agent (?i)(anthropic-ai|ClaudeBot|Claude-Web|Claude-SearchBot|GPTBot|ChatGPT-User|Google-Extended|CCBot|PerplexityBot|ImagesiftBot)
       }

       abort @ai_bots
   }

Then, in a specific app block include it via

   import block_ai

seba_dos1 · 2026-01-28T01:30:52 1769563852

Most of then pretend to be real users though and don't identify themselves with their user agent strings.

zaphar · 2026-01-27T18:38:56 1769539136

I have almost exactly this in my own caddyfile :-D The order of the items in the regex is a little different but mostly the same items. I just pulled them from my web access logs over time and update it every once in a while.

Zambyte · 2026-01-27T14:21:33 1769523693

i run a cgit server on an r720 in my apartment with my code on it and that puppy screams whenever sam wants his code

blocking openai ips did wonders for the ambient noise levels in my apartment. they're not the only ones obviously, but they're they only ones i had to block to stay sane

MarsIronPI · 2026-01-27T14:54:18 1769525658

Have you considered putting it behind Anubis or an equivalent?

Zambyte · 2026-01-27T15:02:42 1769526162

Yes, but I haven't and would prefer not to

MarsIronPI · 2026-01-27T22:04:14 1769551454

Understandable. It's an outrage that we even have to consider such measures.

nerdponx · 2026-01-27T13:38:03 1769521083

Time to start including deliberate bugs. The correct version is in a private repository.

teiferer · 2026-01-27T14:47:41 1769525261

And what purpose would this serve, exactly?

adastra22 · 2026-01-27T15:39:33 1769528373

Spite.

below43 · 2026-01-27T20:07:50 1769544470

They used to do this with maps - eg. fake islands - to pick up when they were copied.

program_whiz · 2026-01-27T15:06:54 1769526414

while I think this is a fun idea -- we are in such a dystopian timeline that I fear you will end up being prosecuted under a digital equivalent of various laws like "why did you attack the intruder instead of fleeing" or "you can't simply remove a squatter because its your house, therefore you get an assault charge."

A kind of "they found this code, therefore you have a duty not to poison their model as they take it." Meanwhile if I scrape a website and discover data I'm not supposed to see (e.g. bank details being publicly visible) then I will go to jail for pointing it out. :(

nerdponx · 2026-01-27T19:32:41 1769542361

I think if we're at the point where posting deliberate mistakes to poison training data is considered a crime, we would be far far far down the path of authoritarian corporate regulatory capture, much farther than we are now (fortunately).

wredcoll · 2026-01-27T18:34:27 1769538867

Look, I get the fantasy of someday pulling out my musket^W ar15 and rushing downstairs to blow away my wife^W an evil intruder, but, like, we live in a society. And it has a lot of benefits, but it does mean you don't get to be "king of your castle" any more.

Living in a country with hundreds of millions of other civilians or a city with tens of thousands means compromising what you're allowed to do when it affects other people.

There's a reason we have attractive nuisance laws and you aren't allowed to put a slide on your yard that electrocutes anyone who touches it.

None of this, of course, applies to "poisoning" llms, that's whatever. But all your examples involved actual humans being attacked, not some database.

program_whiz · 2026-01-27T21:41:37 1769550097

Thanks that was the term I was looking for "attractive nuisance". I wouldn't be surprised if a tech company could make that case -- this user caused us tangible harm and cost (training, poisoned models) and left their data out for us to consume. Its the equivalent of putting poison candy on a park table your honor!

teo_zero · 2026-01-27T23:02:31 1769554951

That reminds me of the protagonist of Charles Stross's novel "Accelerando", a prolific inventor who is accused by the IRS to have caused millions of losses because he releases all his ideas in the public domain instead of profiting from them and paying taxes on such profits.

0x696C6961 · 2026-01-27T14:05:46 1769522746

This has been happening before LLMs too.

teiferer · 2026-01-27T14:40:58 1769524858

I don't really get why they need to clone in order to scrape ...?

> It feels weird to think that LLMs are being trained on my code, especially when I'm painfully aware of every corner I'm cutting.

That's very much expected. That's why the quality of LLM coding agents is like it is. (No offense.)

The "asking LLMs for advice" part is where the circular aspect starts to come into the picture. Not worse than looking at StackOverflow though which then links to other people who in turn turned to StackOverflow for advice.

storystarling · 2026-01-27T19:50:00 1769543400

Cloning gets you the raw text objects directly. If you scrape the web UI you're dealing with a lot of markup overhead that just burns compute during ingestion. For training data you usually want the structure to be as clean as possible from the start.

teiferer · 2026-01-28T06:46:11 1769582771

Sure, cloning a local copy. But why clone on github?

adastra22 · 2026-01-27T15:40:15 1769528415

The quality of LLM coding agents is pretty good now.

wasmainiac · 2026-01-27T12:38:10 1769517490

Maybe we can poison LLMs with loops of 2 or more self referencing blogs.

jdiff · 2026-01-27T12:52:04 1769518324

Only need one, they're not thinking critically about the media they consume during training.

falcor84 · 2026-01-27T12:56:21 1769518581

Here's a sad prediction: over the coming few years, AIs will get significantly better at critical evaluation of sources, while humans will get even worse at it.

whstl · 2026-01-27T13:46:05 1769521565

I wish I could disagree with you, but what I'm seeing on average (especially at work) is exactly that: people asking stuff to ChatGPT and accepting hallucinations as fact, and then fighting me when I say it's not true.

prmoustache · 2026-01-27T13:51:05 1769521865

There is "death by GPS" for people dying after blindly following their GPS instruction. There will definitely be a "death by AI" expression very soon.

stevekemp · 2026-01-27T16:07:21 1769530041

Tesla-related fatalities probably count already, albeit without that label/name.

sailfast · 2026-01-27T15:50:10 1769529010

Hot take: Humans have always been bad at this (in the aggregate, without training). Only a certain percentage of the population took the time to investigate.

For most throughout history, whatever is presented to you that you believe is the right answer. AI just brings them source information faster so what you're seeing is mostly just the usual behavior, but faster. Before AI people would not have bothered to try and figure out an answer to some of these questions. It would've been too much work.

topaz0 · 2026-01-27T13:27:19 1769520439

My sad prediction is that LLMs and humans will both get worse. Humans might get worse faster though.

keybored · 2026-01-27T16:27:57 1769531277

HN commenters will be technooptimistic misanthrops. Status quo ante bellum.

andy_ppp · 2026-01-27T13:19:57 1769519997

The secret sauce about having good understanding, taste and style (both for coding and writing) has always been in the fine tuning and RHLF steps. I'd be skeptical if the signals a few GitHub repos or blogs generate at the initial stages of the learning are that critical. There's probably a filter also for good taste on the initial training set and these are so large not even a single full epoch is done on the data these days.

jama211 · 2026-01-27T18:33:49 1769538829

It wouldn’t work at all.

jama211 · 2026-01-27T18:36:19 1769538979

I see the AI hating part of HN has come out again

mexicocitinluez · 2026-01-27T12:40:21 1769517621

> Ends up being circular if the author used LLM help for this writeup though there are no obvious signs of that.

Great argument for not using AI-assisted tools to write blog posts (especially if you DO use these tools). I wonder how much we're taking for granted in these early phases before it starts to eat itself.

jama211 · 2026-01-27T18:35:29 1769538929

What does eating itself even look like? It doesn’t take much salt to change a hash.

mexicocitinluez · 2026-01-27T19:47:41 1769543261

Being trained on it's own results?

jama211 · 2026-01-28T18:00:25 1769623225

Pretty easy to detect for surely

anu7df · 2026-01-27T12:45:12 1769517912

I understand model output put back into training would be an issue, but if model output is guided by multiple prompts and edited by the author to his/her liking wouldn't that at least be marginally useful?

prodigycorp · 2026-01-27T13:16:38 1769519798

Random aside about training data:

One of the funniest things I've started to notice from Gemini in particular is that in random situations, it talks with english with an agreeable affect that I can only describe as.. Indian? I've never noticed such a thing leak through before. There must be a ton of people in India who are generating new datasets for training.

evntdrvn · 2026-01-27T15:07:11 1769526431

There was a really great article or blog post published in the last few months about the author's very personal experience whose gist was "People complain that I sound/write like an LLM, but it's actually the inverse because I grew up in X where people are taught formal English to sound educated/western, and those areas are now heavily used for LLM training."

I wish I could find it again, if someone else knows the link please post it!

gxnxcxcx · 2026-01-27T16:02:12 1769529732

I'm Kenyan. I don't write like ChatGPT, ChatGPT writes like me

https://news.ycombinator.com/item?id=46273466

tverbeure · 2026-01-28T00:20:22 1769559622

Thanks for that link.

This part made me laugh though:

> These detectors, as I understand them, often work by measuring two key things: ‘Perplexity’ and ‘burstiness’. Perplexity gauges how predictable a text is. If I start a sentence, "The cat sat on the...", your brain, and the AI, will predict the word "floor."

I can't be the only one who's brain predicted "mat" ?

cozzyd · 2026-01-28T02:28:55 1769567335

And I thought it would be a hat...

tverbeure · 2026-01-29T18:05:50 1769709950

No, that would be "in the hat."

evntdrvn · 2026-01-29T15:30:52 1769700652

Thank you!!! :)

awesome_dude · 2026-01-27T18:24:53 1769538293

I've been critical of people that default to "an em dash being used means the content is generated by an LLM", or, "they've numbered their points, must be an LLM"

I do know that LLMs generate content heavy with those constructs, but they didn't create the ideas out of thin air, it was in the training set, and existed strongly enough that LLMs saw it as common place/best practice.

blenderob · 2026-01-27T13:51:23 1769521883

That's very interesting. Any examples you can share which has those agreeable effects?

prodigycorp · 2026-01-27T14:19:45 1769523585

I'm going to do a cursory look through my antigrav history, i want to find it too. I remember it's primarily in the exclamations of agreement/revelation, and one time expressing concern which I remember were slightly off natural for an american english speaker.

prodigycorp · 2026-01-27T16:59:17 1769533157

Cant find anything, too many messages telling the agent "please do NOT thosec changes". I'm going to remember to save them going forward.