I have a different perspective. The Trifecta is a *bad* model because it makes p...

winternewt · 2025-11-26T11:02:35 1764154955

You're not explaining why the trifecta doesn't solve the problem. What attack vector remains?

TeMPOraL · 2025-11-26T12:41:40 1764160900

None, but your product becomes about as useful and functional as a rock.

kccqzy · 2025-11-26T14:05:13 1764165913

This is what reasonable people disagree on. My employer provides several AI coding tools, none of which can communicate with the external internet. It completely removes the exfiltration risk. And people find these tools very useful.

TeMPOraL · 2025-11-26T17:40:30 1764178830

Are you sure? Do they make use of e.g. internal documentation? Or CLI tools? Plenty of ways to have Internet access just one step removed. This would've been flagged by the trifecta thinking.

kccqzy · 2025-11-26T18:16:25 1764180985

Yes. Internal documentation stored locally in Markdown format alongside code. CLI tools run in a sandbox, which restricts general internet access and also prevents direct production access.

gizzlon · 2025-11-26T19:47:28 1764186448

Can it _never_ _ever_ create a script or a html file and get the user to open it?

kccqzy · 2025-11-28T03:04:01 1764299041

That’s different. Now you are asking the user to do an action.

TeMPOraL · 2025-11-28T08:47:13 1764319633

The user could also be another program, or another AI agent.

Thorrez · 2025-11-26T13:37:01 1764164221

>There is no probable, verifiable solution here, not any more than when talking about human employees, contractors, friends.

Well when talking about employees etc, one model to protect against malicious employees is to require every sensitive action (code check in, log access, prod modification) to require approval from a 2nd person. That same model can be used for agents. However, agents, known to be naive, might not be a good approver. So having a human approve everything the agent does could be a good solution.