More

tempestn · 2026-06-09T22:29:56 1781044196

> If Claude gives me poor or incorrect advice while I’m working on an AI component, I have no way of knowing whether the model was confused, whether my problem is unsolvable, or if some invisible policy restriction quietly kicked in.

You should be able to know if your problem was solvable by using your own expertise and judgement, no? If you're relying on LLMs as a substitute for those, I wouldn't expect great results.

notrealyme123 · 2026-06-09T23:05:58 1781046358

You come up with a hypothesis -> you let fable implement it -> fable sabotages your experiment -> you get evidence that hypothesis is not true.

It's that simple.

hedora · 2026-06-09T23:13:03 1781046783

Or, worse:

- It says your safety hypothesis is true, you incorrectly ship, killing lots of people.

- It proposes dangerous experiments.

hedora · 2026-06-09T23:10:53 1781046653

No; once the LLM switches to this new saboteur mode, it’ll be very hard to detect.

Sabotage is an asymmetric weapon. The ratio of damage to effort is nearly unbounded, and any decent saboteur knows that the key trick is to make your output indistinguishable from incompetence.

They’re building state of the art offensive capabilities into a public model, then expecting to maintain control over when it decides to attack its human users.

The premise is laughable, and we’ve all seen how this movie ends.

HarHarVeryFunny · 2026-06-10T01:40:07 1781055607

Great way for Anthropic to build trust with the military

tempestn · 2026-06-09T20:30:39 1781037039

I honestly feel like that's a counter-example. With AI he'd be tweeting some other nonsense. It's not like anyone saw the image and thought he actually was orange Jesus.

FabCH · 2026-06-09T21:04:27 1781039067

Multiple people have the same response, I randomly selected this one for follow up:

Yes, but two things were lost:

1) the need for skill or an accomplice. He _couldn’t_ tweet that image in 2016, not without first asking someone to photoshop it. And that need to engage in human to human communication is something truly fundamental that was changed and lost.

2) Any ambiguity or misunderstanding. Yes bad textual tweets exist for a long time in politics. But there IS something about images that is more powerful than text. The text „I’m Jesus Christ and god sent me to heal the sick“ would probably make the news, but a lot of people would go: „is he quoting the bible? What’s going on?“, not so much with Jesus Picture.

tempestn · 2026-06-10T00:25:05 1781051105

Isn't that kind of good then? It's probably valuable for the public to know that this is how he sees himself.

tempestn · 2026-05-27T21:30:37 1779917437

And this is mostly thanks to the oil sands. Will that always be the economic engine it is today?

tempestn · 2026-05-27T18:50:23 1779907823

I'd hoped this would be a post about Giro Galoppo, which is a fantastic little board game if you have kids (and can find a copy). It's easy enough for small children to learn, but the mix of strategy and luck is such that an adult can put real effort into winning, but will still probably only win 2/3 to 3/4 of games, not every time. And it's quick to play. Great game.

tempestn · 2026-05-26T03:52:04 1779767524

I expect the more significant concern would be OpenClaw opening your front door for someone else.

phreeza · 2026-05-26T04:17:40 1779769060

I think it's just a joke/reference to HAL in 2001: A Space Odyssey

tempestn · 2026-05-08T20:13:18 1778271198

But don't we know that some of the vulnerabilities being discovered predate ai coding?

awesome_dude · 2026-05-08T20:45:17 1778273117

Certainly, and some discoveries have been attributed to AI (I was reading that mozilla firefox were praising mythos recently)

But that's not accounting for all of the discoveries, not at all.

I've also seen the npm people talking about the surge in AI code overwhelming the ability to properly review what's being distributed, and a large number of vulnerabilities being attributed to that

tempestn · 2026-04-25T23:48:42 1777160922

This is a reference to RJK Jr's pronouncement that Trump has a "different way of calculating percentages". Seems apt to me in this context.

Terr_ · 2026-04-26T00:01:32 1777161692

Very much another "Emperor's New Clothes" situation.

If the pathology was entirely within his own privately-owned company that'd be one thing, but Americans are going to continue to get hurt because of it.

tempestn · 2026-04-13T23:34:54 1776123294

How is it an anomaly? Global stocks go up (or at least have positive total return) over time on average because companies produce value. Ultimately it's true that that money has to come from somewhere, so you can say printing it is monetary policy, but the reason it can be done without runaway inflation is the tangible value produced by the firms.

tempestn · 2026-04-11T21:04:45 1775941485

Do you think carrying a gun gives the child soldier power to change his circumstances? It's a problem of organization and collective action, not lack of arms.

tempestn · 2026-04-11T21:01:07 1775941267

If they wanted it to persist, why would they give up a portion of their income to end it? Are you suggesting the 40% is just more honest than the 60?