Hacker Newsnew | past | comments | ask | show | jobs | submit | alphabettsy's commentslogin

That’s unfortunate. I’m not interested in using Musk associated products anymore than I have to.

I’m trying to understand how this is useful information on its own?

Maybe I missed it, but it doesn’t tell you if it’s more successful for less overall cost?

I can easily make Sonnet 4.6 cost way more than any Opus model because while it’s cheaper per prompt it might take 10x more rounds (or never) solve a problem.


Everything in AI moves super quickly, including the hivemind. Anthropic was the darling a few weeks ago after the confrontation with the DoD, but now we hate them because they raised their prices a little. Join us!

The docs suggest not using max effort in most cases to avoid overthinking :shrug:

They've jumped the shark. I truly can't comprehend why all of these changes were necessary. They had a literal money printing machine that actually got real shit done, really well. Now it's a gamble every time and I am pulling back hard from Anthropic ecosystem.

it's clearly all in your head. 4.6 is just as capable as it used to be. literally no one on the internet has managed to post credulous and real evidence of a nerf

this is just another trendy conspiracy theory that people reinforce because of selection/recency bias. you hear "nerf", your brain overindexes on the next time Claude does poorly. it is the same phenomenon when you notice a new vocabulary word all the time.


It seems clear that it was a money spending machine, not a money printing machine.

That’s been there for awhile.

IMO it doesn’t handily beat it.

It’s typically equivalent, sometimes better, sometimes behind. Better at following a well defined plan, less good at concept exploration and planning imo.

At 1m context it’s basically the same price.


Prompt cache expired?

Perhaps, but I also don't see how a prompt cache miss or expiry should result in Claude stating affirmatively that it did something that it did not.

People keep saying this, but I’m not sure I buy it.

I was using both Codex and Claude Code heavily on some projects this weekend.

In one project Codex was screwing everything up and in another one absolutely killing it. I’ve seen the same from Claude.

In the bad Codex example it had the wrong idea and kept trying to figure out how to accomplish the same thing no matter how many times I attempt to correct it. Undoing the recent changes where it went down the wrong path was the only way to get things back on track.

I wonder if context poisoning is a bigger problem than people realize.


This is my experience too, and I always find these posts confusing. I consider myself a very heavy user 4-6 hrs a day and I never hit limits. I have on the $20 plan but not with Max.

Remind me who was president in 2020..


Fair, that one wasn't a good example for this.

Though IMHO it's not just a Biden problem, it's a "everybody in power" problem. They just can't seem to resist (ab)using their power to shape the conversation and censor their opponents. It's also not new, it's been happening for hundreds of years at least. But it did get a lot more brazen under Biden IMHO with Twitter/Facebook etc and admin officials telling private companies what to censor (err, "moderate").


I believe they make satellite components not consumer hardware in the US


The linked BBC article above says the Starlink terminals are made in Texas.


Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: