Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I hope this will fix up the poor quality that we're seeing on Claude Opus 4.6

But degrading a model right before a new release is not the way to go.

 help



I wish someone would elaborate on what they were doing and observed since Jan on opus 4.6. I’ve been using it with 1m context on max thinking since it was released - as a software engineer to write most of my code, code reviews + research and explain unfamiliar code - and haven’t notice a degradation. I’ve seen this mentioned a lot though.

I have seen that codex -latest highest effort - will find some important edge cases that opus 4.6 overlooked when I ask both of them to review my PRs.


I don't use it for coding, but I do use it for real world tasks like general assistant.

I did notice multiple times context rot even in pretty short convos, it trying to overachie and do everything before even asking for my input and forgetting basic instructions (For example I have to "always default to military slang" in my prompt, and it's been forgetting it often, even though it worked fine before)




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: