Not discussing Mythos here, but Opus. Opus to me has been significantly better a...

muyuu · 2026-04-08T00:37:11 1775608631

When did you last compare them? Codex right now is considerably better in my experience. Can't speak for Gemini.

gck1 · 2026-04-08T01:22:13 1775611333

Tried Gemini 2 weeks ago to see where it's at, with gemini-cli.

Failed to use tools, failed to follow instructions, and then went into deranged loop mode.

Essentially, it's where it was 1.5 years ago when I tried it the last time.

It's honestly unbelievable how Google managed to fail so miserably at this.

4b11b4 · 2026-04-08T01:42:13 1775612533

Their harness might be behind

gck1 · 2026-04-08T02:25:15 1775615115

I think failures that I observed with gemini are unrelated to the harness. Because the same failures happened with third party harnesses too.

unsupp0rted · 2026-04-08T11:19:44 1775647184

It’s great on AI Studio. Harness issues, I agree.

Kailhus · 2026-04-11T16:10:42 1775923842

I have not experienced any issues with Gemini 3.1 Pro.

sandos · 2026-04-08T07:30:52 1775633452

Agree, I never actually had great success with Opus. I think its the failures that are annoying, its probably better than codex when its "good", but it fails in annoying ways that I think codex very seldom does.

StingyJelly · 2026-04-08T11:09:15 1775646555

I wouldn't call codex considerably better. It may depend on specific codebase and your expectations, but codex produces more "abstraction for the sake of abstraction" even on simple tasks, while opus in my experience usually chooses right level of abstraction for given task.

otabdeveloper4 · 2026-04-08T11:13:17 1775646797

A secret art known to the cognoscenti as "benchmark gaming".