More

p_stuart82 · 2026-04-22T17:38:27 1776879507

tbh ~1-3% PPL hit from Q4_K_M stopped being the bottleneck a while ago. the bottleneck is the 48 hours of guessing llama.cpp flags and chat template bugs before the ecosystem catches up. you are doing unpaid QA.

anonym29 · 2026-04-22T17:49:54 1776880194

Just wait a week for model bugs to be worked out. This is well-known advice and a common practice within r/localllama. The flags are not hard at all if you're using llama.cpp regularly. If you're new to the ecosystem, that's closer to a one-time effort with irregular updates than it is to something you have to re-learn for every model.

p_stuart82 · 2026-04-21T23:20:35 1776813635

for software engineering? not because of the typing.

the signal is every time a human has to grab the wheel. that's a label for what the agent still misses.

p_stuart82 · 2026-04-19T22:00:58 1776636058

exactly people paid the premium so somebody else's OAuth screwup wouldn't become their Sunday. and here we are.

p_stuart82 · 2026-04-18T17:32:35 1776533555

IMO nobody was paying for magic compute. they're paying to not touch ten years of glue.

if agents eat that glue, the moat gets thin fast.

grim_io · 2026-04-18T18:10:00 1776535800

> agents eat that glue

No wonder they hallucinate :)

andyhedges · 2026-04-18T19:06:35 1776539195

It's when they sniff the glue, then things get wild.

edoceo · 2026-04-19T00:05:34 1776557134

It's been like 90% glue since perl took over.

mpweiher · 2026-04-19T14:11:14 1776607874

Glue: the dark matter of software

https://blog.metaobject.com/2021/06/glue-dark-matter-of-soft...

onemoresoop · 2026-04-19T01:03:41 1776560621

Don’t forget staples and the tape too. LLMs have a weakness for paperclips, hope we don’t end up on that path

linohh · 2026-04-19T11:43:51 1776599031

There's an "Operation Paperclip" joke hidden in here somewhere.

zamadatix · 2026-04-18T18:28:27 1776536907

Yeah, at the last job there was a single outdated external wiki server left sitting in DO for those kinds of reasons while everything updated and internal had moved already (if not twice). If it hadn't become such a security risk it would never have been moved.

drewnick · 2026-04-18T21:13:21 1776546801

The problem is a lot of this glue is proprietary by design at the various cloud services. I realize there are open source and alternative abstractions for a lot of of the same services, but there’s still quite a bit of glue if you’re on AWS, for example, and looking to move to bare metal.

But maybe I’m just thinking of the current capabilities of agents, and if we fast forward a couple years, even removing these abstractions or migrating will be very low friction.

reillyse · 2026-04-18T21:27:37 1776547657

But you can run most of the glue on your own dedicated instances.

I run k8s on a bunch of dedicated servers that are super cheap and I have all bells and whistles - just tell your coding agent to do it. You can literally design the thing you would never do yourself and it works brilliantly.

Postgres running on dedicated hardware replicated and with wal backups - easy just tell codebuff (my harness of choice) to do it. Then any number of firewalls, load balancers, bastion servers, etc. if you can imagine it , codebuff will implement it.

p_stuart82 · 2026-04-17T18:04:41 1776449081

IMO it doesn't flatten design into one thing. it splits it. cheap obvious work at scale, and a way smaller premium tier for real authorship. the middle is what actually gets crushed.

wek · 2026-04-18T01:21:43 1776475303

this is true of AI in general

p_stuart82 · 2026-04-16T20:05:27 1776369927

the awkward part isn't just about reading sensitive files.

search, listings, direct reads, browser and computer use all sit behind different boundaries.

hard to tell what any given approval actually buys or exposes.

p_stuart82 · 2026-04-16T17:04:33 1776359073

caveman stops being a style tool and starts being self-defense. once prompt comes in up to 1.35x fatter, they've basically moved visibility and control entirely into their black box.

p_stuart82 · 2026-04-16T16:26:01 1776356761

yeah they took "i pick the budget" and turned it into "trust us".

bandrami · 2026-04-16T17:10:59 1776359459

I keep saying even if there's not current malfeasance, the incentives being set up where the model ultimately determines the token use which determines the model provider's revenue will absolutely overcome any safeguards or good intentions given long enough.

vessenes · 2026-04-16T18:59:16 1776365956

This might be true, but right now everybody is like "please let me spend more by making you think longer." The datacenter incentives from Anthropic this month are "please don't melt our GPUs anymore" though.

p_stuart82 · 2026-04-15T16:40:14 1776271214

separating codebase and leaving 'cal.diy' for hobbyists is pretty much the classic open-core path. the community phase is over and they need to protect their enterprise revenue.

blaming AI scanners is just really convenient PR cover for a normal license change.

p_stuart82 · 2026-04-15T15:05:27 1776265527

yeah the desktop app forgets it's the desktop app. claude code feels local right up until the api starts coughing up 500s. same thing, just in a terminal instead of a window.