More

brokensegue · 2026-06-22T04:30:45 1782102645

there are models between 2-grams and 600m param models that would be good options. i don't expect a 2-gram to do very well here. also i'm not sure why this model isn't a fine choice if it solves their problem

throwa356262 · 2026-06-22T05:35:51 1782106551

What would you suggest instead?

stephantul · 2026-06-22T08:33:42 1782117222

A non-autoregressive transformer trained with a classification objective.

all2 · 2026-06-22T16:42:49 1782146569

These are absurdly effective for this kind of task. Training is fast and straight forward. Packaging for deployment as ONNX is pretty simple as well.

dev-experiments · 2026-06-22T19:51:48 1782157908

As a follow up to the original article, I added a new experiment using Logistic Regression and the results are very good. It actually improves on the accuracy by a few points.

More details here: https://www.teachmecoolstuff.com/viewarticle/using-logistic-...

brokensegue · 2026-06-20T20:34:27 1781987667

you should link your data to wikidata which will get you free connection back to crunchbase and other sources e.g. https://www.wikidata.org/wiki/Q97041185

You could even back some of the data from there

brokensegue · 2026-06-17T14:01:57 1781704917

EU tried to ban tracking cookies. Anyone could've seen the result coming (every site now disrupts your reading). The defeatist attitude would've been right there.

I don't see how games won't just charge you a $0.01 "subscription" that lasts 5 years or various other such sidesteps. it'll just make everything more annoying.

brokensegue · 2026-06-16T14:26:31 1781619991

anyone have a free photo for wikipedia?

brokensegue · 2026-06-12T02:06:51 1781230011

Using mtp takes away some of the magic for me

extraduder_ire · 2026-06-12T03:14:18 1781234058

That was my reaction too. Most of what I wanted to know from the video is how it would emulate a filesystem that does that.

brokensegue · 2026-06-11T17:39:27 1781199567

i assumed it's for influencers who want to make a video of the new city or something

brokensegue · 2026-06-04T03:48:56 1780544936

Amusingly you just conflated the pico (a dev board) with its chip (rp2040)

brokensegue · 2026-05-30T17:37:19 1780162639

Then maybe caveat your posts with that

brokensegue · 2026-05-28T14:23:00 1779978180

yeah i really don't like the corpus of statements and it makes me doubt lenz. consider

> “Artificial intelligence will cause widespread job loss among software engineers.”

https://lenz.io/c/ai-software-engineers-job-loss-impact-05e4...

this is a statement about the future. who knows? dataset also includes

> Robots will not replace human teachers in schools in the near future.

or

> Papua New Guinea has very few female members of parliament.

what counts as very few?

> “Taurine supplementation supports mood and emotional health in humans.”

why is this labeled as misleading? i'm not even sure when I'm supposed to use the misleading label

> Anaximander was the first scientist in recorded history.

this is a judgement call as the term scientist didn't exist.

the claims that feel actually solidly answerable seem to have much better LLM performance

kostaj · 2026-05-28T15:21:19 1779981679

Agree that some of the claims are forward-looking. The messiness of the real-world and real-user fact checks. No ground-truth verdicts are provided or used in the study though. It only measures the level of agreement between the selected models, not which one is right on which claim. I.e. none of the claims is actually labelled.

brokensegue · 2026-05-28T15:58:58 1779983938

were you involved in making the study? your bio says you work for them so you should probably indicate that in your comments.

lack of agreement when there is no singular correct answer (or any answer at all) isn't a useful metric

I ran into a lot of these kinds of issues when working on the Citation Needed WMF project (and related extensions). Truth is so often very nuanced.

simonw · 2026-05-28T16:16:38 1779984998

They introduced themselves as the study author here: https://news.ycombinator.com/item?id=48307887#48307899

brokensegue · 2026-05-28T16:25:58 1779985558

ah. I missed that.

brokensegue · 2026-05-21T17:06:49 1779383209

Most jeeps never go off-road