More

bytesandbits · 2026-04-07T02:38:53 1775529533

Handy with parakeet is pretty awesome by the way!

perelin · 2026-04-07T05:08:34 1775538514

Agree. Slept on.

Wish they would do an ios version, but the creator already kind of dismissed it.

sipjca · 2026-04-07T06:15:50 1775542550

I just don't have the bandwidth to run another project, maintaining Handy is hard enough on it's own, especially for free!

I didn't just dismiss for no reason, I am a human! I have needs and I can't just sleeplessly stay in front of the computer putting out code. If I had more time I would, but alas.

Someone could easily vibe code an iOS version in a few hours. I could do the same but I do not have time to support it.

xyos · 2026-04-07T07:31:48 1775547108

Thank you for your work, I highly appreciate it!

sipjca · 2026-04-07T08:23:55 1775550235

Thank you!!

bytesandbits · 2026-04-08T21:09:29 1775682569

fine. I will port it myself. Real-time, sub 100ms latency. Here

https://testflight.apple.com/join/myNP5XvU

Barbing · 2026-04-07T10:13:45 1775556825

Unlimited free Parakeet on iOS: VoiceInk

https://apps.apple.com/us/app/voiceink-ai-dictation/id675143...

(I was searching the same as you before I found this last month)

MegagramEnjoyer · 2026-04-07T02:54:11 1775530451

i like handy a lot, so clean

bytesandbits · 2026-04-04T02:29:02 1775269742

here the original NASA photos at high resolution without unnecessary ads.

https://www.nasa.gov/gallery/journey-to-the-moon/

bytesandbits · 2026-04-03T06:56:12 1775199372

why is a mail client needed in an onboard space computer at all?

fsh · 2026-04-03T07:41:29 1775202089

To send email.

1dontnkow_ · 2026-04-03T13:29:48 1775222988

Until somebody remembers to read those mails, the mission will be already over and forgotten. LOL

themafia · 2026-04-03T09:28:24 1775208504

I wonder if we could FOIA that inbox.

bytesandbits · 2026-03-27T06:13:54 1774592034

we constantly underestimate the power of inference scaffolding. I have seen it in all domains: coding, ASR, ARC-AGI benchmarks you name it. Scaffolding can do a lot! And post-training too. I am confident our currently pre-trained models can beat this benchmark over 80% with the right post-training and scaffolding. That being said I don't think ARC-AGI proves much. It is not a useful task at all in the wild. it is just a game; a strange and confusing one. For me this is just a pointless pseudo-academic exercise. Good to have, but by no means measures intelligence and even less utility of a model.

ithkuil · 2026-03-27T10:01:24 1774605684

That's unsurprising given that a lot of our own abilities as humans come from having painstakingly acquired practices and methodologies and tools (like pencil and paper, note taking, let alone algebra, formal methods and electromechanical aids). We call this "education" but it works in a way that is more similar to agentic harnesses than to pretraining or fine-tuning. This is reflected in the fundamental different way in which children and adults learn new skills

Linello · 2026-03-27T10:45:05 1774608305

Scaffolding is all you need. I am absolutely certain about that. It's abound finding good ways to approximate the reward function being used during post-training, but at inference time. A general enough reward that can score candidates well will inevitably improve the abilities of LLMs when put inside scaffolds.

nubg · 2026-03-27T06:49:34 1774594174

what exactly does scaffolding mean in this context? genuine question

boxed · 2026-04-02T14:09:35 1775138975

I'm gonna guess it means "whatever we still need humans to figure out to spoon feed the models"

bytesandbits · 2026-03-27T07:58:06 1774598286

anything that doesn't touch the model parameters at all once it has been compiled. for example, in streaming ASR of an encoder-decoder you can get gains in accuracy just by enhancing the encoder-decoder orchestration and ratio, frequency of fwd passes, dynamically adjusting the length of rolling windows (if using full attention). Prompting would be part of this too, including few-shot examples. Decoding strategy is also part of this (top-k, nucleus, speculative decoding, greedy or anything else). Applying signal processing or any kind of processing to the input before getting it into the model, or to the output. There are a lot of things you can do.

Linello · 2026-03-27T10:47:02 1774608422

Also think about the program-synthesis approach proposed by Poetiq.ai. python programs are being generated and evaluated against previous examples. Then in-context learning is done programmatically via prompt concatenation. If you can "score" online the working and non working examples, then you have a very strong reward signal.

bytesandbits · 2026-03-08T18:14:02 1772993642

Apple

bytesandbits · 2026-03-09T03:34:42 1773027282

https://machinelearning.apple.com/research/exploring-llms-ml...

bytesandbits · 2026-03-04T09:34:57 1772616897

4x faster PREFILL not decode. Decode is bandwidth-bounded. Prefill is flops-constrained.

bytesandbits · 2026-03-03T05:49:45 1772516985

do you run two eSIMs when traveling and if so how is stability / battery life?

jesterson · 2026-03-03T07:00:43 1772521243

Always 2 SIM/esim running simultaneously. Compared to previous non-apple modem it's night and day battery-wise.

Didn't notice any issues with connection speed/stability.

bytesandbits · 2026-03-03T05:39:06 1772516346

incredible work

bytesandbits · 2026-03-01T10:09:01 1772359741

sensei karpathy has done it again

bytesandbits · 2026-02-25T06:35:13 1772001313

parakeet v3 has a much better RTFx than moonshine, it's not just about parameter numbers. Runs faster.

https://huggingface.co/spaces/hf-audio/open_asr_leaderboard

SyneRyder · 2026-02-25T18:51:20 1772045480

That was my experience when I tried Moonshine against Parakeet v3 via Handy. Moonshine was noticeably slower on my 2018-era Intel i7 PC, and didn't seem as accurate either. I'm glad it exists, and I like the smaller size on disk (and presumably RAM too). But for my purposes with Handy I think I need the extra speed and accuracy Parakeet v3 is giving me.

regularfry · 2026-02-25T15:26:50 1772033210

It is about the parameter numbers if what you care about is edge devices with limited RAM. Beyond a certain size your model just doesn't fit, it doesn't matter how good it is - you still can't run it.

bytesandbits · 2026-03-01T10:16:31 1772360191

I am not sure what "edge" device you want to run this on, but you can compress parakeet to under 500MB on RAM / disk with dynamic quants on-the-fly dequantization (GGUF or CoreML centroid palettization style). And retain essentially almost all accuracy.

And just to be clear, 500MB is even enough for a raspberry Pi. Then your problem is not memory, is FLOPS. It might run real-time in a RPi 5, since it has around 50 GFLOPS of FP32, i.e. 100 GFLOPS of FP16. So about 20-50 times less than a modern iPhone. I don't think it will be able to keep it real time, TBF, but close.

regardless, this model with such quantization strategy runs real time at +10x real-time factor even in 6-year old iPhones (which you can acquire for under $200) and offline at a reasonable speed, essentially anywhere.

You get the best of both worlds: the accuracy of a whisper transformer at the speed and footprint of a small model.