More

muyuu · 2026-04-24T11:32:04 1777030324

Unironically curious about the performance of this model on unified VRAM machines.

storus · 2026-04-24T16:53:12 1777049592

Probably usable for chat sessions, unusably slow for agentic coding.

muyuu · 2026-04-24T11:28:57 1777030137

Sounds a lot like taoism, but i guess there's overlap

muyuu · 2026-04-24T11:24:44 1777029884

They're stressed pelicans from Hangzhou.

muyuu · 2026-04-24T11:20:42 1777029642

Yep, it's wild how little emphasis is there on control and replicability in these posts.

Already these models are useful for a myriad of use cases. It's really not that important if a model can 1-shot a particular problem or draw a cuter pelican on a bike. Past a degree of quality, process and reliability are so much more important for anything other than complete hands-off usage, which in business it's not something you're really going to do.

The fact that my tool may be gone tomorrow, and this actually has happened before, with no guarantees of a proper substitute... that's a lot more of a concern than a point extra in some benchmark.

muyuu · 2026-04-23T14:38:38 1776955118

what is the logic behind that anything that can be homegrown should be legal? it makes enforcement harder, but it doesn't make any potential damages any better or worse

muyuu · 2026-04-23T14:37:02 1776955022

this is what gives me pause about this issue

it's easy to just look at the upside of something that doesn't hurt you and you just have an extra choice, but knowing that it can and does wreck the lives of many, I feel that it's a painful thing for me to vote for, or against

muyuu · 2026-04-22T16:00:19 1776873619

i have a Strix Halo machine

typically those dense models are too slow on Strix Halo to be practical, expect 5-7 tps

you can get an idea by looking at other dense benchmarks here: https://strixhalo.zurkowski.net/experiments - i'd expect this model to be tested here soon, i don't think i will personally bother

rpdillon · 2026-04-22T21:57:46 1776895066

Yep, clocking a run right now that's averaging about 8.7t/s. But when I don't mind waiting while I go eat a meal or something, it's not bad!

EDIT: I'm running the Unsloth Qwen3.6-27B-Q6_K GGUF on a Corsair Strix Halo 128GB I bought summer 2025.

https://huggingface.co/unsloth/Qwen3.6-27B-GGUF/blob/main/Qw...

hedgehog · 2026-04-22T16:36:16 1776875776

This one is around 250 t/s prefill and 12.4 generation in my testing.

anonym29 · 2026-04-22T19:39:17 1776886757

similar numbers here - slightly higher PP. slightly better peak speed and retention w/ q8_0 kv cache quants too. llama-bench results here, cba to format for hn: https://pastebin.com/raw/zgJeqRbv

GTR 9 Pro, "performance" profile in BIOS, GTT instead of GART, Fedora 44

hedgehog · 2026-04-22T21:31:22 1776893482

If I did a proper benchmark I think the numbers would be what you got. Minimax M2.7 is also surprisingly not that slow, and in some ways faster as it seems to get things right with less thinking output. (around 140 t/s prefill and 23 t/s generation).

anonym29 · 2026-04-22T23:06:05 1776899165

The problem with M2.7 is that it's full GQA, meaning quadratic attention. It does start fast, but by 64k tokens deep, the version I'm running (Unsloth's UD IQ2_XXS) pp512 drops 95% from 261.3 t/s (0 context depth) to 13.1 t/s. q8_0 KV does help, still hitting 57.4 t/s at 64k depth vs 258.3 t/s at 0 depth. TG's retention rates are better, but still approaching single digit even with q8_0 KV cache by 64k depth.

That said, it was my favorite model when I valued output quality above all else, at least up until the new Qwen 3.6 27B, which I'm currently playing with.

I suspect I will like Qwen 3.6 122B A10B a LOT, maybe even better than M2.7.

muyuu · 2026-04-22T20:14:50 1776888890

interesting, might be worth having around although it is still pretty slow

muyuu · 2026-04-22T15:36:15 1776872175

literally tackling the RAM problem bit by bit

muyuu · 2026-04-22T15:35:24 1776872124

some of these images are hilarious ngl

that Indian guy understood the assignment

muyuu · 2026-04-22T11:08:45 1776856125

Is that so or would those 10B be discounted from the purchase?

not that it isn't wild regardless

gpm · 2026-04-22T12:44:09 1776861849

I'm not sure what you're referring to by "that" but I think you're right that it's 10B to not purchase or 60B to purchase, so as an option posting $10B for an option with a $50 strike price.

muyuu · 2026-04-22T15:04:57 1776870297

have concrete terms been published or is that an educated guess of the contract?

gpm · 2026-04-22T16:30:44 1776875444

It's a statement based on the contents of the articles linked at the top of this comments section.