No, in general mlx versions are always faster, ice tested most of them. | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		phaedrix on April 21, 2025 \| parent \| context \| favorite \| on: Gemma 3 QAT Models: Bringing AI to Consumer GPUs No, in general mlx versions are always faster, ice tested most of them.

85392_school on April 21, 2025 [–]

What TPS difference are you getting?

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact