Ollama is a user-friendly UI for LLM inference. It is powered by llama.cpp (or a... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		benob 34 days ago \| parent \| context \| favorite \| on: Ollama is now powered by MLX on Apple Silicon in p... Ollama is a user-friendly UI for LLM inference. It is powered by llama.cpp (or a fork of it) which is more power-user oriented and requires command-line wrangling. GGML is the math library behind llama.cpp and GGUF is the associated file format used for storing LLM weights.

redmalang 34 days ago [–]

i've found llama.cpp (as i understand it, ollama now uses their own version of this) to work much better in practice, faster and much more flexible.

Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact