releases.shpreview
Ollama/Ollama/v0.23.1

v0.23.1

Gemma 4 MTP (Multi-token Processing) for the MLX runner

Gemma 4 MTP speculative decoding is now supported on Macs. This can give over a 2x speed increase for the Gemma 4 31B model on coding tasks.

ollama run gemma4:31b-coding-mtp-bf16

What's Changed

Full Changelog: https://github.com/ollama/ollama/compare/v0.23.0...v0.23.1

Fetched May 26, 2026