Ollama has released version 0.23.1, introducing Gemma 4 Multi-token Processing (MTP) speculative decoding for its MLX runner. This update, primarily benefiting macOS users with Apple Silicon, can prov...
Ollama v0.23.0 was released today, introducing official support for the Claude Desktop application. This integration allows users to run both Claude Cowork and Claude Code directly within the Claude D...
Ollama has released v0.22.1-rc1, bringing notable enhancements for local AI model execution. Key updates include the addition of NVIDIA TensorRT Model Optimizer import support, which allows for levera...
Ollama released version 0.21.1 recently, introducing the Kimi CLI for local AI model execution. Users can now launch the Kimi CLI via Ollama, which "excels at long horizon agentic execution tasks thro...
Ollama released version 0.20.6 today, focusing on enhancements for local AI deployments. The update notably improves Gemma 4 tool calling capabilities, incorporating Google's latest post-launch fixes ...
Ollama v0.21.0 was released today, introducing the Hermes Agent for enhanced local AI workflows. This integration allows users to deploy Nous Research's self-improving AI agent via the 'ollama launch ...
Ollama, a framework designed for running large language models locally, recently released version 0.20.7. This update brings key improvements for Linux users, especially those utilizing AMD GPUs for t...
Ollama announced its v0.20.8-rc0 pre-release candidate recently, introducing several updates with a key focus on improved hardware support. For Linux users, this version notably updates its AMD ROCm i...
Ollama v0.20.5 was released on April 9, 2026, introducing significant new features for local AI model interaction and performance. The update includes the OpenClaw channel setup, which allows users to...
Ollama recently released version 0.20.0, which introduces support for the Gemma 4 series of large language models. This includes "Effective 2B (E2B)", "Effective 4B (E4B)", "26B (Mixture of Experts mo...
Ollama, the local AI framework, released version 0.19.0 today, March 30, 2026. This update introduces a new web search plugin for its 'ollama launch pi' command, enabling direct web search capabilitie...
Ollama released version 0.18.3 yesterday, introducing direct integration with Microsoft Visual Studio Code through GitHub Copilot. This update allows developers to select and utilize any local or clou...
Ollama has released version 0.18.2 today, introducing performance improvements and fixes for its integration with OpenClaw. A key update in this release is the enhanced speed of local AI code generati...
Ollama v0.18.1 was released, introducing new web search and web fetch plugins specifically for OpenClaw. This update allows Ollama's language models, whether local or cloud-based, to search the web fo...
Ollama version 0.18.0 was released today, bringing significant performance improvements and new integration capabilities for AI model usage. The update enhances the speed of OpenClaw and Ollama's clou...
Ollama v0.17.5 was released today, introducing support for the Qwen 3.5 small model series, now available in 0.8B, 2B, 4B, and 9B parameter sizes. This update addresses several critical issues, includ...
Ollama released version 0.16.0 yesterday, enhancing its local large language model ecosystem with new models and a simplified command-line interface. The update introduces GLM-5, a 744 billion paramet...
Ollama 0.15.0 was released today, introducing the new "ollama launch" command. This command allows users to directly integrate Ollama's models with various large language models, including Claude Code...
Ollama released version 0.14.3 three days ago, introducing several new large language models and text-to-image capabilities. Key additions include Z-Image Turbo, a 6 billion parameter text-to-image mo...
Ollama released version 0.14.2 today, January 19, 2026, a minor update following the experimental image generation models in 0.14.1. The highlight of this release is the introduction of "TranslateGemm...
Ollama v0.14.1 was released today, January 18, 2026, bringing experimental support for image generation models. This new functionality is available for Linux systems configured with CUDA-compatible GP...
Ollama has recently released version 0.13.2-rc2, bringing several updates focused on AI model performance and GPU compatibility. A significant change is the default enablement of flash attention for v...
Ollama recently released version 0.13.1-rc0, introducing several updates for local large language model development. This release addresses specific issues with CUDA VRAM discovery, which should lead ...