🐧 PenguinPulse

Linux Graphics & Gaming News

Articles tagged: local ai

← Back to all articles

Ollama v0.22.1-rc1 Adds NVIDIA TensorRT Import, Enhanced Model Batching

Ollama has released v0.22.1-rc1, bringing notable enhancements for local AI model execution. Key updates include the addition of NVIDIA TensorRT Model Optimizer import support, which allows for levera...

Read more →

Ollama v0.21.1 Adds Kimi CLI, Enhances MLX Runner Performance and Features

Ollama released version 0.21.1 recently, introducing the Kimi CLI for local AI model execution. Users can now launch the Kimi CLI via Ollama, which "excels at long horizon agentic execution tasks thro...

Read more →

Ollama v0.20.6 Delivers Gemma 4 Tool Calling Updates, Parallel Streaming Fixes

Ollama released version 0.20.6 today, focusing on enhancements for local AI deployments. The update notably improves Gemma 4 tool calling capabilities, incorporating Google's latest post-launch fixes ...

Read more →

Ollama v0.21.0 Integrates Nous Research's Hermes Agent for Adaptive AI Workflows

Ollama v0.21.0 was released today, introducing the Hermes Agent for enhanced local AI workflows. This integration allows users to deploy Nous Research's self-improving AI agent via the 'ollama launch ...

Read more →

Ollama v0.20.7 Updates ROCm to 7.2.1 for Linux, Addresses Gemma Model Quality

Ollama, a framework designed for running large language models locally, recently released version 0.20.7. This update brings key improvements for Linux users, especially those utilizing AMD GPUs for t...

Read more →

Ollama v0.20.5 Debuts OpenClaw Multi-Channel Integration, Gemma 4 Flash Attention

Ollama v0.20.5 was released on April 9, 2026, introducing significant new features for local AI model interaction and performance. The update includes the OpenClaw channel setup, which allows users to...

Read more →

Ollama v0.20.0 Introduces Gemma 4 Models; v0.20.4-rc Improves MLX & Flash Attention

Ollama recently released version 0.20.0, which introduces support for the Gemma 4 series of large language models. This includes "Effective 2B (E2B)", "Effective 4B (E4B)", "26B (Mixture of Experts mo...

Read more →

Ollama v0.19.0 Enhances Local AI with Web Search Plugin, KV Cache Efficiency

Ollama, the local AI framework, released version 0.19.0 today, March 30, 2026. This update introduces a new web search plugin for its 'ollama launch pi' command, enabling direct web search capabilitie...

Read more →