llm - PenguinPulse

Articles tagged: llm

Ollama v0.30.5 Fixes Gemma 4:12B Floating Point Exception Crash

June 05, 2026

Ollama, the framework for running large language models locally, recently released version 0.30.5. This update primarily addresses a "gemma4:12b floating point exception crash" that affected the newly...

Ollama v0.23.1 Accelerates Gemma 4 MLX Inference with MTP Speculative Decoding

May 06, 2026

Ollama has released version 0.23.1, introducing Gemma 4 Multi-token Processing (MTP) speculative decoding for its MLX runner. This update, primarily benefiting macOS users with Apple Silicon, can prov...

Ollama 0.23.0 Adds Claude Desktop Integration, Server-Driven Model Recommendations

May 03, 2026

Ollama v0.23.0 was released today, introducing official support for the Claude Desktop application. This integration allows users to run both Claude Cowork and Claude Code directly within the Claude D...

Collabora Integrates BitNet Ternary LLM Inference into ExecuTorch via Vulkan

April 22, 2026

Collabora announced on April 17, 2026, the integration of BitNet-style ternary Large Language Model (LLM) inference into ExecuTorch. This implementation leverages ExecuTorch's Vulkan backend to enable...

SDL Implements Policy Forbidding AI/LLM Generated Code Contributions

April 16, 2026

Today, the Simple DirectMedia Layer (SDL) project, a widely used cross-platform development library for games and a core component of the Steam Runtime, implemented a new policy. This policy explicitl...

GreenBoost Linux Module Augments NVIDIA vRAM with System RAM/NVMe for LLMs

March 15, 2026

An independently developed open-source Linux kernel module called GreenBoost has been reported today. It aims to augment the dedicated video memory (vRAM) on NVIDIA discrete GPUs by utilizing both sys...

Ollama v0.16.0 Introduces New GLM-5, MiniMax-M2.5 LLMs and CLI Tools

February 13, 2026

Ollama released version 0.16.0 yesterday, enhancing its local large language model ecosystem with new models and a simplified command-line interface. The update introduces GLM-5, a 744 billion paramet...

Ollama 0.14.3 Adds Z-Image Turbo Text-to-Image and GLM-4.7-Flash LLM

January 23, 2026

Ollama released version 0.14.3 three days ago, introducing several new large language models and text-to-image capabilities. Key additions include Z-Image Turbo, a 6 billion parameter text-to-image mo...

Ollama v0.13.1-rc0 Improves CUDA VRAM Discovery, Adds Cogito-v2.1 Tool Calling

December 01, 2025

Ollama recently released version 0.13.1-rc0, introducing several updates for local large language model development. This release addresses specific issues with CUDA VRAM discovery, which should lead ...