🐧 PenguinPulse

Linux Graphics & Gaming News

Articles tagged: llm

← Back to all articles

Ollama v0.30.5 Fixes Gemma 4:12B Floating Point Exception Crash

Ollama, the framework for running large language models locally, recently released version 0.30.5. This update primarily addresses a "gemma4:12b floating point exception crash" that affected the newly...

Read more →

Ollama v0.23.1 Accelerates Gemma 4 MLX Inference with MTP Speculative Decoding

Ollama has released version 0.23.1, introducing Gemma 4 Multi-token Processing (MTP) speculative decoding for its MLX runner. This update, primarily benefiting macOS users with Apple Silicon, can prov...

Read more →

Ollama 0.23.0 Adds Claude Desktop Integration, Server-Driven Model Recommendations

Ollama v0.23.0 was released today, introducing official support for the Claude Desktop application. This integration allows users to run both Claude Cowork and Claude Code directly within the Claude D...

Read more →

Collabora Integrates BitNet Ternary LLM Inference into ExecuTorch via Vulkan

Collabora announced on April 17, 2026, the integration of BitNet-style ternary Large Language Model (LLM) inference into ExecuTorch. This implementation leverages ExecuTorch's Vulkan backend to enable...

Read more →

SDL Implements Policy Forbidding AI/LLM Generated Code Contributions

Today, the Simple DirectMedia Layer (SDL) project, a widely used cross-platform development library for games and a core component of the Steam Runtime, implemented a new policy. This policy explicitl...

Read more →

GreenBoost Linux Module Augments NVIDIA vRAM with System RAM/NVMe for LLMs

An independently developed open-source Linux kernel module called GreenBoost has been reported today. It aims to augment the dedicated video memory (vRAM) on NVIDIA discrete GPUs by utilizing both sys...

Read more →

Ollama v0.16.0 Introduces New GLM-5, MiniMax-M2.5 LLMs and CLI Tools

Ollama released version 0.16.0 yesterday, enhancing its local large language model ecosystem with new models and a simplified command-line interface. The update introduces GLM-5, a 744 billion paramet...

Read more →

Ollama 0.14.3 Adds Z-Image Turbo Text-to-Image and GLM-4.7-Flash LLM

Ollama released version 0.14.3 three days ago, introducing several new large language models and text-to-image capabilities. Key additions include Z-Image Turbo, a 6 billion parameter text-to-image mo...

Read more →

Ollama v0.13.1-rc0 Improves CUDA VRAM Discovery, Adds Cogito-v2.1 Tool Calling

Ollama recently released version 0.13.1-rc0, introducing several updates for local large language model development. This release addresses specific issues with CUDA VRAM discovery, which should lead ...

Read more →