Multimodal

May 15, 2026

MiniCPM-V-4.6 Packs Private Visual AI Into Phones

By vramkickedin

MiniCPM-V-4.6 is a new open-source multimodal model that brings image and video understanding directly to smartphones and small computers. It answers questions about photos and video clips without a cloud […]

May 12, 2026

Qwen3.5-9B-DeepSeek-V4-Flash-GGUF Brings Deep Reasoning Home

By vramkickedin

The Qwen3.5-9B-DeepSeek-V4-Flash-GGUF is a compressed language model that packs DeepSeek-V4’s advanced reasoning into a 9-billion-parameter package for local use. It converts the full model into the GGUF format, so it […]

May 12, 2026

Qwen3.6-27B-Heretic-Uncensored-FINETUNE-NEO-CODE-Di-IMatrix-MAX-GGUF

By vramkickedin

The Qwen3.6-27B-Heretic-Uncensored-FINETUNE-NEO-CODE-Di-IMatrix-MAX-GGUF package delivers an uncensored, performance-enhanced version of Qwen’s latest 27B model in highly accurate compressed formats. This release strips away the original model’s refusal behavior, cutting the refusal […]

May 12, 2026

Google Turbocharges Gemma 4 With Gemma-4-26B-A4B-it-assistant

By vramkickedin

Google just dropped a new tool that makes its open-source AI models run much faster. The Gemma-4-26B-A4B-It-Assistant is a lightweight draft model that predicts tokens ahead of the main AI, […]

May 12, 2026

Google Drops Gemma-4-31B-It-Assistant To Triple Local AI Speed

By vramkickedin

The Gemma-4-31B-It-Assistant is a lightweight draft model built to speed up text generation when paired with Google’s full Gemma 4 31B instruction-tuned model. It uses a technique called speculative decoding […]

May 10, 2026

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4 Opens Local Multimodal AI

By vramkickedin

NVIDIA has released Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4, an open multimodal AI model that simultaneously processes video, audio, images, and text. The 31-billion-parameter system uses a hybrid Mamba2-Transformer design that activates only about 3 […]

May 7, 2026

Mistral AI Introduces Mistral-Medium-3.5-128B As One Unified Tool

By vramkickedin

Mistral-Medium-3.5-128B is a dense flagship model designed to handle complex reasoning, coding, and instruction-following tasks. It serves as a unified replacement for several previous models released by the company. The […]

April 30, 2026

Nvidia Unleashes Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16 Locally

By vramkickedin

Nvidia recently released Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16, an open multimodal AI system that processes video, audio, images, and text in a single workflow. Users can run it locally to summarize lengthy meetings, transcribe […]

April 30, 2026

Oceanflowlab Brings OmniVTG-7B to Pinpoint Exact Video Moments

By vramkickedin

OmniVTG-7B is an open-source model that pinpoints exact video segments using simple text prompts. Rather than tagging entire clips, it scans long footage and marks precise start and end times […]

About multimodal releases

Latest multimodal models