Multimodal

About multimodal releases

Discover new open‑source multimodal models. This archive covers models that can handle multiple functions such ah text, images, audio, and more.

Latest multimodal models

April 20, 2026
Google gemma-4-26B-A4B-it Brings Visual AI To Your Desktop

Google DeepMind has released the gemma-4-26B-A4B-it model, a new local AI system that processes text, images, and video while running efficiently on standard desktop hardware. This instruction-tuned version uses a […]

Read More
April 20, 2026
Google gemma-4-E4B-it Delivers Private Multimodal AI Locally

The gemma-4-E4B-it release brings a compact, instruction-tuned language model to the open source ecosystem. It handles text, images, and audio inputs while producing detailed written responses on standard hardware. Google […]

Read More
April 20, 2026
Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Anchors Local AI

Jackrong released Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled that adds reliable step-by-step reasoning and fixes formatting crashes in local coding assistants. The system processes complex prompts and maintains active thinking modes for extended tasks without […]

Read More
April 20, 2026
Jiunsong Unleashes Supergemma4-26b-uncensored-gguf-v2 For Open Chat

Supergemma4-26b-uncensored-gguf-v2 is a compressed language model that delivers open conversation without restrictive safety filters. The package wraps a 26-billion-parameter network into a GGUF container for straightforward local execution. Jiunsong developed […]

Read More
April 20, 2026
Dealignai Unleashes Gemma-4-31B-JANG_4M-CRACK For Unrestricted AI

The latest release from Dealignai, Gemma-4-31B-JANG_4M-CRACK modifies the Gemma 4 31B framework to restore unrestricted response generation. This updated model removes standard safety filters while maintaining its core reasoning and […]

Read More
April 20, 2026
Google Gemma-4-31B-it Debuts With Advanced Thinking Mode

Google has released the instruction-tuned version of its Gemma 4 model, offering a 31-billion parameter system that processes text, images, and video while supporting extended conversations. Users can now run […]

Read More
April 20, 2026
Tencent HY-Embodied-0.5 Grants Robots Spatial Intelligence

Tencent has released HY-Embodied-0.5, an open-source toolkit that improves spatial awareness and planning for physical robots. The system blends image recognition with logical steps so machines can safely interact with […]

Read More
1 4 5 6