News | Local AI News

June 8, 2026

Flux_ID_Adjuster_V2 Banishes The Waxy Skin Look From AI Portraits

By vramkickedin

Flux_ID_Adjuster_V2 is a new ComfyUI node that improves identity consistency and realism in Flux.2 Klein 9B images. It tackles the common waxy skin problem by letting users control how reference […]

June 8, 2026

NVIDIA Drops Cosmos3-Super-Text2Image for Pro Image Crafting

By vramkickedin

Cosmos3-Super-Text2Image is a new open-source image generation model from NVIDIA that creates high-fidelity pictures directly from written prompts. It is a text-to-image variant of the broader Cosmos3 platform, which is […]

June 8, 2026

Cosmos3-Super-Image2Video Animates Stills with a Single Prompt

By vramkickedin

Cosmos3-Super-Image2Video is a new open-source AI model that converts a single image into a short video clip guided by a text description. Released by NVIDIA, the 64-billion-parameter tool along with […]

June 8, 2026

ByteDance Bernini Crafts Videos With Words, Not Pixel Paintbrushes

By vramkickedin

ByteDance has released Bernini, an open-source framework that unifies video generation and editing through a semantic planning approach. Instead of controlling pixels directly, the system uses a multimodal large language […]

June 6, 2026

Flux-2-Klein-9B-Schematic-Lora Morphs CV Tasks Into Image Edits

By vramkickedin

The Flux-2-Klein-9B-Schematic-Lora release offers a set of six LoRA adapters that reframe common computer vision tasks as simple image-editing jobs. Each adapter produces a schematic RGB output—like a depth map […]

June 6, 2026

Cosmos3-Nano Conjures Video, Audio, and Robot Commands from Any Input

By vramkickedin

Nvidia has released Cosmos3-Nano, a 16-billion-parameter omnimodal model that turns text, images, video, audio, or action data into dynamic video with synced sound, reasoning text, or robot movement commands. The […]

June 6, 2026

Bonsai-Image-Ternary-4B-Gemlite-2bit Shrinks A 4B Image Model Into A Tiny 1.21 GB For Fast Local Art

By vramkickedin

The new Bonsai-Image-Ternary-4B-Gemlite-2bit model compresses a 4-billion-parameter text-to-image diffusion transformer into just 1.21 GB. It uses ternary weights—each limited to -1, 0, or +1 with shared scaling—to shrink the model […]

June 6, 2026

Comfyui-Anima-IPadapter Clones Character Looks Without Training

By vramkickedin

The Comfyui-Anima-IPadapter custom node brings IP-Adapter support to the Anima DiT model inside ComfyUI. It lets users inject reference image features directly into the generation process using decoupled cross-attention. This […]

June 6, 2026

LongLive-RAG Rewires Video Makers To Remember Past Frames

By vramkickedin

LongLive-RAG is an open-source framework that turns long video generation into a retrieval problem. An autoregressive generator can look back over its own output and pull in the most relevant […]