March 2026

This month has been packed with releases, dominated by a massive expansion of tools for ComfyUI users alongside powerful new language models and creative suites.

ComfyUI Gets Major Upgrades

Visual and Motion Control

Creators using ComfyUI now have extensive control over their outputs. ComfyUI-Dynamic-Sigmas and ComfyUI-OpenPose-Studio bring visual tuning and pose editing directly to the interface. For animation, ComfyUI-Yedp-Action-Director and ComfyUI-Yedp-Mocap introduce 3D viewports and motion capture capabilities. New nodes like ComfyUI-wan-i2v-control and ComfyUI-Wan-TimeToMove offer precise region and motion guidance for video generation, while ComfyUI_CameraAngleSelector provides an interactive way to choose camera angles.

Workflow Optimization and Management

Managing models is easier with ComfyUI-advanced-model-manager and ComfyUI-Template-Model-Downloader, which automate file organization. Efficiency gets a boost from ComfyUI-CacheDiT and ComfyUI-meancache-z, speeding up generation times. Other handy utilities include ComfyUI-ParallelAnything for multi-GPU processing, ComfyUI Dynamic VRAM for memory optimization, and ComfyUI-IMGNR-Utils for workflow streamlining. Prompt management is covered by ComfyUI-Prompt-Stash and ComfyUI-WildPromptor.

Specialized Generators and Editors

Specific models received dedicated tooling this month. ComfyUI-Flux2Klein-Enhancer and FLUX.2 Klein LoRA Loader refine Flux generation. Video workflows benefit from ComfyUI-PowerLTXLoraLoaderExtra and ComfyUI-ZImageTurboProgressiveLockedUpscale. Image creation is bolstered by Comfy_HunyuanImage3, ComfyUI-ZImagePowerNodes, and the integrated drawing studio ComfyUI-Comfysketch. ComfyUI-Olm-SplineMask allows for precise masking, and ComfyUI-Qwen3-ASR adds speech recognition capabilities.

Language Models for Every Task

High-Performance and Reasoning Models

NVIDIA released gpt-oss-puzzle-88B for efficient H100 deployment, while Nemotron Cascade 2 30B targets local inference. For reasoning, Chuck Norris LLM and GRM2-3b tackle logic and math step-by-step. LongCat-Flash-Prover handles formal mathematics, and MRS-Core provides a reasoning engine for agents. PlaiTO focuses on structured thinking for humanities.

Coding and Multimodal Capabilities

Developers can utilize Qwen3-Coder-Next for coding tasks or run a Character-level GPT transformer to train models from scratch. Multimodal options include MiniCPM-o-4.5 for real-time vision and speech, and Ming-flash-omni-2.0 for unified processing. Qwen3.5-122B-A10B-Uncensored offers uncensored responses, and Nanbeige4.1-3B covers reasoning and code in a compact size. Regency Aghast 27b provides a unique persona-based experience.

Creative Tools for Audio and Video

Video and Image Generation

daVinci-MagiHuman creates audio-video content from text, and SANA-Video generates high-quality 2K videos. Omni-Video 2 combines editing and generation, while ID-LoRA LTX2.3 creates talking-head videos. SAMA-14B allows for instruction-guided video editing. Image tools include Mugen for anime styles, ArcFlow for fast generation, and FreeFuse for combining subjects. Z-Image-Distilled and Z-Image-SDNQ-uint4-svd-r32 offer compressed or faster generation options. Fal Qwen-Image-Edit provides precise camera angle control. SDDj integrates generation into Aseprite. Bytecut Director organizes production workflows, and AI Video Clipper LoRA helps prepare training data.

Audio and Speech Synthesis

Music generation sees a huge upgrade with ACE-Step 1.5, which generates full songs locally in seconds. MOSS-TTS Family and MioTTS Inference offer high-fidelity speech options. PrismAudio generates audio from video using planning, and Speech Swift brings speech tools to Apple Silicon. Voice Clone Studio and WAVe-1B-Multimodal-NL provide comprehensive audio editing and quality checking tools.

Developer Utilities and Datasets

Hardware and infrastructure tools are essential this month. UniInfer checks model compatibility before download, while Strix Halo AI Stack turns AMD machines into AI servers. Lemonade offers a unified local server, and Lora Pilot bundles training tools into Docker. AI Toolkit now supports LTX 2.3 training. Kreuzberg v4.5.0 extracts text from documents, and GLM-OCR reads complex files. For datasets, CaptionFoundry and ImageTagger assist with annotation, alongside SyntheticGen for remote sensing data. Google Code Archive and The Michael Hafftka Catalog preserve code and art history, while WorldVQA tests visual knowledge. SD Webui Style Organizer and OmniPromptStyle CheatSheet help users manage prompts and styles. The llama.cpp MCP Client was also updated for tool use.

Close up of a slate colored puzzle piece floating in a gradient space

NVIDIA Unlocks Speed with New gpt-oss-puzzle-88B Model

31 March 2026

NVIDIA has released gpt-oss-puzzle-88B, a large language model built for efficient deployment on H100-class hardware. The model uses a Mixture-of-Experts architecture with 88 billion parameters and is designed to handle […]

Closeup of a translucent hand flipping over a stack of translucent paper.

SDDj supercharges Aseprite with offline AI animation

30 March 2026

SDDj is a local image generation and animation extension for Aseprite that combines Stable Diffusion with AnimateDiff. It runs entirely on your computer, generating images and animations directly within the […]

Translucent tools floating and falling in a digital space

Vavo Debuts LoRA Pilot for Hassle-Free AI Model Training

30 March 2026

Lora Pilot is an all-in-one Docker workspace that bundles Stable Diffusion LoRA training tools into a single container. It combines dataset preparation, model management, training, and inference workflows so users […]

Flaming text that reads Uncensored HauhauCS Aggressive on a black matte piece of paper

Qwen3.5 122B A10B Uncensored HauhauCS Aggressive Defies Limits

30 March 2026

Qwen3.5-122B-A10B-Uncensored-HauhauCS-Aggressive is a large language model modified to remove all refusal responses while keeping its original capabilities intact. The release achieves zero refusals across 465 tested prompts without any degradation […]

Digital green cat made up of numbers and light in a stylized digital outline

Meituan Debuts LongCat-Flash-Prover for Formal Math Proofs

30 March 2026

LongCat-Flash-Prover is a 560-billion-parameter open-source model designed to handle formal mathematical reasoning. It uses a Mixture-of-Experts architecture to perform tasks like writing formal proofs, translating informal math problems into formal […]

A blue digital floating sub woofer speaker with dark blue wavy particle background

PrismAudio Transforms Video into Realistic Soundtracks

30 March 2026

PrismAudio is a new framework that generates audio from video using reinforcement learning with Chain-of-Thought (CoT) planning. Developed by the FunAudioLLM team, it breaks down the complex task of video-to-audio […]

Digital translucent floating green pair of lips

Yuriyvnv Refines Dutch Speech Data With WAVe Update

26 March 2026

WAVe-1B-Multimodal-NL is a 1 billion parameter model that checks the quality of synthetic speech at the word level. It examines how well spoken audio matches its written transcript, catching errors […]

An old 1990s computer with large text on the screen that reads Chuck Norris LLM

Chuck Norris LLM Flexes Reasoning Muscles

26 March 2026

Chuck Norris LLM is a 32-billion parameter language model fine-tuned from Qwen3 with chain-of-thought reasoning capabilities. The model tackles math, logic, and coding tasks while showing its work step-by-step, making […]

A close up of a woman speaking in to a microphone in digital mesh decoration

Speech Swift Delivers Voice AI for Apple Silicon

26 March 2026

Speech Swift is a comprehensive AI speech toolkit designed specifically for Apple Silicon devices. It allows users to run powerful speech models locally, including tools for speech recognition, text-to-speech synthesis, […]

Close up shot of a digital video editing interface

SAMA-14B Masters Video Editing While Preserving Motion

26 March 2026

SAMA-14B is a new open-source AI model designed for instruction-guided video editing. It allows users to modify videos using text instructions while keeping the original motion and temporal details intact. […]

A group of mannequins where one is green and waving to the camera

Seamless model browsing with ComfyUI-advanced-model-manager

26 March 2026

ComfyUI-advanced-model-manager is a custom node that brings model browsing and downloading directly into ComfyUI. Users can search across hundreds of HuggingFace repositories, download files to the correct folders, and manage […]

Large slate colored metalic tag with the words ImageTagger engraved.

ImageTagger Debuts to Clean Machine Learning Datasets

26 March 2026

ImageTagger is a desktop annotation tool designed for managing image and text pairs, specifically built for machine learning dataset curation workflows. The application provides a streamlined interface for teams and […]

A blue translucent video camera

NVIDIA SANA-Video Accelerates 2K AI Video Creation

26 March 2026

SANA-Video is a new diffusion model designed to create high-quality videos from text prompts. It can generate content up to 2K resolution with minute-long duration while maintaining strong alignment between […]

Graphical speech bubbles over rolling bokeh hills

OpenMOSS MOSS-TTS Speech Studio for home GPUs

22 March 2026

MOSS-TTS Family is an open-source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high-fidelity audio generation across complex real-world scenarios, including long-form […]

A dark blue deck of cards representing the ComfyUI-WildPromptor ComfyUI node

1038lab Unveils ComfyUI-WildPromptor for Easy Prompts

21 March 2026

ComfyUI-WildPromptor is a custom node extension for ComfyUI that streamlines prompt creation and management through a visual dropdown interface. Instead of memorizing wildcard names, users can browse and select keywords […]

Text on drawing paper with a pencil reads ComfyUI-Comfysketch

Draw Inside ComfyUI with ComfyUI-Comfysketch Node

21 March 2026

ComfyUI-Comfysketch is a new custom node that integrates a comprehensive drawing studio directly into the ComfyUI interface. It allows users to create and edit sketches with layers and multiple brush […]

A close up of an artists pallet with the text engraved Comfy_HunyuanImage3

EricRollei Brings To Comfy_HunyuanImage3 Hunyuan Image 3.0

21 March 2026

Comfy_HunyuanImage3 is a set of ComfyUI custom nodes for a collection of quantized versions of the HunyuanImage-3.0 image model. The integration provides professional tools for text-to-image generation, image editing, and […]

Italic orange text for Arcflow on a rolling digital hill

ArcFlow Generates AI Images in Just Two Steps

20 March 2026

ArcFlow is a new framework that generates images from text prompts in just two processing steps. It achieves this by using curved mathematical paths instead of straight shortcuts, which better […]

Omni-video 2 text design with circuit decoration

Fudan-FUXI Unveils Omni-Video 2 AI Tool

20 March 2026

Omni-Video 2 is a unified video editing and generation framework that combines a text-to-video diffusion model with vision-language understanding. The system can generate videos from text descriptions and edit existing […]

Green minimal audio icon wallpaper

FranckyB Updates Voice Clone Studio App

18 March 2026

Voice Clone Studio is a modular Gradio-based web application that handles voice cloning, voice design, multi-speaker conversations, voice conversion, and sound effects generation. The tool consolidates multiple AI audio engines […]

Colorful muted cards ranging from small to large representing upscaling.

New ComfyUI-ZImageTurboProgressiveLockedUpscale

18 March 2026

ComfyUI-ZImageTurboProgressiveLockedUpscale is a new custom node for ComfyUI that handles progressive image upscaling through multiple stages. The node takes a different approach than traditional methods by using sigma slicing and […]

Text logo for ComfyUI-meancache-z on a-digital translucent node panel

Speed Up Z-Image with ComfyUI-meancache-z by Facok

17 March 2026

ComfyUI-meancache-z is a new custom node that accelerates inference for Z-Image Flow Matching models without requiring any model fine-tuning. The tool, similar to the Z-Image loras, achieves speedups between 1.4x […]

MRS Core embossed on a core engine sphere

MRS-core A Reasoning Engine for AI Agents

17 March 2026

MRS-Core is a deterministic reasoning engine built for large language models and autonomous agents. It provides a modular foundation constructed from a small set of reusable operators that execute in […]

A large translucent floating brain graphic

MoonshotAI WorldVQA Tests AI Memory

17 March 2026

WorldVQA is a new benchmark designed to test how well AI models can identify and name visual objects from memory. Created by MoonshotAI, it measures factual visual knowledge rather than […]

Graphical purple interconnecting nodes

ImagineerNL Releases ComfyUI-IMGNR-Utils Nodes

17 March 2026

ComfyUI-IMGNR-Utils is a quality-of-life node pack that streamlines workflows in ComfyUI. It reduces unnecessary clicking and helps keep your workspace organized by addressing common annoyances users face when building AI […]

A screenshot of ComfyUI_cameraangleselector by NickPittas

NickPittas 3D ComfyUI_CameraAngleSelector Node

17 March 2026

ComfyUI_CameraAngleSelector is a custom node for ComfyUI that provides an interactive 3D interface for selecting camera angles. It allows users to visually choose from 96 different camera angle combinations rather […]

A large group of translucent cameras capturing multiple angles

New 96 Angles Qwen-Image-Edit-2511-Multiple-Angles-LoRA

17 March 2026

Fal has released Qwen-Image-Edit-2511-Multiple-Angles-LoRA, a new tool designed to give users precise control over camera angles during the image editing process. This Low-Rank Adaptation (LoRA) allows for the selection of […]

Sleek audio icon design hovering over ripples

ACE-Step 1.5 ComfyUI Generates Songs Locally

17 March 2026

ACE-Step 1.5 ComfyUI brings commercial-grade music generation to local machines. This open-source audio model now runs natively in ComfyUI and can create full songs in under 10 seconds using standard […]

A skeleton in mocap gear holding a webcam

ComfyUI-Yedp-Mocap mocap that Saves VRAM

16 March 2026

ComfyUI-Yedp-Mocap is a new custom node suite for ComfyUI that performs motion capture directly in your web browser. It handles the detection of poses, hands, and faces by utilizing the […]

shapes of chevron speed lines

ComfyUI-CacheDiT Speed Boosts DiT models

16 March 2026

ComfyUI-CacheDiT is a new custom node that accelerates Diffusion Transformer (DiT) models in ComfyUI. It delivers 1.4 up to 2 times faster generation speeds through intelligent caching, with no manual […]

FreeFuse embossed text graphic

FreeFuse LoRA framework for AI Art

15 March 2026

FreeFuse is a new framework that allows users to combine multiple specific subjects into a single AI-generated image without retraining models. It uses a method called Adaptive Token-Level Routing to […]

Alibaba-Pai orange logo on slate tech background graphic

Alibaba-pai Z-Image-Fun-Lora-Distill for Fast Images

15 March 2026

Z-Image-Fun-Lora-Distill is a new LoRA adapter that speeds up image generation for the Z-Image model. It reduces the number of inference steps required while also handling CFG internally, making the […]

Ace-Step 1.5 digital art with audio waves behind the logo

ACE-Step Pumps It Up With Ace-Step 1.5

15 March 2026

ACE-Step 1.5 is a new open-source music generation model that brings commercial-grade audio creation to consumer hardware. It generates full songs in under 10 seconds on an RTX 3090 while […]

Graphical depiction of ai generated garbled javascript.

Nyuuzyou Preserves Google Code Archive

15 March 2026

The Google Code Archive is a massive dataset that preserves source code from the defunct Google Code hosting service. It contains over 65 million files gathered from nearly 500,000 repositories, […]

An embossed logo of the zai team

Z.ai Team Gets Efficient with GLM-OCR

15 March 2026

GLM-OCR is a new open-source model designed to read and understand complex documents. It uses a compact architecture to pull text, formulas, and tables from images and PDFs. The tool […]

PlaiTO LLM brain node design wallpaper graphic

Alibidaran debuts PlaiTO for reasoning

15 March 2026

PlaiTO is a reasoning-focused language model built on LLaMA 3.1 (8B) that emphasizes structured thinking over basic text generation. The model targets humanities and social sciences, specifically handling abstract concepts […]

Green hill in a angled grid design that reads Qwen3-Coder-Next

Unsloth Provides GGUFs for Qwen3-Coder-Next

15 March 2026

Qwen3-Coder-Next is an open-weight language model built specifically for coding agents and local development workflows. The model uses a mixture-of-experts (MoE) architecture with 80 billion total parameters, but cleverly only […]

Sheet metal design of Z-Image-SDNQ-uint4-svd-r32

Quantization for Z-Image-SDNQ-uint4-svd-r32

12 March 2026

Z-Image-SDNQ-uint4-svd-r32 is a compressed version of the Tongyi-MAI/Z-Image text-to-image model that uses 4-bit quantization to significantly reduce file size. The model generates images from text prompts while maintaining most of […]

A digital pixelated lemon on a pixelated grass surface

Lemonade-sdk adds image support to lemonade

12 March 2026

Lemonade is an open-source local AI server that lets users run LLMs, and speech tools directly on their own hardware. The project provides a unified API that combines text, audio […]

Snapshot of the ComfyUI-Qwen3-ASR custom node for Qwen3-ASR and ComfyUI

DarioFT Releases ComfyUI-Qwen3-ASR for Qwen3-ASR

2 March 2026

ComfyUI-Qwen3-ASR is a new custom node pack that brings automatic speech recognition to ComfyUI. It transcribes audio files into text across 52 different languages and dialects, making it a useful […]

Screenshot of martin-rizzo's comfyui nodes ComfyUI-ZImagePowerNodes

More Z-Image nodes ComfyUI-ZImagePowerNodes

2 March 2026

ComfyUI-ZImagePowerNodes is a new collection of custom nodes designed specifically for the Z-Image Turbo model in ComfyUI. The package centers around the ZSampler Turbo, a specialized sampler that produces high-quality […]

Screenshot of the ComfyUI-Wan-TimeToMove custom nodes

GiusTex Unveils ComfyUI-Wan-TimeToMove Node

2 March 2026

ComfyUI-Wan-TimeToMove is a new custom node package that brings Time-to-Move motion control to ComfyUI. It allows users to guide video generation with specific motion signals, giving creators control over how […]

Screenshot of whatsthisaithing's free captioning tool CaptionFoundry

CaptionFoundry Free Captioning Tool

1 March 2026

CaptionFoundry is a free desktop application that helps users prepare image datasets for AI model training. It uses local vision AI models to automatically generate captions for images, eliminating the […]