MusiCue is an open-source tool that converts songs into typed, timeline-based cues for driving animation and show-control software. Developer cedarconnor built it to break audio into separated stems, beats, drum […]
News
The open-source project comfyui-node-canvas introduces a local GUI app called ComfyUI Node Builder that helps people create custom nodes for ComfyUI without manually writing all the repetitive code. It provides […]
OmniNFT is a set of LoRA adapters that fine-tune the open-source LTX Video model to produce better-aligned audio and video. Using reinforcement learning, it guides the generation process so that […]
DramaBox is a text-to-speech system that turns scene descriptions and dialogue into expressive speech, complete with laughs, sighs, and pauses. It can clone a speaker’s timbre from just a 10-second […]
ComfyUI-PlagueKind-Nodes is a custom node for ComfyUI that unifies image and mask resizing in a single step. It offers multiple scaling modes, preserves aspect ratios, and ensures masks stay perfectly […]
ComfyUI-DramaBox is a new custom node pack that brings ResembleAI’s expressive text-to-speech system directly into ComfyUI workflows. It turns text prompts into spoken audio using the LTX-2.3 audio diffusion model, […]
Anima-TrainFlow is a simple, single-page desktop tool for training LoRA adapters on the Anima 2B image generation model. It puts every setting you need right in front of you, skipping […]
A new quantized file for DeepSeek V4 Flash, called Deepseek-V4-GGUF, shrinks the massive AI model so it can run on high-end consumer hardware. It’s a set of GGUF format files […]
Emo is a new mixture-of-experts language model designed so groups of experts naturally specialize in specific topics during training, rather than requiring human labeling. The main release from the Allen […]