Matrix-Game-3.0 is an open-source interactive world model that generates real-time video at 720p resolution and 40 frames per second. It uses a memory-augmented architecture to maintain consistency over long video […]
News
Qwen-3.5-Abliterated-Comfyui-nvfp4 is a collection of quantized language models designed to function as AI assistants directly within ComfyUI. Developer Winnougan created these models to enable multimodal tasks like image analysis and […]
Z-Image-SAM-ControlNet is a new control model designed to transform segmented images into photorealistic pictures. It functions as a ControlNet for the Tongyi-MAI/Z-Image base model, allowing users to guide image generation […]
LongCat-AudioDiT is a new text-to-speech model that generates high-fidelity audio directly from text inputs. It operates directly on the waveform latent space rather than relying on intermediate acoustic representations like […]
ComfyUI-FBnodes is a collection of custom nodes for ComfyUI that streamlines video workflows and adds utility functions for AI content generation. The extension provides tools for video encoding with codec […]
World Model Bench is a new benchmark that tests whether AI world models can actually think about a scene rather than just generate smooth video. It measures cognitive intelligence through […]
PixelSmile is a new diffusion LoRA framework designed for fine-grained facial expression editing. It allows users to modify specific facial expressions in images with precise control over intensity levels, addressing […]
ComfyUI-YOLOE26 is a custom node pack that segments objects in images using text prompts. Users can type simple descriptions like "person," "car," or "red apple" to isolate objects without needing […]
Nemotron3-Nano-4B-Uncensored-HauhauCS-Aggressive is an uncensored version of NVIDIA's Nemotron-3 Nano 4B model. It removes built-in refusals and censorship mechanisms while preserving the original model's capabilities and personality. Developed by HauhauCS, who […]