SAMA-14B is a new open-source AI model designed for instruction-guided video editing. It allows users to modify videos using text instructions while keeping the original motion and temporal details intact. […]
News
A new ComfyUI custom node called FLUX.2 Klein LoRA Loader brings architecture-aware loading to the FLUX.2 Klein 9B model. The tool automatically converts diffusers-format LoRAs to native FLUX format while […]
ComfyUI-advanced-model-manager is a custom node that brings model browsing and downloading directly into ComfyUI. Users can search across hundreds of HuggingFace repositories, download files to the correct folders, and manage […]
ImageTagger is a desktop annotation tool designed for managing image and text pairs, specifically built for machine learning dataset curation workflows. The application provides a streamlined interface for teams and […]
The Michael Hafftka Catalog RaisonnĂ© is a new open dataset containing approximately 3,800 artworks by a single artist spanning five decades. The collection covers work from the 1970s through 2025 […]
SANA-Video is a new diffusion model designed to create high-quality videos from text prompts. It can generate content up to 2K resolution with minute-long duration while maintaining strong alignment between […]
Nanbeige4.1-3B is a compact 3-billion parameter language model designed to handle reasoning, code generation, and agentic tasks in one package. The model performs multi-step problem solving while maintaining alignment with […]
MOSS-TTS Family is an open-source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high-fidelity audio generation across complex real-world scenarios, including long-form […]
MioTTS Inference is a text-to-speech system that uses large language models to generate natural-sounding speech. The project offers multiple model sizes ranging from 0.1B to 2.6B parameters, allowing users to […]