Diff-forge is a new open-source tool that automates the tedious work of preparing video datasets for diffusion model fine-tuning. It runs entirely on your own machine, providing a visual browser-based […]
News
Caption-Creator is a fast, portable GUI tool that runs entirely on your local machine to generate high‑quality image captions and tags. It helps users build custom datasets for AI model […]
Tritant has released Ace-Step-1.5-Api-server-UI, a visual interface that turns the ACE-Step 1.5 music generation model into a full-featured local studio. The tool wraps the model’s API server in a single […]
Walkyrie-1.3B-v1.0 is a new text-to-image model that turns written prompts into 1024×1024 pixel images. It was rebuilt from an existing video-generation model after its language-understanding component was trimmed down to […]
UltraReal_FineTune_Anima is an experimental full model fine-tune of the Anima_preview1 image generator, aimed at delivering more realistic photo-style outputs. It produces strikingly varied visuals, from analog film grain to clean […]
NVIDIA has released Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4, an open multimodal AI model that simultaneously processes video, audio, images, and text. The 31-billion-parameter system uses a hybrid Mamba2-Transformer design that activates only about 3 […]
LTX2.3-10Eros is a new image-to-video model merge that turns a single still image into a short motion clip. Unlike standard weight blending, it combines layers from different training steps to […]
The new release Hy-MT1.5-1.8B-1.25bit is a heavily compressed language translation model designed to run entirely on your phone, no internet needed. It shrinks a powerful 1.8 billion parameter system down […]
IBM has released Granite-4.1-30b, a 30‑billion parameter instruct model that brings upgraded tool calling and long‑context abilities to the open‑source community. It can summarize text, answer questions, write code, and […]