Combining specialized AI adapters into one system now works best when harmful layers are removed first. The ENMP-LoRAMerging project scans multiple adaptation files, identifies components that hurt overall accuracy, and […]
News
Yovecent has released UDM-GRPO, an open-source framework that combines uniform discrete diffusion with reinforcement learning for text-to-image generation. The system stabilizes training and improves output quality by treating the fully […]
Tencent researchers recently published MegaStyle, a system designed to automate the creation of visual style libraries. The pipeline translates text descriptions into images that share matching artistic qualities while keeping […]
Tstars-VTON is an open evaluation dataset designed to test virtual try-on models under realistic shopping conditions. It contains 1,780 image pairs covering layered clothing, footwear, and accessories across dozens of […]
SmartPhotoCrafter is an open-source framework that edits photographs without requiring manual prompts. The system automatically spots visual flaws, plans specific improvements, and applies corrections in a single continuous workflow. Researchers […]
AnyRecon turns scattered photographs into complete three-dimensional scenes using a video-based artificial intelligence system. The framework processes inputs in any order without needing precise spacing between camera angles. OpenImagingLab built […]
TS-Attn introduces a new attention method that improves how AI models handle videos with multiple sequential actions. The system rearranges how the model focuses on time-based data, allowing complex prompts […]
Patch-forcing changes how artificial intelligence generates images by applying different noise removal speeds to separate sections of a picture. Easier areas process quickly while complex sections receive additional refinement, making […]
DynamicRad accelerates long video generation by applying smart sparse attention to existing AI diffusion models. This open framework cuts processing time substantially while keeping visual quality consistent across full-length clips. […]