TurboOCR operates as a dedicated recognition server that converts scanned pages and screenshots into digital text at high speed. The software handles printed material and handwritten notes using specialized graphics […]
News
RvR introduces a new approach to fixing generated images by completely redrawing them instead of attempting minor edits. Researchers at Tsinghua University and Tencent Hunyuan developed this system to solve […]
OmniVTG-7B is an open-source model that pinpoints exact video segments using simple text prompts. Rather than tagging entire clips, it scans long footage and marks precise start and end times […]
Z-Anime delivers a fully trained anime image generation model that operates independently from lightweight add-on patches. The system produces detailed illustrations using everyday descriptive sentences instead of strict keyword lists. […]
Danbooru-Dataset-Filter provides a fast graphical interface for sorting and organizing large image collections used in machine learning projects. The software processes millions of files in seconds, allowing users to build […]
Apple researchers released Ml-videoflextok, an open source video tool that converts footage into flexible sequences instead of fixed grids. This approach stores broad motion first, then adds sharper visual details […]
Generative Refinement Networks, or GRN, offers a new method for creating digital images and video. The approach replaces standard diffusion techniques with progressive visual refinement and adaptive computing. Built by […]
DisCa is a new acceleration framework designed to speed up AI video generation models. It reduces processing time by 11.8 times without sacrificing visual clarity. Tencent researchers who also made […]
ControlFoley transforms video clips into synchronized soundtracks by combining visual scenes, written descriptions, and existing audio samples into a single generation system. This new framework produces matching sound effects and […]