RvR By LeapLabTHU Reimagines Image Fixes Through Total Redraws

RvR introduces a new approach to fixing generated images by completely redrawing them instead of attempting minor edits. Researchers at Tsinghua University and Tencent Hunyuan developed this system to solve common alignment issues in text-to-image generation.
Model Size: 29.2GB & VRAM GPU: requirements vary
How regeneration improves local image editing
- Shifts from pixel-by-pixel patching to full conditional regeneration.
- Uses semantic tokens from the original image to maintain context.
- Expands the modification space for deeper visual corrections.
- Improves alignment scores across multiple standard benchmarks.
- Includes a local web interface for direct interaction.
- Provides ready-to-use training scripts for custom datasets.
Content creators managing large batch workflows may find this approach useful when standard editing tools leave visible artifacts behind privacy restrictions. Researchers can also test the framework locally to evaluate how semantic preservation affects final output quality without relying on cloud services.
Technical considerations from the research team
The development process focused on removing strict pixel preservation rules that often trapped older models in low-quality loops. Testing on public datasets showed measurable gains across three major evaluation suites.
"Instead of relying on intermediate editing instructions and enforcing pixel-level consistency, our method directly regenerates images conditioned on the target prompt and semantic representations of the initial image, thereby enlarging the effective modification space,"
noted the team in a GitHub repository. Download the official checkpoint from Hugging Face, review the complete methodology in the arXiv paper, or access the full source code on GitHub to begin local testing.