Tencent Unveils HunyuanImage 3.0 Instruct AI

Image created by the HunyuanImage 3.0 Instruct model

Tencent has released HunyuanImage 3.0 Instruct, a powerful native multimodal model that represents a significant leap in image generation technology. Launched on January 26, 2026, this advanced AI image model brings unprecedented capabilities in text-to-image and image-to-image generation.

Key Architectural Features

HunyuanImage 3.0 Instruct uses a unified autoregressive framework that goes beyond traditional diffusion-based architectures. These key features include:

  • Largest open-source image generation Mixture of Experts (MoE) model
  • 64 experts with 80 billion total parameters
  • 13 billion parameters activated per token
  • Semantic accuracy and visual excellence

Advanced Training and Optimization Techniques

The model employs sophisticated multi-stage training and post-training optimizations as well as advanced reinforcement learning which includes:

  • Instruction tuning specifically for text-to-image generation
  • Supervised Fine-Tuning (SFT)
  • Direct Preference Optimization (DPO)
  • MixGRPO for aesthetic refinement
  • Novel Reward Distribution Alignment (ReDA)

Performance and Capabilities

The new model supports multiple advanced features such as:

  • Text-to-Image generation
  • Image-to-Image editing
  • Prompt self-rewriting
  • Chain-of-Thought (CoT) reasoning
  • Multi-image fusion
  • High-resolution image generation

Learn More About HunyuanImage 3.0 Instruct

Researchers and developers can access the model through:

Hugging Face: HunyuanImage 3.0 instruct model
GitHub: HunyuanImage 3.0 repository
Project Paper: HunyuanImage 3.0 technical report