Hunyuan Image

Hunyuan Image 3.0 is a large-scale multimodal autoregressive image generation model from Tencent with 80 billion total parameters across 64 mixture-of-experts experts, trained on 5 billion image-text pairs. Unlike diffusion-based pipelines, it models text and image tokens in a unified framework — enabling superior world-knowledge reasoning, prompt adherence on complex thousand-word descriptions, and cinematic-quality output that prioritizes aesthetic polish and art-directed compositions.

Features

Serverless API

Hunyuan Image 3.0 is available via sciforium' serverless API. Use the REST API or OpenAI-compatible client libraries to generate images.

Docs

Instruction-Following

Industry-leading instruction adherence powered by an integrated MLLM, producing images that accurately match complex multi-part text descriptions.

Docs
MiniMax  M2.5
Kimi K2.5
GLM 5
DeepSeek V3.2
gpt-oss-120b
gpt-oss-20b
Qwen3 Instruct
Qwen3 Thinking
Qwen3 Coder
Qwen3.5
Qwen3 VL Instruct
Qwen3 ASR
Qwen-Image
Qwen-Image-Edit
Flux2
Stable Diffusion 3.5
Hunyuan Image
Z-Image
Wan2.2-I2V
Wan2.2-T2V
Hunyuan Image
Z-Image