Hunyuan Image
Hunyuan Image 3.0 is a large-scale multimodal autoregressive image generation model from Tencent with 80 billion total parameters across 64 mixture-of-experts experts, trained on 5 billion image-text pairs. Unlike diffusion-based pipelines, it models text and image tokens in a unified framework — enabling superior world-knowledge reasoning, prompt adherence on complex thousand-word descriptions, and cinematic-quality output that prioritizes aesthetic polish and art-directed compositions.
Features
Serverless API
Hunyuan Image 3.0 is available via sciforium' serverless API. Use the REST API or OpenAI-compatible client libraries to generate images.
DocsInstruction-Following
Industry-leading instruction adherence powered by an integrated MLLM, producing images that accurately match complex multi-part text descriptions.
Docs