Sciforium

Hunyuan Image

Hunyuan Image 3.0 is a large-scale multimodal autoregressive image generation model from Tencent with 80 billion total parameters across 64 mixture-of-experts experts, trained on 5 billion image-text pairs. Unlike diffusion-based pipelines, it models text and image tokens in a unified framework — enabling superior world-knowledge reasoning, prompt adherence on complex thousand-word descriptions, and cinematic-quality output that prioritizes aesthetic polish and art-directed compositions.

Features

Serverless API

Hunyuan Image 3.0 is available via sciforium' serverless API. Use the REST API or OpenAI-compatible client libraries to generate images.

Docs

Instruction-Following

Industry-leading instruction adherence powered by an integrated MLLM, producing images that accurately match complex multi-part text descriptions.

Docs

Metadata

State

Ready

Type

Image

Creator

Tencent

Hugging Face

HunyuanImage-3.0

Specification

Model Weights

BF16

Activation

BF16

KV Cache

Supported Functionality

Fine-tuning

Contact Sales

Serverless

Supported

Context Length

N/A

Embeddings

Input Modality

Text

Output Modality

Image

MiniMax M2.5

Kimi K2.5

GLM 5

DeepSeek V3.2

gpt-oss-120b

gpt-oss-20b

Qwen3 Instruct

Qwen3 Thinking

Qwen3 Coder

Qwen3.5

Qwen3 VL Instruct

Qwen3 ASR

Qwen-Image

Qwen-Image-Edit

Flux2

Stable Diffusion 3.5

Hunyuan Image

Z-Image

Wan2.2-I2V

Wan2.2-T2V

Hunyuan Image

Z-Image