Qwen3.5

Qwen3.5 is a multimodal mixture-of-experts model with 397 billion total parameters and 17 billion active parameters, built on a hybrid Gated Delta Network architecture that enables a 1 million token context window with near-linear compute scaling. The flagship of the Qwen3.5 family, it delivers state-of-the-art performance across knowledge, reasoning, coding, vision-language understanding, and agentic tasks — with native multimodal support and reasoning mode built in.

Features

Serverless API

Qwen3.5 is available via sciforium' serverless API, where you pay per token. There are several ways to call the sciforium API, including sciforium' Python client, the REST API, or OpenAI's Python client.

Docs

Agentic Capabilities

Qwen3.5 delivers state-of-the-art performance across knowledge, reasoning, coding, vision-language understanding, and agentic tasks with native multimodal support and reasoning mode built in.

Docs
MiniMax  M2.5
Kimi K2.5
GLM 5
DeepSeek V3.2
gpt-oss-120b
gpt-oss-20b
Qwen3 Instruct
Qwen3 Thinking
Qwen3 Coder
Qwen3.5
Qwen3 VL Instruct
Qwen3 ASR
Qwen-Image
Qwen-Image-Edit
Flux2
Stable Diffusion 3.5
Hunyuan Image
Z-Image
Wan2.2-I2V
Wan2.2-T2V
Hunyuan Image
Z-Image