Sciforium

Kimi K2.5

Kimi K2.5 is a native multimodal mixture-of-experts model with 1 trillion total parameters and 32 billion active parameters, built through continual pretraining on approximately 15 trillion mixed visual and text tokens atop Kimi-K2-Base. It delivers state-of-the-art performance in visual coding, agentic tool use, and multi-agent orchestration — supporting both instant and thinking modes, agent swarm coordination of up to 100 sub-agents, and seamless integration of vision and language across conversational and agentic workflows.

Features

Serverless API

Kimi K 2.5 is available via sciforium' serverless API, where you pay per token. There are several ways to call the sciforium API, including sciforium' Python client, the REST API, or OpenAI's Python client.

Docs

Agentic Capabilities

Built for complex multi-step tasks with native agentic architecture. Excels at research synthesis, code generation, and iterative problem solving.

Docs

Metadata

State

Ready

Type

LLM

Creator

Moonshot AI

Hugging Face

Kimi-K2.5

Specification

Model Weights

BF16

Activation

BF16

KV Cache

BF16

Supported Functionality

Fine-tuning

Contact Sales

Serverless

Supported

Context Length

262K

Embeddings

Input Modality

Text/Image

Output Modality

Text

MiniMax M2.5

Kimi K2.5

GLM 5

DeepSeek V3.2

gpt-oss-120b

gpt-oss-20b

Qwen3 Instruct

Qwen3 Thinking

Qwen3 Coder

Qwen3.5

Qwen3 VL Instruct

Qwen3 ASR

Qwen-Image

Qwen-Image-Edit

Flux2

Stable Diffusion 3.5

Hunyuan Image

Z-Image

Wan2.2-I2V

Wan2.2-T2V

Hunyuan Image

Z-Image