Kimi K2.5
by Moonshot · chat
Kimi K2.5 is a 256K-context multimodal model supporting text/image/video input, reasoning mode, tool calling, JSON and structured output, and automatic context caching.
Pricing
Input: $0.57/M tokens · Output: $2.375/M tokens
Capabilities
Vision, Function Calling, JSON Mode, Streaming, Reasoning
Context: 262K tokens
Max output: 66K tokens
Routes: 4/4 healthy
Performance
TTFT: 2607ms · Latency: 7916ms · Throughput: 18.6 tok/s