Grok 4.1 Fast Non-Reasoning

by xAI · chat

Low-latency, non-reasoning variant of Grok 4.1 Fast with 2M context window. Delivers fast responses without extended thinking while maintaining frontier-level tool-calling and agentic capabilities.

Pricing

Input: $0.19/M tokens · Output: $0.475/M tokens

Capabilities

Vision, Function Calling, JSON Mode, Streaming

Context: 2000K tokens

Max output: 30K tokens

Routes: 1/1 healthy

Performance

TTFT: 2333ms · Latency: 3133ms · Throughput: 36.6 tok/s