xAI Releases grok-voice-think-fast-1.0 — 67.3% on τ-voice Bench

xAI on April 25, 2026 released grok-voice-think-fast-1.0, a new flagship voice model that scored 67.3% on the τ-voice benchmark — outperforming Google's Gemini 3.1 Flash Live (43.8%), OpenAI's GPT Realtime 1.5 (35.3%) and xAI's own previous Grok Voice Fast 1.0 (38.3%) by more than 24 points. The model is already in production powering Starlink's phone sales and customer support, where xAI says it autonomously resolves 70% of inquiries with no human in the loop.

What Happened

xAI announced the model in a blog post on April 25, positioning it as the company's first voice model designed for "complex, ambiguous, multi-step workflows" rather than simple chatbots. The headline claim is a top-of-leaderboard result on τ-voice Bench, a benchmark that evaluates full-duplex voice agents under realistic conditions including background noise, regional accents, mid-sentence interruptions and natural turn-taking.

The release follows xAI's April 18 launch of standalone Grok speech-to-text and text-to-speech APIs, and rounds out a voice stack that the company is now pitching directly at OpenAI's Realtime API customers. Pricing remains the same as xAI's existing Grok Voice Agent API: $0.05 per minute of audio (about $3/hour), with tool invocations billed separately at $5 per 1,000 calls. The model is available immediately on the xAI Console for testing and via the Grok Voice Agent API for production traffic.

xAI Console showing the Grok Voice Agent developer tools and model selection screen — The xAI Console where developers can test and deploy grok-voice-think-fast-1.0 against their own tool definitions.

Key Details

τ-voice Bench score: 67.3% — first place on the leaderboard, ahead of Gemini 3.1 Flash Live (43.8%), Grok Voice Fast 1.0 (38.3%) and GPT Realtime 1.5 (35.3%).
Background reasoning — the model "thinks" while it talks, using inference time to plan multi-step actions without adding to perceived latency. xAI says response latency stays in the same envelope as its non-reasoning Grok Voice Fast model.
Structured-data extraction — designed to capture email addresses, street addresses, phone numbers, account numbers and full names accurately even with strong accents, fast speech and disfluencies.
Starlink in production — a single grok-voice-think-fast-1.0 agent calls 28 distinct tools across Starlink's phone-sales and support workflows, delivering a reported 20% sales conversion rate and 70% autonomous-resolution rate on inbound support calls.
Pricing — flat $0.05 per minute of audio, plus $5 per 1,000 tool invocations. No separate per-token charge for the voice model itself.
Availability — live on the xAI Console and Grok Voice Agent API as of April 25, 2026; same SDKs as the previous Grok Voice models, so existing integrations only require swapping the model name.

What Developers and Users Are Saying

Reaction across Hacker News, Reddit's r/MachineLearning and developer threads on X has been split between cautious enthusiasm and skepticism. Developers building on OpenAI's Realtime API and Google's Gemini Live have flagged the latency claims and the $0.05/minute price as the two numbers that, if they hold under load, make grok-voice-think-fast-1.0 immediately competitive — particularly for telephony and call-center deployments where token-based pricing has been hard to model. The Times of AI summarised the early developer mood as "if these numbers survive production, OpenAI Realtime customers will start looking elsewhere."

The main pushback is that the headline 67.3% τ-voice number comes from xAI's own evaluation harness, and there are no independent reproductions yet. Several Hacker News commenters noted that voice benchmarks are notoriously sensitive to audio preprocessing and turn-taking heuristics, and called for a third-party evaluation before treating the gap over Gemini and GPT Realtime as settled. The Starlink case study has also drawn questions: a 70% autonomous-resolution rate is striking, but xAI has not published the volume mix or how "resolution" is measured.

What This Means for Developers

For teams already shipping voice agents, this is a meaningful new option rather than an industry reset. The competitive picture now: grok-voice-think-fast-1.0 ($0.05/min, leads τ-voice), Gemini 3.1 Flash Live (cheap, multilingual, half the benchmark score), GPT Realtime 1.5 (deepest tooling ecosystem, lowest benchmark score on τ-voice), and Grok Voice Fast 1.0 (same xAI stack at lower cost for simpler workflows). The path of least resistance is to A/B the new model against your current provider on a real workflow rather than a synthetic benchmark — xAI has kept the API surface compatible with its earlier voice models, so the swap is essentially a model-name change.

Engineers building tool-heavy voice agents (booking, support triage, structured-data capture from phone calls) are the most obvious beneficiaries: the background-reasoning approach is specifically designed for workflows where the agent needs to plan multi-tool calls without long pauses. Teams that only need simple TTS or transcription should keep using xAI's separate STT/TTS APIs from the April 18 release — the new model is overkill for one-shot use cases.

What's Next

xAI says wider rollout, additional language coverage and on-prem deployment options are on the near-term roadmap, and that the model will continue to ship inside Starlink's phone systems as the public reference deployment. Independent τ-voice numbers from Artificial Analysis and academic groups are expected within a few weeks. Developers can read the full announcement and try the model on the xAI blog post or the xAI developer console.

Sources

xAI — Grok Voice Think Fast 1.0 announcement — primary source, benchmark numbers and Starlink stats.
MarkTechPost — independent coverage of the τ-voice leaderboard.
TestingCatalog — developer-focused walkthrough including the xAI Console screenshot.
Times of AI — community reaction and analysis.
xAI Models & Pricing — current Grok API model list and per-minute voice pricing.
MarkTechPost — Grok STT/TTS APIs (April 18) — context on the prior week's voice-stack release.

What Happened

Key Details

τ-voice Bench score: 67.3% — first place on the leaderboard, ahead of Gemini 3.1 Flash Live (43.8%), Grok Voice Fast 1.0 (38.3%) and GPT Realtime 1.5 (35.3%).
Background reasoning — the model "thinks" while it talks, using inference time to plan multi-step actions without adding to perceived latency. xAI says response latency stays in the same envelope as its non-reasoning Grok Voice Fast model.
Structured-data extraction — designed to capture email addresses, street addresses, phone numbers, account numbers and full names accurately even with strong accents, fast speech and disfluencies.
Starlink in production — a single grok-voice-think-fast-1.0 agent calls 28 distinct tools across Starlink's phone-sales and support workflows, delivering a reported 20% sales conversion rate and 70% autonomous-resolution rate on inbound support calls.
Pricing — flat $0.05 per minute of audio, plus $5 per 1,000 tool invocations. No separate per-token charge for the voice model itself.
Availability — live on the xAI Console and Grok Voice Agent API as of April 25, 2026; same SDKs as the previous Grok Voice models, so existing integrations only require swapping the model name.

What Developers and Users Are Saying

What This Means for Developers

What's Next

Sources

xAI — Grok Voice Think Fast 1.0 announcement — primary source, benchmark numbers and Starlink stats.
MarkTechPost — independent coverage of the τ-voice leaderboard.
TestingCatalog — developer-focused walkthrough including the xAI Console screenshot.
Times of AI — community reaction and analysis.
xAI Models & Pricing — current Grok API model list and per-minute voice pricing.
MarkTechPost — Grok STT/TTS APIs (April 18) — context on the prior week's voice-stack release.

xAI Launches grok-voice-think-fast-1.0 — Tops τ-voice Bench at 67.3%, Powers Starlink Support (April 2026)

What Happened

Key Details

What Developers and Users Are Saying

What This Means for Developers

What's Next

Sources

xAI Launches grok-voice-think-fast-1.0 — Tops τ-voice Bench at 67.3%, Powers Starlink Support (April 2026)

What Happened

Key Details

What Developers and Users Are Saying

What This Means for Developers

What's Next

Sources