Grok 4.3, xAI’s newest frontier release, hit broad API availability this week with three changes that meaningfully shift the model-pricing landscape: a 40% cut on input pricing (down to $1.25 per million tokens), expansion to a 1-million-token context window, and the first native video-input support in the Grok line. Add in standalone Speech APIs (low-latency STT and TTS with diarization, timestamps, and multilingual support), category-leading scores on specialized legal and corporate-finance benchmarks, and downloadable artifacts (PDFs, spreadsheets, PowerPoint) generated directly from chat — and the practical case for evaluating Grok 4.3 for production workloads is the strongest it’s been since the model line launched. This is the developer breakdown: what’s new, why it matters, and how to evaluate Grok 4.3 against the rest of the field today.
What’s actually new
Three layered changes define the Grok 4.3 release. First, aggressive pricing. Input tokens drop from $2.10/M to $1.25/M (about 40% cut); output tokens drop from $4.00/M to $2.50/M (about 38% cut). For the same workload that cost $100/day on Grok 4.2, Grok 4.3 lands around $60. The cuts close the cost gap with the Chinese open-weights coding models that have been pressuring frontier-model pricing throughout 2026. xAI’s positioning is explicit: “frontier capability at challenger pricing.”
Second, 1-million-token context window. Up from the 256K context of Grok 4.2. The 1M window matches Gemini 3.1 Pro and exceeds GPT-5.5’s 200K window. For workloads that benefit from large context — entire codebases, multi-document research, long-running agentic conversations — Grok 4.3 now competes on equal footing with Gemini and ahead of OpenAI’s flagship.
Third, native video input. Where Grok 4.2 handled images natively but required transcoding video to frame-sequences for analysis, Grok 4.3 ingests video directly with audio track included. The use cases this opens up — meeting analysis, content moderation, video-based instruction following, multimodal RAG over recorded content — were technically possible but operationally cumbersome before. Native handling makes them practical.
Beyond the headline three, the release adds several capabilities worth knowing. Downloadable artifact generation: Grok 4.3 can produce ready-to-share PDFs, fully-formatted spreadsheets (xlsx with formulas, conditional formatting, charts), and PowerPoint decks directly from conversation. Independent evaluators report the output quality is genuinely usable rather than the boilerplate output earlier “AI generates documents” features produced.
Specialized benchmark wins: Vals AI’s independent evaluation puts Grok 4.3 at #1 on CaseLaw v2 (79.3% accuracy) — a benchmark covering legal reasoning across US case law — and #1 on CorpFin, covering corporate finance reasoning. The frontier-model rankings on general-purpose benchmarks (MMLU, GPQA, MATH) are roughly comparable across GPT-5.5, Claude Opus 4.7, Gemini 3.1 Pro, and Grok 4.3; the differentiation is now in specialized domains.
Speech APIs: separate from the chat completions API, xAI now ships standalone Grok Speech-to-Text and Text-to-Speech endpoints. Speech-to-Text handles 90+ languages with diarization (speaker tagging), timestamps, and real-time streaming. Text-to-Speech offers expressive voices with rapid voice cloning from short audio samples. Pricing is competitive with ElevenLabs and OpenAI‘s voice products, with category-leading latency.
Improved agentic performance: tool-call accuracy, JSON mode reliability, and long-horizon task completion rates all show double-digit improvements over Grok 4.2 in xAI’s reported benchmarks. The agentic gains matter most for production deployments running multi-step workflows; quality on these dimensions translates directly into reliability.
Why it matters
- The frontier-model price floor moves down. Grok 4.3 at $1.25/$2.50 is meaningfully cheaper than GPT-5.5 ($10/$30) or Claude Opus 4.7 ($15/$75) for input and output. For workloads where Grok 4.3’s capability is sufficient, the cost savings are 80%+. Expect OpenAI and Anthropic to respond with their own price cuts within the next quarter.
- Native video input changes the multimodal application landscape. Frontier models that handle video natively were limited to Gemini and now Grok 4.3. For applications built around video understanding — content moderation, education, sports analytics, security review — the practical options just expanded.
- The 1M context window competes directly with Gemini 3.1 Pro. Long-context workloads no longer have just one frontier option; they now have two. Comparison shopping based on quality, latency, and cost just got more interesting.
- Specialized benchmark leadership signals domain depth. CaseLaw v2 and CorpFin aren’t general benchmarks — they reflect deep work on specific verticals. Legal and financial-services teams that previously defaulted to GPT-5.5 or Claude Opus now have a credible alternative tuned to their domains.
- Speech APIs at competitive pricing pressure ElevenLabs and OpenAI Voice. The standalone Speech APIs aren’t a side product — they’re a serious entry into the voice category. Production voice deployments now have a third credible vendor option.
- Document artifact generation closes a UX gap. The “AI helped me write something but I still need to format it” friction point disappears for workflows that produce reports, analyses, or presentations. Time savings compound across knowledge-work use cases.
How to use it today
Grok 4.3 is available through xAI’s API directly and through the Grok consumer apps (web, iOS, Android, X integration). Production deployments use the API; consumer evaluation works through the apps.
- API quick start. Get an API key from console.x.ai. The API is OpenAI-compatible — existing code that calls OpenAI’s chat completions API works unchanged after a base URL and key swap.