Home /Reviews /Grok 4.20 Heavy
Tested 7 d.Revised Apr. 26
Tool review · Detailed verdict
xAI

Grok 4.20 Heavy

« Fast, plugged into X in real time, leading Heavy agents. Weak spots: blurry GDPR, average French writing. »

Overall score
weighted average · 5 axes
§ Presentation video · Grok 4.20 Heavy
YouTube · officiel
xAI · juillet 2025 · démo officielle Grok 4 + 4 Heavy · official source
Price / month
30 €
Pro plan
Cost / 1M tokens
3 €
output rate
Context
256k
max tokens
GDPR
5.5/10
to verify
Hosting
🇺🇸 US
by default
§3.1Strength profile
Radar: Write 7.6/10, Code 8.4/10, Reason 8.7/10, GDPR 5.5/10, Speed 9/10WriteCodeReasonGDPRSpeed
Scale 0 to 10 · RadarOnAI internal method
§3.2+ Strong points
  • 0116-agent Heavy for long reasoning
  • 02Real-time X data (live monitoring)
  • 03Native multimodal text/image/video/audio
§3.3 Weak points
  • 01US hosting, no EU option
  • 02French writing less refined than Claude
  • 03Unpredictable moderation
Best for

Real-time monitoring, feed analysis, fast comms.

Not for

Polished editorial writing, public sector, sensitive missions.

§04Benchmarks — Grok 4.20 Heavy

Against the rest, cold.

BenchmarkScoreValueRank
GPQA Diamond
Scientific reasoning
79.5%
#3
SWE-bench Verified
Real-world bug fixing
64%
#4
MMLU-Pro
General knowledge
82.5%
#4
HumanEval+
Code generation
87%
#5
FR-Check 2026
Internal language test
71/100
#7
Overall score evolution · Q1 '25 → Q3 '26
Évolution: 40 → 83
Q1 '25Q2 '25Q3 '25Q4 '25Q1 '26Q2 '26Q3 '26
+43
pts in 7 quarters
§ ShareX LinkedIn
§ Editor's view · try it?

If your long-form text matters more than your images, Grok 4.20 Heavy remains the subscription that pays back its 30monthly fee fastest. We use it daily in the newsroom.

Affiliate link ↗ — RadarOnAI earns a commission if you subscribe, at no extra cost to you. Our verdicts never depend on commissions.
Essayer Grok
No commitment · cancel in one click