OpenAI researchers accused xAI about publishing misleading Grok 3 benchmarks. The truth is a little more nuanced.
We thoroughly tested the Grok 3 model and came away surprised by its capabilities as it is a model that outperforms o3-mini, ...
Did xAI manipulate Grok-3’s benchmarks? Explore the controversy, strengths, and weaknesses of this AI model in our in-depth ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results