OpenAI researchers accused xAI about publishing misleading Grok 3 benchmarks. The truth is a little more nuanced.
We thoroughly tested the Grok 3 model and came away surprised by its capabilities as it is a model that outperforms o3-mini, ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results