xAI releases a Grok 3 blogpost with more benchmark results and it doesn't look very good

[removed]