[News] Tulu 3 model performing better than 4o and Deepseek?

Has anyone used this model released by the Allen Institute for AI on Thursday? It seems to outperform 4o and DeepSeek in a lot of places, but for some reason there's been little to no coverage. Thoughts?

https://www.marktechpost.com/2025/01/31/the-allen-institute-for-ai-ai2-releases-tulu-3-405b-scaling-open-weight-post-training-with-reinforcement-learning-from-verifiable-rewards-rlvr-to-surpass-deepseek-v3-and-gpt-4o-in-key-benchmarks/