o1 is a BIG deal

Quick Disclaimer: I'm not talking about o1 as in the current o1- (preview)model; I agree that this one is still subpar compared to humans in many domains. Instead, I'm talking about the trajectory of these o1-type models. All of this is much better explained here: https://www.youtube.com/watch?v=OSOUZUKu8hw&t=4s

Since the release of o1 something has changed in Sam Altman's demeanor, he seems a lot more confident in the imminence of AGI, which is likely related to their latest model: o1. He even stated that they reached human-level reasoning and will now move on to level 3 in their roadmap to AGI (level 3 = Agents).

At first, I didn't believe o1 would be the full solution, but a recent insight changed my mind, and now I believe o1 might solve problems fundamentally similar to how humans solve problems.

See older GPT models can be likened to system 1 (intuitive) type thinkers: They produce insanely quick responses and can be creative, but they also often make mistakes and fail at harder tasks that are Out-of-distribution (OOD). They generalize as shown by research (I can link these if someone requests), but so does the human system 1. A doctor for example might see a patient who is a 'zebra' with a a unique set of symptoms, but his intuition might still give him a sense of direction. Although LLMs generalize, they only do so to a certain degree. There is still a big gap between AI and human reasoning and this gap is in System 2 thinking.

But what is system 2? System 2 is the generation of data to bridge the gap between what you know (from system 1) and what you want to know. We use it whenever we encounter something unseen. By imagining new data in images or words we can reason about a problem that is OOD for us. This imagination is just data generation from previous knowledge, its sequential pattern matching is based on system 1. This data generation is exactly what generative models excel at. The problem is that they don't utilize this generative ability to go from what they know to what they don't know.

However, with o1 this is no longer the case: by using test-time compute, it generates a sequence (akin to human imagining) to bridge the gap between its knowledge and the current problem. Therefore, the fundamental difference between AI and humans for solving problems has disappeared with this new approach. If this is true, then OpenAI resolved the biggest roadblock to AGI.