AI and the Humbling of Human Intellect
I often test new LLM models with the following question — Given the following conditions, how many ways can Professor Y assign six different books to four different students? The most expensive book must be assigned to student X. Each student must receive at least one book. The first model which can solve this problem is OpenAI’s o1 model. The second one is Google’s Gemini 2.0 Flash Thinking model. Then more of such thinking models emerged, such as DeepSeek’s R1, Anthropic’s Claude 3.7 (with extended thinking mode), all of which can solve this problem flawlessly. ...