AI and the Humbling of Human Intellect

I often test new LLM models with the following question — Given the following conditions, how many ways can Professor Y assign six different books to four different students? The most expensive book must be assigned to student X. Each student must receive at least one book. The first model which can solve this question is OpenAI’s o1 model. The second one is Google’s Gemini 2.0 Flash Thinking model. Then more of such thinking models emerged, including DeepSeek’s R1 model and Anthropic’s Claude 3.7 (with extended thinking mode), all of which can solve this problem flawlessly. ...

March 26, 2025 · 3 min · Xing Shi Cai