A new test of AI capabilities consists of puzzles that humans are able to solve without too much trouble, but which all leading AI models struggle with. To improve and pass the test, AI companies will need to balance problem-solving abilities with cost.
Ai companies will just train on these specific puzzles. Then they will claim their AI is AGI and the quality of the models will be the exact same or worse than before. They’ll just have one checkmark more in their marketing.