Researchers conducted an experiment by integrating AI models with Super Mario Bros. to reveal their real-time abilities.
Why Super Mario Bros. is Crucial for AI Testing
Games have long served as a playground for AI testing. Super Mario Bros. demands quick decision-making and adaptability to unforeseen situations, making it a unique tool for testing AI.
AI Model Performance in the Mushroom Kingdom
Hao AI Lab tested AI models in Super Mario using GamingAgent. Anthropic's Claude 3.7 showed strong performance, while even well-known models like Google's Gemini 1.5 Pro faced challenges.
The Reasoning Paradox: Challenges for 'Thinking' Models in Gaming
Reasoning-driven models like GPT-4o excel in most tasks but not in real-time, making them slow for games demanding impulsivity.
Super Mario Bros. helps expose advanced capabilities of AI. Despite the gaming scope, it stimulates real application development requiring instant reaction and strategy.