NYU Researcher Explains Why AI Models Still Struggle to Play Video Games
By
Matthew S. Smith
Baker's choice. Dense with flavour, light on filler.
Summary
Julian Togelius, director of NYU's Game Innovation Lab and co-founder of Modl.ai, discusses a recent paper exploring why LLMs and AI models struggle with playing video games despite their rapid improvements in coding. He argues that coding is a "well-behaved game" with clear rules, while video games require general intelligence, adaptability, and real-time decision-making that current AI lacks. The article uses this gap to highlight the broader limitations of AI in 2026, challenging the perception that AI is close to human-level general intelligence.
Key quotes
· 2 pulledIt's not just LLMs that are bad at this. We do not have general game AI.
There's a widespread perception that because we can build AI that...
You might also wanna read

The Limitations of Generative AI for Creating Video Game Worlds
The article examines the limitations of generative AI in creating compelling video game worlds, contrasting it with traditional procedural g

Neuroscience Challenges AI Optimism: Are Large Language Models a Path to True Intelligence?
The article examines the ambitious claims by tech leaders like Mark Zuckerberg, Dario Amodei, and Sam Altman about achieving superintelligen

AI Labs Compete for Video Game Data to Train World Models and Agents
The article discusses the growing interest in AI world models and agents that can interact with the real world. It highlights how Medal, a v

Generative AI's Growing Role in Video Game Development in 2025: Industry Adoption and Developer Pushback
The article examines the growing adoption of generative AI in the video game industry in 2025, highlighting how major game studios and CEOs

Google DeepMind's SIMA 2 AI Agent Learns to Play Video Games Using Gemini AI
Google DeepMind has developed SIMA 2, an advanced AI agent that learns to play video games like No Man's Sky, Valheim, and Goat Simulator 3.
ARC Prize benchmark reveals AI systems score under 1% on spatial reasoning puzzles while humans achieve 100%
The article discusses the ARC Prize Foundation's May 2026 benchmark results showing that while humans scored 100% on a game-like AI test, th
theconversation.com·7h ago