LLMs, Gotta Think ’Em All: When Pokémon Battles Become a Serious AI Benchmark
Game AI usually has a familiar job: lose convincingly. Not too quickly, because that feels insulting. Not too brutally, because that feels like homework wearing a boss battle costume. Good game AI sits in the narrow emotional band between “I can beat this” and “I need to think.” The old solution was scripted behavior, heuristics, difficulty sliders, or reinforcement learning trained until the agent stopped embarrassing itself. The newer temptation is simpler: give the game state to an LLM and ask it to play. ...