Science ❯ Computer Science ❯ AI Research
Educational Psychology Addiction in AI Performance Evaluation
The rules-driven simulator spotlights persistent weaknesses in long-horizon planning, including memory.