Ma, Xiaojian

Inspecting Generalization of Reinforced Learners: The HALMA Benchmark

2020

Ma, Xiaojian
Advisor(s): Zhu, Song-Chun

Abstract

Humans learn compositional and causal abstraction, i.e., knowledge, in response to the structure of naturalistic tasks. When presented with a problem-solving task involving some objects, toddlers would first interact with these objects to reckon what they are and what can be done with them. Leveraging these concepts, they could understand the internal structure of this task, without seeing all of the problem instances. Remarkably, they further build cognitively executable strategies to rapidly solve novel problems. To empower a learning agent with similar capability, we argue there shall be three levels of generalization in how an agent represents its knowledge: perceptual, conceptual, and algorithmic. In this work, we devise the very first systematic benchmark that offers joint evaluation covering all three levels. This benchmark is centered around a novel task domain, HALMA, for visual concept development and rapid problem-solving. We conduct extensive experiments on reinforcement learning agents with various inductive biases and carefully report their proficiency and weakness.

Main Content

For improved accessibility of PDF content, download the file to your device.

UCLA

Inspecting Generalization of Reinforced Learners: The HALMA Benchmark