Skip to main content
eScholarship
Open Access Publications from the University of California

Towards a Metacognitive Reinforcement Learning Approach for Planning in Adaptive Learning Systems

Creative Commons 'BY' version 4.0 license
Abstract

Learners face metacognitive challenges in planning efficient allocation of limited study time and cognitive resources. Our work draws cognitive science research showing how humans use reinforcement learning to adaptively develop metareasoning heuristics that balance deliberation and exploitation in learning sequence planning. We model this framework computationally by formulating adaptive content sequencing as a Markov Decision Process with meta-level states, actions, and rewards. A neural meta-policy module governs deliberation on building new personalized learning plans versus the reuse of prior recommendations through simulated user interactions. Testing using 100 simulated agents exhibiting the evolution of knowledge, interests, and consumption patterns provided longitudinal data on meta-policy responsiveness to dynamic learning requirements. Analyzing trends over time and trigger-reaction lags quantified opportunities for improving deliberation latency and relevance. The simulated experiments demonstrate promising progress in computationally modeling the metacognitive capacity for resource-rational planning by strategically balancing plan quality and computational effort in education content recommendation.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View