Patel, Sagar; Jyothi, Sangeetha Abdu; Narodytska, Nina

doi:10.1609/aaai.v38i13.29372

This item is not available for download from eScholarship

CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems

2024

Published Web Location

https://ojs.aaai.org/index.php/AAAI/article/view/29372/30590

No data is associated with this publication.

Creative Commons 'BY-NC-ND' version 4.0 license

Abstract

We present CrystalBox, a novel, model-agnostic, posthoc explainability framework for Deep Reinforcement Learning (DRL) controllers in the large family of input-driven environments which includes computer systems. We combine the natural decomposability of reward functions in input-driven environments with the explanatory power of decomposed returns. We propose an efficient algorithm to generate future-based explanations across both discrete and continuous control environments. Using applications such as adaptive bitrate streaming and congestion control, we demonstrate CrystalBox's capability to generate high-fidelity explanations. We further illustrate its higher utility across three practical use cases: contrastive explanations, network observability, and guided reward design, as opposed to prior explainability techniques that identify salient features.

Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.

Item not freely available? Link broken?

Report a problem accessing this item

UC Irvine

CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems

Published Web Location