Explaining Reinforcement Learning Agents by Policy Comparison

Lee, Jun Ki

Full Metadata

Overview

Year:: 2022
Contributor:: Lee, Jun Ki (creator); Littman, Michael (Advisor); Konidaris, George (Reader); Bach, Stephen (Reader); Brown University. Department of Computer Science (sponsor)
Genre:: theses
Subject:: Reinforcement learning; Artificial intelligence; Explainable AI
Extent:: xiii, 95 p.

Files

Description

Abstract:: Reinforcement learning (RL) techniques have led to remarkable results in challenging domains such as Atari games, Go, and Starcraft, suggesting that practical applications lie just over the horizon. Before we can trust decisions made by RL policies, however, we need more visibility into how they work. To explain a reinforcement-learning agent, I propose extending the power of counterfactual reasoning to sequential domains by comparing its policy to a baseline policy at a set of automatically identified decision points. My novel method for selecting important decision points considers a large pool of candidate states and decomposes the agent's value into the reward obtained before vs. after visiting that state. A state is considered important if the accumulated reward obtained after switching to the baseline policy is most different from that obtained after continuing its policy. The engine of this computation is a decomposition of occupancy frequencies of an agent’s policy that characterize the whereabouts of an agent before and after the policy change. Structuring the policy evaluation in this way provides a causal account for its outcome. I have demonstrated the approach on a set of standard RL benchmark domains, providing explanations using the decomposed occupancy frequencies.
Notes:: Thesis (Ph. D.)--Brown University, 2022

Content

Citation

Lee, Jun Ki, "Explaining Reinforcement Learning Agents by Policy Comparison" (2022). Computer Science Theses and Dissertations. Brown Digital Repository. Brown University Library. https://repository.library.brown.edu/studio/item/bdr:kuvehb7p/

Relations

Collection:

Computer Science Theses and Dissertations

Theses and Dissertations for the Computer Science department.
...