The paper, Rethinking Out-of-Distribution (OOD) Detection for Reinforcement Learning, presents new benchmarks for evaluating RL agents’ ability to detect novel scenarios.
This paper challenges existing OOD detection techniques and introduces DEXTER as a potential solution, promising more robust response mechanisms for RL agents in unanticipated environments.