Robustness in the Face of Partial Identifiability in Reward Learning
arXiv:2501.06376v2 Announce Type: replace-cross Abstract: In Reward Learning (ReL), we are given feedback on an unknown target reward, and the goal is to use this information to recover it in order to carry out some downstream application, e.g., planning. When…
