Structural Reward Model: Enhancing Interpretability, Efficiency, and Scalability in Reward Modeling
arXiv:2509.25361v1 Announce Type: new Abstract: Reward Models (RMs) are key components for evaluating and guiding language model outputs. However, traditional scalar RMs often struggle with incorporating contextual and background information during inference, leading to incomplete evaluations. Generative RMs (GRMs) attempt…
