VORTEX: Aligning Task Utility and Human Preferences through LLM-Guided Reward Shaping
arXiv:2509.16399v1 Announce Type: new Abstract: In social impact optimization, AI decision systems often rely on solvers that optimize well-calibrated mathematical objectives. However, these solvers cannot directly accommodate evolving human preferences, typically expressed in natural language rather than formal constraints. Recent…
