Power Constrained Nonstationary Bandits with Habituation and Recovery Dynamics
arXiv:2511.02944v1 Announce Type: new Abstract: A common challenge for decision makers is selecting actions whose rewards are unknown and evolve over time based on prior policies. For instance, repeated use may reduce an action’s effectiveness (habituation), while inactivity may restore…
