Quantum Reinforcement Learning with Dynamic-Circuit Qubit Reuse and Grover-Based Trajectory Optimization
arXiv:2509.16002v1 Announce Type: cross Abstract: A fully quantum reinforcement learning framework is developed that integrates a quantum Markov decision process, dynamic circuit-based qubit reuse, and Grover’s algorithm for trajectory optimization. The framework encodes states, actions, rewards, and transitions entirely within…
