Quantum Gradient Estimation And Its Application To Quantum Reinforcement Learning