Stationary Deep Reinforcement Learning with Quantum K-spin Hamiltonian RegularizationΒΆ