Quantum Brain
← Back to papers

Quantum Policy Gradient Algorithm with Optimized Action Decoding

Nico Meyer, D. D. Scherer, A. Plinge, Christopher Mutschler, M. Hartmann·December 13, 2022·DOI: 10.48550/arXiv.2212.06663
Computer SciencePhysics

AI Breakdown

Get a structured breakdown of this paper — what it's about, the core idea, and key takeaways for the field.

Abstract

Quantum machine learning implemented by variational quantum circuits (VQCs) is considered a promising concept for the noisy intermediate-scale quantum computing era. Focusing on applications in quantum reinforcement learning, we propose a specific action decoding procedure for a quantum policy gradient approach. We introduce a novel quality measure that enables us to optimize the classical post-processing required for action selection, inspired by local and global quantum measurements. The resulting algorithm demonstrates a significant performance improvement in several benchmark environments. With this technique, we successfully execute a full training routine on a 5-qubit hardware device. Our method introduces only negligible classical overhead and has the potential to improve VQC-based algorithms beyond the field of quantum reinforcement learning.

Related Research

Quantum Intelligence

Ask about quantum research, companies, or market developments.