Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kunal Sharma

Shallow-circuit Supervised Learning on a Quantum Processor

Jan 06, 2026

Luca Candelori, Swarnadeep Majumder, Antonio Mezzacapo, Javier Robledo Moreno, Kharen Musaelian, Santhanam Nagarajan, Sunil Pinnamaneni, Kunal Sharma, Dario Villani

Abstract:Quantum computing has long promised transformative advances in data analysis, yet practical quantum machine learning has remained elusive due to fundamental obstacles such as a steep quantum cost for the loading of classical data and poor trainability of many quantum machine learning algorithms designed for near-term quantum hardware. In this work, we show that one can overcome these obstacles by using a linear Hamiltonian-based machine learning method which provides a compact quantum representation of classical data via ground state problems for k-local Hamiltonians. We use the recent sample-based Krylov quantum diagonalization method to compute low-energy states of the data Hamiltonians, whose parameters are trained to express classical datasets through local gradients. We demonstrate the efficacy and scalability of the methods by performing experiments on benchmark datasets using up to 50 qubits of an IBM Heron quantum processor.

Via

Access Paper or Ask Questions

A Review of Barren Plateaus in Variational Quantum Computing

May 01, 2024

Martin Larocca, Supanut Thanasilp, Samson Wang, Kunal Sharma, Jacob Biamonte, Patrick J. Coles, Lukasz Cincio, Jarrod R. McClean, Zoë Holmes, M. Cerezo

Abstract:Variational quantum computing offers a flexible computational paradigm with applications in diverse areas. However, a key obstacle to realizing their potential is the Barren Plateau (BP) phenomenon. When a model exhibits a BP, its parameter optimization landscape becomes exponentially flat and featureless as the problem size increases. Importantly, all the moving pieces of an algorithm -- choices of ansatz, initial state, observable, loss function and hardware noise -- can lead to BPs when ill-suited. Due to the significant impact of BPs on trainability, researchers have dedicated considerable effort to develop theoretical and heuristic methods to understand and mitigate their effects. As a result, the study of BPs has become a thriving area of research, influencing and cross-fertilizing other fields such as quantum optimal control, tensor networks, and learning theory. This article provides a comprehensive review of the current understanding of the BP phenomenon.

* 21 pages, 10 boxes

Via

Access Paper or Ask Questions

An analytic theory for the dynamics of wide quantum neural networks

Mar 30, 2022

Junyu Liu, Khadijeh Najafi, Kunal Sharma, Francesco Tacchino, Liang Jiang, Antonio Mezzacapo

Figure 1 for An analytic theory for the dynamics of wide quantum neural networks

Figure 2 for An analytic theory for the dynamics of wide quantum neural networks

Figure 3 for An analytic theory for the dynamics of wide quantum neural networks

Figure 4 for An analytic theory for the dynamics of wide quantum neural networks

Abstract:Parametrized quantum circuits can be used as quantum neural networks and have the potential to outperform their classical counterparts when trained for addressing learning problems. To date, much of the results on their performance on practical problems are heuristic in nature. In particular, the convergence rate for the training of quantum neural networks is not fully understood. Here, we analyze the dynamics of gradient descent for the training error of a class of variational quantum machine learning models. We define wide quantum neural networks as parameterized quantum circuits in the limit of a large number of qubits and variational parameters. We then find a simple analytic formula that captures the average behavior of their loss function and discuss the consequences of our findings. For example, for random quantum circuits, we predict and characterize an exponential decay of the residual training error as a function of the parameters of the system. We finally validate our analytic results with numerical experiments.

* 26 pages, 5 figures. Comments welcome

Via

Access Paper or Ask Questions

Generalization in quantum machine learning from few training data

Nov 09, 2021

Matthias C. Caro, Hsin-Yuan Huang, M. Cerezo, Kunal Sharma, Andrew Sornborger, Lukasz Cincio, Patrick J. Coles

Figure 1 for Generalization in quantum machine learning from few training data

Figure 2 for Generalization in quantum machine learning from few training data

Figure 3 for Generalization in quantum machine learning from few training data

Figure 4 for Generalization in quantum machine learning from few training data

Abstract:Modern quantum machine learning (QML) methods involve variationally optimizing a parameterized quantum circuit on a training data set, and subsequently making predictions on a testing data set (i.e., generalizing). In this work, we provide a comprehensive study of generalization performance in QML after training on a limited number $N$ of training data points. We show that the generalization error of a quantum machine learning model with $T$ trainable gates scales at worst as $\sqrt{T/N}$. When only $K \ll T$ gates have undergone substantial change in the optimization process, we prove that the generalization error improves to $\sqrt{K / N}$. Our results imply that the compiling of unitaries into a polynomial number of native gates, a crucial application for the quantum computing industry that typically uses exponential-size training data, can be sped up significantly. We also show that classification of quantum states across a phase transition with a quantum convolutional neural network requires only a very small training data set. Other potential applications include learning quantum error correcting codes or quantum dynamical simulation. Our work injects new hope into the field of QML, as good generalization is guaranteed from few training data.

* 14+25 pages, 4+1 figures

Via

Access Paper or Ask Questions

Connecting ansatz expressibility to gradient magnitudes and barren plateaus

Jan 06, 2021

Zoë Holmes, Kunal Sharma, M. Cerezo, Patrick J. Coles

Figure 1 for Connecting ansatz expressibility to gradient magnitudes and barren plateaus

Figure 2 for Connecting ansatz expressibility to gradient magnitudes and barren plateaus

Figure 3 for Connecting ansatz expressibility to gradient magnitudes and barren plateaus

Figure 4 for Connecting ansatz expressibility to gradient magnitudes and barren plateaus

Abstract:Parameterized quantum circuits serve as ans\"{a}tze for solving variational problems and provide a flexible paradigm for programming near-term quantum computers. Ideally, such ans\"{a}tze should be highly expressive so that a close approximation of the desired solution can be accessed. On the other hand, the ansatz must also have sufficiently large gradients to allow for training. Here, we derive a fundamental relationship between these two essential properties: expressibility and trainability. This is done by extending the well established barren plateau phenomenon, which holds for ans\"{a}tze that form exact 2-designs, to arbitrary ans\"{a}tze. Specifically, we calculate the variance in the cost gradient in terms of the expressibility of the ansatz, as measured by its distance from being a 2-design. Our resulting bounds indicate that highly expressive ans\"{a}tze exhibit flatter cost landscapes and therefore will be harder to train. Furthermore, we provide numerics illustrating the effect of expressiblity on gradient scalings, and we discuss the implications for designing strategies to avoid barren plateaus.

* Main text: 10 pages, 4 figures. Appendices: 10 pages, 2 figures

Via

Access Paper or Ask Questions

Noise-Induced Barren Plateaus in Variational Quantum Algorithms

Jul 28, 2020

Samson Wang, Enrico Fontana, M. Cerezo, Kunal Sharma, Akira Sone, Lukasz Cincio, Patrick J. Coles

Figure 1 for Noise-Induced Barren Plateaus in Variational Quantum Algorithms

Figure 2 for Noise-Induced Barren Plateaus in Variational Quantum Algorithms

Figure 3 for Noise-Induced Barren Plateaus in Variational Quantum Algorithms

Figure 4 for Noise-Induced Barren Plateaus in Variational Quantum Algorithms

Abstract:Variational Quantum Algorithms (VQAs) may be a path to quantum advantage on Noisy Intermediate-Scale Quantum (NISQ) computers. A natural question is whether the noise on NISQ devices places any fundamental limitations on the performance of VQAs. In this work, we rigorously prove a serious limitation for noisy VQAs, in that the noise causes the training landscape to have a barren plateau (i.e., vanishing gradient). Specifically, for the local Pauli noise considered, we prove that the gradient vanishes exponentially in the number of layers $L$. This implies exponential decay in the number of qubits $n$ when $L$ scales as $\operatorname{poly}(n)$, for sufficiently large coefficients in the polynomial. These noise-induced barren plateaus (NIBPs) are conceptually different from noise-free barren plateaus, which are linked to random parameter initialization. Our result is formulated for an abstract ansatz that includes as special cases the Quantum Alternating Operator Ansatz (QAOA) and the Unitary Coupled Cluster Ansatz, among others. In the case of the QAOA, we implement numerical heuristics that confirm the NIBP phenomenon for a realistic hardware noise model.

* 9+11 pages, 4+1 figures

Via

Access Paper or Ask Questions

Reformulation of the No-Free-Lunch Theorem for Entangled Data Sets

Jul 09, 2020

Kunal Sharma, M. Cerezo, Zoë Holmes, Lukasz Cincio, Andrew Sornborger, Patrick J. Coles

Figure 1 for Reformulation of the No-Free-Lunch Theorem for Entangled Data Sets

Figure 2 for Reformulation of the No-Free-Lunch Theorem for Entangled Data Sets

Figure 3 for Reformulation of the No-Free-Lunch Theorem for Entangled Data Sets

Figure 4 for Reformulation of the No-Free-Lunch Theorem for Entangled Data Sets

Abstract:The No-Free-Lunch (NFL) theorem is a celebrated result in learning theory that limits one's ability to learn a function with a training data set. With the recent rise of quantum machine learning, it is natural to ask whether there is a quantum analog of the NFL theorem, which would restrict a quantum computer's ability to learn a unitary process (the quantum analog of a function) with quantum training data. However, in the quantum setting, the training data can possess entanglement, a strong correlation with no classical analog. In this work, we show that entangled data sets lead to an apparent violation of the (classical) NFL theorem. This motivates a reformulation that accounts for the degree of entanglement in the training set. As our main result, we prove a quantum NFL theorem whereby the fundamental limit on the learnability of a unitary is reduced by entanglement. We employ Rigetti's quantum computer to test both the classical and quantum NFL theorems. Our work establishes that entanglement is a commodity in quantum machine learning.

* 7+13 pages, 4+2 figures

Via

Access Paper or Ask Questions

Trainability of Dissipative Perceptron-Based Quantum Neural Networks

May 26, 2020

Kunal Sharma, M. Cerezo, Lukasz Cincio, Patrick J. Coles

Figure 1 for Trainability of Dissipative Perceptron-Based Quantum Neural Networks

Figure 2 for Trainability of Dissipative Perceptron-Based Quantum Neural Networks

Figure 3 for Trainability of Dissipative Perceptron-Based Quantum Neural Networks

Abstract:Several architectures have been proposed for quantum neural networks (QNNs), with the goal of efficiently performing machine learning tasks on quantum data. Rigorous scaling results are urgently needed for specific QNN constructions to understand which, if any, will be trainable at a large scale. Here, we analyze the gradient scaling (and hence the trainability) for a recently proposed architecture that we called dissipative QNNs (DQNNs), where the input qubits of each layer are discarded at the layer's output. We find that DQNNs can exhibit barren plateaus, i.e., gradients that vanish exponentially in the number of qubits. Moreover, we provide quantitative bounds on the scaling of the gradient for DQNNs under different conditions, such as different cost functions and circuit depths, and show that trainability is not always guaranteed.

* 5 + 18 pages, 3 figures

Via

Access Paper or Ask Questions