Kernel Descent¶
Implements Kernel Descent, an optimizer for variational quantum algorithms that minimizes via reproducing-kernel-Hilbert-space local approximations.
The objective is the expectation \(f(\theta) = \langle \psi(\theta) | M | \psi(\theta) \rangle\) of an observable \(M\) over the state produced by a parametrized circuit. Such functions lie in a finite-dimensional reproducing kernel Hilbert space \(H\) of trigonometric polynomials, with kernel
At each step, kernel descent evaluates \(f\) on a grid around the current point \(\theta_t\) and assembles a local surrogate \(\tilde{f}_t\) in \(H\) that matches \(f\) exactly along low-dimensional coordinate subspaces. A hyperparameter \(L\) (\(1 \le L \le m\)) sets the order: the evaluation offsets \(q_1, \dots, q_D\) are the distinct points of \(\{-\tfrac{2\pi}{3}, 0, \tfrac{2\pi}{3}\}^m\) with at most \(L\) nonzero entries. Because that grid makes the kernel Gram matrix the identity, the surrogate has a closed form with no linear solve, and the next iterate comes from classically minimizing it.
where \(\theta\) are the circuit parameters, \(\theta_t\) the iterate at step \(t\), \(f\) the expectation-value objective, \(\tilde{K}\) the RKHS kernel, \(q_1, \dots, q_D \in \{-\tfrac{2\pi}{3}, 0, \tfrac{2\pi}{3}\}^m\) the parameter-shift offsets with at most \(L\) nonzero entries, \(D = \sum_{k=0}^{L} 2^k \binom{m}{k}\) the number of circuit evaluations per iteration, and \(L\) the approximation order.
Reference: Lars Simon, Holger Eble, Manuel Radons, "Introducing the Kernel Descent Optimizer for Variational Quantum Algorithms", Scientific Reports 2025. https://arxiv.org/abs/2409.10257