AClipped-dpSGD¶
Implements AClipped-dpSGD, differentially private SGD for heavy-tailed data using one-time clipping with output averaging.
For stochastic convex optimization on heavy-tailed data, each noisy stochastic gradient is clipped a single time per step to bound its sensitivity, calibrated Gaussian noise is added for \((\epsilon,\delta)\)-differential privacy, and a plain SGD step is taken. The estimator returned is the uniform average of all iterates, which together with the one-time clipping (rather than repeated multi-pass clipping) yields an efficient, privacy-calibrated method with sharp convergence rates.
where \(g_t\) is the stochastic gradient, \(\lambda\) the clipping threshold, \(\gamma\) the step size, \(T\) the number of iterations, \(n\) the dataset size, \(m\) the mini-batch size, \((\epsilon,\delta)\) the privacy budget, \(c\) a constant from the privacy analysis, \(I_d\) the \(d\)-dimensional identity, and \(\bar\theta\) the averaged output. In the constrained case the update step is followed by a projection onto the feasible set \(\mathcal{X}\).
Reference: Chenhan Jin, Kaiwen Zhou, Bo Han, Ming-Chang Yang, James Cheng, "Efficient Private SCO for Heavy-Tailed Data via Averaged Clipping", Machine Learning 2024. https://arxiv.org/abs/2206.13011