Hyperparameter-free DP optimization (GeN-DP)¶
Implements HyFreeDP, a hyperparameter-free differentially private optimizer that sets its own learning rate via a privatized generalized-Newton step.
Standard DP training fixes a learning rate schedule by grid search, which is costly and itself leaks information through the data-dependent choice. HyFreeDP removes this by combining automatic per-sample gradient clipping with the generalized Newton (GeN) learning rate, adapted to DP. Each step privatizes both the gradient direction \(m_t\) and a few loss probes along that direction; fitting a quadratic to the probed losses yields a privatized slope and curvature whose ratio is the learning rate. The direction can be passed through any base optimizer (momentum, Adam-style preconditioning, weight decay) as DP post-processing.
where \(g_i,L_i\) are the per-sample gradient and loss, \(R_g,R_l\) the gradient and loss clipping norms, \(\sigma_g,\sigma_l\) the noise multipliers, \(B\) the batch size, and \(z_g,z_l\) Gaussian DP noise; \(\tilde{L}\) is the privatized loss probed at offsets \(\eta_j\) along \(-m_t\), and the least-squares fit returns the privatized directional slope \(b=(G^{\top}m)_{\mathrm{DP}}\) and curvature \(a=(m^{\top}Hm)_{\mathrm{DP}}\), giving the generalized-Newton step \(\eta_t=b/a\).
Reference: Zhiqi Bu, Ruixuan Liu, "Towards hyperparameter-free optimization with differential privacy", ICLR 2025. https://arxiv.org/abs/2503.00703