DP-PCA: Statistically Optimal and Differentially Private PCA

Xiyang Liu; Weihao Kong; Prateek Jain; Sewoong Oh

Back to NeurIPS

NeurIPS 2022

DP-PCA: Statistically Optimal and Differentially Private PCA

Conference Paper Main Conference Track Artificial Intelligence · Machine Learning

PDF Details

Abstract

We study the canonical statistical task of computing the principal component from i. i. d. ~data under differential privacy. Although extensively studied in literature, existing solutions fall short on two key aspects: ($i$) even for Gaussian data, existing private algorithms require the number of samples $n$ to scale super-linearly with $d$, i. e. , $n=\Omega(d^{3/2})$, to obtain non-trivial results while non-private PCA requires only $n=O(d)$, and ($ii$) existing techniques suffer from a large error even when the variance in each data point is small. We propose DP-PCA method that uses a single-pass minibatch gradient descent style algorithm to overcome the above limitations. For sub-Gaussian data, we provide nearly optimal statistical error rates even for $n=O(d \log d)$.

Authors

Keywords

No keywords are indexed for this paper.

Context

Venue: Annual Conference on Neural Information Processing Systems
Archive span: 1987-2025
Indexed papers: 30776
Paper id: 548735637311204272