Є-Samples for Kernels

Jeff M. Phillips

Back to SODA

SODA 2013

Є-Samples for Kernels

Conference Paper Session 8C Algorithms and Complexity · Theoretical Computer Science

Details

Abstract

We study the worst case error of kernel density estimates via subset approximation. A kernel density estimate of a distribution is the convolution of that distribution with a fixed kernel (e. g. Gaussian kernel). Given a subset (i. e. a point set) of the input distribution, we can compare the kernel density estimates of the input distribution with that of the subset and bound the worst case error. If the maximum error is ε, then this subset can be thought of as an ε-sample (aka an ε-approximation) of the range space defined with the input distribution as the ground set and the fixed kernel representing the family of ranges. Interestingly, in this case the ranges are not binary, but have a continuous range (for simplicity we focus on kernels with range of [0, 1]); these allow for smoother notions of range spaces. It turns out, the use of this smoother family of range spaces has an added benefit of greatly decreasing the size required for ε-samples. For instance, in the plane the size is O ((1/ε 4/3 ) log 2/3 (1/ε)) for disks (based on VC-dimension arguments) but is only for Gaussian kernels and for kernels with bounded slope that only affect a bounded domain. These bounds are accomplished by studying the discrepancy of these “kernel” range spaces, and here the improvement in bounds are even more pronounced. In the plane, we show the discrepancy is for these kernels, whereas for balls there is a lower bound of Ω( n 1/4 ).

Authors

Jeff M. Phillips

Keywords

No keywords are indexed for this paper.

Context

Venue: ACM-SIAM Symposium on Discrete Algorithms
Archive span: 1990-2025
Indexed papers: 4674
Paper id: 56647354006359735