Randomized Composable Core-sets for Distributed Submodular Maximization

Vahab Mirrokni; Morteza Zadimoghaddam

Back to STOC

STOC 2015

Randomized Composable Core-sets for Distributed Submodular Maximization

Conference Paper Session 2B Algorithms and Complexity · Theoretical Computer Science

Details

Abstract

An effective technique for solving optimization problems over massive data sets is to partition the data into smaller pieces, solve the problem on each piece and compute a representative solution from it, and finally obtain a solution inside the union of the representative solutions for all pieces. This technique can be captured via the concept of composable core-sets , and has been recently applied to solve diversity maximization problems as well as several clustering problems [7,15,8]. However, for coverage and submodular maximization problems, impossibility bounds are known for this technique [15]. In this paper, we focus on efficient construction of a randomized variant of composable core-sets where the above idea is applied on a random clustering of the data. We employ this technique for the coverage, monotone and non-monotone submodular maximization problems. Our results significantly improve upon the hardness results for non-randomized core-sets, and imply improved results for submodular maximization in a distributed and streaming settings. The effectiveness of this technique has been confirmed empirically for several machine learning applications [22], and our proof provides a theoretical foundation to this idea.

Randomized Composable Core-sets for Distributed Submodular Maximization

Abstract

Authors

Keywords

Context