Arrow Research search

Author name cluster

Siddharth Bhandari

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

4 papers
2 author rows

Possible papers

4

NeurIPS Conference 2025 Conference Paper

Replicable Online Learning

  • Saba Ahmadi
  • Siddharth Bhandari
  • Avrim Blum

We investigate the concept of algorithmic replicability introduced by Impagliazzo et al. (2022) in an online setting. In our model, the input sequence received by the online learner is generated from time-varying distributions chosen by an adversary (obliviously). Our objective is to design low-regret online algorithms that, with high probability, produce the \emph{exact same sequence} of actions when run on two independently sampled input sequences generated as described above. We refer to such algorithms as adversarially replicable. Previous works explored replicability in the online setting under inputs generated independently from a fixed distribution; we term this notion as iid-replicability. Our model generalizes to capture both adversarial and iid input sequences, as well as their mixtures, which can be modeled by setting certain distributions as point-masses. We demonstrate adversarially replicable online learning algorithms for online linear optimization and the experts problem that achieve sub-linear regret. Additionally, we propose a general framework for converting an online learner into an adversarially replicable one within our setting, bounding the new regret in terms of the original algorithm’s regret. We also present a nearly optimal (in terms of regret) iid-replicable online algorithm for the experts problem, highlighting the distinction between the iid and adversarial notions of replicability. Finally, we establish lower bounds on the regret (in terms of the replicability parameter and time) that any replicable online algorithm must incur.

STOC Conference 2020 Conference Paper

Improved bounds for perfect sampling of k-colorings in graphs

  • Siddharth Bhandari
  • Sayantan Chakraborty 0002

We present a randomized algorithm that takes as input an undirected n -vertex graph G with maximum degree Δ and an integer k > 3Δ, and returns a random proper k -coloring of G . The distribution of the coloring is perfectly uniform over the set of all proper k -colorings; the expected running time of the algorithm is poly ( k , n )= O ( n Δ 2 · log( k )). This improves upon a result of Huber (STOC 1998) who obtained a polynomial time perfect sampling algorithm for k >Δ 2 +2Δ. Prior to our work, no algorithm with expected running time poly ( k , n ) was known to guarantee perfectly sampling with sub-quadratic number of colors in general. Our algorithm (like several other perfect sampling algorithms including Huber’s) is based on the Coupling from the Past method. Inspired by the bounding chain approach, pioneered independently by Huber (STOC 1998) and H'aggstr'om & Nelander (Scand. J. Statist., 1999), we employ a novel bounding chain to derive our result for the graph coloring problem.