NormFit: A Lightweight Solution for Few-Shot Federated Learning with Non-IID Data

Azadeh Motamedi; Jae-Mo Kang; Il-Min Kim

Back to NeurIPS

NeurIPS 2025

NormFit: A Lightweight Solution for Few-Shot Federated Learning with Non-IID Data

Conference Paper Main Conference Track Artificial Intelligence · Machine Learning

PDF Details

Abstract

Vision–Language Models (VLMs) have recently attracted considerable attention in Federated Learning (FL) due to their strong and robust performance. In particular, few-shot adaptation with pre-trained VLMs like CLIP enhances the performance of downstream tasks. However, existing methods still suffer from substantial communication overhead, high local computational demands, and suboptimal performance under non-IID user data. To simultaneously address all those limitations, we propose NormFit, a lightweight solution that selectively fine-tunes only a very small portion of the model parameters, specifically only the Pre-LayerNorm parameters of the vision encoder within a VLM. Overcoming the existing tradeoff between performance and communication/computation efficiency in few-shot FL, NormFit sets a new benchmark by simultaneously achieving superior accuracy and substantially reduced communication and computational demands. Theoretically, we show that NormFit yields a considerably smaller generalization gap compared to tuning all LayerNorm parameters. Importantly, NormFit can function effectively as a standalone solution or integrate seamlessly with existing few-shot fine-tuning methods to further enhance their performance. Notably, NormFit offers implementation simplicity, achieving these improvements without any algorithmic modifications, changes to the underlying model architecture, or the addition of external parameters.

NormFit: A Lightweight Solution for Few-Shot Federated Learning with Non-IID Data

Abstract

Authors

Keywords

Context