Model Editing for Vision Transformers

Xinyi Huang; Kangfei Zhao; Long-Kai Huang

Back to NeurIPS

NeurIPS 2025

Model Editing for Vision Transformers

Conference Paper Main Conference Track Artificial Intelligence · Machine Learning

PDF Details

Abstract

Model editing offers a promising paradigm for efficiently and precisely updating knowledge in pre-trained transformers without costly retraining. While extensively studied in language models (LMs), model editing for vision transformers (ViTs) remains underexplored. Existing methods typically adapt LM-based techniques by modifying the multi-layer perceptron (MLP) modules, overlooking the unique characteristics of ViTs. In this work, we show that ViT predictions are more strongly influenced by the multi-head self-attention (MSA) modules than by the MLPs. Building on this observation, we propose a two-stage framework for editing ViTs. First, we identify which attention heads are most responsible for incorrect predictions. Next, we selectively remove the corresponding features to correct the model’s prediction. To further balance error correction with predictive stability on unrelated data, we learn a projection matrix that refines the image representations. Extensive experiments across multiple real-world datasets and model editing benchmarks demonstrate that our method consistently outperforms existing model editing methods for ViTs, achieving superior generalization and locality. Our code is available at https: //github. com/shanghxy/Model-editing-for-vision-transformers.

Model Editing for Vision Transformers

Abstract

Authors

Keywords

Context