Arrow Research search

Author name cluster

Xiaohui Wang

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

5 papers
2 author rows

Possible papers

5

NeurIPS Conference 2025 Conference Paper

Breaking the Compression Ceiling: Data-Free Pipeline for Ultra-Efficient Delta Compression

  • Xiaohui Wang
  • Peng Ye
  • Chenyu Huang
  • Shenghe Zheng
  • Bo Zhang
  • Lei Bai
  • Wanli Ouyang
  • Tao Chen

With the rise of the fine-tuned–pretrained paradigm, storing numerous fine-tuned models for multi-tasking creates significant storage overhead. Delta compression alleviates this by storing only the pretrained model and the highly compressed delta weights (the differences between fine-tuned and pretrained model weights). However, existing methods fail to maintain both high compression and performance, and often rely on data. To address these challenges, we propose UltraDelta, the first data-free delta compression pipeline that achieves both ultra-high compression and strong performance. UltraDelta is designed to minimize redundancy, maximize information, and stabilize performance across inter-layer, intra-layer, and global dimensions, using three key components: (1) Variance-Based Mixed Sparsity Allocation assigns sparsity based on variance, giving lower sparsity to high-variance layers to preserve inter-layer information. (2) Distribution-Aware Compression applies uniform quantization and then groups parameters by value, followed by group-wise pruning, to better preserve intra-layer distribution. (3) Trace-Norm-Guided Rescaling uses the trace norm of delta weights to estimate a global rescaling factor, improving model stability under higher compression. Extensive experiments across (a) large language models (fine-tuned on LLaMA-2 7B and 13B) with up to 50$\times$ compression, (b) general NLP models (RoBERTa-base, T5-base) with up to 224$\times$ compression, (c) vision models (ViT-B/32, ViT-L/14) with up to 132$\times$ compression, and (d) multi-modal models (BEiT-3) with 18$\times$ compression, demonstrate that UltraDelta consistently outperforms existing methods, especially under ultra-high compression. Code is available at https: //github. com/xiaohuiwang000/UltraDelta.

NeurIPS Conference 2025 Conference Paper

Object Concepts Emerge from Motion

  • Haoqian Liang
  • Xiaohui Wang
  • Zhichao Li
  • Ya Yang
  • Naiyan Wang

Object concepts play a foundational role in human visual cognition, enabling perception, memory, and interaction in the physical world. Inspired by findings in developmental psychology—where infants are shown to acquire object understanding through observation of motion—we propose a biologically inspired framework for learning object-centric visual representations in an unsupervised manner. We were inspired by the insight that motion boundary serves as a strong signal for object-level grouping, which can be used to derive pseudo-instance supervision from raw videos. Concretely, we generate motion-based instance masks using off-the-shelf optical flow and clustering algorithms, and use them to train visual encoders via contrastive learning. Our framework is fully label-free and does not rely on camera calibration, making it scalable to large-scale unstructured video data. We evaluate our approach on three downstream tasks spanning both low-level (monocular depth estimation) and high-level (3D object detection and occupancy prediction) vision. Our models outperform previous supervised and self-supervised baselines and demonstrate strong generalization to unseen scenes. These results suggest that motion-induced object representations offer a compelling alternative to existing vision foundation models, capturing a crucial but overlooked level of abstraction: the visual instance. The implementation can be found here: https: //github. com/yulemao/Object Concepts Emerge from Motion

ECAI Conference 2020 Conference Paper

An Efficient Agreement Mechanism in CapsNets by Pairwise Product

  • Lei Zhao
  • Xiaohui Wang
  • Lei Huang 0015

Capsule networks (CapsNets) are capable of modeling visual hierarchical relationships, which is achieved by the “routing-by-agreement” mechanism. This paper proposes a pairwise agreement mechanism to build capsules, inspired by the feature interactions of factorization machines (FMs). The proposed method has a much lower computation complexity. We further proposed a new CapsNet architecture that combines the strengths of residual networks in representing low-level visual features and CapsNets in modeling the relationships of parts to wholes. We conduct comprehensive experiments to compare the routing algorithms, including dynamic routing, EM routing, and our proposed FM agreement, based on both architectures of original CapsNet and our proposed one, and the results show that our method achieves both excellent performance and efficiency under a variety of situations.

IROS Conference 2006 Conference Paper

Study on Kinematics Decoupling for Parallel Manipulator with Perpendicular Structures

  • Jianjun Zhang 0003
  • Weimin Li
  • Xiaohui Wang
  • Feng Gao 0011

Kinematics decoupling characteristics (KDCs) for parallel manipulators simplify the kinematics models, and make them easier in calibration and control. This paper focuses on KDCs for parallel manipulators which are described in detail. The relationship between input and output of a system with KDCs is discussed and the characteristics of its transfer matrix are educed. Then, the kinematics of two parallel manipulators, a 6-PSS parallel micro-manipulator with perpendicular structures (PMMWPS) and a 6-PP+S parallel manipulator with perpendicular structures (PMWPS), is analyzed. Also, the kinematics models are obtained. In the course of the analysis, conditional decoupling is defined. The results show that the two proposed PMWPSs have KDCs. The KDCs for PMWPSs would offer a new idea in the area of parallel manipulators.