Author name cluster

Sen Wang

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

31 papers

2 author rows

EAAI Journal 2026 Journal Article

A printed circuit board surface defect detection method for long-tail and multi-scale scenarios

Xuangang Li
Sen Wang
Liying Zhu
Aiping Shen
Dianlu Hu

Surface defects on Printed Circuit Board (PCB) in industrial production exhibit characteristics of random occurrence, uneven category distribution, variable scales, and minute dimensions, increasing the difficulty of quality inspection. To achieve multi-scale defect detection on PCB surfaces in long-tail small-target scenarios, the Long-Tail Dynamic Multi-Scale Printed Circuit Board (LDM-PCB) detection approach proposed in this paper employs the Long-Tail Feature Extraction Network (LTFE-Net) as the backbone, enhances the representation of tail defects by the Adaptive Tail Attention (ATA) module, improves the ability of the model to quickly capture the low-frequency defect features, and effectively solves the imbalance problem of feature learning under long-tail data distribution. The Dynamic Multi-Scale Fusion (DMS-Fuse) architecture dynamically adjusts feature fusion weights for defects of varying sizes through adaptive weighting strategies, enabling feature interaction across scales. A designed dynamic prediction layer preserves high-resolution defect features, directly outputting dynamic information to mitigate detail degradation in deep networks and improve localization accuracy for subtle defects. On self-built long-tail defect dataset, LDM-PCB achieves 99. 1% mean Average Precision at Intersection-over-Union threshold 0. 5 ( m A P 0. 5 ) with only 8. 61 million (M) parameters, surpassing baseline models by 1. 8 percentage points. The detection speed reaches 100 frames per second (FPS), achieving a balance between accuracy and speed, with results superior to other algorithms. Generalization experiments on public PCB datasets further demonstrate the optimal performance of LDM-PCB. Deployment results on edge devices indicate industrial deployment potential.

Details DOI

EAAI Journal 2026 Journal Article

Zero-velocity update -aided navigation method for miniature quadruped robot based on adapted virtual inertial measurement unit

Siwei Tang
Weixing Qian
Sen Wang
Feng Yang
Xinyuan Wang
Weinan Gao
Pengyu Liu

Addressing the challenges associated with installing inertial measurement units (IMUs) on the feet of miniature quadruped robots, this paper proposes a zero-velocity update (ZUPT) method based on adaptive virtual inertial measurement unit (VIMU). This approach eliminates the reliance of existing ZUPT method for inertial navigation systems on foot-mounted IMUs and gait recognition. By utilizing the IMU outputs from legs and feet of a quadruped robot as the training dataset, an innovative Convolutional Neural Network (CNN)- Bidirectional gated recurrent unit neural network (BiGRU)-Attention hybrid network is constructed to establish a nonlinear mapping relationship between the multiple IMUs. In practical applications, the foot-mounted VIMU can be generated solely from the leg-mounted IMU data, and the modified navigation parameters are then output through a ZUPT algorithm to achieve accurate positioning of the quadruped robot. Experimental results demonstrate that the positioning errors of this method is about 1. 34 % of the total path under diverse terrain conditions, including slopes, stairs, and grasslands, outperforming gait recognition-dependent methods in terms of accuracy. This approach effectively implements the inertial navigation function of quadruped robots and enhances the adaptability of ZUPT method to unstructured and unknown terrains. This approach has great potential to improve Global Navigation Satellite Systems (GNSS)-denied positioning performance of quadruped robots in complex environments without the assistance of visual sensor and LightLaser Detection and Ranging (LiDAR).

Details DOI

EAAI Journal 2025 Journal Article

A lightweight vision transformer with embedded hybrid attention for quick response code defect classification

Dianlu Hu
Lun Zhao
Yu Ren
Sen Wang
Xuanlin Ye
Haohan Zhang
Changqing Peng

Quick Response (QR) code label printing quality is crucial to product control. Due to the limited number of defect samples, unclear features, and the need to detect a large number of labels in real time, automated visual inspection faces challenges. For efficient and accurate automated visual defect recognition of printed QR code production, we propose a lightweight Vision Transformer network, Vision Transformer with Embedded Hybrid Attention (ViT-EHA). First, the Mixed Depthwise Convolution Block (MDConvBlock) is introduced to capture QR code defect details and feature information. This method additionally reduces the number of model parameters and computational costs. Furthermore, the LeAttention-Local Convolution-Multilayer Perceptron (LeALCM) module is proposed to enhance the ability to capture global information of the model and improve the effect of minor defect recognition. Ultimately, a hybrid attention (HA) module has been integrated to enhance the processing of low-level image features and to strengthen the interplay between shallow and deep features. To verify the validity and generalization of the model, the experimental results show that the proposed ViT-EHA method achieved an accuracy of 99. 00% and a parameter count of 4. 198 million (M) on the self-constructed dataset Code-10 (QR Code Dataset with 10 Classes), and the accuracy reached 98. 33% and 97. 73% on the public datasets NEU-CLS (Northeastern University Classification Dataset) and NEU-CLS-64 (Northeastern University Classification Dataset with 64 × 64 images), respectively.

Details DOI

IROS Conference 2025 Conference Paper

An Inflatable Deployable Origami Grasper for Adaptive and High-Load Grasping

Peng Yan
Guang Liang
Sen Wang
Hailin Huang
Wei Wang
Xu Li
Bing Li

Robotic graspers are essential for enhancing the efficiency and versatility of robots in grasping tasks. In this paper, we propose a novel inflatable deployable origami grasper with a rigid-flexible coupling structure. The proposed grasper can achieve multiple deployment configurations under a single pneumatic actuation, enabling both deployment and grasping operations while also allowing for passive self-folding during deflation. The design and fabrication of the grasper are presented. Then, the stiffness model for the inflatable deployable origami unit is developed based on the equivalent truss method. Experimental results show that the grasper successfully grasps objects of various shapes and sizes in both enveloping and fingertip grasping modes, using either two or four fingers. With its simple mechanical system and high deploy/fold ratio, the proposed grasper holds significant potential for applications in industrial automation and space exploration.

Details

NeurIPS Conference 2025 Conference Paper

DynaRend: Learning 3D Dynamics via Masked Future Rendering for Robotic Manipulation

Jingyi Tian
Le Wang
Sanping Zhou
Sen Wang
Gang Hua

Learning generalizable robotic manipulation policies remains a key challenge due to the scarcity of diverse real-world training data. While recent approaches have attempted to mitigate this through self-supervised representation learning, most either rely on 2D vision pretraining paradigms such as masked image modeling, which primarily focus on static semantics or scene geometry, or utilize large-scale video prediction models that emphasize 2D dynamics, thus failing to jointly learn the geometry, semantics, and dynamics required for effective manipulation. In this paper, we present DynaRend, a representation learning framework that learns 3D-aware and dynamics-informed triplane features via masked reconstruction and future prediction using differentiable volumetric rendering. By pretraining on multi-view RGB-D video data, DynaRend jointly captures spatial geometry, future dynamics, and task semantics in a unified triplane representation. The learned representations can be effectively transferred to downstream robotic manipulation tasks via action value map prediction. We evaluate DynaRend on two challenging benchmarks, RLBench and Colosseum, as well as in real-world robotic experiments, demonstrating substantial improvements in policy success rate, generalization to environmental perturbations, and real-world applicability across diverse manipulation tasks.