Author name cluster

Wei Wu

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

89 papers

2 author rows

JBHI Journal 2026 Journal Article

Efficient Sleep Staging With Bayesian Uncertainty-Guided Active Learning

Tianyou Yu
Rui Huang
Fei Wang
Jun Zhang
Wei Wu
Zhuliang Yu
Yuanqing Li
Jun Xiao

Automated sleep staging is essential for large-scale and home-based sleep monitoring; however, in routine clinical practice, sleep annotation remains largely dependent on experienced experts performing time-consuming and labor-intensive manual scoring. Existing automatic systems often struggle to adapt reliably to new subjects, limiting their clinical adoption and reinforcing the reliance on expert review. This creates a strong demand for adaptive and efficient sleep staging systems that can substantially reduce annotation workload while preserving expert-level accuracy. We propose BayesSleepNet, a novel framework that integrates Bayesian uncertainty quantification with active learning for adaptive sleep staging. BayesSleepNet employs principled Bayesian modeling by placing distributions over network weights and performing Monte Carlo sampling at inference, enabling explicit quantification of model (epistemic) uncertainty. These uncertainty estimates drive a two-stage sample selection strategy that first fine-tunes the model using representative epochs and subsequently prioritizes persistently uncertain samples for expert review. Across four public sleep datasets, BayesSleepNet consistently improves performance—by 7. 60% in accuracy, 8. 27% in macro-F1, and 0. 104 in Cohen's $\kappa$ —while requiring manual annotation of only 20% of data from new subjects. Despite its adaptive learning capability, BayesSleepNet remains computationally lightweight, using substantially fewer parameters than representative high-capacity state-of-the-art models. These results demonstrate the clinical promise of uncertainty-aware active learning as a practical and cost-efficient paradigm for semi-automated sleep staging. Code is available at https://github.com/yuty2009/bayesugal.

Details DOI

AAAI Conference 2026 Conference Paper

Explicit Modeling of Causal Factors and Confounders for Image Classification

Wei Wu
Lei Meng
Zhuang Qi
Zixuan Li
Yachong Zhang
Xiaoshuo Yan
Xiangxu Meng

Causal inference has emerged as a promising approach for identifying decisive semantic factors and eliminating spurious correlations in visual representation learning. However, most existing methods rely on latent, data-driven confounder modeling, normally attributing the source of bias to background information while neglecting object-level semantic confusions that commonly occur in complex scenes. This limits their effectiveness in disentangling causal factors from confounding semantics. To address this challenge, we propose an explicit modeling approach for both causal factors and confounders, termed Explicit Modeling Causal Model (EMCM). The proposed framework consists of three key components. The Features Stability Estimation module explicitly models the relationship between visual semantics and class labels by leveraging clustering patterns to perform class-aware separation of causal and confounding factors. It produces class-specific causal factors and confounding factors linked to ambiguous categories. Subsequently, the Discriminative Features Enhancing module integrates causal factors into fused patch features via front-door intervention for stable semantics. In parallel, the Explicit Confounder Modeling and Debiasing Module learns confounders under clear label guidance and derives debiased context features by TDE modeling. This framework leverages two complementary causal perspectives to construct a unified semantic representation that facilitates improved generalization. Extensive experiments on two datasets demonstrate that EMCM effectively disentangles causal and confounding factors in complex scenarios, consistently outperforming state-of-the-art causal debiasing methods and text-guided methods in all metrics.

PDF Details DOI

AAAI Conference 2026 Conference Paper

Introducing Decomposed Causality with Spatiotemporal Object-Centric Representation for Video Classification

Yachong Zhang
Lei Meng
Shuo Xu
Zhuang Qi
Wei Wu
Lei Wu
Xiangxu Meng

Video classification requires event-level representations of objects and their interactions. Existing methods typically rely on data-driven approaches, which either learn such features from whole frames or object-centric visual regions. Therefore, the modeling of spatiotemporal interactions among objects is usually overlooked. To address this issue, this paper presents a Decomposition of Synergistic, Unique, and Redundant Causal Representations Learning (SurdCRL) model for video classification, which introduces a newly-proposed SURD causal theory to model the spatiotemporal features of both object dynamics and their in- and cross-frame interactions. Specifically, SurdCRL employs three modules to model the object-centric spatiotemporal dynamics using distinct types of causal components, where the first module Spatial-Temporal Entity Modeling decouples the frame into object and context entities, and employs a temporal message passing block to capture object state changes over time, generating spatiotemporal features as basic causal variables. Second, the Dual-Path Causal Inference module mitigates confounders among causal variables by front-door and back-door interventions, thus enabling the subsequent causal components to reflect their intrinsic effects. Finally, the Causal Composition and Selection module employs the compositional structure-aware attention to project the causal variables and their high-order interactions into the synergistic, unique, and redundant components. Experiments on two benchmarking datasets verify that SurdCRL better captures event-relevant object-centric representation by decomposing spatiotemporal object interactions into three types of causal components.

PDF Details DOI

JBHI Journal 2026 Journal Article

MB-STFormer: A Multi-Band Spectral-Temporal Transformer with Efficient Attention for Enhanced EEG-Based Fatigue Detection

Ke Liu
Lilong Sun
Wenlong Wang
Zhenghui Gu
Zhuliang Yu
Wei Wu

Accurate detection of driver fatigue is critical for preventing traffic accidents. Although electroencephalogram (EEG) signals provide a robust physiological indicator of fatigue, effectively capturing their intricate spatiotemporal-spectral dynamics poses significant challenges. In this paper, we propose MB-STFormer, a novel deep neural network designed for EEG-based fatigue detection, which systematically integrates neurophysiological priors into deep feature learning. The proposed MB-STFormer employs a multi-branch frequency-aware module to extract spatiotemporal features from EEG signals, with each branch dedicated to a distinct frequency sub-band. By leveraging adaptive temporal convolution kernel sizes tailored to each sub-band, the model adeptly captures the inherent rhythmic patterns and temporal dynamics unique to different frequency components. Additionally, we introduce an Efficient Additive Attention mechanism to aggregate global contextual information, thereby addressing the over-smoothing of subtle yet critical features often encountered with conventional transformer self-attention mechanisms. Extensive experiments conducted on three publicly available datasets demonstrate that MB-STFormer achieves state-of-the-art performance while maintaining superior interpretability and generalizability. The proposed framework offers a promising solution for real-world fatigue monitoring systems.

Details DOI

AAAI Conference 2026 Conference Paper

MetaAct-RL: Training Language Models for Reasoning Through Meta-Action-Based Reinforcement Learning

Zhiheng Xi
Yuhui Wang
Yiwen Ding
Guanyu Li
Senjie Jin
Shichun Liu
Jixuan Huang
Dingwen Yang

Outcome-based reinforcement learning has made notable advances in training language models (LMs) for reasoning. However, without explicit incentives and controls, this paradigm has limitations and instability in eliciting high-quality reasoning trajectories with diverse actions—particularly for models whose pretraining lacked extensive reasoning-related data. To this end, we introduce MetaAct-RL, a new RL framework that frames LMs’ thinking as sequential decision making over meta-actions. In this framework, the model chooses and executes a high-level action at each step—such as forward reasoning, critique, or refinement—to gradually reach the correct answer. To encourage deeper exploration, richer action diversity, and to improve sampling efficiency in the RL optimization process, MetaAct-RL incorporates appropriate length-based reward and regularization, and a key-state restart mechanism. Extensive experiments across six benchmarks show that MetaAct-RL improves reasoning performance by 7.99 on Llama3.2-1B and 7.17 on Llama3.1-8B relative to vanilla RL method. Moreover, on the challenging AIME-2024, our method outperforms the vanilla RL by 7.5 with Qwen2.5-1.5B.

PDF Details DOI

EAAI Journal 2026 Journal Article

Municipal solid waste gasification predictions using hybrid-data physics-informed neural networks

Vincentius Surya Kurnia Adi
Lei Shi
Wei Wu

Details DOI

EAAI Journal 2026 Journal Article

Pyramid graph neural network knowledge distillation with pre-trained language model for medical question answering

Xuening Li
Fangjiong Chen
Wei Wu
Liyi Zeng
Zhaoquan Gu
Yanchun Zhang

Details DOI

EAAI Journal 2026 Journal Article

Reliability-centered approach to artificial intelligence-driven predictive maintenance for industrial Internet of Things

Wei Wu

Details DOI

YNIMG Journal 2026 Journal Article

VSSI 2 p -Net: Physics-guided deep unfolding with L 2 p -norm and variation sparsity for EEG source imaging

Luhua Wang
Jun Zhang
Zhenghui Gu
Ke Liu
Wei Wu
Tianyou Yu
Zhuliang Yu
Yuanqing Li

Details DOI

JBHI Journal 2025 Journal Article

A Novel Approach to Explore Internal Cardiac Electrophysiological Pattern under Emotional Stress

Hanrui Dong
Shijie He
Wei Wu
Xianbin Zhang
Ming Li
Richard Millham
Guibin Bian
Wanqing Wu

Numerous psychological and clinical studies have confirmed a correlation between mental and cardiac health. We aim to explore this relationship further by examining how emotions influence cardiac health. By collecting body surface potential and utilizing the electrocardiographic imaging (ECGI) model, we can noninvasively and continuously reconstruct internal cardiac electrical activity. To enhance the existing ECGI model on various datasets, we propose an information fusion strategy called Emotional Potential Conversion CycleGAN. It enables data alignment across diverse datasets while preserving emotional information, allowing us to reconstruct cardiac electrical activity in various emotional states. Our results demonstrate successful data conversion while maintaining emotional integrity, achieving an impressive 91. 92% accuracy in emotion recognition. We further validated this approach using publicly available datasets, WESAD and SWELL, which yielded consistent results. Additionally, we conducted preliminary investigations into the correlation and variability of cardiac activity across different sites under stress. The correlation study indicates a generalized association among various regions of the heart, while variability studies reveal that fluctuations in cardiac electrical activity during stress are primarily concentrated around the atrioventricular node and Purkinje fibers. This suggests a potential risk for pre-excitation syndrome, possibly due to the possible presence of a Kent bundle. Overall, we present a practical approach for studying the interplay between emotional states and cardiac health. Our findings indicate a potential relationship under stress that may provide valuable insights for future research.

Details DOI

JBHI Journal 2025 Journal Article

ADMM-ESINet: A Deep Unrolling Network for EEG Extended Source Imaging

Ke Liu
Hang Jiang
Hu Yang
Jun Zhang
Zhenghui Gu
Zhuliang Yu
Yu Zhang
Bin Xiao

Electroencephalography (EEG) source imaging (ESI) methods aim to reconstruct cortical sources from scalp EEG signals, a crucial task for understanding the normal brain as well as brain disorders. Traditional model-driven ESI methods face challenges in real-time reconstruction, while deep neural network (DNN)-based ESI methods often struggle with generalization to new data. To address these issues, we propose ADMM-ESINet, a novel deep unfolding neural network for robust and efficient reconstruction of EEG extended sources. ADMM-ESINet leverages a structured sparsity constraint within a regularization framework and employs the Alternating Direction Method of Multipliers (ADMM) to achieve iterative solutions. By unrolling the ADMM algorithm into a cascaded network architecture, ADMM-ESINet effectively integrates prior knowledge, enabling end-to-end, real-time ESI. Crucially, both the regularization parameters and the spatial transform operator are learned directly from the training data. Numerical results demonstrate that ADMM-ESINet surpasses traditional DNN-based methods in generalization ability and accurately reconstructs the location, extent, and temporal dynamics of extended sources, establishing ADMM-ESINet as a promising method for real-time ESI.

Details DOI

EAAI Journal 2025 Journal Article

Anomaly detection in distributed systems based on Spatio-Temporal Causal Inference

Chunmao Jiang
Wei Wu

Details DOI

NeurIPS Conference 2025 Conference Paper

Beyond Node-Centric Modeling: Sketching Signed Networks with Simplicial Complexes

Wei Wu
Xuan Tan
Yan Peng
Ling Chen
Fangfang Li
Chuan Luo

Signed networks can reflect more complex connections through positive and negative edges, and cost-effective signed network sketching can significantly benefit an important link sign prediction task in the era of big data. Existing signed network embedding algorithms mainly learn node representation in the Graph Neural Network (GNN) framework with the balance theory. However, the node-wise representation learning methods either limit the representational power because they primarily rely on node pairwise relationship in the network, or suffer from severe efficiency issues. Recent research has explored simplicial complexes to capture higher-order interactions and integrated them into GNN frameworks. Motivated by that, we propose EdgeSketch+, a simple and effective edge embedding algorithm beyond traditional node-centric modeling that directly represents edges as low-dimensional vectors without transitioning from node embeddings. The proposed approach maintains a good balance between accuracy and efficiency by exploiting the Locality Sensitive Hashing (LSH) technique to swiftly capture the higher-order information derived from the simplicial complex in a manner of no learning processes. Experiments show that EdgeSketch+ matches state-of-the-art accuracy while significantly reducing runtime, achieving speedups of up to $546. 07\times$ compared to GNN-based methods.