Author name cluster

Wei Hu

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

56 papers

2 author rows

AAAI Conference 2026 Conference Paper

Answering the Unanswerable Is to Err Knowingly: Analyzing and Mitigating Abstention Failures in Large Reasoning Models

Yi Liu
Xiangyu Liu
Zequn Sun
Wei Hu

Large reasoning models (LRMs) have shown remarkable progress on complex reasoning tasks. However, some questions posed to LRMs are inherently unanswerable, such as math problems lacking sufficient conditions. We find that LRMs continually fail to provide appropriate abstentions when confronted with these unanswerable questions. In this paper, we systematically analyze, investigate, and resolve this issue for trustworthy AI. We first conduct a detailed analysis of the distinct response behaviors of LRMs when facing unanswerable questions. Then, we show that LRMs possess sufficient cognitive capabilities to recognize the flaws in these questions. However, they fail to exhibit appropriate abstention behavior, revealing a misalignment between their internal cognition and external response. Finally, to resolve this issue, we propose a lightweight, two-stage method that combines cognitive monitoring with inference-time intervention. Experimental results demonstrate that our method significantly improves the abstention rate while maintaining the reasoning performance.

PDF Details DOI

JBHI Journal 2026 Journal Article

BECM-Net: A Multi-granularity Collaborative Framework for Semi-Supervised Fetal Ultrasound Segmentation

Wei Hu
Cong Tan
Wendong Wang
Zeheng Wang
Qibing Qin
Wenfeng Zhang
Haibo Ni

Accurate segmentation of fetal ultrasound (US) images is essential for measuring the Angle of Progression (AoP) and assessing fetal head descent during labor. However, conventional semi-supervised learning (SSL) for ultrasound segmentation is challenged by inaccurate pseudo-labeling at blurred or low-contrast boundaries and by limited enforcement of consistency. To address these challenges, we propose the Boundary-Enhanced Collaborative Multi-granularity Network (BECM-Net), which, from a multi-granularity modeling perspective, can be interpreted as a unified framework that jointly optimizes pixel-level, region-level, and structure-level representations. Specifically, at the pixel level, a novel DirDiff-Conv module enhances boundary perception and texture representation through multi-orientation differential filtering, enabling fine-grained modeling of local structures. At the region level, the Uncertainty-Confidence Aligned Mix (UCA-Mix) strategy performs uncertainty-guided bidirectional region-level mixing, facilitating semantic alignment and reducing pseudo-label noise. At the structure level, the ContourRefine branch models object contours by integrating deep semantic features with shallow boundary cues while coupling boundary learning with pseudo-label supervision, thereby enforcing structural-level consistency in global shape and boundary continuity. Through collaborative optimization across multiple granularities, BECM-Net provides more reliable supervision and robust feature learning under limited annotations. Extensive experiments on fetal ultrasound datasets demonstrate that BECM-Net can achieve the state-of-the-art performance, with particularly notable gains in challenging regions with ambiguous pubic symphysis and fetal head boundaries.

Details DOI

AAAI Conference 2026 Conference Paper

Do We Truly Need So Many Samples? Multi-LLM Repeated Sampling Efficiently Scales Test-Time Compute

Jianhao Chen
Zishuo Xun
Bocheng Zhou
Han Qi
Hangfan Zhang
Qiaosheng Zhang
Yang Chen
Wei Hu

This paper presents a simple, effective, and cost-efficient strategy, named ModelSwitch, to improve LLM performance by scaling test-time compute. ModelSwitch builds upon the repeated-sampling-then-voting framework, with a novel twist: incorporating multiple models, even weaker ones, to leverage their complementary strengths that potentially arise from diverse training data and paradigms. By using sample consistency as a signal, our strategy dynamically switches between models. Theoretical analysis highlights the efficiency and performance advantages of our strategy. Extensive experiments on seven datasets demonstrate that our strategy not only outperforms self-consistency and state-of-the-art multi-agent debate approaches, but also significantly reduces inference costs. Additionally, our strategy requires only a few comparable LLMs to achieve optimal performance and can be extended with verification methods, demonstrating the potential of leveraging multiple LLMs in the generation-verification paradigm.

PDF Details DOI

JBHI Journal 2026 Journal Article

MsGA: Gestational Age Estimation with Multi-plane Unified Measurements Driven by Anatomic Segmentation

Mingjun Huang
Junbo Zhang
Wei Hu
Chao Sun
Xiantao Cai
Bo Du

An accurate estimation of gestational age is critical for prenatal care and clinical decision-making. Existing ultrasound-based gestational age estimation methods are limited by the insufficient information representation capacity of conventional medical segmentation models, noise interference in ultrasound images, and inter-observer variability in traditional geometry-based measurement methods. To address these challenges, we propose the MsGA model to estimate gestational age with multi-plane unified measurements driven by anatomic segmentation. In the anatomic segmentation stage, a lightweight and high-performance LGF-UNet module is proposed, which utilizes the Deep Patch Embedding module to expand the receptive field, the Local-Global Fusion Transformer block to enhance local-global feature fusion, and the Focusing Attention Bottleneck module to suppress ultrasound noise via an adaptive threshold. In the measurement stage, a Point Regression module is introduced to refine biometric landmark localization. Furthermore, we create a fully annotated ultrasound plane dataset for the estimation of gestational age across various gestational stages. Extensive experiments on the dataset have demonstrated the effectiveness of the whole model and each module. Our MsGA model is superior to existing models with fewer parameters and achieves state-of-the-art performance on the Gestational Age Estimation task.

Details DOI

AAAI Conference 2026 Conference Paper

ProtSAE: Disentangling and Interpreting Protein Language Models via Semantically-Guided Sparse Autoencoders

Xiangyu Liu
Haodi Lei
Yi Liu
Yang Liu
Wei Hu

Sparse Autoencoder (SAE) has emerged as a powerful tool for mechanistic interpretability of large language models. Recent works apply SAE to protein language models (PLMs), aiming to extract and analyze biologically meaningful features from their latent spaces. However, SAE suffers from semantic entanglement, where individual neurons often mix multiple nonlinear concepts, making it difficult to reliably interpret or manipulate model behaviors. In this paper, we propose a semantically-guided SAE, called ProtSAE. Unlike existing SAE which requires annotation datasets to filter and interpret activations, we guide semantic disentanglement during training using both annotation datasets and domain knowledge to mitigate the effects of entangled attributes. We design interpretability experiments showing that ProtSAE learns more biologically relevant and interpretable hidden features compared to previous methods. Performance analyses further demonstrate that ProtSAE maintains high reconstruction fidelity while achieving better results in interpretable probing. We also show the potential of ProtSAE in steering PLMs for downstream generation tasks.

PDF Details DOI

TMLR Journal 2026 Journal Article

Sparse Mean Estimation in Adversarial Settings via Incremental Learning

Jianhao Ma
Rui Ray Chen
Yinghui He
Salar Fattahi
Wei Hu

In this paper, we study the problem of sparse mean estimation under adversarial corruptions, where the goal is to estimate the $k$-sparse mean of a heavy-tailed distribution from samples contaminated by adversarial noise. Existing methods face two key limitations: they require prior knowledge of the sparsity level $k$ and scale poorly in high-dimensional settings. We propose a simple and scalable estimator that addresses both challenges. Specifically, it learns the $k$-sparse mean without knowing $k$ in advance and operates in near-linear time and memory with respect to the ambient dimension. Under a moderate signal-to-noise ratio, our method achieves the optimal statistical rate, matching the information-theoretic lower bound. Extensive simulations corroborate our theoretical guarantees. At the heart of our approach is an incremental learning phenomenon: we show that a basic subgradient method applied to a nonconvex two-layer formulation with an $\ell_1$-loss can incrementally learn the $k$ nonzero components of the true mean while suppressing the rest. More broadly, our work is the first to reveal the incremental learning phenomenon of the subgradient method in the presence of heavy-tailed distributions and adversarial corruption.