Author name cluster

Xin Yu

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

44 papers

2 author rows

AAAI Conference 2026 Conference Paper

Decoupling Understanding from Reasoning via Problem Space Mapping for Small-Scale Model Reasoning

Li Wang
Changhao Zhang
Zengqi Xiu
Kai Lu
Xin Yu
Kui Zhang
Wenjun Wu

Despite recent advances in the reasoning capabilities of Large Language Models (LLMs), improving the reasoning ability of Small Language Models (SLMs, e.g., up to 1.5B parameters) remains challenging. A key obstacle lies in the complexity and variability of natural language: essentially equivalent problems often appear in diverse surface forms, often obscured by redundant or distracting details. This imposes a dual burden on SLMs: they must first extract the core problem from complex linguistic input, and then perform reasoning based on that understanding. The resulting vast and noisy problem space hinders optimization, particularly for models with limited capacity. To address this, we propose a new framework that decouples understanding from reasoning by mapping natural language problems into a canonical problem space-a semantically simplified yet expressive domain. This enables SLMs to focus on reasoning over standardized inputs, free from linguistic variability. Within this framework, we introduce DURIT (Decoupled Understanding from Reasoning via Iterative Training), a three-step algorithm that iteratively: (1) mapping natural language problems via reinforcement learning, (2) aligns reasoning trajectories through self-distillation, and (3) trains reasoning policies in the problem space. The mapper and reasoner are co-trained in an alternating loop throughout this process. Experiments show that DURIT substantially improves SLMs' performance on both in-domain and out-of-domain mathematical and logical reasoning tasks. Beyond improving reasoning capabilities, DURIT also improves the robustness of reasoning, validating decoupling understanding from reasoning as an effective strategy for strengthening SLMs.

PDF Details DOI

YNIMG Journal 2026 Journal Article

Resting-state fMRI coherence is selectively diminished around 0.1 Hz in patients with unilateral carotid artery stenosis

Sangcheon Choi
Gabriel Hoffmann
Sebastian Schneider
Stephan Kaczmarz
Xin Yu
Christine Preibisch
Christian Sorg

In the brain, vasomotor dynamics at infra-slow frequencies (∼0.1 Hz), driven by synchronized oscillations of smooth muscle cells in vessel walls, are thought to play a crucial role in regulating cerebral perfusion and underlie resting-state functional connectivity (FC), typically measured by correlated time courses of functional signals. In particular, rodent studies have demonstrated that vasomotor activity contributes to the coherence of blood oxygenation level dependent (BOLD) signal fluctuations. However, in humans, detecting this contribution non-invasively remains challenging due to the limited spatiotemporal sensitivity of functional magnetic resonance imaging (fMRI) to vasomotion. Given that prior studies have identified internal carotid artery stenosis (ICAS) as an informative conditional lesion model of vasomotor and hemodynamic impairments in humans, we investigated whether ICAS affects interhemispheric BOLD coherence at ∼0.1 Hz. Using a multi-modal fMRI framework integrating resting-state fMRI with quantitative mapping of cerebral blood volume, blood flow, oxygen metabolism, and BOLD time lag, we compared BOLD coherence between patients with asymptomatic unilateral ICAS and healthy controls. Frequency-specific analysis revealed significantly diminished inter-hemispheric BOLD coherence at ∼0.1 Hz across canonical resting-state networks in ICAS patients, while ultra-slow (<0.05 Hz) coherence remained largely preserved. This reduction was spatially widespread across brain networks and particularly pronounced in watershed areas, i.e., border zones between major vascular territories, associated with significantly increased lateralization of cerebral blood volume (p < 0.01). Notably, coherence-based FC patterns at ∼0.1 Hz were heterogeneous within watershed areas but homogeneous outside, suggesting an interplay between compensatory mechanisms and cerebrovascular impairment. Taken together, our findings demonstrate that ICAS induces subtle, frequency- and region-specific alterations in interhemispheric FC, consistent with a model in which impaired vasomotor activity and hemodynamic dysfunctions impact resting-state FC in the human brain.

Details DOI

TIST Journal 2026 Journal Article

ROIS: Role-Based Multi-Agent Collaboration by Context-Time-Aware Information Sharing

Hanwen Qi
Tinghuai Ma
Kexing Peng
Xin Yu

In complex cooperative tasks, Multi-Agent Reinforcement Learning (MARL) faces the dual challenges of an exponentially growing joint action space and the constraints of partial observability. While the Centralized Training with Decentralized Execution (CTDE) paradigm is widely adopted, it often leads to homogeneous policies that lack the necessary specialization for complex teamwork. While role-based methods encourage specialization, they often lack mechanisms for inter-agent interaction. Consequently, the lack of rich information for role assignment means their roles may be assigned ineffectively, hindering the convergence of the team policy to its optimum. To address this critical gap, we propose ROIS, a novel framework that enhances multi-agent collaboration by grounding dynamic role assignments in a context-time-aware information sharing mechanism. Our key insight is to leverage a dedicated information sharing module that captures multi-step temporal context, providing each agent with richer, tailored feedback from its teammates. This mechanism directly addresses the lack of inter-agent interaction, leading to more accurate and effective role assignments. This results in a more coherent task division, which guides specialized policies toward the optimal joint policy and drastically reduces ineffective exploration. We conduct extensive experiments on the demanding StarCraft II, SMACv2, and Multi-agent Particle Environment benchmarks. The results demonstrate that ROIS consistently achieves state-of-the-art performance, significantly outperforming a wide range of advanced baselines, particularly in scenarios requiring deep coordination and policy adaptation. Finally, comprehensive ablation studies confirm the essential contribution of each component to the framework’s success.

Details DOI

EAAI Journal 2026 Journal Article

Spatial dependency learning for image-based anomaly detection in engine combustion

Luyun Miao
Dazhi Zhang
Zhen Cao
Zhichang Guo
Yao Li
Xun Yuan
Jangbo Peng
Chaobo Yang

Traditional scramjet anomaly detection methods are constrained by delayed pressure responses and handcrafted features that depend on expert experience. To address this issue, this paper proposes an intelligent situational awareness algorithm for engine anomaly detection based on chemiluminescence imaging of combustion processes. The model learns the spatial dependencies of local features in stable flame images, using a self-supervised learning framework to characterize the feature distribution of normal image patches and identify anomalies as deviations from this distribution. Experimental results demonstrate that the proposed method achieves 100. 0% accuracy and 100. 0% area under the receiver operating characteristic curve (AUROC) at the image level, while 90. 9% accuracy and 94. 8% AUROC at the pixel level. The algorithm is trained solely on normal images and is capable of simultaneously detecting both abnormal states and abnormal regions.

Details DOI

NeurIPS Conference 2025 Conference Paper

AltLoRA: Towards Better Gradient Approximation in Low-Rank Adaptation with Alternating Projections

Xin Yu
Yujia Wang
Jinghui Chen
Lingzhou Xue

Low-Rank Adaptation (LoRA) has emerged as an effective technique for reducing memory overhead in fine-tuning large language models. However, it often suffers from sub-optimal performance compared with full fine-tuning since the update is constrained in the low-rank space. Recent variants such as LoRA-Pro attempt to mitigate this by adjusting the gradients of the low-rank matrices to approximate the full gradient. However, LoRA-Pro's solution is not unique, and different solutions can lead to significantly varying performance in ablation studies. Besides, to incorporate momentum or adaptive optimization design, approaches like LoRA-Pro must first compute the equivalent gradient, causing a higher memory cost close to full fine-tuning. A key challenge remains in integrating momentum properly into the low-rank space with lower memory cost. In this work, we propose AltLoRA, an alternating projection method that avoids the difficulties in gradient approximation brought by the joint update design, meanwhile integrating momentum without higher memory complexity. Our theoretical analysis provides convergence guarantees and further shows that AltLoRA enables stable feature learning and robustness to transformation invariance. Extensive experiments across multiple tasks demonstrate that AltLoRA outperforms LoRA and its variants, narrowing the gap toward full fine-tuning while preserving superior memory efficiency.