Arrow Research search

Author name cluster

Yimin Chen

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

10 papers
2 author rows

Possible papers

10

YNIMG Journal 2026 Journal Article

Portable and dynamic magnetoencephalography measurement using compact MSR by precise two-stage magnetic field adjustment and control strategy

  • Yimin Chen
  • Xikai Liu
  • Zhenfeng Gao
  • Ya Deng
  • Pengfei Song
  • Bangcheng Han

The non-invasive technique of magnetoencephalography (MEG) has become increasingly vital for neuroscience research as well as for disease diagnosis and treatment. This article presents a portable and dynamic MEG system that utilizes a compact magnetically shielded room (MSR) integrated with a high-performance magnetic field compensation system and an MEG helmet equipped with an array of optically pumped magnetometer (OPM) sensors, achieving both high resolution and cost-effectiveness. The magnetic field compensation system employs two types of custom coils: external Helmholtz-like coils to adjust the distribution of residual magnetic fields, and internal planar coils to control magnetic field bias and disturbances. This configuration, combined with an active disturbance rejection control (ADRC) strategy, establishes an ultra-weak and stable measurement environment across a large uniform region. We then conducted experiments in which subjects wore the MEG helmet and underwent MEG measurements during periodic eye opening and closing and electrical median nerve stimulation, performed in both stationary and natural movement states. The above brain magnetic signals were measured in the occipital lobe and the postcentral gyrus, respectively. The results demonstrate that the MEG system maintains a comparable high detection resolution when the subject moves, while also offering significantly reduced cost and weight. The proposed system promises to deliver significant benefits for clinical practice and neuroscience research.

AAAI Conference 2026 Conference Paper

REACT-LLM: A Benchmark for Evaluating LLM Integration with Causal Features in Clinical Prognostic Tasks

  • Linna Wang
  • Zhixuan You
  • Qihui Zhang
  • Jiunan Wen
  • Ji Shi
  • Yimin Chen
  • Yusen Wang
  • Fanqi Ding

Large Language Models (LLMs) and causal learning each hold strong potential for clinical decision making (CDM). However, their synergy remains poorly understood, largely due to the lack of systematic benchmarks evaluating their integration in clinical risk prediction. In real-world healthcare, identifying features with causal influence on outcomes is crucial for actionable and trustworthy predictions. While recent work highlights LLMs' emerging causal reasoning abilities, there lacks comprehensive benchmarks to assess their causal learning and performance informed by causal features in clinical risk prediction. To address this, we introduce REACT-LLM, a benchmark designed to evaluate whether combining LLMs with causal features can enhance clinical prognostic performance and potentially outperform traditional machine learning (ML) methods. Unlike existing LLM-clinical benchmarks that often focus on a limited set of outcomes, REACT-LLM evaluates 7 clinical outcomes across 2 real-world datasets, comparing 15 prominent LLMs, 6 traditional ML models, and 3 causal discovery (CD) algorithms. Our findings indicate that while LLMs perform reasonably in clinical prognostics, they have not yet outperformed traditional ML models. Integrating causal features derived from CD algorithms into LLMs offers limited performance gains, primarily due to the strict assumptions of many CD methods, which are often violated in complex clinical data. While the direct integration yields limited improvement, our benchmark reveals a more promising synergy: LLMs serve effectively as knowledge-rich collaborators for identifying and optimizing causal features. Additionally, in-context learning improves LLM predictions when prompts are tailored to the task and model. Different LLMs show varying sensitivity to structured data encoding formats, for example, open-source models perform better with JSON, while smaller models benefit from narrative serialization. These findings highlight the need to match prompts and data formats to model architecture and pretraining.

NeurIPS Conference 2025 Conference Paper

ToxicTextCLIP: Text-Based Poisoning and Backdoor Attacks on CLIP Pre-training

  • Xin Yao
  • Haiyang Zhao
  • Yimin Chen
  • Jiawei Guo
  • Kecheng Huang
  • Ming Zhao

The Contrastive Language-Image Pretraining (CLIP) model has significantly advanced vision-language modeling by aligning image-text pairs from large-scale web data through self-supervised contrastive learning. Yet, its reliance on uncurated Internet-sourced data exposes it to data poisoning and backdoor risks. While existing studies primarily investigate image-based attacks, the text modality, which is equally central to CLIP's training, remains underexplored. In this work, we introduce ToxicTextCLIP, a framework for generating high-quality adversarial texts that target CLIP during the pre-training phase. The framework addresses two key challenges: semantic misalignment caused by background inconsistency with the target class, and the scarcity of background-consistent texts. To this end, ToxicTextCLIP iteratively applies: 1) a background-aware selector that prioritizes texts with background content aligned to the target class, and 2) a background-driven augmenter that generates semantically coherent and diverse poisoned samples. Extensive experiments on classification and retrieval tasks show that ToxicTextCLIP achieves up to 95. 83\% poisoning success and 98. 68% backdoor Hit@1, while bypassing RoCLIP, CleanCLIP and SafeCLIP defenses. The source code can be accessed via https: //github. com/xinyaocse/ToxicTextCLIP/.

YNIMG Journal 2024 Journal Article

Lightweight and wearable magnetoencephalography system based on spatially-grid constrained coils and compact magnetically shielded room

  • Shuai Dou
  • Xikai Liu
  • Ya Deng
  • Yimin Chen
  • Pengfei Song
  • Tong Wen
  • Bangcheng Han

Magnetoencephalography based on optically pumped magnetometers can passively detect the ultra-weak brain magnetic field signals, which has significant clinical application prospects for the diagnosis and treatment of cerebral disorders. This paper proposes a brain magnetic signal measurement method on the basis of the active-passive coupling magnetic shielding strategy and helmet-mounted detection array, which has lower cost and comparable performance over the existing ones. We first utilized the spatially-grid constrained coils and biplanar coils with proportion-integration-differentiation controller with tracking differentiator to ensure a near-zero and stable magnetic field environment with large uniform region. Subsequently, we implemented the brain magnetic signal measurement with the subject randomly moving fingers through tapping a keyboard and with the condition of opening and closing the eyes. Effectively induced brain magnetic signals were detected at the motor functional area and occipital lobe area in the two experiments, respectively. The proposed method will contribute to the development of functional brain imaging.

AAAI Conference 2022 Conference Paper

Uncertainty Estimation via Response Scaling for Pseudo-Mask Noise Mitigation in Weakly-Supervised Semantic Segmentation

  • Yi Li
  • Yiqun Duan
  • Zhanghui Kuang
  • Yimin Chen
  • Wayne Zhang
  • Xiaomeng Li

Weakly-Supervised Semantic Segmentation (WSSS) segments objects without a heavy burden of dense annotation. While as a price, generated pseudo-masks exist obvious noisy pixels, which result in sub-optimal segmentation models trained over these pseudo-masks. But rare studies notice or work on this problem, even these noisy pixels are inevitable after their improvements on pseudo-mask. So we try to improve WSSS in the aspect of noise mitigation. And we observe that many noisy pixels are of high confidence, especially when the response range is too wide or narrow, presenting an uncertain status. Thus, in this paper, we simulate noisy variations of response by scaling the prediction map multiple times for uncertainty estimation. The uncertainty is then used to weight the segmentation loss to mitigate noisy supervision signals. We call this method URN, abbreviated from Uncertainty estimation via Response scaling for Noise mitigation. Experiments validate the benefits of URN, and our method achieves state-of-the-art results at 71. 2% and 41. 5% on PASCAL VOC 2012 and MS COCO 2014 respectively, without extra models like saliency detection. Code is available at https: //github. com/XMed-Lab/URN.

ICML Conference 2021 Conference Paper

Group Fisher Pruning for Practical Network Compression

  • Liyang Liu
  • Shilong Zhang
  • Zhanghui Kuang
  • Aojun Zhou
  • Jing-Hao Xue
  • Xinjiang Wang
  • Yimin Chen
  • Wenming Yang

Network compression has been widely studied since it is able to reduce the memory and computation cost during inference. However, previous methods seldom deal with complicated structures like residual connections, group/depth-wise convolution and feature pyramid network, where channels of multiple layers are coupled and need to be pruned simultaneously. In this paper, we present a general channel pruning approach that can be applied to various complicated structures. Particularly, we propose a layer grouping algorithm to find coupled channels automatically. Then we derive a unified metric based on Fisher information to evaluate the importance of a single channel and coupled channels. Moreover, we find that inference speedup on GPUs is more correlated with the reduction of memory rather than FLOPs, and thus we employ the memory reduction of each channel to normalize the importance. Our method can be used to prune any structures including those with coupled channels. We conduct extensive experiments on various backbones, including the classic ResNet and ResNeXt, mobile-friendly MobileNetV2, and the NAS-based RegNet, both on image classification and object detection which is under-explored. Experimental results validate that our method can effectively prune sophisticated networks, boosting inference speed without sacrificing accuracy.

ICLR Conference 2021 Conference Paper

Towards Impartial Multi-task Learning

  • Liyang Liu
  • Yi Li 0050
  • Zhanghui Kuang
  • Jing-Hao Xue
  • Yimin Chen
  • Wenming Yang
  • Qingmin Liao
  • Wayne Zhang 0001

Multi-task learning (MTL) has been widely used in representation learning. However, naively training all tasks simultaneously may lead to the partial training issue, where specific tasks are trained more adequately than others. In this paper, we propose to learn multiple tasks impartially. Specifically, for the task-shared parameters, we optimize the scaling factors via a closed-form solution, such that the aggregated gradient (sum of raw gradients weighted by the scaling factors) has equal projections onto individual tasks. For the task-specific parameters, we dynamically weigh the task losses so that all of them are kept at a comparable scale. Further, we find the above gradient balance and loss balance are complementary and thus propose a hybrid balance method to further improve the performance. Our impartial multi-task learning (IMTL) can be end-to-end trained without any heuristic hyper-parameter tuning, and is general to be applied on all kinds of losses without any distribution assumption. Moreover, our IMTL can converge to similar results even when the task losses are designed to have different scales, and thus it is scale-invariant. We extensively evaluate our IMTL on the standard MTL benchmarks including Cityscapes, NYUv2 and CelebA. It outperforms existing loss weighting methods under the same experimental settings.

AAAI Conference 2020 Conference Paper

Adaptive Unimodal Cost Volume Filtering for Deep Stereo Matching

  • Youmin Zhang
  • Yimin Chen
  • Xiao Bai
  • Suihanjin Yu
  • Kun Yu
  • Zhiwei Li
  • Kuiyuan Yang

State-of-the-art deep learning based stereo matching approaches treat disparity estimation as a regression problem, where loss function is directly defined on true disparities and their estimated ones. However, disparity is just a byproduct of a matching process modeled by cost volume, while indirectly learning cost volume driven by disparity regression is prone to overfitting since the cost volume is under constrained. In this paper, we propose to directly add constraints to the cost volume by filtering cost volume with unimodal distribution peaked at true disparities. In addition, variances of the unimodal distributions for each pixel are estimated to explicitly model matching uncertainty under different contexts. The proposed architecture achieves state-ofthe-art performance on Scene Flow and two KITTI stereo benchmarks. In particular, our method ranked the 1st place of KITTI 2012 evaluation and the 4th place of KITTI 2015 evaluation (recorded on 2019. 8. 20). The codes of AcfNet are available at: https: //github. com/youmi-zym/AcfNet.

AAAI Conference 2019 Conference Paper

Learning Segmentation Masks with the Independence Prior

  • Songmin Dai
  • Xiaoqiang Li
  • Lu Wang
  • Pin Wu
  • Weiqin Tong
  • Yimin Chen

An instance with a bad mask might make a composite image that uses it look fake. This encourages us to learn segmentation by generating realistic composite images. To achieve this, we propose a novel framework that exploits a new proposed prior called the independence prior based on Generative Adversarial Networks (GANs). The generator produces an image with multiple category-specific instance providers, a layout module and a composition module. Firstly, each provider independently outputs a category-specific instance image with a soft mask. Then the provided instances’ poses are corrected by the layout module. Lastly, the composition module combines these instances into a final image. Training with adversarial loss and penalty for mask area, each provider learns a mask that is as small as possible but enough to cover a complete category-specific instance. Weakly supervised semantic segmentation methods widely use grouping cues modeling the association between image parts, which are either artificially designed or learned with costly segmentation labels or only modeled on local pairs. Unlike them, our method automatically models the dependence between any parts and learns instance segmentation. We apply our framework in two cases: (1) Foreground segmentation on category-specific images with box-level annotation. (2) Unsupervised learning of instance appearances and masks with only one image of homogeneous object cluster (HOC). We get appealing results in both tasks, which shows the independence prior is useful for instance segmentation and it is possible to unsupervisedly learn instance masks with only one image.

YNIMG Journal 2015 Journal Article

3D MR ventricle segmentation in pre-term infants with post-hemorrhagic ventricle dilatation (PHVD) using multi-phase geodesic level-sets

  • Wu Qiu
  • Jing Yuan
  • Martin Rajchl
  • Jessica Kishimoto
  • Yimin Chen
  • Sandrine de Ribaupierre
  • Bernard Chiu
  • Aaron Fenster

Intraventricular hemorrhage (IVH) or bleed within the cerebral ventricles is a common condition among very low birth weight pre-term neonates. The prognosis for these patients is worsened should they develop progressive ventricular dilatation, i. e. , post-hemorrhagic ventricle dilatation (PHVD), which occurs in 10–30% of IVH patients. Accurate measurement of ventricular volume would be valuable information and could be used to predict PHVD and determine whether that specific patient with ventricular dilatation requires treatment. While the monitoring of PHVD in infants is typically done by repeated transfontanell 2D ultrasound (US) and not MRI, once the patient's fontanels have closed around 12–18months of life, the follow-up patient scans are done by MRI. Manual segmentation of ventricles from MR images is still seen as a gold standard. However, it is extremely time- and labor-consuming, and it also has observer variability. This paper proposes an accurate multiphase geodesic level-set segmentation algorithm for the extraction of the cerebral ventricle system of pre-term PHVD neonates from 3D T1 weighted MR images. The proposed segmentation algorithm makes use of multi-region segmentation technique associated with spatial priors built from a multi-atlas registration scheme. The leave-one-out cross validation with 19 patients with mild enlargement of ventricles and 7 hydrocephalus patients shows that the proposed method is accurate, suggesting that the proposed approach could be potentially used for volumetric and morphological analysis of the ventricle system of IVH neonatal brains in clinical practice.