Author name cluster

Yi Zhou

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

93 papers

2 author rows

AIIM Journal 2026 Journal Article

A Character-level Convolutional Recurrent Interaction Network for joint traditional Chinese medicine clinical named entity recognition and relation extraction

Qiang Xu
Zhi-hui Zhao
Wei-wei Liu
Yu Fang
Wen-jun Tang
Yi Zhou
Ke Zhu
Hai Xiang

The electronic medical record (EMR) of traditional Chinese medicine (TCM) is a crucial document for recording patients’ clinical data, structured around four main dimensions: inspection, listening and smelling, inquiry, and palpation. Analyzing these records using natural language processing holds promise for further structuring and modeling TCM medical data. Currently, deep learning-based named entity recognition is considered the prevailing method for processing TCM EMRs. However, these state-of-the-art models fail to consider the four diagnostic dimensions of TCM clinical data and their impact on entity type extraction, as well as to fully understand the semantic features of ancient Chinese representations in TCM. To address these issues, we introduce a joint clinical named recognition and relation extraction method designed to recognize and classify clinical entities – such as location and symptom attributes – along with their associative relationships (four diagnostic dimensions). In this study, we propose a Character-level Convolutional Recurrent Interaction Network (CCRIN), which treats the four diagnostic dimensions as relationships, locations as head entities, and symptom attributes as tail entities. The CCRIN integrates Chinese character embeddings and Chinese inter-character contextual convolutional feature vectors to capture the semantic information of the ancient Chinese language, while combining entity and relation extraction with a self-attention mechanism to generate rich feature representations through multi-task dynamic interaction. This approach enables the efficient extraction of TCM entities and relations related to the four diagnostic dimensions. Empirical studies on the NYT and the TCM-cases datasets demonstrate the superiority of the proposed model. The model novelly employs a multi-task joint extraction method for entities and relations. The method is performed based on the four diagnostic methods in traditional Chinese medicine. Chinese character embeddings and inter-character contextual feature vectors are integrated. The effectiveness is validated on publicly available and self-constructed datasets.

Details DOI

JBHI Journal 2026 Journal Article

A Graph Convolutional Network with Pretrained Features and Iterative Polar Coordinate Attention for Cross-Subject EEG-based Neuropsychiatric Diagnosis

Yin Liu
Jiaojiao Deng
Runyi Xu
Yi Zhou

Electroencephalography (EEG)-based diagnosis of neuropsychiatric disorders offers a non-invasive and cost-effective solution for early detection. However, robust cross-subject generalization remains a major challenge due to substantial inter-individual variability in EEG signals. To address this, we propose PreIPCA-GCN, a novel graph convolutional network that integrates pretrained temporal features and Iterative Polar Coordinate Attention (IPCA)-based brain connectivity modeling. Specifically, we utilize a modified version of LaBraM, a large-scale pretrained EEG model, to extract subject-invariant node representations. Functional brain connectivity is then characterized using Pearson correlation and cosine similarity in polar space, capturing both connectivity strength (radius) and phase synchronization (angle). To fuse these complementary cues, we introduce a dual-path IPCA mechanism, refining the adjacency matrix across iterations. PreIPCA-GCN is evaluated on six public EEG datasets covering five neuropsychiatric disorders (e. g. , attention-deficit/hyperactivity disorder, Alzheimer's disease, schizophrenia), consistently demonstrating strong cross-subject accuracies under both hold-out (86. 77%-95. 69%) and leave-one-subject-out cross-validation (88. 81%-97. 49%). Comprehensive comparative results show that PreIPCA-GCN outperforms several state-of-the-art methods. Ablation studies further confirm the effectiveness of both the pretrained node features and IPCA-based fused adjacency matrix in improving cross-subject generalization. These findings suggest PreIPCA-GCN as a robust and generalizable framework for cross-subject EEG-based neuropsychiatric diagnosis, offering strong potential for future clinical applications.

Details DOI

JBHI Journal 2026 Journal Article

Advanced Camera-Based Scoliosis Screening via Deep Learning Detection and Fusion of Trunk, Limb, and Skeleton Features

Ziyan Wang
Yi Zhou
Ninghui Xu
Yuqin Zhou
Heran Zhao
Zhiyong Chang
Zhigang Hu
Xiao Han

Scoliosis significantly impacts quality of life, highlighting the need for effective early scoliosis screening (SS) and intervention. However, current SS methods often involve physical contact, undressing, or radiation exposure. This study introduces an innovative, non-invasive SS approach utilizing a monocular RGB camera that eliminates the need for undressing, sensor attachment, and radiation exposure. We introduce a novel approach that employs Parameterized Human 3D Reconstruction (PH3DR) to reconstruct 3D human models, thereby effectively eliminating clothing obstructions, seamlessly integrated with an ISANet segmentation network, which has been enhanced by Multi-Scale Fusion Attention (MSFA) module we proposed for facilitating the segmentation of distinct human trunk and limb features (HTLF), capturing body surface asymmetries related to scoliosis. Additionally, we propose a Swin Transformer-enhanced CMU-Pose to extract human skeleton features (HSF), identifying skeletal asymmetries crucial for SS. Finally, we develop a fusion model that integrates the HTLF and HSF, combining surface morphology and skeletal features to improve the precision of SS. The experiments demonstrated that PH3DR and MSFA significantly improved the segmentation and extraction of HTLF, whereas ST-based CMU-Pose substantially enhanced the extraction of HSF. Our final model achieved a comparable F1 (0. 895 $\pm$ 0. 014) to the best-performing baseline model, with only 0. 79% of the parameters and 1. 64% of the FLOPs, achieving 36 FPS–significantly higher than the best-performing baseline model (10 FPS). Moreover, our model outperformed two spine surgeons, one less experienced and the other moderately experienced. With its patient-friendly, privacy-preserving, and easily deployable solution, this approach is particularly well-suited for early SS and routine monitoring.

Details DOI

AAAI Conference 2026 Conference Paper

Bidirectional Channel-selective Semantic Interaction for Semi-Supervised Medical Segmentation

Kaiwen Huang
Yizhe Zhang
Yi Zhou
Tianyang Xu
Tao Zhou

Semi-supervised medical image segmentation is an effective method for addressing scenarios with limited labeled data. Existing methods mainly rely on frameworks such as mean teacher and dual-stream consistency learning. These approaches often face issues like error accumulation and model structural complexity, while also neglecting the interaction between labeled and unlabeled data streams. To overcome these challenges, we propose a Bidirectional Channel-selective Semantic Interaction (BCSI) framework for semi-supervised medical image segmentation. First, we propose a Semantic-Spatial Perturbation (SSP) mechanism, which disturbs the data using two strong augmentation operations and leverages unsupervised learning with pseudo-labels from weak augmentations. Additionally, we employ consistency on the predictions from the two strong augmentations to further improve model stability and robustness. Second, to reduce noise during the interaction between labeled and unlabeled data, we propose a Channel-selective Router (CR) component, which dynamically selects the most relevant channels for information exchange. This mechanism ensures that only highly relevant features are activated, minimizing unnecessary interference. Finally, the Bidirectional Channel-wise Interaction (BCI) strategy is employed to supplement additional semantic information and enhance the representation of important channels. Experimental results on multiple benchmarking 3D medical datasets demonstrate that the proposed method outperforms existing semi-supervised approaches.

PDF Details DOI

AAAI Conference 2026 Conference Paper

StrokeFusion: Vector Sketch Generation via Joint Stroke-UDF Encoding and Latent Sequence Diffusion

Jin Zhou
Yi Zhou
Hongliang Yang
Pengfei Xu
Hui Huang

In the field of sketch generation, raster-format trained models often produce non-stroke artifacts, while vector-format trained models typically lack a holistic understanding of sketches, leading to compromised recognizability. Moreover, existing methods struggle to extract common features from similar elements (e.g., eyes of animals) appearing at varying positions across sketches. To address these challenges, we propose StrokeFusion, a two-stage framework for vector sketch generation. It contains a dual-modal sketch feature learning network that maps strokes into a high-quality latent space. This network decomposes sketches into normalized strokes and jointly encodes stroke sequences with Unsigned Distance Function (UDF) maps, representing sketches as sets of stroke feature vectors. Building upon this representation, our framework exploits a stroke-level latent diffusion model that simultaneously adjusts stroke position, scale, and trajectory during generation. This enables high-fidelity stroke generation while supporting stroke interpolation editing. Extensive experiments across multiple sketch datasets, demonstrate that our framework outperforms state-of-the-art techniques, validating its effectiveness in preserving structural integrity and semantic features.

PDF Details DOI

IJCAI Conference 2025 Conference Paper

A Novel Local Search Algorithm for the Vertex Bisection Minimization Problem

Rui Sun
Xinyu Wang
Yiyuan Wang
Jiangnan Li
Yi Zhou

The vertex bisection minimization problem (VBMP) is a fundamental graph partitioning problem with numerous real-world applications. In this study, we propose a (k, l, S)-cluster guided local search algorithm to address this challenge. First, we propose a novel (k, l, S)-cluster enumeration procedure, which is based on two key concepts: the (k, l, S)-cluster and the local cluster core. The (k, l, S)-cluster limits both the connectivity and distinct boundaries of a given vertex set, and the local cluster core represents the most cohesive substructure within a (k, l, S)-cluster. Building up on the above (k, l, S)-cluster enumeration procedure, we present a novel (k, l, S)-cluster guided perturbation mechanism designed to escape from local optima. Next, we propose a two-manner local search procedure that employs two distinct search models to explore the neighboring search space efficiently. Experimental results demonstrate that the proposed algorithm performs best on nearly all instances.

PDF Details DOI

IJCAI Conference 2025 Conference Paper

A Reduction-Based Algorithm for the Clique Interdiction Problem

Chenghao Zhu
Yi Zhou
Haoyu Jiang

The Clique Interdiction Problem (CIP) aims to minimize the size of the largest clique in a given graph by removing a given number of vertices. The CIP models a special Stackelberg game and has important applications in fields such as pandemic control and terrorist identification. However, the CIP is a bilevel graph optimization problem, making it very challenging to solve. Recently, data reduction techniques have been successfully applied in many (single-level) graph optimization problems like vertex cover. Motivated by this, we investigate a set of novel reduction rules and design a reduction-based algorithm, RECIP, for practically solving the CIP. RECIP enjoys an effective preprocessing procedure that systematically reduces the input graph, making the problem much easier to solve. Extensive experiments on 124 large real-world networks demonstrate the superior performance of RECIP and validate the effectiveness of the proposed reduction rules.

PDF Details DOI

TMLR Journal 2025 Journal Article

Adaptive Gradient Normalization and Independent Sampling for (Stochastic) Generalized-Smooth Optimization

Yufeng Yang
Erin E. Tripp
Yifan Sun
Shaofeng Zou
Yi Zhou

Recent studies have shown that many nonconvex machine learning problems satisfy a generalized-smooth condition that extends beyond traditional smooth nonconvex optimization. However, the existing algorithms are not fully adapted to such generalized-smooth nonconvex geometry and encounter significant technical limitations on their convergence analysis. In this work, we first analyze the convergence of adaptively normalized gradient descent under function geometries characterized by generalized-smoothness and the generalized PL condition, revealing the advantage of adaptive gradient normalization. Our results provide theoretical insights into adaptive normalization across various scenarios. For stochastic generalized-smooth nonconvex optimization, we propose the Independent-Adaptively Normalized Stochastic Gradient Descent algorithm, which leverages adaptive gradient normalization, independent sampling, and gradient clipping to achieve an $\mathcal{O}(\epsilon^{-4})$ sample complexity under relaxed noise assumptions. Experiments on large-scale nonconvex generalized-smooth problems demonstrate the fast convergence of our algorithm.