Author name cluster

Yu Zhao

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

36 papers

2 author rows

AAAI Conference 2026 Conference Paper

Grow-on-Demand: Sparse and Adaptive Expert Expansion for Continual Instruction Tuning

Ying Zhang
Xingyue Guo
Yu Zhao
Xuhui Sui
Baohang Zhou
Xinying Qian
Xiaojie Yuan

Continual instruction tuning aims to incrementally adapt large language models to new tasks without forgetting previously acquired knowledge. Existing approaches often struggle to balance plasticity and stability. Replay-based methods retrain on historical data, which raises privacy concerns. Architecture-based methods allocate task-specific components, resulting in significant parameter growth. To address this, we consider a structure-sharing strategy that enables parameter reuse across similar tasks and expands only when necessary, avoiding any data replay. Specifically, we introduce Grow-on-Demand (GoD-MoE), a parameter-efficient framework that is based on sparse and adaptive expert module expansion for continual instruction tuning. GoD-MoE inserts multiple LoRA-based experts into attention layers and dynamically activates a small subset of experts for each task. To avoid redundant parameter growth, we develop an Expert Demand Detector that determines whether new experts are added, facilitating adaptive structural sharing and minimizing parameter overhead. We conduct comprehensive experiments on the TRACE benchmark, demonstrating that GoD-MoE achieves state-of-the-art performance. Furthermore, it effectively mitigates catastrophic forgetting and even outperforms several advanced replay-based baselines.

PDF Details DOI

AAAI Conference 2026 Conference Paper

Knowledge Graph Guided Heterogeneity-Informed Diffusion Model for Spatio-Temporal Generation

Zi'ang Wang
Lei Chen
Yuanchang Jin
Pan Deng
Shuangshuang Pang
Junting Liu
Yu Zhao

Spatio-temporal data generation aims to synthesize realistic urban data across graph nodes by learning spatial and temporal dependencies. This task plays a crucial role in urban planning by enabling the simulation of unobserved nodes. However, existing approaches face critical limitations that time series generation methods fail to generalize to unseen nodes, while spatio-temporal generative models are either restricted to the trajectory generation task or dependent on auxiliary data inputs. To bridge these gaps, we propose a Knowledge Graph Guided Heterogeneity-Informed Diffusion Model (KGDiff) in this paper through the following key innovations. First, we design a geometry-aware mixture of experts integrating Euclidean, hyperbolic, and hyperspherical representations to comprehensively encode urban structural knowledge. Next, we present a learnable meta spatio-temporal pattern module that normalizes node-specific heterogeneity before the generation process, and a conditional denoising process that progressively transforms random noise into realistic samples under structural guidance. Finally, extensive experiments across real-world urban datasets demonstrate that KGDiff achieves the state-of-art performance in generating realistic urban spatio-temporal data.

PDF Details DOI

AAAI Conference 2026 Conference Paper

Transferable Graph Condensation from the Causal Perspective

Huaming Du
Yijie Huang
Su Yao
Yiying Wang
Yueyang Zhou
Jingwen Yang
Jinshi Zhang
Han Ji

The increasing scale of graph datasets has significantly improved the performance of graph representation learning methods, but it has also introduced substantial training challenges. Graph dataset condensation techniques have emerged to compress large datasets into smaller yet information-rich datasets, while maintaining similar test performance. However, these methods strictly require downstream applications to match the original dataset and task, which often fails in cross-task and cross-domain scenarios. To address these challenges, we propose a novel causal-invariance-based and transferable graph dataset condensation method, named TGCC, providing effective and transferable condensed datasets. Specifically, to preserve domain-invariant knowledge, we first extract domain causal-invariant features from the spatial domain of the graph using causal interventions. Then, to fully capture the structural and feature information of the original graph, we perform enhanced condensation operations. Finally, through spectral-domain Enhanced contrastive learning, we inject the causal-invariant features into the condensed graph, ensuring that the compressed graph retains the causal information of the original graph. Experimental results on five public datasets and our novel FinReport dataset demonstrate that TGCC achieves up to a 13.41% improvement in cross-task and cross-domain complex scenarios compared to existing methods, and achieves state-of-the-art performance on 5 out of 6 datasets in the single dataset and task scenario.

PDF Details DOI

ICRA Conference 2025 Conference Paper

Closed-Loop Open-Vocabulary Mobile Manipulation with GPT-4V

Peiyuan Zhi
Zhiyuan Zhang
Yu Zhao
Muzhi Han
Zeyu Zhang 0001
Zhitian Li
Ziyuan Jiao
Baoxiong Jia

Autonomous robot navigation and manipulation in open environments require reasoning and replanning with closed-loop feedback. In this work, we present COME-robot, the first closed-loop robotic system utilizing the GPT-4V vision-language foundation model for open-ended reasoning and adaptive planning in real-world scenarios. COME-robot incorporates two key innovative modules: (i) a multi-level open-vocabulary perception and situated reasoning module that enables effective exploration of the 3D environment and target object identification using commonsense knowledge and situated information, and (ii) an iterative closed-loop feedback and restoration mechanism that verifies task feasibility, monitors execution success, and traces failure causes across different modules for robust failure recovery. Through comprehensive experiments involving 8 challenging real-world mobile and tabletop manipulation tasks, COME-robot demonstrates a significant improvement in task success rate ( $\sim 35 \%$ ) compared to state-of-the-art methods. We further conduct comprehensive analyses to elucidate how COME-robot's design facilitates failure recovery, free-form instruction following, and long-horizon task planning.

Details

JBHI Journal 2025 Journal Article

Estimation of Ankle Joint Moment From Plantar Pressure Through an Optimized Sensor Layout Using Genetic Algorithm and Deep Forest Regression

Mingxia Gong
Wenxuan Chen
Yih-Kuen Jan
Yu Zhao
Jie Yao
Yan Wang
Weiyan Ren
Fang Pu

Objective: Ankle joint moments are critical in gait analysis, with accurate assessments typically necessitating complex inverse dynamics modeling. Pressure insoles are widely used wearable devices that have shown feasibility in estimating joint angles. However, achieving cost-effective, high-precision estimation of ankle joint moment remains challenging. This study combines genetic algorithm (GA) with deep forest regression (DFR) to optimize the number and layout of plantar pressure sensors, and estimate ankle joint moment based on plantar pressure. Methods: 26 healthy young participants were recruited to collect motion trajectories, ground reaction forces, and plantar pressure data while walking at fast, medium, and slow speeds. Ten gait cycles per speed per participant were analyzed for ankle joint moments using inverse dynamics, constituting the dataset. An optimization algorithm was constructed by combining GA with DFR, using the fitness function as the objective for sensor number and layout optimization. The leave-one-out cross-validation was employed to evaluate the precision of the model. Results: The highest fitness was achieved with an optimized layout using 9 sensors. The Pearson Correlation Coefficients for the sagittal, coronal, and transverse plane moments were 0. 967 ± 0. 014, 0. 918 ± 0. 027, and 0. 894 ± 0. 073. The optimized layout showed no significant difference in estimation accuracy across various walking speeds (P > 0. 05). Conclusion: The proposed GA-DFR algorithm is capable of estimating ankle joint moment accurately and optimizing the number and layout of sensors. Significance: The algorithm and optimized sensor layout enables the accurate and rapid estimation of ankle joint moment from plantar pressure insoles with trade-off approach.