Author name cluster

Jian Chen

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

62 papers

2 author rows

AAAI Conference 2026 Conference Paper

A Better Start: Sensitivity-Aware Warm-Up for Robust and Efficient Fine-Tuning

Yile Chen
Zeyi Wen
Jian Chen
Jin Huang

As an essential component of fine-tuning, warm-up plays a crucial role in promoting stability and generalization. Many studies have examined its underlying mechanisms from different aspects. However, most of the studies focus on incorporating these insights into optimizers to reduce the reliance on warm-up. Little attention has been paid to addressing the inherent limitations of the warm-up itself, which restricts its effectiveness. In this work, we revisit warm-up from a loss landscape perspective and identify several limitations with existing warm-up, including: (1) susceptibility to nearby suboptimal traps, (2) sensitivity to hyperparameters and random seeds, and (3) inefficiency during the early stages of training. To overcome these limitations, we propose Sensitivity-Aware Warm-Up (SAWU), a lightweight and adaptive strategy that dynamically leverages learning sensitivity during warm-up to guide updates toward better and more stable basins. In addition, SAWU also introduces an adaptive scheduling mechanism and phase transition strategy across warm-up, stable, and decay phases to further enhance robustness and efficiency. Extensive experiments on various downstream tasks show that SAWU significantly outperforms the vanilla method (e.g., average 3.43% improvement on RoBerta). Moreover, SAWU can be easily combined with various optimizers and remains effective even when warm-up-based methods fail (e.g, it lifts RAdam from 49.46% to 91.78% on qnli. Thanks to its lightweight nature, SAWU introduces minimal overhead and even reduces training time by over 5% compared to other methods.

PDF Details DOI

EAAI Journal 2026 Journal Article

A deep learning framework for on-street parking demand prediction: Integrating spatio-temporal dynamics and policy impacts

Keliang Liu
Jian Chen

Urban on-street short-term parking demand prediction is fundamental for smart parking guidance systems. However, current prediction methods often rely on a single data source and fail to account for the dynamic impacts of environmental factors outside the parking system. This limitation constrains the accuracy of model prediction and does not allow for an assessment of how parking demand is affected by the dynamics of policy changes. To address this issue, this study proposes a comprehensive deep learning forecasting framework. Utilizing data from 123 on-street parking facilities over seven months, totaling more than 6. 48 million parking order data. Recognizing the varied spatial and temporal influences that built environment and parking regulations exert on demand patterns, we used the Multi-scale Geographically and Temporally Weighted Regression model (MGTWR) to quantify these relationships. We then incorporated the spatio-temporal coefficients derived from the MGTWR model, alongside additional influential variables, as inputs for a novel deep learning architecture that combines MGTWR, Graph Attention Networks (GAT), and Attention-based Long Short-Term Memory (ALSTM), which we designate as “MGTWR-GAT-ALSTM. ” Our model was benchmarked against traditional baseline methods, and the results indicate that MGTWR-GAT-ALSTM yields superior predictive performance, with Mean Absolute Error, Root Mean Squared Error, and Coefficient of Determination metrics of 0. 01, 0. 04, and 0. 92, respectively. Additionally, we performed ablation experiments to confirm that the model design does not introduce redundancy. The proposed prediction model aims to enhance the construction of smart parking systems, providing a dynamic assessment tool for parking policies.