Author name cluster

Song Han

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

25 papers

1 author row

EAAI Journal 2026 Journal Article

Cross-layer feature consistency and dual-transformer residual framework for underwater image enhancement

Xinbin Li
Lei Cheng
Song Han
Jing Yang
Hui Dang
Muge Li

Underwater imaging suffers from complex degradations (e. g. , color casts, blur, and haze) due to light scattering in water, limiting its utility in engineering applications such as marine exploration and underwater robotics. To address this, we propose the Cross-layer Feature Consistency-guided Dual-Transformer Reconstruction Framework (CFC-DTRF). In terms of artificial intelligence contribution, this work introduces a novel multi-stage framework that leverages feature-consistency supervision to jointly constrain feature and pixel domains, effectively disentangling content and color degradations through dedicated transformers. The framework integrates two innovative modules: a Sliding-Window Content-Attention Transformer (SWCA-Transformer) for detail preservation and a Multi-Scale Color-Attention Transformer (MSCA-Transformer) for color correction, enhancing restoration fidelity with computational efficiency. For engineering applications, this method significantly improves underwater image quality for practical tasks like environmental monitoring and robotic navigation. Extensive experiments show that CFC-DTRF outperforms state-of-the-art methods in content preservation and color accuracy. The code of the proposed CFC-DTRF is available at https: //github. com/ChengLeiYSU/CFC-DTRF.

Details DOI

EAAI Journal 2025 Journal Article

A novel ensemble method based on residual convolutional neural network with attention module for transient stability assessment considering operational variability

Wensheng Liu
Song Han
Na Rong

Data-driven methods have been extensively applied in the field of power system transient stability assessment (TSA) owing to their robust capabilities to excavate valuable features. However, TSA methods still face significant challenges in predictive accuracy and generalization ability under variable operation conditions with fluctuating loads or power generations. To address this, a data-driven ensemble TSA method which integrates convolutional block attention module (CBAM) with residual network (ResNet) is proposed to enhance the prediction accuracy. Meanwhile, the traditional cross entropy loss function is replaced by the focal loss function, aiming to reduce the misclassification of unstable samples. Moreover, a rapid updating strategy integrating active learning and fine turning techniques is suggested. It can renew the classifier quickly with limited labeled samples and less time when the network topology changes substantially and makes the pre-trained TSA model unavailable, thus ensuring optimal performance on the new topology. Finally, case studies conducted on the New England 10-machine 39-bus system and the Western Electricity Coordinating Council (WECC) 29-machine 179-bus system validate the effectiveness and robustness of the proposed TSA method. The accuracy of the proposed TSA method achieves 99. 56% on 10-machine system and 99. 47% on 29-machine system separately, demonstrating the superiority of the proposed TSA method.

Details DOI

EAAI Journal 2025 Journal Article

Adversarial black-box attack and defense for convolutional neural network-based power quality disturbance classification

Xiudong Zhang
Congmei Jiang
Mingbiao Yu
Xiankui Wen
Jing Zhang
Na Rong
Song Han

Correctly identifying power quality disturbance (PQD) is crucial for the proper functioning of power systems. Deep learning (DL) techniques have been widely used for PQD classification due to their excellent performance. However, DL models are susceptible to adversarial attacks, posing a serious security threat to DL-based PQD classification systems. This issue has received limited attention in current research. In this study, we first utilize a convolutional neural network (CNN) to recognize various types of PQD signals. To evaluate model robustness, we introduce a black-box attack method for PQD classification based on the variance-tuning momentum iterative fast gradient sign method (VMI-FGSM). VMI-FGSM integrates a variance tuning method into the iterative process of the momentum iterative fast gradient sign method (MI-FGSM), thereby producing more transferable adversarial PQD signals. To defend against such attacks, we propose a perturbation removal defense based on a generative adversarial network (PRD-GAN). This approach is capable of removing perturbations from adversarial PQD signals before they are recognized by the target classification model. Experiments demonstrate that VMI-FGSM produces adversarial perturbations that are nearly identical to those of the advanced MI-FGSM, but its adversarial examples are significantly more effective at misleading the target CNN model. Furthermore, the proposed PRD-GAN effectively reconstructs adversarial PQD signals into clean forms under various black-box attack intensities and outperforms the multi-level denoising autoencoder (ML-DAE) in defense performance due to its superior reconstruction capability.

Details DOI

NeurIPS Conference 2025 Conference Paper

Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search

Yuxian Gu
Qinghao Hu
Haocheng Xi
Junyu Chen
Shang Yang
Song Han
Han Cai

We present Jet-Nemotron, a new family of hybrid-architecture language models, which matches or exceeds the accuracy of leading full-attention models while significantly improving generation throughput. Jet-Nemotron is developed using Post Neural Architecture Search (PostNAS), a novel neural architecture exploration pipeline that enables efficient model design. Unlike prior approaches, PostNAS begins with a pre-trained full-attention model and freezes its MLP weights, allowing efficient exploration of attention block designs. The pipeline includes four key components: (1) learning optimal full-attention layer placement and elimination, (2) linear attention block selection, (3) designing new attention blocks, and (4) performing hardware-aware hyperparameter search. Our Jet-Nemotron-2B model achieves comparable or superior accuracy to Qwen3, Qwen2. 5, Gemma3, and Llama3. 2 across a comprehensive suite of benchmarks while delivering up to 53. 6× generation throughput speedup and 6. 1× prefilling speedup. It also achieves higher accuracy on MMLU and MMLU-Pro than recent advanced MoE full-attention models, such as DeepSeek-V3-Small and Moonlight, despite their larger scale with 15B total and 2. 2B activated parameters.