Author name cluster

Fan Wu

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

56 papers

2 author rows

AIJ Journal 2026 Journal Article

Auction design with ex post ROI constraints

Hongtao Lv
Xiaohui Bei
Zhenzhe Zheng
Fan Wu

Details DOI

AAAI Conference 2026 Conference Paper

EcoAgent: An Efficient Device-Cloud Collaborative Multi-Agent Framework for Mobile Automation

Biao Yi
Xueyu Hu
Yurun Chen
Shengyu Zhang
Hongxia Yang
Fan Wu

To tackle increasingly complex tasks, recent research on mobile agents has shifted towards multi-agent collaboration. Current mobile multi-agent systems are primarily deployed in the cloud, leading to high latency and operational costs. A straightforward idea is to deploy a device–cloud collaborative multi-agent system, which is nontrivial, as directly extending existing systems introduces new challenges: (1) reliance on cloud-side verification requires uploading mobile screenshots, compromising user privacy; and (2) open-loop cooperation lacking device-to-cloud feedback, underutilizing device resources and increasing latency. To overcome these limitations, we propose EcoAgent, a closed-loop device-cloud collaborative multi-agent framework designed for privacy-aware, efficient, and responsive mobile automation. EcoAgent integrates a novel reasoning approach, Dual-ReACT, into the cloud-based Planning Agent, fully exploiting cloud reasoning to compensate for limited on-device capacity, thereby enabling device-side verification and lightweight feedback. Furthermore, the device-based Observation Agent leverages a Pre-understanding Module to summarize screen content into concise textual descriptions, significantly reducing token usage and device-cloud communication overhead while preserving privacy. Experiments on AndroidWorld demonstrate that EcoAgent matches the task success rates of fully cloud-based agents, while reducing resource consumption and response latency.

PDF Details DOI

JBHI Journal 2026 Journal Article

Privacy-Preserving Data Augmentation for Digital Pathology Using Improved DCGAN

Fengjun Hu
Fan Wu
Dongping Zhang
Hanjie Gu

The intelligent analysis of Whole Slide Images (WSI) in digital pathology is critical for advancing precision medicine, particularly in oncology. However, the availability of WSI datasets is often limited by privacy regulations, which constrains the performance and generalizability of deep learning models. To address this challenge, this paper proposes an improved data augmentation method based on Deep Convolutional Generative Adversarial Network (DCGAN). Our approach leverages self-supervised pretraining with the CTransPath model to extract diverse and representationally rich WSI features, which guide the generation of high-quality synthetic images. We further enhance the model by introducing a least-squares adversarial loss and a frequency domain loss to improve pixel-level accuracy and structural fidelity, while incorporating residual blocks and skip connections to increase network depth, mitigate gradient vanishing, and improve training stability. Experimental results on the PatchCamelyon dataset demonstrate that our improved DCGAN achieves superior SSIM and FID scores compared to traditional models. The augmented datasets significantly enhance the performance of downstream classification tasks, improving accuracy, AUC, and F1 scores.

Details DOI

NeurIPS Conference 2025 Conference Paper

A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning

Yuzheng Hu
Fan Wu
Haotian Ye
David Forsyth
James Zou
Nan Jiang
Jiaqi Ma
Han Zhao

Online reinforcement learning (RL) excels in complex, safety-critical domains but suffers from sample inefficiency, training instability, and limited interpretability. Data attribution provides a principled way to trace model behavior back to training samples, yet existing methods assume fixed datasets, which is violated in online RL where each experience both updates the policy and shapes future data collection. In this paper, we initiate the study of data attribution for online RL, focusing on the widely used Proximal Policy Optimization (PPO) algorithm. We start by establishing a local attribution framework, interpreting model checkpoints with respect to the records in the recent training buffer. We design two target functions, capturing agent action and cumulative return respectively, and measure each record's contribution through gradient similarity between its training loss and these targets. We demonstrate the power of this framework through three concrete applications: diagnosis of learning, temporal analysis of behavior formation, and targeted intervention during training. Leveraging this framework, we further propose an algorithm, iterative influence-based filtering (IIF), for online RL training that iteratively performs experience filtering to refine policy updates. Across standard RL benchmarks (classic control, navigation, locomotion) to RLHF for large language models, IIF reduces sample complexity, speeds up training, and achieves higher returns. Together, these results open a new direction for making online RL more interpretable, efficient, and effective.