Author name cluster

Wufan Wang

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

7 papers

2 author rows

AAAI Conference 2026 Conference Paper

Learning to LEAP: Efficient Dense Point Tracking by Focusing Where It Matters

Chenzhi Zhao
Wufan Wang
Bo Zhang
Wendong Wang

Tracking Any Point (TAP) is a foundational task in computer vision with broad applicability. The state-of-the-art self-supervised TAP method leverages a global matching transformer and contrastive random walks to learn point correspondences. However, its dense all-pairs attention and correlation volume computation tend to introduce irrelevant features and produce less informative training signals, degrading both learning efficiency and tracking accuracy. To address these limitations, we introduce LEAP-Track, a self-supervised TAP approach that computes the attention matrices and correlation volume over adaptively selected sparse pairs. It consists of two core designs: (1) Curriculum-based Sparse Attention (CSA), which dynamically focuses on the most relevant keys, promoting the learning of discriminative features; and (2) Progressive k-NN Transition (PkT), which reformulates the contrastive random walk to operate on an increasingly sparse k-NN affinity graph to reinforce the learning of the most informative correspondences. By integrating the above two designs into a two-stage training paradigm, LEAP-Track is shown both theoretically and empirically to effectively boost learning efficiency, achieving superior tracking accuracy over existing self-supervised TAP methods.

PDF Details DOI

AAAI Conference 2023 Conference Paper

Revisiting Unsupervised Local Descriptor Learning

Wufan Wang
Lei Zhang
Hua Huang

Constructing accurate training tuples is crucial for unsupervised local descriptor learning, yet challenging due to the absence of patch labels. The state-of-the-art approach constructs tuples with heuristic rules, which struggle to precisely depict real-world patch transformations, in spite of enabling fast model convergence. A possible solution to alleviate the problem is the clustering-based approach, which can capture realistic patch variations and learn more accurate class decision boundaries, but suffers from slow model convergence. This paper presents HybridDesc, an unsupervised approach that learns powerful local descriptor models with fast convergence speed by combining the rule-based and clustering-based approaches to construct training tuples. In addition, HybridDesc also contributes two concrete enhancing mechanisms: (1) a Differentiable Hyperparameter Search (DHS) strategy to find the optimal hyperparameter setting of the rule-based approach so as to provide accurate prior for the clustering-based approach, (2) an On-Demand Clustering (ODC) method to reduce the clustering overhead of the clustering-based approach without eroding its advantage. Extensive experimental results show that HybridDesc can efficiently learn local descriptors that surpass existing unsupervised local descriptors and even rival competitive supervised ones.

PDF Details DOI

ICRA Conference 2020 Conference Paper

VALID: A Comprehensive Virtual Aerial Image Dataset

Lyujie Chen
Feng Liu
Yan Zhao
Wufan Wang
Xiaming Yuan
Jihong Zhu 0001

Aerial imagery plays an important role in land-use planning, population analysis, precision agriculture, and unmanned aerial vehicle tasks. However, existing aerial image datasets generally suffer from the problem of inaccurate labeling, single ground truth type, and few category numbers. In this work, we implement a simulator that can simultaneously acquire diverse visual ground truth data in the virtual environment. Based on that, we collect a comprehensive Virtual AeriaL Image Dataset named VALID, consisting of 6690 high-resolution images, all annotated with panoptic segmentation on 30 categories, object detection with oriented bounding box, and binocular depth maps, collected in 6 different virtual scenes and 5 various ambient conditions (sunny, dusk, night, snow and fog). To our knowledge, VALID is the first aerial image dataset that can provide panoptic level segmentation and complete dense depth maps. We analyze the characteristics of VALID and evaluate state-of-the-art methods for multiple tasks to provide reference baselines. The experiment results demonstrate that VALID is well presented and challenging. The dataset is available at https://sites.google.com/view/valid-dataset/.

Details

ICRA Conference 2018 Conference Paper

Adaptive Attitude Control for a Tail-Sitter UAV with Single Thrust-Vectored Propeller

Wufan Wang
Jihong Zhu 0001
Minchi Kuang
Xufei Zhu

Tail-sitter unmanned aerial vehicles (UAVs) have gained extensive popularity in recent years due to their inherent advantages of both fixed wing and rotary wing UAVs. However, these advantages are accompanied with control challenges because of two different flight regimes and drastically changing dynamics during transition flights. This paper focuses on the design of a unified controller free from cumbersome controller switchings and applicable in all attitude range for a tail-sitter with single thrust-vectored propeller. To achieve this, both thrust vectoring model and full-regime aerodynamics model are built first, after which a complete attitude dynamics model of the tail-sitter is established utilizing the quaternion attitude description to avoid the singularity problem. An adaptive controller is then derived based on a simplified model using the Lyapunov stability theory with unknown system parameters identified online by forgetting factor recursive least square (FF-RLS) method. Flight experiments are conducted to demonstrate the feasibility and effectiveness of the proposed control scheme.

Details

IJCAI Conference 2018 Conference Paper

Learning Transferable UAV for Forest Visual Perception

Lyujie Chen
Wufan Wang
Jihong Zhu

In this paper, we propose a new pipeline of training a monocular UAV to fly a collision-free trajectory along the dense forest trail. As gathering high-precision images in the real world is expensive and the off-the-shelf dataset has some deficiencies, we collect a new dense forest trail dataset in a variety of simulated environment in Unreal Engine. Then we formulate visual perception of forests as a classification problem. A ResNet-18 model is trained to decide the moving direction frame by frame. To transfer the learned strategy to the real world, we construct a ResNet-18 adaptation model via multi-kernel maximum mean discrepancies to leverage the relevant labelled data and alleviate the discrepancy between simulated and real environment. Simulation and real-world flight with a variety of appearance and environment changes are both tested. The ResNet-18 adaptation and its variant model achieve the best result of 84. 08% accuracy in reality.

PDF Details

IROS Conference 2017 Conference Paper

Design, modelling and hovering control of a tail-sitter with single thrust-vectored propeller

Wufan Wang
Jihong Zhu 0001
Minchi Kuang

This paper focuses on the design, modelling and hovering control of a tail-sitter with single thrust-vectored propeller which possesses the inherent advantages of both fixed wing and rotary wing unmanned aerial vehicles (UAVs). The developed tail-sitter requires only the same number of actuators as a normal fixed wing aircraft and achieves attitude control through deflections of the thrust-vectored propeller and ailerons. Thrust vectoring is realized by mounting a simple gimbal mechanism beneath the propeller motor. Both the thrust vector model and aerodynamics model are established, which leads to a complete nonlinear model of the tail-sitter in hovering state. Quaternion is applied for attitude description to avoid the singularity problem and improve computation efficiency. Through reasonable assumptions, a simplified model of the tail-sitter is obtained, based on which a backstepping controller is designed using the Lyapunov stability theory. Experimental results are presented to demonstrate the effectiveness of the proposed control scheme.

Details

ICRA Conference 2017 Conference Paper

Flight controller design and demonstration of a thrust-vectored tailsitter

Minchi Kuang
Jihong Zhu 0001
Wufan Wang
Yunfei Tang

This paper discusses the design and control methods of a thrust-vectored tailsitter that combines the advantages of both fixed wing and rotary wing systems. Separable takeoff bracket and controllable forward landing are implemented to reduce the flight weight and mitigate the effects of crosswinds. A six-degrees-of-freedom model especially for this tailsitter is then proposed to describe the dynamics of the whole system. Attitude representation based on horizontal /vertical Euler angles is presented to avoid the problem of singularity. Attitude and altitude controllers that switch between horizontal and vertical modes are used. In these controllers linear/constant acceleration approximation and filtered feed-forward acceleration algorithm are implemented. Effectiveness and reliability of the proposed control methods are demonstrated and evaluated by experimental results of the whole flight envelope.

Details