Author name cluster

Pong Yuen

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

4 papers

1 author row

AAAI Conference 2018 Conference Paper

Domain-Shared Group-Sparse Dictionary Learning for Unsupervised Domain Adaptation

Baoyao Yang
Andy Ma
Pong Yuen

Unsupervised domain adaptation has been proved to be a promising approach to solve the problem of dataset bias. To employ source labels in the target domain, it is required to align the joint distributions of source and target data. To do this, the key research problem is to align conditional distributions across domains without target labels. In this paper, we propose a new criterion of domain-shared groupsparsity that is an equivalent condition for conditional distribution alignment. To solve the problem in joint distribution alignment, a domain-shared group-sparse dictionary learning method is developed towards joint alignment of conditional and marginal distributions. A classiﬁer for target domain is trained using the domain-shared group-sparse coef- ﬁcients and the target-speciﬁc information from the target data. Experimental results on cross-domain face and object recognition show that the proposed method outperforms eight state-of-the-art unsupervised domain adaptation algorithms.

PDF Details

AAAI Conference 2018 Conference Paper

Hierarchical Discriminative Learning for Visible Thermal Person Re-Identification

Mang Ye
Xiangyuan Lan
Jiawei Li
Pong Yuen

Person re-identiﬁcation is widely studied in visible spectrum, where all the person images are captured by visible cameras. However, visible cameras may not capture valid appearance information under poor illumination conditions, e. g, at night. In this case, thermal camera is superior since it is less dependent on the lighting by using infrared light to capture the human body. To this end, this paper investigates a cross-modal re-identiﬁcation problem, namely visible-thermal person reidentiﬁcation (VT-REID). Existing cross-modal matching methods mainly focus on modeling the cross-modality discrepancy, while VT-REID also suffers from cross-view variations caused by different camera views. Therefore, we propose a hierarchical cross-modality matching model by jointly optimizing the modality-speciﬁc and modality-shared metrics. The modality-speciﬁc metrics transform two heterogenous modalities into a consistent space that modality-shared metric can be subsequently learnt. Meanwhile, the modalityspeciﬁc metric compacts features of the same person within each modality to handle the large intra-modality intra-person variations (e. g. viewpoints, pose). Additionally, an improved two-stream CNN network is presented to learn the multimodality sharable feature representations. Identity loss and contrastive loss are integrated to enhance the discriminability and modality-invariance with partially shared layer parameters. Extensive experiments illustrate the effectiveness and robustness of the proposed method.

PDF Details

AAAI Conference 2018 Conference Paper

Robust Collaborative Discriminative Learning for RGB-Infrared Tracking

Xiangyuan Lan
Mang Ye
Shengping Zhang
Pong Yuen

Tracking target of interests is an important step for motion perception in intelligent video surveillance systems. While most recently developed tracking algorithms are grounded in RGB image sequences, it should be noted that information from RGB modality is not always reliable (e. g. in a dark environment with poor lighting condition), which urges the need to integrate information from infrared modality for effective tracking because of the insensitivity to illumination condition of infrared thermal camera. However, several issues encountered during the tracking process limit the fusing performance of these heterogeneous modalities: 1) the crossmodality discrepancy of visual and motion characteristics, 2) the uncertainty of degree of reliability in different modalities, and 3) large target appearance variations and background distractions within each modality. To address these issues, this paper proposes a novel and optimal discriminative learning framework for multi-modality tracking. In particular, the proposed discriminative learning framework is able to: 1) jointly eliminate outlier samples caused by large variations and learn discriminability-consistent features from heterogeneous modalities, and 2) collaboratively perform modality reliability measurement and target-background separation. Extensive experiments on RGB-infrared image sequences demonstrate the effectiveness of the proposed method.

PDF Details

AAAI Conference 2015 Conference Paper

Online Dictionary Learning on Symmetric Positive Definite Manifolds with Vision Applications

Shengping Zhang
Shiva Kasiviswanathan
Pong Yuen
Mehrtash Harandi

Symmetric Positive Deﬁnite (SPD) matrices in the form of region covariances are considered rich descriptors for images and videos. Recent studies suggest that exploiting the Riemannian geometry of the SPD manifolds could lead to improved performances for vision applications. For tasks involving processing large-scale and dynamic data in computer vision, the underlying model is required to progressively and efﬁciently adapt itself to the new and unseen observations. Motivated by these requirements, this paper studies the problem of online dictionary learning on the SPD manifolds. We make use of the Stein divergence to recast the problem of online dictionary learning on the manifolds to a problem in Reproducing Kernel Hilbert Spaces, for which, we develop efﬁcient algorithms by taking into account the geometric structure of the SPD manifolds. To our best knowledge, our work is the ﬁrst study that provides a solution for online dictionary learning on the SPD manifolds. Empirical results on both large-scale image classiﬁcation task and dynamic video processing tasks validate the superior performance of our approach as compared to several state-of-the-art algorithms.

PDF Details