Arrow Research search

Author name cluster

Man Lin

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

4 papers
1 author row

Possible papers

4

AAAI Conference 2025 Conference Paper

Zero-Shot Image Captioning with Multi-type Entity Representations

  • Delong Zeng
  • Ying Shen
  • Man Lin
  • Zihao Yi
  • Jiarui Ouyang

As data and computational resources continue to expand, incorporating a variety of knowledge during the pre-training phase enhances large models, providing them with strong zero-shot capabilities. Due to the alignment of modal features by visual language models, zero-shot image captioning no longer necessitates pre-training on paired image-text labeled data, enabling accurate text description generation for images not encountered before. While recent research focuses on methods utilizing entity retrieval as anchors to bridge the gap between different modalities, these approaches often fall short of thoroughly analyzing the impact of entity retrieval recall on the zero-shot generation capabilities. To address this issue, we propose MERCap, a zero-shot image captioning method employing Multi-type Entity representation Retrieval. More specifically, we first approximate image representation using the CLIP representation of text and Gaussian noise to address the modality gap. Then, we train a GPT-2 decoder to reconstruct text using entities as hard prompts and CLIP representations as soft prompts. Additionally, we construct a domain-specific entity set, assigning multiple representations to each entity and refining their representation vectors through contrastive learning. During inference, we retrieve entities and input them into the decoder to generate corresponding captions. Extensive experiments validate that our approach is efficient, achieving a new state-of-the-art level in cross-domain captioning and demonstrating strong competitiveness in in-domain captioning compared to existing methods.

EAAI Journal 2023 Journal Article

Black-box attack against GAN-generated image detector with contrastive perturbation

  • Zijie Lou
  • Gang Cao
  • Man Lin

The emergence of visually realistic GAN-generated facial images has raised concerns regarding potential misuse. In response, effective forensic algorithms have been developed to detect such synthetic images in recent years. However, the vulnerability of such forensic detectors to adversarial attacks remains an important issue that requires further investigation. In this paper, we propose a new black-box attack method against GAN-generated image detectors. It involves contrastive learning strategy to train an encoder–decoder anti-forensic network with a contrastive loss function. GAN-generated and corresponding simulated real images are constructed as positive and negative samples, respectively. By leveraging the trained attack model, we can apply imperceptible perturbation to input synthetic images for removing GAN fingerprint to some extent. GAN-generated image detectors may be deceived consequently. Extensive experimental results demonstrate that the proposed attack effectively reduces the accuracy of three state-of-the-art detectors on six popular GANs, while also achieving high visual quality of the attacked images. The source code will be available at https: //github. com/ZXMMD/BAttGAND.

IS Journal 2014 Journal Article

Pervasive Service Bus: Smart SOA Infrastructure for Ambient Intelligence

  • Gang Pan
  • Li Zhang
  • Zhaohui Wu
  • Shijian Li
  • Laurence Yang
  • Man Lin
  • Yuanchun Shi

Ambient intelligence (AmI) aims to make our everyday environments intelligent--that is, sensitive, adaptive, and responsive to the presence of people--in a transparent manner. Several challenges exist to building an efficient infrastructure for AmI, including interoperation of heterogeneous systems, intelligence for anticipatory user assistance, adaptability to dynamic environments for good user experience, and scalability to additional users and spaces. Here, the authors propose Pervasive Service Bus (PSB), a smart service-oriented architecture (SOA) framework for AmI spaces that models all computing activities as unified pervasive services. They present an online planning algorithm to adapt service flows to contexts and user tasks. PSB employs a sub-bus-based layout to maintain efficiency in large-scale service interactions. They also discuss their results in evaluating PSB's performance in a Smart Home testbed.

IS Journal 2011 Journal Article

TaskShadow: Toward Seamless Task Migration across Smart Environments

  • Gang Pan
  • Yuqiong Xu
  • Zhaohui Wu
  • Shijian Li
  • Laurence Yang
  • Man Lin
  • Zhong Liu

The OSGi-based platform TaskShadow supports seamless task migration across smart environments using a task-to-service mapping algorithm to semantically search for suitable low-level services that achieve high-level tasks.