Author name cluster

Ji Qi

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

3 papers

2 author rows

ICRA Conference 2025 Conference Paper

LamPro: Multi-Prototype Representation Learning for Enhanced Visual Pattern Recognition

Ji Qi
Wei Sun
Qihe Huang
Zhengyang Zhou
Yang Wang 0015

Visual pattern recognition usually plays important roles in robotics and automation society where the pattern recognition relies on representation learning. Existing representation learning often neglects two important issues, the diversity of intra-class representation and under-exploited label utilization, especially the negative feedback during training process. Fortunately, prototype learning potentially raises label utilization and encourages intra-class diversity. In this paper, we investigate the intra-class diversity and effective updates in prototype learning for enhanced visual pattern recognition. Specifically, we propose a Label-aware multi-Prototype learning, LamPro, by incorporating the label awareness into both prototype formation and update to improve the representation quality. Firstly, we design a supervised contrastive learning to achieve class-discriminative representations. Secondly, we randomly initialize multiple prototypes and update the nearest prototype upon the arrival of instance, to preserve intra-class diversity. Thirdly, we propose a novel Label-guided Adaptive Updating. We separate the prototype updates from the representation optimization and exploit the label indexes to directly implement the prediction feedback. To correct the model optimization directions, we identify the negative feedback, and correct the prototype updates via queries of labels. Finally, we design a memory-based counter to alternately update these deviated prototypes. Experiments verify the effectiveness of our label-aware and joint multi-prototype updating strategies.

Details

NeurIPS Conference 2024 Conference Paper

CogVLM: Visual Expert for Pretrained Language Models

Weihan Wang
Qingsong Lv
Wenmeng Yu
Wenyi Hong
Ji Qi
Yan Wang
Junhui Ji
Zhuoyi Yang

We introduce CogVLM, a powerful open-source visual language foundation model. Different from the popular \emph{shallow alignment} method which maps image features into the input space of language model, CogVLM bridges the gap between the frozen pretrained language model and image encoder by a trainable visual expert module in the attention and FFN layers. As a result, CogVLM enables a deep fusion of vision language features without sacrificing any performance on NLP tasks. CogVLM-17B achieves state-of-the-art performance on 17 classic cross-modal benchmarks, including 1) image captioning datasets: NoCaps, Flicker30k, 2) VQA datasets: OKVQA, TextVQA, OCRVQA, ScienceQA, 3) LVLM benchmarks: MM-Vet, MMBench, SEED-Bench, LLaVABench, POPE, MMMU, MathVista, 4) visual grounding datasets: RefCOCO, RefCOCO+, RefCOCOg, Visual7W. Codes and checkpoints are available at Github.

PDF Details DOI

AAMAS Conference 2019 Conference Paper

How to Get the Most from Goods Donated to Charities

Christopher Culley
Ji Qi
Carmine Ventre

The charity sector is assuming a central role in many countries, due to a generalized increase in wealth inequalities and the restructuring of the welfare state. This market, however, exhibits inefficiencies. In this work, we empirically test the adoption of a centralized truthful allocation mechanism without money to charities bidding for donations. Our results show that it is indeed possible to improve the income of the sector by at least 50% on average. We further show how the application of proxy bidding allows to maintain a significant portion of the welfare improvements without the need of many bids. Our results pave the way for a novel and more profitable model of distribution of donated goods.

PDF