Arrow Research search

Author name cluster

Shizhan Chen

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

2 papers
2 author rows

Possible papers

2

AAAI Conference 2023 Conference Paper

Reducing Sentiment Bias in Pre-trained Sentiment Classification via Adaptive Gumbel Attack

  • Jiachen Tian
  • Shizhan Chen
  • Xiaowang Zhang
  • Xin Wang
  • Zhiyong Feng

Pre-trained language models (PLMs) have recently enabled rapid progress on sentiment classification under the pre-train and fine-tune paradigm, where the fine-tuning phase aims to transfer the factual knowledge learned by PLMs to sentiment classification. However, current fine-tuning methods ignore the risk that PLMs cause the problem of sentiment bias, that is, PLMs tend to inject positive or negative sentiment from the contextual information of certain entities (or aspects) into their word embeddings, leading them to establish spurious correlations with labels. In this paper, we propose an adaptive Gumbel-attacked classifier that immunes sentiment bias from an adversarial-attack perspective. Due to the complexity and diversity of sentiment bias, we construct multiple Gumbel-attack expert networks to generate various noises from mixed Gumbel distribution constrained by mutual information minimization, and design an adaptive training framework to synthesize complex noise by confidence-guided controlling the number of expert networks. Finally, we capture these noises that effectively simulate sentiment bias based on the feedback of the classifier, and then propose a multi-channel parameter updating algorithm to strengthen the classifier to recognize these noises by fusing the parameters between the classifier and each expert network. Experimental results illustrate that our method significantly reduced sentiment bias and improved the performance of sentiment classification.

ECAI Conference 2020 Conference Paper

A Graph-Based Measurement for Text Imbalance Classification

  • Jiachen Tian
  • Shizhan Chen
  • Xiaowang Zhang
  • Zhiyong Feng 0002

Imbalanced text classification, as practical and essential text classification, is the task to learn labels or categories for imbalanced text data. Existing imbalanced text classification approaches are mostly based on the Imbalance Ratio (i. e. ratio of sizes between categories). Recently, some researchers verified that the imbalance ratio severely affects the performance of classifiers when intrinsic characteristics of data such as class overlapping and small disjuncts occur. However, since the distribution of real-world data is unknown, it is difficult to describe above intrinsic characteristics directly. In this paper, we transform the unknown distribution of data into a graph model and present a graph-based imbalance index named G IR to predict the impact of imbalanced text data on classification performance. Firstly, we introduce an environmental factor that makes the imbalance index sensitive to the intrinsic characteristics of data. Secondly, we propose a graph-based method to calculate this environmental factor. Finally, we use the imbalance index to analyze the performances of imbalanced learning methods and the impact of imbalanced data on text classifiers. The experimental results evaluated on both synthetic data sets and real-world data sets demonstrate the effectiveness of our approach.