Arrow Research search
Back to IJCAI

IJCAI 2021

Towards Generating Summaries for Lexically Confusing Code through Code Erosion

Conference Paper Multidisciplinary Topics and Applications Artificial Intelligence

Abstract

Code summarization aims to summarize code functionality as high-level nature language descriptions to assist in code comprehension. Recent approaches in this field mainly focus on generating summaries for code with precise identifier names, in which meaningful words can be found indicating code functionality. When faced with lexically confusing code, current approaches are likely to fail since the correlation between code lexical tokens and summaries is scarce. To tackle this problem, we propose a novel summarization framework named VECOS. VECOS introduces an erosion mechanism to conquer the model's reliance on precisely defined lexical information. To facilitate learning the eroded code's functionality, we force the representation of the eroded code to align with the representation of its original counterpart via variational inference. Experimental results show that our approach outperforms the state-of-the-art approaches to generate coherent and reliable summaries for various lexically confusing code.

Authors

Keywords

  • Data Mining: Mining Codebase and Software Repository
  • Multidisciplinary Topics and Applications: Knowledge-based Software Engineering

Context

Venue
International Joint Conference on Artificial Intelligence
Archive span
1969-2025
Indexed papers
14525
Paper id
893418888849026507