Arrow Research search
Back to IS

IS 2007

Cost-Sensitive-Data Preprocessing for Mining Customer Relationship Management Databases

Journal Article journal-article Artificial Intelligence ยท Intelligent Systems

Abstract

A staged-framework for data preprocessing has been developed to support data mining and help service providers identify customers who might switch to a competitor. The framework pushes the cost sensitivity and data imbalance of customer retention data into the data preprocessing itself. Tests using data set from the ACM KDD Cup 1998 showed that the framework outperformed the winner of that data mining and knowledge discovery competition. The framework has also been incorporated into a software system, called ED-Money. To demonstrate the framework's ability to predict customer attrition with high accuracy, it was applied to some benchmark data and to a real customer attrition data set from a large Chinese mobile telecommunications company

Authors

Keywords

  • Customer relationship management
  • Databases
  • Costs
  • Data preprocessing
  • Data mining
  • Switches
  • Sun
  • Communication industry
  • Industrial relations
  • Mining industry
  • Data Pre-processing
  • Mobile Communication
  • Cost Information
  • Customer Value
  • Direct Marketing
  • Neural Network
  • Training Data
  • Decision Tree
  • Response Ratio
  • Training Subsets
  • Test Subset
  • Net Profit
  • Linearly Separable
  • Ensemble Of Networks
  • Validation Subset
  • Cost Matrix
  • Earn Money
  • Back Propagation Neural Network
  • Back-propagation Network
  • cost-sensitive-data preprocessing
  • ensemble of classifiers

Context

Venue
IEEE Intelligent Systems
Archive span
2001-2026
Indexed papers
2921
Paper id
585065505352035651