EAAI Journal 2026 Journal Article
Semantic-driven seasonal data classification: An artificial intelligence-enabled cost-effective storage system
- Zhu Yuan
- Xueqiang Lv
- Yunchao Gong
- Ping Xie
- Xiao Qin
- Xindong You
Modern storage systems face significant challenges in balancing energy costs and performance, especially for text-rich workloads (e. g. , e-commerce logs, social media archives). Traditional frequency-based classification methods fail to exploit semantic patterns in textual metadata, leading to suboptimal resource allocation. In this paper, we propose a novel deep learning-method based on Bidirectional Encoder Representations from Transformers-Recurrent Convolutional Neural Networks (BERT-RCNN) to extract seasonal features from data for classification, enabling cost-effective storage management. Our approach addresses the limitations of traditional frequency-based classification by incorporating semantic features from data text. Additionally, we explore the long-period seasonal features embedded within the text, offering a new approach for multi-feature classification. To evaluate the practical impact of our method, we developed a cost module within CloudSimDisk to simulate storage system operating costs. By constructing models for energy consumption and operating costs and generating real-world workloads from Baidu Index data, we implement and test our solution. Experimental results show that the BERT-RCNN model outperforms traditional K-means and other deep learning methods, reducing energy consumption by 3. 90%–5. 69%, saving operational costs by 3. 62%–6. 13%, and shortening response time by 15. 12%–26. 09%.