EAAI Journal 2026 Journal Article
Forecast-enhanced bilevel real-time pricing for microgrids via hybrid-action reinforcement learning
- Jingqi Wang
- Yan Gao
- Youmeng He
The integration of distributed energy resources into microgrids faces many complex challenges, including renewable intermittency, hybrid decision-making, and hierarchical coordination. This paper presents a forecast-enhanced bilevel real-time pricing framework using a hybrid-action deep reinforcement learning (DRL) algorithm with Gumbel-Softmax reparameterization. The framework manages both discrete generator commitment and continuous pricing decisions through integrated optimization. Our approach integrates Long Short-Term Memory (LSTM) forecasting to enhance proactive scheduling, while coordinating microgrid agents through a bilevel optimization architecture. The main innovations include: a hybrid-action DRL algorithm integrating Gumbel-Softmax reparameterization for joint discrete–continuous optimization; LSTM-based renewable forecasting integrated into state representation. Our DRL approach shows enhanced system performance with improved constraint satisfaction and operational efficiency, offering a practical solution for complex hybrid-action energy optimization problems.