AI-powered predictive control for hybrid renewable microgrids: Integrating photovoltaic generation and battery storage in smart homes and industrial applications

López Flórez, Sebastián; Hernández González, Guillermo; Prieto Tejedor, Javier; Prieta Pintado, Fernando de la

doi:10.1016/j.mlwa.2026.100864

Título

AI-powered predictive control for hybrid renewable microgrids: Integrating photovoltaic generation and battery storage in smart homes and industrial applications

Autor(es)

López Flórez, Sebastián

Hernández González, Guillermo

Prieto Tejedor, Javier

Prieta Pintado, Fernando de la

Palabras clave

Energy management

Energy efficiency optimization

Risk planning

Reinforcement learning

Fecha de publicación

2026-06

Editor

Elsevier B.V.

Citación

Flórez, S. L., Hernández, G., Prieto, J., & de la Prieta, F. (2026). AI-powered predictive control for hybrid renewable microgrids: Integrating photovoltaic generation and battery storage in smart homes and industrial applications. Machine Learning with Applications, 24, 100864. https://doi.org/10.1016/j.mlwa.2026.100864

Resumen

[EN]The integration of renewable energy sources into microgrids remains challenging due to generation intermittency, storage inefficiencies, and progressive battery degradation. In this work, our main contribution is a standardized empirical benchmark of deep reinforcement learning for microgrid energy management across multiple operating regimes under a consistent evaluation protocol. To make these evaluations rigorous and comparable, we define a common state/action/reward specification that explicitly encodes battery-health proxies, dynamic pricing, and operational constraints, and we adapt a representative set of RL families accordingly: value-based methods (DoubleDQN, NoisyNet-DQN, PER-DQN, C51), policy-gradient approaches (PG), and actor–critic algorithms (PPO, A2C, SAC, DDPG, A3C-Energy). We benchmark the resulting agents on real-world driven datasets and environments (GEFCom2014, StoreNet, and a CityLearn-inspired setting), assessing performance under a consistent evaluation pipeline. Empirically, the reported results indicate that the adapted agents tend to reduce daily operating cost, smooth charge–discharge cycling, and improve battery-health proxies under realistic operating conditions, with gains that vary by dataset and operating regime. In particular, DQN-based variants exhibit consistent gains over standard exploration schemes, while methods such as PPO and actor–critic configurations maintain competitive and more stable behavior under highly volatile price signals and renewable fluctuations. Overall, the study provides scenario-conditional evidence on which RL families tend to perform well or poorly under the tested conditions, rather than a universal ranking across datasets.

URI

https://hdl.handle.net/10366/170167

ISSN

2666-8270

DOI

10.1016/j.mlwa.2026.100864

Versión del editor

https://doi.org/10.1016/j.mlwa.2026.100864

Collections