Journal of Modern Power Systems and Clean Energy

ISSN 2196-5625 CN 32-1884/TK

Mixed Deep Reinforcement Learning Considering Discrete-continuous Hybrid Action Space for Smart Home Energy Management
Author:
Affiliation:

1.State Key Laboratory of Internet of Things for Smart City, University of Macau, Macao S.A.R., China
2.School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China
3.Shunde Graduate School, University of Science and Technology Beijing, Foshan 528399, China

Fund Project:

This work was supported by the National Natural Science Foundation of China (No. 62002016), the Science and Technology Development Fund, Macau S.A.R. (No. 0137/2019/A3), the Beijing Natural Science Foundation (No. 9204028), and the Guangdong Basic and Applied Basic Research Foundation (No. 2019A1515111165).

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
    Abstract:

    This paper develops deep reinforcement learning (DRL) algorithms for optimizing the operation of home energy system which consists of photovoltaic (PV) panels, battery energy storage system, and household appliances. Model-free DRL algorithms can efficiently handle the difficulty of energy system modeling and uncertainty of PV generation. However, discrete-continuous hybrid action space of the considered home energy system challenges existing DRL algorithms for either discrete actions or continuous actions. Thus, a mixed deep reinforcement learning (MDRL) algorithm is proposed, which integrates deep Q-learning (DQL) algorithm and deep deterministic policy gradient (DDPG) algorithm. The DQL algorithm deals with discrete actions, while the DDPG algorithm handles continuous actions. The MDRL algorithm learns optimal strategy by trial-and-error interactions with the environment. However, unsafe actions, which violate system constraints, can give rise to great cost. To handle such problem, a safe-MDRL algorithm is further proposed. Simulation studies demonstrate that the proposed MDRL algorithm can efficiently handle the challenge from discrete-continuous hybrid action space for home energy management. The proposed MDRL algorithm reduces the operation cost while maintaining the human thermal comfort by comparing with benchmark algorithms on the test dataset. Moreover, the safe-MDRL algorithm greatly reduces the loss of thermal comfort in the learning stage by the proposed MDRL algorithm.

    Reference
    Related
    Cited by
Get Citation
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:June 27,2021
  • Revised:October 08,2021
  • Adopted:
  • Online: May 12,2022
  • Published: