Mixed Deep Reinforcement Learning Considering Discrete-continuous Hybrid Action Space for Smart Home Energy Management

doi:10.35833/MPCE.2021.000394

Home > Archive>Volume 10, Issue 3, 2022 >743-754. DOI:10.35833/MPCE.2021.000394

PDF HTML Export

Mixed Deep Reinforcement Learning Considering Discrete-continuous Hybrid Action Space for Smart Home Energy Management

DOI:

10.35833/MPCE.2021.000394

Author:

Affiliation:

1.State Key Laboratory of Internet of Things for Smart City, University of Macau, Macao S.A.R., China
2.School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China
3.Shunde Graduate School, University of Science and Technology Beijing, Foshan 528399, China

Fund Project:

This work was supported by the National Natural Science Foundation of China (No. 62002016), the Science and Technology Development Fund, Macau S.A.R. (No. 0137/2019/A3), the Beijing Natural Science Foundation (No. 9204028), and the Guangdong Basic and Applied Basic Research Foundation (No. 2019A1515111165).

Article

Figures

Metrics

Reference

Cited by

Materials

Abstract:

This paper develops deep reinforcement learning (DRL) algorithms for optimizing the operation of home energy system which consists of photovoltaic (PV) panels, battery energy storage system, and household appliances. Model-free DRL algorithms can efficiently handle the difficulty of energy system modeling and uncertainty of PV generation. However, discrete-continuous hybrid action space of the considered home energy system challenges existing DRL algorithms for either discrete actions or continuous actions. Thus, a mixed deep reinforcement learning (MDRL) algorithm is proposed, which integrates deep Q-learning (DQL) algorithm and deep deterministic policy gradient (DDPG) algorithm. The DQL algorithm deals with discrete actions, while the DDPG algorithm handles continuous actions. The MDRL algorithm learns optimal strategy by trial-and-error interactions with the environment. However, unsafe actions, which violate system constraints, can give rise to great cost. To handle such problem, a safe-MDRL algorithm is further proposed. Simulation studies demonstrate that the proposed MDRL algorithm can efficiently handle the challenge from discrete-continuous hybrid action space for home energy management. The proposed MDRL algorithm reduces the operation cost while maintaining the human thermal comfort by comparing with benchmark algorithms on the test dataset. Moreover, the safe-MDRL algorithm greatly reduces the loss of thermal comfort in the learning stage by the proposed MDRL algorithm.

Reference

Cited by

Get Citation

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:June 27,2021
Revised:October 08,2021
Adopted:
Online: May 12,2022
Published:

Home

Introduction

Editorial Board

For Author

Call For Papers

APC

Sponsor & Publisher

Get Citation

Share

Article Metrics

History