Journal of Modern Power Systems and Clean Energy

ISSN 2196-5625 CN 32-1884/TK

Power System Flow Adjustment and Sample Generation Based on Deep Reinforcement Learning
Author:
Affiliation:

1.Department of Electrical Engineering, Tsinghua University, Beijing, China;2.State Grid Ningxia Electric Power Co. Ltd., Yinchuan, China

Fund Project:

This work was supported by the Science and Technology Project of the State Grid Corporation of China (No. 5400-201935258A-0-0-00), and the National Natural Science Foundation of China (No. 51777104).

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
    Abstract:

    With the increasing complexity of power system structures and the increasing penetration of renewable energy, the number of possible power system operation modes increases dramatically. It is difficult to make manual power flow adjustments to establish an initial convergent power flow that is suitable for operation mode analysis. At present, problems of low efficiency and long time consumption are encountered in the formulation of operation modes, resulting in a very limited number of generated operation modes. In this paper, we propose an intelligent power flow adjustment and generation model based on a deep network and reinforcement learning. First, a discriminator is trained to judge the power flow convergence, and the output of this discriminator is used to construct a value function. Then, the reinforcement learning method is adopted to learn a strategy for power flow convergence adjustment. Finally, a large number of convergent power flow samples are generated using the learned adjustment strategy. Compared with the traditional flow adjustment method, the proposed method has significant advantages that the learning of the power flow adjustment strategy does not depend on the parameters of the power system model. Therefore, this strategy can be automatically learned without manual intervention, which allows a large number of different operation modes to be efficiently formulated. The verification results of a case study show that the proposed method can independently learn a power flow adjustment strategy and generate various convergent power flows.

    Reference
    Related
    Cited by
Get Citation
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:April 17,2020
  • Revised:
  • Adopted:
  • Online: December 03,2020
  • Published: