期刊文献+

Reinforcement Learning and Its Applications in Modern Power and Energy Systems: A Review 被引量:33

原文传递
导出
摘要 With the growing integration of distributed energy resources(DERs),flexible loads,and other emerging technologies,there are increasing complexities and uncertainties for modern power and energy systems.This brings great challenges to the operation and control.Besides,with the deployment of advanced sensor and smart meters,a large number of data are generated,which brings opportunities for novel data-driven methods to deal with complicated operation and control issues.Among them,reinforcement learning(RL)is one of the most widely promoted methods for control and optimization problems.This paper provides a comprehensive literature review of RL in terms of basic ideas,various types of algorithms,and their applications in power and energy systems.The challenges and further works are also discussed.
出处 《Journal of Modern Power Systems and Clean Energy》 SCIE EI CSCD 2020年第6期1029-1042,共14页 现代电力系统与清洁能源学报(英文)
基金 supported by the Sichuan Science and Technology Program(Sichuan Distinguished Young Scholars)(No.2020JDJQ0037).
作者简介 Di Cao is currently pursuing the Ph.D.degree in control science and engineering with University of Electronic Science and Technology of China,Chengdu,China.His research interests include optimization of distribution network and application of machine learning algorithms in power system.e-mail:caodi@std.uestc.edu.cn;Corresponding author:Weihao Hu received the B.Eng.and M.Sc.degrees in electrical engineering from Xi’an Jiaotong University,Xi’an,China,in 2004 and 2007,respectively,and the Ph.D.degree from Aalborg University,Aalborg,Denmark,in 2012.He is currently a Full Professor and the Director of the Institute of Smart Power and Energy Systems,University of Electronics Science and Technology of China,Chengdu,China.His research interests include artificial intelligence in modern power system and renewable power generation.e-mail:whu@uestc.edu.cn;Junbo Zhao received the Ph.D.degree from the Bradley Department of Electrical and Computer Engineering,Virginia Polytechnic Institute and State University,Falls Church,USA,in 2018.Now he is an Assistant Professor with Mississippi State University,Starkville,USA.His research interests include power system modeling,real-time monitoring,dynamics and cyber security,big data analytic,and robust statistical signal processing.e-mail:junbo@ece.msstate.edu;Guozhou Zhang received the B.S.degree from Chongqing University of Technology,Chongqing,China,in 2016,the M.S.degree from the University of Electronic Science and Technology of China,Chengdu,China,in 2019,where he is currently pursuing the Ph.D.degree in control science and engineering.His research interest includes power system analysis and control.e-mail:zgz@std.uestc.edu.cn;Bin Zhang received the B.S.degree from Hohai University,Nanjing,China,in 2017.He is currently pursuing the M.S.degree in University of Electronic Science and Technology of China,Chengdu,China.His research interest is optimization of hybrid energy system.e-mail:sven@uestc.edu.cn;Zhou Liu received the B.Eng.and M.Sc.degrees in electrical engineering from Huazhong University of Science and Technology(HUST),Wuhan,China,in 2004 and 2007,respectively,and the Ph.D.degree in energy technology from the Department of Energy Technology,Aalborg University,Aalborg,Denmark,in 2013.He is currently with the Department of Energy Technology,Aalborg University as an Assistant Professor.His research interests include power system analysis and digital simulation,wide-area protection and control,wind power integration and power substation automation,high-voltage direct current(HVDC)circuit breaker and protection.e-mail:zli@et.aau.dk;Zhe Chen received the B.Eng.and M.Sc.degrees from the Northeast China Institute of Electric Power Engineering,Jilin,China,and the Ph.D.degree from the University of Durham,Durham,UK.He is a Full Professor with the Department of Energy Technology,Aalborg University,Aalborg,Denmark.His research areas include power systems,power electronics and electric machines,and his main current research interests include wind energy and modern power systems.e-mail:zch@et.aau.dk;Frede Blaabjerg received the Ph.D.degree in electrical engineering from Aalborg University,Aalborg,Denmark,in 1995.He was with ABB-Scandia,Randers,Denmark,from 1987 to 1988.He became an Assistant Professor,in 1992,an Associate Professor in 1996,and a Full Professor of power electronics and drives in 1998.In 2017,he became a Villum Investigator.He is Honoris Causa at University Politehnica Timisoara,Timisoara,Romania,and Tallinn Technical University,Tallinn,Estonia.His current research interests include power electronics and its applications such as in wind turbines,PV systems,reliability,harmonics and adjustable speed drives.e-mail:fbl@et.aau.dk
  • 相关文献

参考文献1

二级参考文献14

  • 1Puterman M L.Markov Decision Process:Discrete Dynamic Dtochastic Programming.New-York:Wiley,1994
  • 2Kaya M,Alhajj R.Fuzzy olap association rules mining based modular reinforcement learning approach for multiagent systems.IEEE Transactions on Systems,Man and Cybernetics part B:Cybernetics,2005,35(2):326-338
  • 3Singh S,Bertsekas D.Reinforcement learning for dynamic channel allocation in cellular telephone systems//Mozer M C,Jordan M L,Petsche T.Proceedings of the NIPS-9.Cambridge MA:MIT Press,1997:974
  • 4Vengerov D N,Berenji H R.A fuzzy reinforcement learning approach to power control in wireless transmitters.IEEE Transactions on Systems,Man,and Cybernetics part B:Cybernetics,2005,35(4):768-778
  • 5Critesl R H,Barto A G.Elevator group control using multiple reinforcement learning Agents.Machine Learning,1998,33(2/3):235-262
  • 6Kaelbling L P,Littman M L,Moore A P.Reinforcement learning:A survey.Journal of Artificial Intelligence Research,1996,4:237-285
  • 7Sutton R S,Barto A G.Reinforcement Learning:An Introduction.Cambridge MA:MIT Press,1998
  • 8Schwartz A.A reinforcement learning method for maximizing undiscounted rewards//Huns M N,Singh M P eds.Proceedings of the 10th Annual Conference on Machine Learning.San Francisco:Morgan Kaufmann,1993:298-305
  • 9Tadepalli P,Ok D.Model-based average reward reinforcement learning.Artificial Intelligence,1998,100(1/2):177-224
  • 10Gosavi A.Reinforcement learning for long run average cost.European Journal of Operational Research,2004,155 (3):654-674

共引文献37

同被引文献371

引证文献33

二级引证文献261

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部