Study and application of reinforcement learning based on DAI in cooperative strategy of robot soccer
GUO Qi1,2, ZHANG Da-zhi1, YANG Yong-tian2
1.Dept.of Mathematics,Harbin Institute of Technology,Harbin 150001,China;2.Dept.of Computer Science and Engineering,Harbin Engineering University,Harbin 150001,China
Abstract:
A dynamic cooperation model of multi-agent is established by combining reinforcement learning with distributed artificial intelligence(DAI),in which the concept of individual optimization loses its meaning because of the dependence of repayment on each agent itself and the choice of other agents.Utilizing the idea of DAI,the intellectual unit of each robot and the change of task and environment,each agent can make decisions independently and finish various complicated tasks by communication and reciprocation between each other.The method is superior to other reinforcement learning methods commonly used in the multi-agent system.It can improve the convergence velocity of reinforcement learning,decrease requirements of computer memory,and enhance the capability of computing and logical ratiocinating for agent.The result of a simulated robot soccer match proves that the proposed cooperative strategy is valid.
Key words: robot soccer reinforcement learning cooperative strategy distributed artificial intelligence
DOI:10.11916/j.issn.1005-9113.2009.04.014
Clc Number:TP242.6
Fund:
删除或更新信息,请邮件至freekaoyan#163.com(#换成@)
Study and application of reinforcement learning based on DAI in cooperative strategy of robot soccer
本站小编 哈尔滨工业大学/2019-10-23
相关话题/Study and application reinforcement learning
Influence of Si Contents on the Microstructure Evolution and Mechanical Properties of Al-Mg-Si-Cu-Zn
Influence of Si Contents on the Microstructure Evolution and Mechanical Properties of Al-Mg-Si-Cu-Zn Alloys Author NameAffiliationLiang ZhuState Key Laboratory for Advanced Metals and Materials, University of Science and Technology Beijing, Beijing 100083, ChinaMingxing GuoS ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2020-12-05GEO Satellite Thruster Configuration and Optimization
GEO Satellite Thruster Configuration and Optimization Author NameAffiliationJiajia FengBeijing Institute of Control Engineering, Beijing 100190, ChinaScience and Technology on Space Intelligent Control Laboratory,Beijing 100190,ChinaZuowei WangBeijing Institute of Control En ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2020-12-05Hidden Attractors in a Delayed Memristive Differential System with Fractional Order and Chaos Synchr
Hidden Attractors in a Delayed Memristive Differential System with Fractional Order and Chaos Synchronization Author NameAffiliationDawei DingSchool of Electronics and Information Engineering, Anhui University, Hefei 230601, ChinaKey Laboratory of Intelligent Computing and S ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2020-12-05Vibration Absorption Efficiency and Higher Branches Elimination of Variable-Stiffness Nonlinear Ener
Vibration Absorption Efficiency and Higher Branches Elimination of Variable-Stiffness Nonlinear Energy Sink Author NameAffiliationRui ZhongTianjin Key Laboratory of the Design and Intelligent Control of the Advanced Mechatronical System, Tianjin University of Technology,Tian ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2020-12-05Biodegradation of Ammonia Nitrogen Using a Novel Candida sp. Strain N6 Immobilization
Biodegradation of Ammonia Nitrogen Using a Novel Candida sp. Strain N6 Immobilization Author NameAffiliationKai WangSchool of Marine Science and Technology, Harbin Institute of Technology, Weihai,Weihai 264209, Shandong, ChinaSchool of Municipal and Environmental Engineering ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2020-12-05Aging Behavior and Creep Characteristics of CACB
Aging Behavior and Creep Characteristics of CACB Author NameAffiliationYunliang LiSchool of Transportation Science and Engineering, Harbin Institute of Technology, Harbin 150090, ChinaYuze LiuSchool of Transportation Science and Engineering, Harbin Institute of Technology, H ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2020-12-05Loop Closure Detection of Visual SLAM Based on Point and Line Features
Loop Closure Detection of Visual SLAM Based on Point and Line Features Author NameAffiliationChang’an LiuSchool of Control and Computer Engineering,North China Electric Power University,Beijing 102206, ChinaRuiying ChengSchool of Control and Computer Engineering,North China ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2020-12-05Dynamics Analysis of Fractional-Order Memristive Time-Delay Chaotic System and Circuit Implementatio
Dynamics Analysis of Fractional-Order Memristive Time-Delay Chaotic System and Circuit Implementation Author NameAffiliationDawei DingSchool of Electronics and Information Engineering, Anhui University, Hefei 230601, ChinaHui LiuSchool of Electronics and Information Engineer ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2020-12-05Static and Dynamic Analyses of Composite Beam Bonded with MFC Actuator
Static and Dynamic Analyses of Composite Beam Bonded with MFC Actuator Author NameAffiliationKe WuXi’an Institution of Space Radio Technology, Xi’an 710100, ChinaHoufei FangShanghai YS Information Technology Co., Ltd., Shanghai 200240, ChinaLan LanShanghai YS Information Tec ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2020-12-05Review: Recent Progress on the Application of REBCO Superconductor Bulks
Review: Recent Progress on the Application of REBCO Superconductor Bulks Author NameAffiliationZili ZhangInstitute of Electrical Engineering, Chinese Academy of Sciences, Beijing 100190, ChinaYinming DaiInstitute of Electrical Engineering, Chinese Academy of Sciences, Beijin ...哈尔滨工业大学科研学术 本站小编 哈尔滨工业大学 2020-12-05