Study and application of reinforcement learning based on DAI in cooperative strategy of robot soccer

删除或更新信息，请邮件至freekaoyan#163.com(#换成@)

本站小编哈尔滨工业大学/2019-10-23

Study and application of reinforcement learning based on DAI in cooperative strategy of robot soccer

GUO Qi^1,2, ZHANG Da-zhi¹, YANG Yong-tian²

1.Dept.of Mathematics,Harbin Institute of Technology,Harbin 150001,China;2.Dept.of Computer Science and Engineering,Harbin Engineering University,Harbin 150001,China

Abstract:

A dynamic cooperation model of multi-agent is established by combining reinforcement learning with distributed artificial intelligence(DAI),in which the concept of individual optimization loses its meaning because of the dependence of repayment on each agent itself and the choice of other agents.Utilizing the idea of DAI,the intellectual unit of each robot and the change of task and environment,each agent can make decisions independently and finish various complicated tasks by communication and reciprocation between each other.The method is superior to other reinforcement learning methods commonly used in the multi-agent system.It can improve the convergence velocity of reinforcement learning,decrease requirements of computer memory,and enhance the capability of computing and logical ratiocinating for agent.The result of a simulated robot soccer match proves that the proposed cooperative strategy is valid.

Key words: robot soccer reinforcement learning cooperative strategy distributed artificial intelligence

DOI：10.11916/j.issn.1005-9113.2009.04.014

Clc Number:TP242.6

Fund:

相关话题/Study and application reinforcement learning

领限时大额优惠券,享本站正版考研考试资料!
优惠券领取后72小时内有效，10万种最新考研考试考证类电子打印资料任你选。涵盖全国500余所院校考研专业课、200多种职业资格考试、1100多种经典教材，产品类型包含电子书、题库、全套资料以及视频，无论您是考研复习、考证刷题，还是考前冲刺等，不同类型的产品可满足您学习上的不同需求。 ...
考试优惠券本站小编 Free壹佰分学习网 2022-09-19
Influence of Si Contents on the Microstructure Evolution and Mechanical Properties of Al-Mg-Si-Cu-Zn
Influence of Si Contents on the Microstructure Evolution and Mechanical Properties of Al-Mg-Si-Cu-Zn Alloys Author NameAffiliationLiang ZhuState Key Laboratory for Advanced Metals and Materials, University of Science and Technology Beijing, Beijing 100083, ChinaMingxing GuoS ...
哈尔滨工业大学科研学术本站小编哈尔滨工业大学 2020-12-05
GEO Satellite Thruster Configuration and Optimization
GEO Satellite Thruster Configuration and Optimization Author NameAffiliationJiajia FengBeijing Institute of Control Engineering, Beijing 100190, ChinaScience and Technology on Space Intelligent Control Laboratory,Beijing 100190,ChinaZuowei WangBeijing Institute of Control En ...
哈尔滨工业大学科研学术本站小编哈尔滨工业大学 2020-12-05
Hidden Attractors in a Delayed Memristive Differential System with Fractional Order and Chaos Synchr
Hidden Attractors in a Delayed Memristive Differential System with Fractional Order and Chaos Synchronization Author NameAffiliationDawei DingSchool of Electronics and Information Engineering, Anhui University, Hefei 230601, ChinaKey Laboratory of Intelligent Computing and S ...
哈尔滨工业大学科研学术本站小编哈尔滨工业大学 2020-12-05
Vibration Absorption Efficiency and Higher Branches Elimination of Variable-Stiffness Nonlinear Ener
Vibration Absorption Efficiency and Higher Branches Elimination of Variable-Stiffness Nonlinear Energy Sink Author NameAffiliationRui ZhongTianjin Key Laboratory of the Design and Intelligent Control of the Advanced Mechatronical System, Tianjin University of Technology,Tian ...
哈尔滨工业大学科研学术本站小编哈尔滨工业大学 2020-12-05
Biodegradation of Ammonia Nitrogen Using a Novel Candida sp. Strain N6 Immobilization
Biodegradation of Ammonia Nitrogen Using a Novel Candida sp. Strain N6 Immobilization Author NameAffiliationKai WangSchool of Marine Science and Technology, Harbin Institute of Technology, Weihai,Weihai 264209, Shandong, ChinaSchool of Municipal and Environmental Engineering ...
哈尔滨工业大学科研学术本站小编哈尔滨工业大学 2020-12-05
Aging Behavior and Creep Characteristics of CACB
Aging Behavior and Creep Characteristics of CACB Author NameAffiliationYunliang LiSchool of Transportation Science and Engineering, Harbin Institute of Technology, Harbin 150090, ChinaYuze LiuSchool of Transportation Science and Engineering, Harbin Institute of Technology, H ...
哈尔滨工业大学科研学术本站小编哈尔滨工业大学 2020-12-05
Loop Closure Detection of Visual SLAM Based on Point and Line Features
Loop Closure Detection of Visual SLAM Based on Point and Line Features Author NameAffiliationChang’an LiuSchool of Control and Computer Engineering,North China Electric Power University,Beijing 102206, ChinaRuiying ChengSchool of Control and Computer Engineering,North China ...
哈尔滨工业大学科研学术本站小编哈尔滨工业大学 2020-12-05
Dynamics Analysis of Fractional-Order Memristive Time-Delay Chaotic System and Circuit Implementatio
Dynamics Analysis of Fractional-Order Memristive Time-Delay Chaotic System and Circuit Implementation Author NameAffiliationDawei DingSchool of Electronics and Information Engineering, Anhui University, Hefei 230601, ChinaHui LiuSchool of Electronics and Information Engineer ...
哈尔滨工业大学科研学术本站小编哈尔滨工业大学 2020-12-05
Static and Dynamic Analyses of Composite Beam Bonded with MFC Actuator
Static and Dynamic Analyses of Composite Beam Bonded with MFC Actuator Author NameAffiliationKe WuXi’an Institution of Space Radio Technology, Xi’an 710100, ChinaHoufei FangShanghai YS Information Technology Co., Ltd., Shanghai 200240, ChinaLan LanShanghai YS Information Tec ...
哈尔滨工业大学科研学术本站小编哈尔滨工业大学 2020-12-05
Review: Recent Progress on the Application of REBCO Superconductor Bulks
Review: Recent Progress on the Application of REBCO Superconductor Bulks Author NameAffiliationZili ZhangInstitute of Electrical Engineering, Chinese Academy of Sciences, Beijing 100190, ChinaYinming DaiInstitute of Electrical Engineering, Chinese Academy of Sciences, Beijin ...
哈尔滨工业大学科研学术本站小编哈尔滨工业大学 2020-12-05