Adaptive Design of Role Differentiation by Division of Reward Function in Multi-Agent Reinforcement Learning

JCMSI : SICE journal of control, measurement, and system integration (SICE JCMSI)

TANIGUCHI Tadahiro
TABUCHI Kazuma
SAWARAGI Tetsuo

巻: 3
号: 1
開始ページ: 26
終了ページ: 34
記述言語: 英語
掲載種別
DOI: 10.9746/jcmsi.3.26
出版者・発行元: 公益社団法人計測自動制御学会

There are several problems which discourage an organization from achieving tasks, e.g., partial observation, credit assignment, and concurrent learning in multi-agent reinforcement learning. In many conventional approaches, each agent estimates hidden states, e.g., sensor inputs, positions, and policies of other agents, and reduces the uncertainty in the partially-observable Markov decision process (POMDP), which partially solve the multiagent reinforcement learning problem. In contrast, people reduce uncertainty in human organizations in the real world by autonomously dividing the roles played by individual agents. In a framework of reinforcement learning, roles are mainly represented by goals for individual agents. This paper presents a method for generating internal rewards from manager agents to worker agents. It also explicitly divides the roles, which enables a POMDP task for each agent to be transformed into a simple MDP task under certain conditions. Several situational experiments are also described and the validity of the proposed method is evaluated.

リンク情報

DOI: https://doi.org/10.9746/jcmsi.3.26
J-GLOBAL: https://jglobal.jst.go.jp/detail?JGLOBAL_ID=201002202728386260
CiNii Articles: http://ci.nii.ac.jp/naid/10031139983
CiNii Books: http://ci.nii.ac.jp/ncid/AA12293218
URL: https://jlc.jst.go.jp/DN/JALC/00354246970?from=CiNii

ID情報

DOI : 10.9746/jcmsi.3.26
ISSN : 1882-4889
J-Global ID : 201002202728386260
CiNii Articles ID : 10031139983
CiNii Books ID : AA12293218

エクスポート: BibTeX RIS

椹木哲夫

論文

Adaptive Design of Role Differentiation by Division of Reward Function in Multi-Agent Reinforcement Learning

メニュー

共著者の一覧