論文

査読有り
2010年1月31日

Adaptive Design of Role Differentiation by Division of Reward Function in Multi-Agent Reinforcement Learning

JCMSI : SICE journal of control, measurement, and system integration (SICE JCMSI)
  • TANIGUCHI Tadahiro
  • ,
  • TABUCHI Kazuma
  • ,
  • SAWARAGI Tetsuo

3
1
開始ページ
26
終了ページ
34
記述言語
英語
掲載種別
DOI
10.9746/jcmsi.3.26
出版者・発行元
公益社団法人 計測自動制御学会

There are several problems which discourage an organization from achieving tasks, e.g., partial observation, credit assignment, and concurrent learning in multi-agent reinforcement learning. In many conventional approaches, each agent estimates hidden states, e.g., sensor inputs, positions, and policies of other agents, and reduces the uncertainty in the partially-observable Markov decision process (POMDP), which partially solve the multiagent reinforcement learning problem. In contrast, people reduce uncertainty in human organizations in the real world by autonomously dividing the roles played by individual agents. In a framework of reinforcement learning, roles are mainly represented by goals for individual agents. This paper presents a method for generating internal rewards from manager agents to worker agents. It also explicitly divides the roles, which enables a POMDP task for each agent to be transformed into a simple MDP task under certain conditions. Several situational experiments are also described and the validity of the proposed method is evaluated.

リンク情報
DOI
https://doi.org/10.9746/jcmsi.3.26
J-GLOBAL
https://jglobal.jst.go.jp/detail?JGLOBAL_ID=201002202728386260
CiNii Articles
http://ci.nii.ac.jp/naid/10031139983
CiNii Books
http://ci.nii.ac.jp/ncid/AA12293218
URL
https://jlc.jst.go.jp/DN/JALC/00354246970?from=CiNii
ID情報
  • DOI : 10.9746/jcmsi.3.26
  • ISSN : 1882-4889
  • J-Global ID : 201002202728386260
  • CiNii Articles ID : 10031139983
  • CiNii Books ID : AA12293218

エクスポート
BibTeX RIS