Papers

Peer-reviewed
May 24, 2016

Reinforcement learning with internal reward for multi-Agent cooperation: A theoretical approach

BICT 2015 - 9th EAI International Conference on Bio-Inspired Information and Communications Technologies
  • Fumito Uwano
  • ,
  • Naoki Tatebe
  • ,
  • Masaya Nakata
  • ,
  • Keiki Takadama
  • ,
  • Tim Kovacs

Volume
2
Number
8
First page
e2
Last page
Language
English
Publishing type
Research paper (international conference proceedings)
DOI
10.4108/eai.3-12-2015.2262878
Publisher
Association for Computing Machinery, Inc

This paper focuses on a multi-Agent cooperation which is generally difficult to be achieved without sufficient information of other agents, and proposes the reinforcement learning method that introduces an internal reward for a multi-Agent cooperation without sufficient information. To guarantee to achieve such a cooperation, this paper theoretically derives the condition of selecting appropriate actions by changing internal rewards given to the agents, and extends the reinforcement learning methods (Q-learning and Profit Sharing) to enable the agents to acquire the appropriate Q-values up- dated according to the derived condition. Concretely, the internal rewards change when the agents can only find better solution than the current one. The intensive simulations on the maze problems as one of test beds have revealed the following implications:(1) our proposed method successfully enables the agents to select their own appropriate cooperating actions which contribute to acquiring the minimum steps towards to their goals, while the conventional methods (i.e., Q-learning and Profit Sharing) cannot always acquire the minimum steps
and (2) the proposed method based on Profit Sharing provides the same good performance as the proposed method based on Q-learning.

Link information
DOI
https://doi.org/10.4108/eai.3-12-2015.2262878
DBLP
https://dblp.uni-trier.de/rec/journals/eetcc/UwanoTNTK16
URL
http://dblp.uni-trier.de/db/journals/eetcc/eetcc2.html#journals/eetcc/UwanoTNTK16
ID information
  • DOI : 10.4108/eai.3-12-2015.2262878
  • DBLP ID : journals/eetcc/UwanoTNTK16
  • SCOPUS ID : 85034614186

Export
BibTeX RIS