論文

2018年

Neuron as an agent

6th International Conference on Learning Representations, ICLR 2018 - Workshop Track Proceedings
  • Shohei Ohsawa
  • ,
  • Kei Akuzawa
  • ,
  • Tatsuya Matsushima
  • ,
  • Gustavo Bezerra
  • ,
  • Yusuke Iwasawa
  • ,
  • Hiroshi Kajino
  • ,
  • Seiya Takenaka
  • ,
  • Yutaka Matsuo

記述言語
掲載種別
研究論文(国際会議プロシーディングス)

Existing multi-agent reinforcement learning (MARL) communication methods have relied on a trusted third party (TTP) to distribute reward to agents, leaving them inapplicable in peer-to-peer environments. This paper proposes reward distribution using Neuron as an Agent (NaaA) in MARL without a TTP with two key ideas: (i) inter-agent reward distribution and (ii) auction theory. Auction theory is introduced because inter-agent reward distribution is insufficient for optimization. Agents in NaaA maximize their profits (the difference between reward and cost) and, as a theoretical result, the auction mechanism is shown to have agents autonomously evaluate counterfactual returns as the values of other agents. NaaA enables representation trades in peer-to-peer environments, ultimately regarding unit in neural networks as agents. Finally, numerical experiments (a single-agent environment from OpenAI Gym and a multi-agent environment from ViZDoom) confirm that NaaA framework optimization leads to better performance in reinforcement learning.

リンク情報
Scopus
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85083951632&origin=inward
Scopus Citedby
https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=85083951632&origin=inward
ID情報
  • SCOPUS ID : 85083951632

エクスポート
BibTeX RIS