LSTM-based Abstraction of Hetero Observation and Transition in Non-Communicative Multi-Agent Reinforcement Learning

Fumito Uwano

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

LSTM-based Abstraction of Hetero Observation and Transition in Non-Communicative Multi-Agent Reinforcement Learning

Topics: Machine Learning; Multi-Agent Systems

In Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, 172-179, 2022

Author: Fumito Uwano

Affiliation: Department of Computer Science, Okayama University, 3-1-1, Tsushima-naka, Kita-ku, Okayama, Japan

Keyword(s): Multiagent System, Reinforcement Learning, LSTM, Hetero-information, Hetero-transition.

Abstract: This study focuses on noncommunicative multiagent learning with hetero-information where agents observe each other in different resolutions of information. A new method is proposed for adapting the time dimension of the hetero-information from the observation by expanding the Asynchronous Advantage Actor–Critic (A3C) algorithm. The profit minimizing reinforcement learning with oblivion of memory mechanism was the previously used noncommunicative and cooperative learning method in multiagent reinforcement learning. We then insert an long short-term memory (LSTM) module into the A3C neural network to adapt to the time dimension influence of the hetero-information. The experiments investigate the performance of the proposed method on the hetero-information environment in terms of the effectiveness of LSTM. The experimental results show that: (1) the proposed method performs better than A3C. Without the LSTM module, the proposed method enabled the agents’ learning to converge. (2) LSTM c an adapt the time dimension of the input information. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.140.198.43

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Uwano, F. (2022). LSTM-based Abstraction of Hetero Observation and Transition in Non-Communicative Multi-Agent Reinforcement Learning. In Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART; ISBN 978-989-758-547-0; ISSN 2184-433X, SciTePress, pages 172-179. DOI: 10.5220/0010795700003116

@conference{icaart22,
author={Fumito Uwano.},
title={LSTM-based Abstraction of Hetero Observation and Transition in Non-Communicative Multi-Agent Reinforcement Learning},
booktitle={Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART},
year={2022},
pages={172-179},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010795700003116},
isbn={978-989-758-547-0},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART
TI - LSTM-based Abstraction of Hetero Observation and Transition in Non-Communicative Multi-Agent Reinforcement Learning
SN - 978-989-758-547-0
IS - 2184-433X
AU - Uwano, F.
PY - 2022
SP - 172
EP - 179
DO - 10.5220/0010795700003116
PB - SciTePress