Multi-agent Policy Gradient Algorithms for Cyber-physical Systems with Lossy Communication

Adrian Redder; Arunselvan Ramaswamy; Holger Karl

Research.Publish.Connect.

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Multi-agent Policy Gradient Algorithms for Cyber-physical Systems with Lossy Communication

Topics: Agent Communication and Languages; Distributed Problem Solving; Machine Learning; Multi-Agent Systems

In Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, 282-289, 2022

Authors: Adrian Redder ¹ ; Arunselvan Ramaswamy ¹ and Holger Karl ²

Affiliations: ¹ Department of Computer Science, Paderborn University, Germany ; ² Hasso-Plattner-Institute, Potsdam University, Germany

Keyword(s): Policy Gradient Algorithms, Multi-agent Learning, Communication Networks, Distributed Optimisation, Age of Information, Continuous Control.

Abstract: Distributed online learning over delaying communication networks is a fundamental problem in multi-agent learning, since the convergence behaviour of interacting agents is distorted by their delayed communication. It is a priori unclear, how much communication delay can be allowed, such that the joint policies of multiple agents can still converge to a solution of a multi-agent learning problem. In this work, we present the decentralization of the well known deep deterministic policy gradient algorithm using a communication network. We illustrate the convergence of the algorithm and the effect of lossy communication on the rate of convergence for a two-agent flow control problem, where the agents exchange their local information over a delaying wireless network. Finally, we discuss theoretical implications for this algorithm using recent advances in the theory of age of information and deep reinforcement learning.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 13.59.36.203

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Redder, A.; Ramaswamy, A. and Karl, H. (2022). Multi-agent Policy Gradient Algorithms for Cyber-physical Systems with Lossy Communication. In Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART; ISBN 978-989-758-547-0; ISSN 2184-433X, SciTePress, pages 282-289. DOI: 10.5220/0010845400003116

@conference{icaart22,
author={Adrian Redder. and Arunselvan Ramaswamy. and Holger Karl.},
title={Multi-agent Policy Gradient Algorithms for Cyber-physical Systems with Lossy Communication},
booktitle={Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART},
year={2022},
pages={282-289},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010845400003116},
isbn={978-989-758-547-0},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART
TI - Multi-agent Policy Gradient Algorithms for Cyber-physical Systems with Lossy Communication
SN - 978-989-758-547-0
IS - 2184-433X
AU - Redder, A.
AU - Ramaswamy, A.
AU - Karl, H.
PY - 2022
SP - 282
EP - 289
DO - 10.5220/0010845400003116
PB - SciTePress