Swarm Behavior Cloning

Jonas Nüßlein; Maximilian Zorn; Philipp Altmann; Claudia Linnhoff-Popien

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Swarm Behavior Cloning

Topics: Agent Models and Architectures

In Proceedings of the 17th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, 23-32, 2025 , Porto, Portugal

Authors: Jonas Nüßlein ; Maximilian Zorn ; Philipp Altmann and Claudia Linnhoff-Popien

Affiliation: Institute of Computer Science, LMU Munich, Germany

Keyword(s): Reinforcement Learning, Imitation Learning, Ensemble, Robustness.

Abstract: In sequential decision-making environments, the primary approaches for training agents are Reinforcement Learning (RL) and Imitation Learning (IL). Unlike RL, which relies on modeling a reward function, IL leverages expert demonstrations, where an expert policy πe (e.g., a human) provides the desired behavior. Formally, a dataset D of state-action pairs is provided: D = (s,a = πe(s)). A common technique within IL is Behavior Cloning (BC), where a policy π(s) = a is learned through supervised learning on D. Further improvements can be achieved by using an ensemble of N individually trained BC policies, denoted as E = {πi(s)}1≤i≤N. The ensemble’s action a for a given state s is the aggregated output of the N actions: a = 1 N ∑i πi(s). This paper addresses the issue of increasing action differences—the observation that discrepancies between the N predicted actions grow in states that are underrepresented in the training data. Large action differences can result in suboptimal aggregated actions. To address this, we propose a method that fosters greater alignment among the policies while preserving the diversity of their computations. This approach reduces action differences and ensures that the ensemble retains its inherent strengths, such as robustness and varied decision-making. We evaluate our approach across eight diverse environments, demonstrating a notable decrease in action differences and significant improvements in overall performance, as measured by mean episode returns. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 18.219.32.237

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Nüßlein, J., Zorn, M., Altmann, P. and Linnhoff-Popien, C. (2025). Swarm Behavior Cloning. In Proceedings of the 17th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART; ISBN 978-989-758-737-5; ISSN 2184-433X, SciTePress, pages 23-32. DOI: 10.5220/0013086600003890

@conference{icaart25,
author={Jonas Nüßlein and Maximilian Zorn and Philipp Altmann and Claudia Linnhoff{-}Popien},
title={Swarm Behavior Cloning},
booktitle={Proceedings of the 17th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART},
year={2025},
pages={23-32},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013086600003890},
isbn={978-989-758-737-5},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 17th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART
TI - Swarm Behavior Cloning
SN - 978-989-758-737-5
IS - 2184-433X
AU - Nüßlein, J.
AU - Zorn, M.
AU - Altmann, P.
AU - Linnhoff-Popien, C.
PY - 2025
SP - 23
EP - 32
DO - 10.5220/0013086600003890
PB - SciTePress