loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Michał Bortkiewicz 1 ; Jakub Łyskawa 1 ; Paweł Wawrzyński 1 ; 2 ; Mateusz Ostaszewski 1 ; Artur Grudkowski 1 ; Bartłomiej Sobieski 1 and Tomasz Trzciński 1 ; 2 ; 3 ; 4 ; 5

Affiliations: 1 Warsaw University of Technology, Institute of Computer Science, Poland ; 2 IDEAS NCBR, Poland ; 3 Jagiellonian University, Poland ; 4 Tooploox, Poland ; 5 Ensavid, Poland

Keyword(s): Hierarchical Reinforcement Learning, Deep Reinforcement Learning, Control.

Abstract: Achieving long-term goals becomes more feasible when we break them into smaller, manageable subgoals. Yet, a crucial question arises: how specific should these subgoals be? Existing Goal-Conditioned Hierarchical Reinforcement Learning methods are based on lower-level policies aiming at subgoals designated by higher-level policies. These methods are sensitive to the proximity threshold under which the subgoals are considered achieved. Constant thresholds make the subgoals impossible to achieve in the early learning stages, easy to achieve in the late stages, and require careful manual tuning to yield reasonable overall learning performance. We argue that subgoal precision should depend on the agent’s recent performance rather than be predefined. We propose Adaptive Subgoal Required Distance (ASRD), a drop-in replacement method for subgoal threshold creation that considers the agent’s current lower-level capabilities for appropriate subgoals. Our results demonstrate that subgoal precis ion is essential for HRL convergence speed, and our method improves the performance of existing HRL algorithms. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.16.207.44

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Bortkiewicz, M.; Łyskawa, J.; Wawrzyński, P.; Ostaszewski, M.; Grudkowski, A.; Sobieski, B. and Trzciński, T. (2024). Subgoal Reachability in Goal Conditioned Hierarchical Reinforcement Learning. In Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART; ISBN 978-989-758-680-4; ISSN 2184-433X, SciTePress, pages 221-230. DOI: 10.5220/0012326200003636

@conference{icaart24,
author={Michał Bortkiewicz. and Jakub Łyskawa. and Paweł Wawrzyński. and Mateusz Ostaszewski. and Artur Grudkowski. and Bartłomiej Sobieski. and Tomasz Trzciński.},
title={Subgoal Reachability in Goal Conditioned Hierarchical Reinforcement Learning},
booktitle={Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART},
year={2024},
pages={221-230},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012326200003636},
isbn={978-989-758-680-4},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART
TI - Subgoal Reachability in Goal Conditioned Hierarchical Reinforcement Learning
SN - 978-989-758-680-4
IS - 2184-433X
AU - Bortkiewicz, M.
AU - Łyskawa, J.
AU - Wawrzyński, P.
AU - Ostaszewski, M.
AU - Grudkowski, A.
AU - Sobieski, B.
AU - Trzciński, T.
PY - 2024
SP - 221
EP - 230
DO - 10.5220/0012326200003636
PB - SciTePress