Deep W-Networks: Solving Multi-Objective Optimisation Problems with Deep Reinforcement Learning

Jernej Hribar, Luke Hackett, Ivana Dusparic

2023

Abstract

In this paper, we build on advances introduced by the Deep Q-Networks (DQN) approach to extend the multiobjective tabular Reinforcement Learning (RL) algorithm W-learning to large state spaces. W-learning algorithm can naturally solve the competition between multiple single policies in multi-objective environments. However, the tabular version does not scale well to environments with large state spaces. To address this issue, we replace underlying Q-tables with DQN, and propose an addition of W-Networks, as a replacement for tabular weights (W) representations. We evaluate the resulting Deep W-Networks (DWN) approach in two widely-accepted multi-objective RL benchmarks: deep sea treasure and multi-objective mountain car. We show that DWN solves the competition between multiple policies while outperforming the baseline in the form of a DQN solution. Additionally, we demonstrate that the proposed algorithm can find the Pareto front in both tested environments.

Download


Paper Citation


in Harvard Style

Hribar J., Hackett L. and Dusparic I. (2023). Deep W-Networks: Solving Multi-Objective Optimisation Problems with Deep Reinforcement Learning. In Proceedings of the 15th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, ISBN 978-989-758-623-1, pages 17-26. DOI: 10.5220/0011610300003393


in Bibtex Style

@conference{icaart23,
author={Jernej Hribar and Luke Hackett and Ivana Dusparic},
title={Deep W-Networks: Solving Multi-Objective Optimisation Problems with Deep Reinforcement Learning},
booktitle={Proceedings of the 15th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,},
year={2023},
pages={17-26},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011610300003393},
isbn={978-989-758-623-1},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 15th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,
TI - Deep W-Networks: Solving Multi-Objective Optimisation Problems with Deep Reinforcement Learning
SN - 978-989-758-623-1
AU - Hribar J.
AU - Hackett L.
AU - Dusparic I.
PY - 2023
SP - 17
EP - 26
DO - 10.5220/0011610300003393