loading
Papers

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Mathieu Lelerre and Abdel-Illah Mouaddib

Affiliation: Université de Caen Normandie, France

ISBN: 978-989-758-201-1

Keyword(s): Behavior, Recognition, MDP, Reinforcement Learning.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Computational Intelligence ; Evolution Strategies ; Evolutionary Computing ; Evolutionary Robotics and Intelligent Agents ; Soft Computing

Abstract: The coordination between cooperative autonomous agents is mainly based on knowing or estimating the behavior policy of each others. Most approaches assume that agents estimate the policies of the others by considering the optimal ones. Unfortunately, this assumption is not valid when we face the coordination problem between semi-autonomous agents where an external entity can act to change the behavior of the agents in a non-optimal way. We face such problems when the external entity is an operator guiding or tele-operating a system where many factors can affect the behavior of the operator such as stress, hesitations, preferences, ... In such situations the recognition of the other agent policies become harder than usual since considering all situations of hesitations or stress is not feasible. In this paper, we propose an approach able to recognize and predict future actions and behavior of such agents when they can follow any policy including non-optimal ones and different hesit ations and preferences cases by using online learning techniques. The main idea of our approach is based on estimating, initially, the policy by the optimal one then we update it according to the observed behavior to derive a new estimated policy. In this paper, we present three learning methods of updating policies, show their stability and efficiency and compare them with existing approaches. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.231.226.211

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Lelerre, M. and Mouaddib, A. (2016). Non-optimal Semi-autonomous Agent Behavior Policy Recognition.In Proceedings of the 8th International Joint Conference on Computational Intelligence - Volume 3: ECTA, (IJCCI 2016) ISBN 978-989-758-201-1, pages 193-200. DOI: 10.5220/0006054401930200

@conference{ecta16,
author={Mathieu Lelerre. and Abdel{-}Illah Mouaddib.},
title={Non-optimal Semi-autonomous Agent Behavior Policy Recognition},
booktitle={Proceedings of the 8th International Joint Conference on Computational Intelligence - Volume 3: ECTA, (IJCCI 2016)},
year={2016},
pages={193-200},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006054401930200},
isbn={978-989-758-201-1},
}

TY - CONF

JO - Proceedings of the 8th International Joint Conference on Computational Intelligence - Volume 3: ECTA, (IJCCI 2016)
TI - Non-optimal Semi-autonomous Agent Behavior Policy Recognition
SN - 978-989-758-201-1
AU - Lelerre, M.
AU - Mouaddib, A.
PY - 2016
SP - 193
EP - 200
DO - 10.5220/0006054401930200

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.