A CAUTIOUS APPROACH TO GENERALIZATION IN REINFORCEMENT LEARNING

Raphael Fonteneau; Susan A. Murphy; Louis Wehenkel; Damien Ernst

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

A CAUTIOUS APPROACH TO GENERALIZATION IN REINFORCEMENT LEARNING

Topics: Machine Learning; Uncertainty in AI

In Proceedings of the 2nd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, 64-73, 2010 , Valencia, Spain

Authors: Raphael Fonteneau ¹ ; Susan A. Murphy ² ; Louis Wehenkel ¹ and Damien Ernst ³

Affiliations: ¹ University of Liège, Belgium ; ² University of Michigan, United States ; ³ University of Liege, Belgium

Keyword(s): Reinforcement learning, Prior knowledge, Cautious generalization.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Computational Intelligence ; Evolutionary Computing ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Soft Computing ; Symbolic Systems ; Uncertainty in AI

Abstract: In the context of a deterministic Lipschitz continuous environment over continuous state spaces, finite action spaces, and a finite optimization horizon, we propose an algorithm of polynomial complexity which exploits weak prior knowledge about its environment for computing from a given sample of trajectories and for a given initial state a sequence of actions. The proposed Viterbi-like algorithm maximizes a recently proposed lower bound on the return depending on the initial state, and uses to this end prior knowledge about the environment provided in the form of upper bounds on its Lipschitz constants. It thereby avoids, in way depending on the initial state and on the the prior knowledge, those regions of the state space where the sample is too sparse to make safe generalizations. Our experiments show that it can lead to more cautious policies than algorithms combining dynamic programming with function approximators. We give also a condition on the sample sparsity ensuring that, for a given initial state, the proposed algorithm produces an optimal sequence of actions in open-loop. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.157

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Fonteneau, R., Murphy, S. A., Wehenkel, L., Ernst and D. (2010). A CAUTIOUS APPROACH TO GENERALIZATION IN REINFORCEMENT LEARNING. In Proceedings of the 2nd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART; ISBN 978-989-674-021-4; ISSN 2184-433X, SciTePress, pages 64-73. DOI: 10.5220/0002726900640073

@conference{icaart10,
author={Raphael Fonteneau and Susan A. Murphy and Louis Wehenkel and Damien Ernst},
title={A CAUTIOUS APPROACH TO GENERALIZATION IN REINFORCEMENT LEARNING},
booktitle={Proceedings of the 2nd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART},
year={2010},
pages={64-73},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002726900640073},
isbn={978-989-674-021-4},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 2nd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART
TI - A CAUTIOUS APPROACH TO GENERALIZATION IN REINFORCEMENT LEARNING
SN - 978-989-674-021-4
IS - 2184-433X
AU - Fonteneau, R.
AU - Murphy, S.
AU - Wehenkel, L.
AU - Ernst, D.
PY - 2010
SP - 64
EP - 73
DO - 10.5220/0002726900640073
PB - SciTePress