Uncertainty-based Out-of-Distribution Classification in Deep Reinforcement Learning

Andreas Sedlmeier; Thomas Gabor; Thomy Phan; Lenz Belzner; Claudia Linnhoff-Popien

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Uncertainty-based Out-of-Distribution Classification in Deep Reinforcement Learning

Topics: Autonomous Systems; Deep Learning; Mobile Agents; Neural Networks; Uncertainty in AI

In Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, 522-529, 2020 , Valletta, Malta

Authors: Andreas Sedlmeier ¹ ; Thomas Gabor ¹ ; Thomy Phan ¹ ; Lenz Belzner ² and Claudia Linnhoff-Popien ¹

Affiliations: ¹ LMU Munich, Munich, Germany ; ² MaibornWolff, Munich, Germany

Keyword(s): Uncertainty in AI, Out-of-Distribution Classification, Deep Reinforcement Learning.

Abstract: Robustness to out-of-distribution (OOD) data is an important goal in building reliable machine learning systems. As a first step towards a solution, we consider the problem of detecting such data in a value-based deep reinforcement learning (RL) setting. Modelling this problem as a one-class classification problem, we propose a framework for uncertainty-based OOD classification: UBOOD. It is based on the effect that an agent’s epistemic uncertainty is reduced for situations encountered during training (in-distribution), and thus lower than for unencountered (OOD) situations. Being agnostic towards the approach used for estimating epistemic uncertainty, combinations with different uncertainty estimation methods, e.g. approximate Bayesian inference methods or ensembling techniques are possible. Evaluation shows that the framework produces reliable classification results when combined with ensemble-based estimators, while the combination with concrete dropout-based estimators fails to r eliably detect OOD situations. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.108

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Sedlmeier, A., Gabor, T., Phan, T., Belzner, L. and Linnhoff-Popien, C. (2020). Uncertainty-based Out-of-Distribution Classification in Deep Reinforcement Learning. In Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART; ISBN 978-989-758-395-7; ISSN 2184-433X, SciTePress, pages 522-529. DOI: 10.5220/0008949905220529

@conference{icaart20,
author={Andreas Sedlmeier and Thomas Gabor and Thomy Phan and Lenz Belzner and Claudia Linnhoff{-}Popien},
title={Uncertainty-based Out-of-Distribution Classification in Deep Reinforcement Learning},
booktitle={Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART},
year={2020},
pages={522-529},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008949905220529},
isbn={978-989-758-395-7},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART
TI - Uncertainty-based Out-of-Distribution Classification in Deep Reinforcement Learning
SN - 978-989-758-395-7
IS - 2184-433X
AU - Sedlmeier, A.
AU - Gabor, T.
AU - Phan, T.
AU - Belzner, L.
AU - Linnhoff-Popien, C.
PY - 2020
SP - 522
EP - 529
DO - 10.5220/0008949905220529
PB - SciTePress