Let’s Do the Time Warp Again: Human Action Assistance for Reinforcement Learning Agents

Carter B. Burn; Frederick L. Crabbe; Rebecca Hwa

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Let’s Do the Time Warp Again: Human Action Assistance for Reinforcement Learning Agents

Topics: Cooperation and Coordination; Machine Learning; Mobile Agents

In Proceedings of the 13th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, 92-100, 2021

Authors: Carter B. Burn ¹ ; Frederick L. Crabbe ¹ and Rebecca Hwa ²

Affiliations: ¹ Dept. of Computer Science, United States Naval Academy, Annapolis, MD, U.S.A. ; ² Dept. of Computer Science, University of Pittsburgh, Pittsburgh, PA, U.S.A.

Keyword(s): Human-assisted Reinforcement Learning, Action-advice, Time-warp.

Abstract: Reinforcement learning (RL) agents may take a long time to learn a policy for a complex task. One way to help the agent to convergence on a policy faster is by offering it some form of assistance from a teacher who already has some expertise on the same task. The teacher can be either a human or another computer agent, and they can provide assistance by controlling the reward, action selection, or state definition that the agent views. However, some forms of assistance might come more naturally from a human teacher than a computer teacher and vice versa. For instance, a challenge for human teachers in providing action selection is that because computers and human operate at different speed increments, it is difficult to translate what constitutes an action selection for a particular state in a human’s perception to that of the computer agent. In this paper, we introduce a system called Time Warp that allows a human teacher to provide action selection assistance to the agent during cr itical moments of the training for the RL agent. We find that Time Warp is able to help the agent develop a better policy in less time than an RL agent with no assistance and rivals the performance of computer teaching agents. Time Warp also is able to reach the results with only ten minutes of human training time. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.15.206.25

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Burn, C.; Crabbe, F. and Hwa, R. (2021). Let’s Do the Time Warp Again: Human Action Assistance for Reinforcement Learning Agents. In Proceedings of the 13th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART; ISBN 978-989-758-484-8; ISSN 2184-433X, SciTePress, pages 92-100. DOI: 10.5220/0010258700920100

@conference{icaart21,
author={Carter B. Burn. and Frederick L. Crabbe. and Rebecca Hwa.},
title={Let’s Do the Time Warp Again: Human Action Assistance for Reinforcement Learning Agents},
booktitle={Proceedings of the 13th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART},
year={2021},
pages={92-100},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010258700920100},
isbn={978-989-758-484-8},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 13th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART
TI - Let’s Do the Time Warp Again: Human Action Assistance for Reinforcement Learning Agents
SN - 978-989-758-484-8
IS - 2184-433X
AU - Burn, C.
AU - Crabbe, F.
AU - Hwa, R.
PY - 2021
SP - 92
EP - 100
DO - 10.5220/0010258700920100
PB - SciTePress