Authors:
B. I. Lyons
and
J. Michael Herrmann
Affiliation:
Institute of Perception, Action and Behaviour, University of Edinburgh, 10 Crichton Street, Edinburgh, EH8 9AB, U.K.
Keyword(s):
Reinforcement Learning, Exploration-Exploitation Dilemma, Intrinsic Motivation, Self-Referential Learning, Empowerment, Autonomous Agents.
Abstract:
Reinforcement learning aims at maximising an external evaluative signal over a certain time horizon. If no reward is available within the time horizon, the agent faces an autonomous learning task which can be used to explore, to gather information, and to bootstrap particular learning behaviours. We discuss here how the agent can use a current representation of the value, of its state and of the environment, in order to produce autonomous learning behaviour in the absence of a meaningful rewards. The family of methods that is introduced here is open to further development and research in the field of reflexive reinforcement learning.