Reflexive Reinforcement Learning: Methods for Self-Referential Autonomous Learning

B. Lyons, J. Herrmann


Reinforcement learning aims at maximising an external evaluative signal over a certain time horizon. If no reward is available within the time horizon, the agent faces an autonomous learning task which can be used to explore, to gather information, and to bootstrap particular learning behaviours. We discuss here how the agent can use a current representation of the value, of its state and of the environment, in order to produce autonomous learning behaviour in the absence of a meaningful rewards. The family of methods that is introduced here is open to further development and research in the field of reflexive reinforcement learning.


Paper Citation