loading
Papers

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Hugo Tanzarella Teixeira and Celso Pascoli Bottura

Affiliation: State University of Campinas - UNICAMP, Brazil

ISBN: 978-989-758-122-9

Keyword(s): Machine Learning, Reinforcement Learning, Temporal Difference Learning, Value Function Approximation, Online Support Vector Machine.

Related Ontology Subjects/Areas/Topics: Informatics in Control, Automation and Robotics ; Intelligent Control Systems and Optimization ; Machine Learning in Control Applications

Abstract: This paper proposes a new algorithm for Temporal-Difference (TD) learning using online support vector regression. It benefits from the good generalization properties support vector regression (SVR) has, and also can do incremental learning and automatically track variation of environment with time-varying characteristics. Using the online SVR we can obtain good estimation of value function in TD learning in linear and nonlinear prediction problems. Experimental results demonstrate the effectiveness of the proposed method by comparison with others methods.

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.92.92.168

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Tanzarella Teixeira, H. and Pascoli Bottura, C. (2015). Temporal-Difference Learning - An Online Support Vector Regression Approach.In Proceedings of the 12th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO, ISBN 978-989-758-122-9, pages 318-323. DOI: 10.5220/0005572103180323

@conference{icinco15,
author={Hugo Tanzarella Teixeira. and Celso Pascoli Bottura.},
title={Temporal-Difference Learning - An Online Support Vector Regression Approach},
booktitle={Proceedings of the 12th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO,},
year={2015},
pages={318-323},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005572103180323},
isbn={978-989-758-122-9},
}

TY - CONF

JO - Proceedings of the 12th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO,
TI - Temporal-Difference Learning - An Online Support Vector Regression Approach
SN - 978-989-758-122-9
AU - Tanzarella Teixeira, H.
AU - Pascoli Bottura, C.
PY - 2015
SP - 318
EP - 323
DO - 10.5220/0005572103180323

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.