Filter Learning from Deep Descriptors of a Fully Convolutional Siamese Network for Tracking in Videos

Hugo Chaves, Kevyn Ribeiro, André Brito, Hemerson Tacon, Marcelo Vieira, Augusto Cerqueira, Saulo Villela, Helena Maia, Darwin Concha, Helio Pedrini

Abstract

Siamese Neural Networks (SNNs) attracted the attention of the Visual Object Tracking community due to their relatively low computational cost and high efficacy to compare similarity between a reference and a candidate object to track its trajectory in a video over time. However, a video tracker that purely relies on an SNN might suffer from drifting due to changes in the target object. We propose a framework to take into account the changes of the target object in multiple time-based descriptors. In order to show its validity, we define long-term and short-term descriptors based on the first and the recent appearance of the object, respectively. Such memories are combined into a final descriptor that is the actual tracking reference. To compute the short-term memory descriptor, we estimate a filter bank through the usage of a genetic algorithm strategy. The final method has a low computational cost since it is applied through convolution operations along the tracking. According to the experiments performed in the widely used OTB50 dataset, our proposal improves the performance of an SNN dedicated to visual object tracking, being comparable to the state of the art methods.

Download


Paper Citation


in Harvard Style

Chaves H., Ribeiro K., Brito A., Tacon H., Vieira M., Cerqueira A., Villela S., Maia H., Concha D. and Pedrini H. (2020). Filter Learning from Deep Descriptors of a Fully Convolutional Siamese Network for Tracking in Videos.In Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, ISBN 978-989-758-402-2, pages 685-694. DOI: 10.5220/0008957606850694


in Bibtex Style

@conference{visapp20,
author={Hugo Chaves and Kevyn Ribeiro and André Brito and Hemerson Tacon and Marcelo Vieira and Augusto Cerqueira and Saulo Villela and Helena Maia and Darwin Concha and Helio Pedrini},
title={Filter Learning from Deep Descriptors of a Fully Convolutional Siamese Network for Tracking in Videos},
booktitle={Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP,},
year={2020},
pages={685-694},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008957606850694},
isbn={978-989-758-402-2},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP,
TI - Filter Learning from Deep Descriptors of a Fully Convolutional Siamese Network for Tracking in Videos
SN - 978-989-758-402-2
AU - Chaves H.
AU - Ribeiro K.
AU - Brito A.
AU - Tacon H.
AU - Vieira M.
AU - Cerqueira A.
AU - Villela S.
AU - Maia H.
AU - Concha D.
AU - Pedrini H.
PY - 2020
SP - 685
EP - 694
DO - 10.5220/0008957606850694