Supervised Person Re-ID based on Deep Hand-crafted and CNN Features

Salma Ksibi, Mahmoud Mejdoub, Chokri Ben Amar

2018

Abstract

Gaussian Fisher Vector (GFV) encoding is an extension of the conventional Fisher Vector (FV) that effectively discards the noisy background information by localizing the pedestrian position in the image. Nevertheless, GFV can only provide a shallow description of the pedestrian features. In order to capture more complex structural information, we propose in this paper a layered extension of GFV that we called LGFV. The representation is based on two nested layers that hierarchically refine the FV encoding from one layer to the next by integrating more spatial neighborhood information. Besides, we present in this paper a new rich multi-level semantic pedestrian representation built simultaneously upon complementary deep hand-crafted and deep Convolutional Neural Network (CNN) features. The deep hand-crafted feature is depicted by the combination of GFV mid-level features and high-level LGFV ones while a deep CNN feature is obtained by learning in a classification mode an effective embedding of the raw pedestrian pixels. The proposed deep hand-crafted features produce competitive accuracy with respect to the deep CNN ones without needing neither pre-training nor data augmentation, and the proposed multi-level representation further boosts the re-ID performance.

Download


Paper Citation


in Harvard Style

Ksibi S., Mejdoub M. and Ben Amar C. (2018). Supervised Person Re-ID based on Deep Hand-crafted and CNN Features. In Proceedings of the 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2018) - Volume 4: VISAPP; ISBN 978-989-758-290-5, SciTePress, pages 63-74. DOI: 10.5220/0006625400630074


in Bibtex Style

@conference{visapp18,
author={Salma Ksibi and Mahmoud Mejdoub and Chokri Ben Amar},
title={Supervised Person Re-ID based on Deep Hand-crafted and CNN Features},
booktitle={Proceedings of the 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2018) - Volume 4: VISAPP},
year={2018},
pages={63-74},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006625400630074},
isbn={978-989-758-290-5},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2018) - Volume 4: VISAPP
TI - Supervised Person Re-ID based on Deep Hand-crafted and CNN Features
SN - 978-989-758-290-5
AU - Ksibi S.
AU - Mejdoub M.
AU - Ben Amar C.
PY - 2018
SP - 63
EP - 74
DO - 10.5220/0006625400630074
PB - SciTePress