An Efficient Dual Dimensionality Reduction Scheme of Features for Image Classification

Hai-Xia Long, Li Zhou, Qiang Zhang, Jing Zhang, Xiao-Guang Li

2016

Abstract

The statistical property of Bag of Word (BoW) model and spatial property of Spatial Pyramid Matching (SPM) are usually used to improve distinguishing ability of features by adding redundant information for image classification. But the increasing of the image feature dimension will cause “curse of dimensionality” problem. To address this issue, a dual dimensionality reduction scheme that combines Locality Preserving Projection (LPP) with the Principal Component Analysis (PCA) has been proposed in the paper. Firstly, LPP has been used to reduce the feature dimensions of each SPM and each dimensionality reduced feature vector is cascaded into a global vector. After that, the dimension of the global vector is reduced by PCA. The experimental results on four standard image classification databases show that, compared with the benchmark ScSPM( Sparse coding based Spatial Pyramid Matching), when the dimension of image features is reduced to only 5% of that of the baseline scheme, the classification performance of the dual dimensionality reduction scheme proposed in this paper still can be improved about 5%.

References

  1. Xie L, Tian Q, Wang M, et al. Spatial pooling of heterogeneous features for image classification. IEEE Transactions on Image Processing, 2014 (23): 1994- 2008.
  2. Yang J, Yu K, Gong Y, et al. Linear spatial pyramid matching using sparse coding for image classification. Computer Vision and Pattern Recognition, 2009: 1794-1801.
  3. Lazebnik S, Schmid C, Ponce J. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on. 2006, 2: 2169-2178.
  4. R Bellman. Adaptive Control Processes:A Guided Tour 1961.
  5. S. Gu, L. Zhang, W. Zuo, and X. Feng. Projective Dictionary Pair Learning for Pattern Classification. In NIPS 2014.
  6. Li L J, Su H, Fei-Fei L, et al. Object bank: A high-level image representation for scene classifica-tion & semantic feature sparsification. Advances in neural information processing systems. 2010: 1378-1386.
  7. Niyogi X. Locality preserving projections. Neural information processing systems. MIT, 2004, 16: 153.
  8. Zhang C, Xiao X, Pang J, et al. Beyond visual word ambiguity: Weighted local feature encoding with governing region. Journal of Visual Communication and Image Representation, 2014, 25(6): 1387-1398.
  9. Zhang C, Liang C, Pang J, et al. Undoing the codebook bias by linear transformation with sparsity and F-norm constraints for image classification. Pattern Recognition Letters, 2014, 45: 197-204.
  10. Lei B, Tan E L, Chen S, et al. Saliency-driven image classification method based on histogram mining and image score. Pattern Recognition, 2015, 48(8): 2567- 2580.
  11. Wang X, Ma J, Xu M. Image Classification Using Sparse Coding and Spatial Pyramid Matching. 2014 International Conference on e-Education, e-Business and Information Management. Atlantis Press, 2014.
  12. Yan S, Xu X, Xu D, et al. Image classification with densely sampled image windows and generalized adaptive multiple kernel learning. Cybernetics, IEEE Transactions on, 2015, 45(3): 395-404.
  13. Yang Y B, Zhu Q H, Mao X J, et al. Visual feature coding for image classification integrating dictionary structure. Pattern Recognition, 2015.
  14. Fei-Fei L, Fergus R, Perona P. Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. Computer Vision and Image Understanding, 2007, 106(1): 59-70.
  15. Lazebnik S, Schmid C, Ponce J. Semi-local affine parts for object recognition. British Machine Vision Conference (BMVC'04). 2004: 779-788.
  16. Griffin G, Holub A, Perona P. Caltech-256 object category dataset. California Institute of Technology (2007). Supplied as additional material tr. 5(6).
  17. Van Gemert J C, Geusebroek J M, Veenman C J, et al. Kernel codebooks for scene categorization. Computer Vision-ECCV 2008. Springer Berlin Heidelberg, 2008: 696-709.
  18. Wang J, Yang J, Yu K, et al. Locality-constrained linear coding for image classification. Computer Vision and Pattern Recognition (CVPR), 2010: 3360-3367.
  19. Luo Hui-lan, Guo Min-Jie, Kong Fan-Sheng. Image Classification Method by Combing Multi-feature and Sparse Coding. Pattern Recognition and Artificial Intelligence, 2014,27 (4): 345-355.
  20. Gao S, Tsang I W, Chia L T, et al. Local features are not lonely-Laplacian sparse coding for image classification. Computer Vision and Pattern Recognition (CVPR), 2010: 3555-3561.
  21. Gao S, Tsang I W, Chia L T. Sparse representation with kernel. Image Processing, IEEE Transactions on, 2013, 22(2): 423-434.
  22. Zhang C, Liu J, Tian Q, et al. Beyond visual features: A weak semantic image representation using exemplar classifiers for classification. Neuro-computing, 2013, 120: 318-324.
  23. Li L J, Su H, Fei-Fei L, et al. Object bank: A high-level image representation for scene classification & semantic feature sparsification. Advances in neural information processing systems. 2010: 1378-1386.
Download


Paper Citation


in Harvard Style

Long H., Zhou L., Zhang Q., Zhang J. and Li X. (2016). An Efficient Dual Dimensionality Reduction Scheme of Features for Image Classification . In Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, (VISIGRAPP 2016) ISBN 978-989-758-175-5, pages 672-678. DOI: 10.5220/0005787506720678


in Bibtex Style

@conference{visapp16,
author={Hai-Xia Long and Li Zhou and Qiang Zhang and Jing Zhang and Xiao-Guang Li},
title={An Efficient Dual Dimensionality Reduction Scheme of Features for Image Classification},
booktitle={Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, (VISIGRAPP 2016)},
year={2016},
pages={672-678},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005787506720678},
isbn={978-989-758-175-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, (VISIGRAPP 2016)
TI - An Efficient Dual Dimensionality Reduction Scheme of Features for Image Classification
SN - 978-989-758-175-5
AU - Long H.
AU - Zhou L.
AU - Zhang Q.
AU - Zhang J.
AU - Li X.
PY - 2016
SP - 672
EP - 678
DO - 10.5220/0005787506720678