Hierarchical Feature Extraction using Partial Least Squares Regression and Clustering for Image Classification

Ryoma Hasegawa; Kazuhiro Hotta

doi:10.5220/0006254303900395

Hierarchical Feature Extraction using Partial Least Squares Regression and Clustering for Image Classification

Ryoma Hasegawa, Kazuhiro Hotta

2017

Abstract

In this paper, we propose an image classification method using Partial Least Squares regression (PLS) and clustering. PLSNet is a simple network using PLS for image classification and obtained high accuracies on the MNIST and CIFAR-10 datasets. It crops a lot of local regions from training images as explanatory variables, and their class labels are used as objective variables. Then PLS is applied to those variables, and some filters are obtained. However, there are a variety of local regions in each class, and intra-class variance is large. Therefore, we consider that local regions in each class should be divided and handled separately. In this paper, we apply clustering to local regions in each class and make a set from a cluster of all classes. There are some sets whose number is the number of clusters. Then we apply PLSNet to each set. By doing the processes, we obtain some feature vectors per image. Finally, we train SVM for each feature vector and classify the images by voting the result of SVM. Our PLSNet obtained 82.42% accuracy on the CIFAR-10 dataset. This accuracy is 1.69% higher than PLSNet without clustering and an attractive result of the methods without CNN.

References

Dudani, S, A., 1976. The distance-weighted k-nearestneighbor rule. In IEEE Transactions on Systems, Man, and Cybernetics.
Vapnik, V., 1998. Statistical learning theory, Wiley. New York.
Wold, H., 1985. Partial least squares, Wiley. New York.
Hasegawa, R. and Hotta, K., 2016. Plsnet: a simple network using partial least squares regression for image classification. In International Conference on Pattern Recognition.
Badrinarayanan, V., Kendall, A. and Cipolla, R., 2015. Segnet: a deep convolutional encoder-decoder architecture for image segmentation. In International Conference on Computer Vision.
Krizhevsky, A., Sutskever, I. and Hinton, G. E., 2012. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems 25.
Schwartz, W, R., Kembhavi, A. and Davis, L, S., 2009. Human detection using partial least squares analysis. In International Conference on Computer Vision.
Shelhamer, E., Long, J. and Darrell, T., 2015. Fully convolutional networks for semantic segmentation. In IEEE Conference on Computer Vision and Pattern Recognition.
Girshick, R., Donahue, J., Darrell, T. and Malik, J., 2014. Rich feature hierarchies for accurate object detection and semantic segmentation. In IEEE Conference on Computer Vision and Pattern Recognition.
He, K., Zhang, X., Ren, S. and Sun, J., 2014. Spatial pyramid pooling in deep convolutional networks for visual recognition. In European Conference on Computer Vision.
Lecun, Y., Bottou, L., Bengio, Y. and Haffner, P., 1998. Gradient-based learning applied to document recognition. In Proceedings of the IEEE.
Oquab, M., Bottou, L., Laptev, I. and Sivic, J., 2014. Learning and transferring mid-level image representations using convolutional neural networks. In IEEE Conference on Computer Vision and Pattern Recognition.
Taigman, Y., Yang, M., Ranzato, M. and Wolf, L., 2014. Deepface: closing the gap to human-level performance in face verification. In IEEE Conference on Computer Vision and Pattern Recognition.
Chan, T., Jia, K., Gao, S., Lu, J., Zeng, Z. and Ma, Y., 2014. Pcanet: a simple deep learning baseline for image classification? In IEEE Transactions on Image Processing.
Deng, J., Dong, W., Socher, R., Li, L., Li, K. and Fei-Fei, L., 2009. Imagenet: a large-scale hierarchical image database. In IEEE Conference on Computer Vision and Pattern Recognition.
Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R. and Fei-Fei, L., 2014. Large-scale video classification with convolutional neural networks. In IEEE Conference on Computer Vision and Pattern Recognition.
Xiao, T., Xu, Y., Yang, K., Zhang, J., Peng, Y. and Zhang, Z., 2015. The application of two-level attention models in deep convolutional neural network for finegrained image classification. In IEEE Conference on Computer Vision and Pattern Recognition.
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V. and Rabinovich, A., 2015. Going deeper with convolutions. In IEEE Conference on Computer Vision and Pattern Recognition.

Download

Paper Citation

in Harvard Style

Hasegawa R. and Hotta K. (2017). Hierarchical Feature Extraction using Partial Least Squares Regression and Clustering for Image Classification . In Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP, (VISIGRAPP 2017) ISBN 978-989-758-226-4, pages 390-395. DOI: 10.5220/0006254303900395

in Bibtex Style

@conference{visapp17,
author={Ryoma Hasegawa and Kazuhiro Hotta},
title={Hierarchical Feature Extraction using Partial Least Squares Regression and Clustering for Image Classification},
booktitle={Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP, (VISIGRAPP 2017)},
year={2017},
pages={390-395},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006254303900395},
isbn={978-989-758-226-4},
}

in EndNote Style

TY - CONF
JO - Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP, (VISIGRAPP 2017)
TI - Hierarchical Feature Extraction using Partial Least Squares Regression and Clustering for Image Classification
SN - 978-989-758-226-4
AU - Hasegawa R.
AU - Hotta K.
PY - 2017
SP - 390
EP - 395
DO - 10.5220/0006254303900395