
under Grant BK20131296, Grant BK20130639 and 
NSFC under Grant 61005051. The authors thank 
Lantmännen Danpo A/S for providing the chicken 
images. 
REFERENCES 
Alcantarilla, P., Bartoli, A. and A.Davison. KAZE 
Features. Proc. of ECCV, 214-227, 2012. 
Alcantarilla, P., Nuevo, J., Bartoli, A., Fast explicit 
diffusion for accelerated features in nonlinear scale 
spaces. Proc. of BMVC, 13.1-13.11, 2013. 
Battiti, R., Using Mutual Information for Selecting 
Features in Supervised Neural Net Learning. IEEE 
Trans. Neural Networks, 5(4):537-550, 1994.  
Bay, H., Ess, A., Tuytelaars, T, Van Gool, L., SURF: 
Speeded Up Robust Features. Computer Vision and 
Image Understanding, 110(3):346-359, 2008. 
Bo, L., Lai, K., Ren, X., Fox, D., Object Recognition with 
Hierarchical Kernel Descriptors. Proc. of CVPR, 
1:1729-1736, 2011. 
Bo, L., Ren, X., Fox, D., Kernel Descriptors for Visual 
Recognition. Proc. of NIPS, 244-252, 2010. 
Bo, L., Ren, X., Fox, D., Multipath sparse coding using 
hierarchical matching pursuit. Proc. of CVPR, 1:660-
667, 2013. 
Bo, L., Sminchisescu, C., Efficient Match Kernel between 
Sets of Features for Visual Recognition. Proc. of NIPS. 
1:135-143, 2009. 
Bosch, A., Zisserman, A., and Munoz, X., Image 
Classification using Random Forests and Ferns. Proc. 
of ICCV, 1:1-8, 2007. 
Boureau, Y.-L., Roux, N. L., Bach, F., Ponce, J., LeCun, 
Y., Ask the locals: Multi-way local pooling for image 
recognition. Proc. of ICCV, 1:2651–2658, 2011. 
Brown, G., Pocock, A., Zhao, M., Luján, M., Conditional 
likelihood maximisation: a unifying framework for 
information theoretic feature selection. The Journal of 
Machine Learning Research, 13(1):27-66, 2012. 
Cao, Y., Wang, C., Li, Z., Zhang, L., Spatial -bag-of-
features. Proc. of CVPR, 1:3352-3359, 2010. 
Ciresan, D., Meier, U., Schmidhuber, J., Multi-column 
Deep Neural Networks for Image Classification. Proc. 
of CVPR, 3642-3649, 2012. 
Dalal, N., Triggs, B., Histograms of oriented gradients for 
human detection. Proc. of CVPR, 1:886 -893, 2005. 
Everingham, M. L, Van Gool, C., Williams, K. I., Winn, 
J., and Zisserman, A., The pascal visual object classes 
(VOC) challenge. International Journal of Computer 
Vision, 88(2): 303–338, 2010. 
Feng, J., Ni, B., Tian, Q., Yan, S., Geometric p-norm 
feature pooling for image classification. Proc. of 
CVPR, 1:2697–2704, 2011. 
Gómez-Chova, L., Jenssen, R., Camps-Valls, G., Kernel 
Entropy Component Analysis for Remote Sensing 
Image Clustering. IEEE Geoscience and Remote 
Sensing Letters, 9(2):312-316, 2012. 
Goodfellow, I., Courville, A., Bengio, Y., Spike-and-Slab 
Sparse Coding for Unsupervised Feature Discovery, in 
NIPS Workshop on Challenges in Learning 
Hierarchical Models, 2011. 
Hellman, M.E., Raviv, J., Probability of error, 
equivocation, and the Chernoff bound. IEEE Trans. on 
Information Theory, 16:368–372, 1979. 
Hild II, K.E., Erdogmus, D., Principe, J.C., An Analysis of 
Entropy Estimators for Blind Source Separation. 
Signal Processing, 86(1):182-194, 2006. 
Hild II, K., Erdogmus, D., Torkkola, K., Principe, J., 
Feature Extraction Using Information-Theoretic 
Learning. IEEE Trans. Pattern Analysis and Machine 
Intelligence, 28(9):1385-1392, 2006. 
Jégou, H., Douze, M., Schmid, C., Packing bag-of-
features. Proc. of ICCV, 1:2357-2364, 2009. 
Jenssen, R., Kernel entropy component analysis. IEEE 
Trans. Pattern Analysis and Machine Intelligence, 
32(5):847–860, 2010. 
Jenssen, R., Eltoft, T., A new information theoretic 
analysis of sum-of-squared-error kernel clustering. 
Neurocomputing, 72(1-3):23-31, 2008. 
Jia, Y., Huang, C., Darrell, T., Beyond spatial pyramids: 
Receptive field learning for pooled image features. 
Proc. of CVPR, 1:3370–3377, 2012. 
Jiang, Z., Zhang, G., and Davis, L. S., Submodular 
dictionary learning for sparse coding. Proc. of CVPR, 
1:3418–3425, 2012. 
Kwak, N., Choi, C., Input Feature Selection by Mutual 
Information Based on Parzen Window. IEEE Trans. 
Pattern Analysis and Machine Intelligence, 
24(12):1667-1671, 2002. 
Lazebnik, S., Schmid, C., Ponce, J., Beyond bags of 
features: Spatial pyramid matching for recognizing 
natural scene categories. Proc. of CVPR, 1:2169-2178, 
2006. 
Le, Q., Ngiam, J., Chia, Z.C., Koh, P., Ng, A., Tiled 
convolutional neural networks. Proc. of NIPS, 1:1279-
1287, 2010. 
Leiva-Murillo, J., and Artes-Rodriguez, A., Information-
Theoretic Linear Feature Extraction based on Kernel 
Density Estimators: A Review. IEEE Trans. Systems, 
Man, and Cybernetics, Part C: Applications and 
Reviews, 42(6):1180-1189, 2012. 
Li, F., Fergus, R., and Perona, P., One-shot learning of 
object categories. IEEE Trans. Pattern Analysis and 
Machine Intelligence, 28(4):594–611, 2006. 
Liu, C., Shum, H., Kullback-Leibler boosting. Proc. of 
CVPR, 1:587-594, 2003. 
Liu, L., Wang, L., and Liu, X., In defense of soft-
assignment coding. Proc. of ICCV, 1:2486–2493, 
2011. 
Lowe, D., Distinctive image features from scale-invariant 
keypoints.  International Journal of Computer Vision, 
60(2):91-110, 2004. 
McCann, S., Lowe, D., Spatially local coding for object 
recognition. Proc. of ACCV, 2012. 
Ojala, T., Pietikäinen, M., Mäenpää, T., Multiresolution 
gray-scale and rotation invariant texture classification 
with local binary patterns. IEEE Trans. Pattern 
ICPRAM2015-InternationalConferenceonPatternRecognitionApplicationsandMethods
108