editors, Proceedings of the Fourteenth International
Conference on Artificial Intelligence and Statistics,
AISTATS 2011, Fort Lauderdale, USA, April 11-13,
2011, volume 15 of JMLR Proceedings, pages 215–
223. JMLR.org.
Cohen, T. and Welling, M. (2016). Group equivariant con-
volutional networks. In Balcan, M. and Weinberger,
K. Q., editors, Proceedings of the 33nd International
Conference on Machine Learning, ICML 2016, New
York City, NY, USA, June 19-24, 2016, volume 48 of
JMLR Workshop and Conference Proceedings, pages
2990–2999. JMLR.org.
Cohen, T. S. and Welling, M. (2017). Steerable cnns.
In 5th International Conference on Learning Rep-
resentations, ICLR 2017, Toulon, France, April 24-
26, 2017, Conference Track Proceedings. OpenRe-
view.net.
Doersch, C., Gupta, A., and Efros, A. A. (2015). Unsuper-
vised visual representation learning by context predic-
tion. In 2015 IEEE International Conference on Com-
puter Vision, ICCV 2015, Santiago, Chile, December
7-13, 2015, pages 1422–1430. IEEE Computer Soci-
ety.
Donsker, M. D. and Varadhan, S. S. (1975). Asymptotic
evaluation of certain markov process expectations for
large time, i. Communications on Pure and Applied
Mathematics, 28(1):1–47.
Dosovitskiy, A., Fischer, P., Springenberg, J. T., Riedmiller,
M. A., and Brox, T. (2016). Discriminative unsu-
pervised feature learning with exemplar convolutional
neural networks. IEEE Trans. Pattern Anal. Mach. In-
tell., 38(9):1734–1747.
Gidaris, S., Singh, P., and Komodakis, N. (2018). Unsuper-
vised representation learning by predicting image ro-
tations. In 6th International Conference on Learning
Representations, ICLR 2018, Vancouver, BC, Canada,
April 30 - May 3, 2018, Conference Track Proceed-
ings. OpenReview.net.
Heaton, J. (2018). Ian goodfellow, yoshua bengio, and
aaron courville: Deep learning - the MIT press, 2016,
800 pp, ISBN: 0262035618. Genet. Program. Evolv-
able Mach., 19(1-2):305–307.
Hinton, G. E., Krizhevsky, A., and Wang, S. D. (2011).
Transforming auto-encoders. In Honkela, T., Duch,
W., Girolami, M. A., and Kaski, S., editors, Artifi-
cial Neural Networks and Machine Learning - ICANN
2011 - 21st International Conference on Artificial
Neural Networks, Espoo, Finland, June 14-17, 2011,
Proceedings, Part I, volume 6791 of Lecture Notes in
Computer Science, pages 44–51. Springer.
Kingma, D. P. and Ba, J. (2015). Adam: A method for
stochastic optimization. In Bengio, Y. and LeCun,
Y., editors, 3rd International Conference on Learn-
ing Representations, ICLR 2015, San Diego, CA, USA,
May 7-9, 2015, Conference Track Proceedings.
Kingma, D. P. and Welling, M. (2013). Auto-encoding vari-
ational bayes. arXiv preprint arXiv:1312.6114.
Krizhevsky, A., Hinton, G., et al. (2009). Learning multiple
layers of features from tiny images.
Krizhevsky, A., Sutskever, I., and Hinton, G. E. (2017). Im-
agenet classification with deep convolutional neural
networks. Commun. ACM, 60(6):84–90.
Lenssen, J. E., Fey, M., and Libuschewski, P. (2018). Group
equivariant capsule networks. In Bengio, S., Wallach,
H. M., Larochelle, H., Grauman, K., Cesa-Bianchi,
N., and Garnett, R., editors, Advances in Neural
Information Processing Systems 31: Annual Con-
ference on Neural Information Processing Systems
2018, NeurIPS 2018, December 3-8, 2018, Montr
´
eal,
Canada, pages 8858–8867.
Lin, M., Chen, Q., and Yan, S. (2013). Network in network.
arXiv preprint arXiv:1312.4400.
Nguyen, X., Wainwright, M. J., and Jordan, M. I. (2010).
Estimating divergence functionals and the likelihood
ratio by convex risk minimization. IEEE Trans. Inf.
Theory, 56(11):5847–5861.
Noroozi, M. and Favaro, P. (2016). Unsupervised learning
of visual representations by solving jigsaw puzzles.
In Leibe, B., Matas, J., Sebe, N., and Welling, M.,
editors, Computer Vision - ECCV 2016 - 14th Euro-
pean Conference, Amsterdam, The Netherlands, Octo-
ber 11-14, 2016, Proceedings, Part VI, volume 9910
of Lecture Notes in Computer Science, pages 69–84.
Springer.
Noroozi, M., Pirsiavash, H., and Favaro, P. (2017). Repre-
sentation learning by learning to count. In IEEE Inter-
national Conference on Computer Vision, ICCV 2017,
Venice, Italy, October 22-29, 2017, pages 5899–5907.
IEEE Computer Society.
Poole, B., Ozair, S., van den Oord, A., Alemi, A., and
Tucker, G. (2019). On variational bounds of mu-
tual information. In Chaudhuri, K. and Salakhutdi-
nov, R., editors, Proceedings of the 36th International
Conference on Machine Learning, ICML 2019, 9-15
June 2019, Long Beach, California, USA, volume 97
of Proceedings of Machine Learning Research, pages
5171–5180. PMLR.
Qi, G. (2019). Learning generalized transformation equiv-
ariant representations via autoencoding transforma-
tions. CoRR, abs/1906.08628.
Qi, G., Zhang, L., Chen, C. W., and Tian, Q. (2019). AVT:
unsupervised learning of transformation equivariant
representations by autoencoding variational transfor-
mations. In 2019 IEEE/CVF International Confer-
ence on Computer Vision, ICCV 2019, Seoul, Korea
(South), October 27 - November 2, 2019, pages 8129–
8138. IEEE.
Schmidt, M., Roux, N. L., and Bach, F. R. (2017). Minimiz-
ing finite sums with the stochastic average gradient.
Math. Program., 162(1-2):83–112.
van den Oord, A., Li, Y., and Vinyals, O. (2018). Repre-
sentation learning with contrastive predictive coding.
CoRR, abs/1807.03748.
Wang, D. and Liu, Q. (2018). An optimization view on
dynamic routing between capsules. In 6th Interna-
tional Conference on Learning Representations, ICLR
2018, Vancouver, BC, Canada, April 30 - May 3, 2018,
Workshop Track Proceedings. OpenReview.net.
Zhang, L., Qi, G., Wang, L., and Luo, J. (2019). AET vs.
AED: unsupervised representation learning by auto-
encoding transformations rather than data. In IEEE
ICPRAM 2022 - 11th International Conference on Pattern Recognition Applications and Methods
108