Recognizing Buildings through Deep Learning: A Case Study on Half-timbered Framed Buildings in Calw City

Konstantinos Makantasis, Nikolaos Doulamis, Athanasios Voulodimos

Abstract

Automatic detection and recognition of specific types of urban buildings is extremely important for a variety of applications ranging from outdoor urban reconstruction to navigation. In this paper we propose a system for the automatic detection and recognition of urban buildings. Most of the existing work relies on the exploitation of handcrafted features for recognizing buildings. However, due to their complex structure it is rarely a priori known which features are important for the recognition task. Our method overcomes this drawback by exploiting a deep learning framework, based on convolutional neural networks, which automatically construct highly descriptive features directly from raw data. We evaluate the performance of our method on the recognition of half-timbered framed buildings in Calw city in Germany.

References

  1. Ali, H., Paar, G., and Paletta, L. (2007a). Semantic indexing for visual recognition of buildings. In 5th Int. Symp. on Mobile Mapping Technology. Citeseer.
  2. Ali, H., Seifert, C., Jindal, N., Paletta, L., and Paar, G. (2007b). Window detection in facades. In Image Analysis and Processing, 2007. ICIAP 2007. 14th International Conference on, pages 837-842. IEEE.
  3. Baatz, G., K öser, K., Chen, D., Grzeszczuk, R., and Pollefeys, M. (2010). Handling urban location recognition as a 2d homothetic problem. In European Conference on Computer Vision, pages 266-279. Springer.
  4. Bengio, Y., Lamblin, P., Popovici, D., and Larochelle, H. (2007). Greedy layer-wise training of deep networks. In Advances in NIPS 19, pages 153-160.
  5. Bres, S. and Tellez, B. (2006). Localisation and augmented reality for mobile applications in culture heritage. Lyon: INSA.
  6. Friedman, J. H., Bentley, J. L., and Finkel, R. A. (1977). An algorithm for finding best matches in logarithmic expected time. ACM Transactions on Mathematical Software (TOMS), 3(3):209-226.
  7. Hinton, G., Osindero, S., and Teh, Y. (2006). A fast learning algorithm for deep belief nets. Neural Computation, 18(7):1527-1554.
  8. Hinton, G. E. and Salakhutdinov, R. R. (2006). Reducing the dimensionality of data with neural networks. Science, 313(5786):504-507.
  9. Lowe, D. G. (2004). Distinctive image features from scaleinvariant keypoints. International Journal of Computer Vision, 60(2):91-110.
  10. Makantasis, K., Karantzalos, K., Doulamis, A., and Loupos, K. (2015). Deep learning-based man-made object detection from hyperspectral data. In International Symposium on Visual Computing, pages 717- 727. Springer.
  11. Mnih, V. and Hinton, G. E. (2012). Learning to label aerial images from noisy data. In Proceedings of the 29th International Conference on Machine Learning, ICML 2012,. icml.cc / Omnipress.
  12. Montavon, G., Orr, G., and Muller, K.-R. (2012). Neural Networks: Tricks of the Trade. Springer Berlin Heidelberg.
  13. M üller, P., Zeng, G., Wonka, P., and Van Gool, L. (2007). Image-based procedural modeling of facades. ACM Transactions on Graphics (TOG), 26(3):85.
  14. Olojede, A. and Suleman, H. (2015). Investigating image processing algorithms for navigating cultural heritage spaces using mobile devices. In International Conference on Asian Digital Libraries, pages 215-224. Springer.
  15. Schindler, G., Krishnamurthy, P., Lublinerman, R., Liu, Y., and Dellaert, F. (2008). Detecting and matching repeated patterns for automatic geo-tagging in urban environments. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, pages 1-7. IEEE.
  16. Shechtman, E. and Irani, M. (2007). Matching local selfsimilarities across images and videos. In 2007 IEEE Conference on Computer Vision and Pattern Recognition, pages 1-8. IEEE.
  17. Singhal, A., Luo, J., and Zhu, W. (2003). Probabilistic spatial context models for scene content understanding. In Computer Vision and Pattern Recognition, 2003. Proceedings. 2003 IEEE Computer Society Conference on, volume 1, pages I-235. IEEE.
  18. Tell, D. and Carlsson, S. (2000). Wide baseline point matching using affine invariants computed from intensity profiles. In European Conference on Computer Vision, pages 814-828. Springer.
  19. Tell, D. and Carlsson, S. (2002). Combining appearance and topology for wide baseline matching. In European Conference on Computer Vision, pages 68-81. Springer.
  20. Wendel, A., Donoser, M., and Bischof, H. (2010). Unsupervised facade segmentation using repetitive patterns. In Joint Pattern Recognition Symposium, pages 51-60. Springer.
  21. Xiao, J., Fang, T., Zhao, P., Lhuillier, M., and Quan, L. (2009). Image-based street-side city modeling. ACM Transactions on Graphics (TOG), 28(5):114.
Download


Paper Citation


in Harvard Style

Makantasis K., Doulamis N. and Voulodimos A. (2017). Recognizing Buildings through Deep Learning: A Case Study on Half-timbered Framed Buildings in Calw City . In Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: CVICG4CULT, ISBN 978-989-758-226-4, pages 444-450. DOI: 10.5220/0006347204440450


in Bibtex Style

@conference{cvicg4cult17,
author={Konstantinos Makantasis and Nikolaos Doulamis and Athanasios Voulodimos},
title={Recognizing Buildings through Deep Learning: A Case Study on Half-timbered Framed Buildings in Calw City},
booktitle={Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: CVICG4CULT,},
year={2017},
pages={444-450},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006347204440450},
isbn={978-989-758-226-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: CVICG4CULT,
TI - Recognizing Buildings through Deep Learning: A Case Study on Half-timbered Framed Buildings in Calw City
SN - 978-989-758-226-4
AU - Makantasis K.
AU - Doulamis N.
AU - Voulodimos A.
PY - 2017
SP - 444
EP - 450
DO - 10.5220/0006347204440450