Efficient Multi-kernel Ray Tracing for GPUs

Thomas Schiffer, Dieter W. Fellner

2014

Abstract

Images with high visual quality are often generated by a ray tracing algorithm. Despite its conceptual simplicity, designing an efficient mapping of ray tracing computations to massively parallel hardware architectures is a challenging task.In this paper we investigate the performance of state-of-the-art ray traversal algorithms for bounding volume hierarchies on GPUs and discuss their potentials and limitations. Based on this analysis, a novel ray traversal scheme called batch tracing is proposed. It decomposes the task into multiple kernels, each of which is designed for efficient parallel execution. Our algorithm achieves comparable performance to currently prevailing approaches and represents a promising avenue for future research.

References

  1. Aila, T. and Karras, T. (2010). Architecture considerations for tracing incoherent rays. In Proceedings of the Conference on High Performance Graphics, HPG 7810, pages 113-122, Saarbrücken, Germany. Eurographics Association.
  2. Aila, T. and Laine, S. (2009). Understanding the efficiency of ray traversal on gpus. In Proceedings of the Conference on High Performance Graphics 2009, HPG 7809, pages 145-149, New York, NY, USA. ACM.
  3. Aila, T., Laine, S., and Karras, T. (2012). Understanding the efficiency of ray traversal on gpus - kepler and fermi addendum. NVIDIA Technical Report NVR-2012-02, NVIDIA Corporation.
  4. Cazals, F. and Sbert, M. (1997). Some integral geometry tools to estimate the complexity of 3d scenes. Technical report, iMAGIS/GRAVIR-IMAG, Grenoble, France, Departament dInformtica i Matemtica Aplicada, Universitat de Girona, Spain.
  5. Garanzha, K. (2010). Fast ray sorting and breadth-first packet traversal for gpu ray tracing. Oral presentation at EG2010.
  6. Garanzha, K. and Loop, C. (2010). Fast ray sorting and breadth-first packet traversal for gpu ray tracing. In Proceedings of the Eurographics, EG 7810, pages 289- 298. Eurographics Association.
  7. Hoberock, J., Lu, V., Jia, Y., and Hart, J. C. (2009). Stream compaction for deferred shading. In Proceedings of the Conference on High Performance Graphics 2009, HPG 7809, pages 173-180, New York, NY, USA. ACM.
  8. Lindholm, E., Nickolls, J., Oberman, S., and Montrym, J. (2008). Nvidia tesla: A unified graphics and computing architecture. Micro, IEEE, 28(2):39 -55.
  9. NVIDIA (2009). Nvidia's next generation cuda compute architecture: Fermi.
  10. NVidia (2012). Nvidia gk110 architecture whitepaper.
  11. Pantaleoni, J., Fascione, L., Hill, M., and Aila, T. (2010). Pantaray: fast ray-traced occlusion caching of massive scenes. In ACM SIGGRAPH 2010 papers, SIGGRAPH 7810, pages 37:1-37:10, New York, NY, USA. ACM.
  12. Tzeng, S., Patney, A., and Owens, J. D. (2010). Task management for irregular-parallel workloads on the gpu. In Doggett, M., Laine, S., and Hunt, W., editors, High Performance Graphics, pages 29-37. Eurographics Association.
  13. Wald, I. (2011). Active thread compaction for gpu path tracing. In Proceedings of the ACM SIGGRAPH Symposium on High Performance Graphics, HPG 7811, pages 51-58, New York, NY, USA. ACM.
Download


Paper Citation


in Harvard Style

Schiffer T. and Fellner D. (2014). Efficient Multi-kernel Ray Tracing for GPUs . In Proceedings of the 9th International Conference on Computer Graphics Theory and Applications - Volume 1: GRAPP, (VISIGRAPP 2014) ISBN 978-989-758-002-4, pages 209-217. DOI: 10.5220/0004703502090217


in Bibtex Style

@conference{grapp14,
author={Thomas Schiffer and Dieter W. Fellner},
title={Efficient Multi-kernel Ray Tracing for GPUs},
booktitle={Proceedings of the 9th International Conference on Computer Graphics Theory and Applications - Volume 1: GRAPP, (VISIGRAPP 2014)},
year={2014},
pages={209-217},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004703502090217},
isbn={978-989-758-002-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 9th International Conference on Computer Graphics Theory and Applications - Volume 1: GRAPP, (VISIGRAPP 2014)
TI - Efficient Multi-kernel Ray Tracing for GPUs
SN - 978-989-758-002-4
AU - Schiffer T.
AU - Fellner D.
PY - 2014
SP - 209
EP - 217
DO - 10.5220/0004703502090217