Lazy Work Stealing for Continuous Hierarchy Traversal on Deformable Bodies

Vinícius da Silva, Claudio Esperança, Ricardo Guerra Marroquim

2014

Abstract

This study presents the results of research in dynamic load balancing for Continuous Collision Detection (CCD) using Bounding Volumes Hierarchies (BVHs) on Graphics Processing Units (GPUs). Hierarchy traversal is a challenging problem for GPU computing, since the work load of traversal has a very dynamic nature. Current research resulted in methods to dynamically balance load as the traversal is evaluated. Unfortunately, current grid-based GPU computing interfaces are not well suited for this type of computing and load balancing code can generate excessive overhead. This work presents a novel algorithm to address some of the most glaring problems. The algorithm uses the new concept of lazy work stealing, which tries to get the most out of the parallel capabilities of GPUs by greedy work stealing and lazy work evaluation. Also, the algorithm is designed to augment shared memory usage per block and diminish CPU-GPU context exchange penalties.

References

  1. Arora, N. S., Blumofe, R. D., and Plaxton, C. G. (1998). Thread scheduling for multiprogrammed multiprocessors. In Proceedings of the tenth annual ACM symposium on Parallel algorithms and architectures, SPAA 7898, pages 119-129, New York, NY, USA. ACM.
  2. Cederman, D. and Tsigas, P. (2009). On sorting and load balancing on GPUs. SIGARCH Comput. Archit. News, 36(5):11-18.
  3. Curtis, S., Tamstorf, R., and Manocha, D. (2008). Fast collision detection for deformable models using representative-triangles. In Haines, E. and McGuire, M., editors, Proceedings of the 2008 Symposium on Interactive 3D Graphics, SI3D 2008, February 15-17, 2008, Redwood City, CA, USA, pages 61-69. ACM.
  4. Ericson, C. (2004). Real-Time Collision Detection (The Morgan Kaufmann Series in Interactive 3-D Technology) (The Morgan Kaufmann Series in Interactive 3D Technology). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA.
  5. Gottschalk, S., Lin, M. C., and Manocha, D. (1996). Obbtree: a hierarchical structure for rapid interference detection. In Proceedings of the 23rd annual conference on Computer graphics and interactive techniques, SIGGRAPH 7896, pages 171-180, New York, NY, USA. ACM.
  6. Group, K. (2012). The opencl specification version: 1.2 document revision: 19.
  7. Heo, J.-P., Seong, J.-K., Kim, D., Otaduy, M. A., Hong, J.-M., Tang, M., and Yoon, S.-E. (2010). Fastcd: Fracturing-aware stable collision detection. In Popovic, Z. and Otaduy, M. A., editors, Proceedings of the 2010 Eurographics/ACM SIGGRAPH Symposium on Computer Animation, SCA 2010, Madrid, Spain, 2010, pages 149-158. Eurographics Association.
  8. Kim, D., Heo, J.-P., Huh, J., Kim, J., and Yoon, S.-E. (2009a). HPCCD: Hybrid parallel continuous collision detection using cpus and gpus. Computer Graphics Forum (Pacific Graphics).
  9. Kim, D., Heo, J.-P., Huh, J., Kim, J., and Yoon, S.-E. (2009b). Hpccd: Hybrid parallel continuous collision detection using cpus and gpus. Computer Graphics Forum (Pacific Graphics).
  10. Kim, D., Heo, J.-P., and Yoon, S.-e. (2009c). Pccd: parallel continuous collision detection. In SIGGRAPH 7809: Posters, SIGGRAPH 7809, pages 50:1-50:1, New York, NY, USA. ACM.
  11. Lauterbach, C., Mo, Q., and Manocha, D. (2010). gproximity: Hierarchical gpu-based operations for collision and distance queries. In Proceedings of Eurographics 2010.
  12. Ni, T. (2009). Directcompute.
  13. Nickolls, J. and Dally, W. J. (2010). The gpu computing era. IEEE Micro, 30(2):56-69.
  14. Provot, X. (1997). Collision and self-collision handling in cloth model dedicated to design garments. In Graphics Interface 97, pages 177-179.
  15. Sengupta, S., Harris, M., Zhang, Y., and Owens, J. D. (2007). Scan primitives for gpu computing. In Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS symposium on Graphics hardware, GH 7807, pages 97-106, Aire-la-Ville, Switzerland, Switzerland. Eurographics Association.
  16. Tang, M., Curtis, S., Yoon, S.-E., and Manocha, D. (2008). Interactive continuous collision detection between deformable models using connectivity-based culling. In SPM 7808: Proceedings of the 2008 ACM symposium on Solid and physical modeling, pages 25-36, New York, NY, USA. ACM.
  17. Tang, M., Manocha, D., Lin, J., and Tong, R. (2011). Collision-streams: Fast GPU-based collision detection for deformable models. In I3D 7811: Proceedings of the 2011 ACM SIGGRAPH symposium on Interactive 3D Graphics and Games, pages 63-70.
  18. Tang, M., Manocha, D., and Tong, R. (2010a). Fast continuous collision detection using deforming nonpenetration filters. In Proceedings of the 2010 ACM SIGGRAPH symposium on Interactive 3D Graphics and Games, I3D 7810, pages 7-13, New York, NY, USA. ACM.
  19. Tang, M., Manocha, D., and Tong, R. (2010b). Mccd: Multi-core collision detection between deformable models using front-based decomposition. Graphical Models, 72(2):7-23.
  20. Teschner, M., Kimmerle, S., Zachmann, G., Heidelberger, B., Raghupathi, L., Fuhrmann, A., Cani, M.-P., Faure, F., Magnetat-Thalmann, N., and Strasser, W. (2004). Collision detection for deformable objects.
Download


Paper Citation


in Harvard Style

Silva V., Esperança C. and Guerra Marroquim R. (2014). Lazy Work Stealing for Continuous Hierarchy Traversal on Deformable Bodies . In Proceedings of the 9th International Conference on Computer Graphics Theory and Applications - Volume 1: GRAPP, (VISIGRAPP 2014) ISBN 978-989-758-002-4, pages 430-437. DOI: 10.5220/0004656904300437


in Bibtex Style

@conference{grapp14,
author={Vinícius da Silva and Claudio Esperança and Ricardo Guerra Marroquim},
title={Lazy Work Stealing for Continuous Hierarchy Traversal on Deformable Bodies},
booktitle={Proceedings of the 9th International Conference on Computer Graphics Theory and Applications - Volume 1: GRAPP, (VISIGRAPP 2014)},
year={2014},
pages={430-437},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004656904300437},
isbn={978-989-758-002-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 9th International Conference on Computer Graphics Theory and Applications - Volume 1: GRAPP, (VISIGRAPP 2014)
TI - Lazy Work Stealing for Continuous Hierarchy Traversal on Deformable Bodies
SN - 978-989-758-002-4
AU - Silva V.
AU - Esperança C.
AU - Guerra Marroquim R.
PY - 2014
SP - 430
EP - 437
DO - 10.5220/0004656904300437