loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Patrik Goorts 1 ; Sammy Rogmans 2 ; Steven Vanden Eynde 3 and Philippe Bekaert 1

Affiliations: 1 Hasselt University - tUL - IBBT, Belgium ; 2 Hasselt University - tUL - IBBT and IMEC, Belgium ; 3 Lessius Hogeschool – Campus De Nayer, Belgium

Keyword(s): CUDA, GPGPU, Optimization principles, Visual computing, Fermi.

Related Ontology Subjects/Areas/Topics: Architecture and Protocols ; Distributed Multimedia Systems ; Image and Video Processing, Compression and Segmentation ; Multimedia ; Multimedia and Communications ; Multimedia Signal Processing ; Multimedia Systems and Applications ; Multimodal Signal Processing ; Performance Measurement and Evaluation, Qos. ; Telecommunications

Abstract: In this paper, we provide examples to optimize signal processing or visual computing algorithms written for SIMT-based GPU architectures. These implementations demonstrate the optimizations for CUDA or its successors OpenCL and DirectCompute. We discuss the effect and optimization principles of memory coalescing, bandwidth reduction, processor occupancy, bank conflict reduction, local memory elimination and instruction optimization. The effect of the optimization steps are illustrated by state-of-the-art examples. A comparison with optimized and unoptimized algorithms is provided. A first example discusses the construction of joint histograms using shared memory, where optimizations lead to a significant speedup compared to the original implementation. A second example presents convolution and the acquired results.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.118.184.237

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Goorts, P.; Rogmans, S.; Vanden Eynde, S. and Bekaert, P. (2010). PRACTICAL EXAMPLES OF GPU COMPUTING OPTIMIZATION PRINCIPLES. In Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2010) - SIGMAP; ISBN 978-989-8425-19-5, SciTePress, pages 46-49. DOI: 10.5220/0002990400460049

@conference{sigmap10,
author={Patrik Goorts. and Sammy Rogmans. and Steven {Vanden Eynde}. and Philippe Bekaert.},
title={PRACTICAL EXAMPLES OF GPU COMPUTING OPTIMIZATION PRINCIPLES},
booktitle={Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2010) - SIGMAP},
year={2010},
pages={46-49},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002990400460049},
isbn={978-989-8425-19-5},
}

TY - CONF

JO - Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2010) - SIGMAP
TI - PRACTICAL EXAMPLES OF GPU COMPUTING OPTIMIZATION PRINCIPLES
SN - 978-989-8425-19-5
AU - Goorts, P.
AU - Rogmans, S.
AU - Vanden Eynde, S.
AU - Bekaert, P.
PY - 2010
SP - 46
EP - 49
DO - 10.5220/0002990400460049
PB - SciTePress