Por favor, use este identificador para citar o enlazar este ítem:https://uvadoc.uva.es/handle/10324/75090
Título
Using Fermi Architecture Knowledge to Speed up CUDA and OpenCL Programs
Congreso
IEEE 10th International Symposium on Parallel and Distributed Processing with Applications (ISPA), 2012
Año del Documento
2012
Editorial
IEEE
Descripción Física
8 p.
Descripción
Producción Científica
Documento Fuente
IEEE 10th International Symposium on Parallel and Distributed Processing with Applications (ISPA), 2012, At: Leganés, Madrid, Spain, p. 617-624
Abstract
The NVIDIA graphics processing units (GPUs) are playing an important role as general purpose programming devices. The implementation of parallel codes to exploit the GPU hardware architecture is a task for experienced programmers. The threadblock size and shape choice is one of the most important user decisions when a parallel problem is coded. The threadblock configuration has a significant impact on the global performance of the program. While in CUDA parallel programming model it is always necessary to specify the threadblock size and shape, the OpenCL standard also offers an automatic mechanism to take this delicate decision. In this paper we present a study of these criteria for Fermi architecture, introducing a general approach for threadblock choice, and showing that there is considerable room for improvement in OpenCL automatic strategy.
Materias (normalizadas)
Informática
Materias Unesco
1203
3304
Palabras Clave
GPGPU, automatic code tuning, Fermi, CUDA, OpenCL
Patrocinador
This research is partly supported by the Ministerio de Industria, Spain (CENIT OCEANLIDER), MICINN (Spain) and the European Union FEDER (CAPAP-H3 network TIN2010- 12011-E, TIN2011-25639), and the HPC-EUROPA2 project (project number: 228398) with the support of the European Commission - Capacities Area - Research Infrastructures Initiative.
Version del Editor
Idioma
eng
Tipo de versión
info:eu-repo/semantics/publishedVersion
Derechos
openAccess
Aparece en las colecciones
Files in questo item