Understanding the Impact of CUDA Tuning Techniques for Fermi

Torres de la Sierra, Yuri; González Escribano, Arturo; Llanos Ferraris, Diego Rafael

doi:10.1109/HPCSim.2011.5999886

Título

Understanding the Impact of CUDA Tuning Techniques for Fermi

dc.contributor.author	Torres de la Sierra, Yuri
dc.contributor.author	González Escribano, Arturo
dc.contributor.author	Llanos Ferraris, Diego Rafael
dc.date.accessioned	2025-03-12T09:16:53Z
dc.date.available	2025-03-12T09:16:53Z
dc.date.issued	2011
dc.identifier.citation	2011 International Conference on High Performance Computing and Simulation (HPCS), Istanbul, Turkey, July 4-8 2011 Istambul, Turkey	es
dc.identifier.isbn	978-1-61284-383-4	es
dc.identifier.uri	https://uvadoc.uva.es/handle/10324/75305
dc.description	Producción Científica	es
dc.description.abstract	While the correctness of an NVIDIA CUDA program is easy to achieve, exploiting the GPU capabilities to obtain the best performance possible is a task for CUDA experienced programmers. Typical code tuning strategies, like choosing an appropriate size and shape for the thread blocks, programming a good coalescing, or maximize occupancy, are inter-dependent. Moreover, the choices are also dependent on the underlying architecture details, and the global-memory access pattern of the designed solution. For example, the size and shapes of threadblocks are usually chosen to facilitate encoding (e.g. square shapes), while maximizing the multiprocessors' occupancy. How ever, this simple choice does not usually provide the best performance results. In this paper we discuss important relations between the size and shapes of threadblocks, occupancy, global memory access patterns, and other Fermi architecture features, such as the configuration of the new transparent cache. We present an insight based approach to tuning techniques, providing lines to understand the complex relations, and to easily avoid bad tuning settings.	es
dc.format.extent	6 p.	es
dc.format.mimetype	application/pdf	es
dc.language.iso	eng	es
dc.rights.accessRights	info:eu-repo/semantics/openAccess	es
dc.subject	Informática	es
dc.subject.classification	GPU, Fermi, performance, code tuning	es
dc.title	Understanding the Impact of CUDA Tuning Techniques for Fermi	es
dc.type	info:eu-repo/semantics/conferenceObject	es
dc.identifier.doi	10.1109/HPCSim.2011.5999886	es
dc.relation.publisherversion	https://www.researchgate.net/publication/224255446_Understanding_the_Impact_of_CUDA_Tuning_Techniques_for_Fermi	es
dc.title.event	2011 International Conference on High Performance Computing and Simulation (HPCS)	es
dc.description.project	This research is partly supported by the Ministerio de Industria, Spain (CENIT MARTA, CENIT OASIS, CENITOCEANLIDER), Ministerio de Ciencia y Tecnología (CAPAP-H3 network, TIN2010-12011-E), and the HPC-EUROPA2 project (project number: 228398) with the support of the European Commission - Capacities Area - Research Infrastructures Initiative.	es
dc.type.hasVersion	info:eu-repo/semantics/publishedVersion	es
dc.subject.unesco	1203 Ciencia de Los Ordenadores	es
dc.subject.unesco	3304 Tecnología de Los Ordenadores	es

Fichier(s) constituant ce document

Nom:: weha.pdf
Taille:: 118.7Ko
Format:: PDF

Voir/Ouvrir

Ce document figure dans la(les) collection(s) suivante(s)

DEP41 - Comunicaciones a congresos, conferencias, etc. [104]

Afficher la notice abrégée