Mostrar el registro sencillo del ítem

dc.contributor.authorTorres de la Sierra, Yuri 
dc.contributor.authorGonzález Escribano, Arturo 
dc.contributor.authorLlanos Ferraris, Diego Rafael 
dc.date.accessioned2025-03-12T09:16:53Z
dc.date.available2025-03-12T09:16:53Z
dc.date.issued2011
dc.identifier.citation2011 International Conference on High Performance Computing and Simulation (HPCS), Istanbul, Turkey, July 4-8 2011 Istambul, Turkeyes
dc.identifier.isbn978-1-61284-383-4es
dc.identifier.urihttps://uvadoc.uva.es/handle/10324/75305
dc.descriptionProducción Científicaes
dc.description.abstractWhile the correctness of an NVIDIA CUDA program is easy to achieve, exploiting the GPU capabilities to obtain the best performance possible is a task for CUDA experienced programmers. Typical code tuning strategies, like choosing an appropriate size and shape for the thread blocks, programming a good coalescing, or maximize occupancy, are inter-dependent. Moreover, the choices are also dependent on the underlying architecture details, and the global-memory access pattern of the designed solution. For example, the size and shapes of threadblocks are usually chosen to facilitate encoding (e.g. square shapes), while maximizing the multiprocessors' occupancy. How ever, this simple choice does not usually provide the best performance results. In this paper we discuss important relations between the size and shapes of threadblocks, occupancy, global memory access patterns, and other Fermi architecture features, such as the configuration of the new transparent cache. We present an insight based approach to tuning techniques, providing lines to understand the complex relations, and to easily avoid bad tuning settings.es
dc.format.extent6 p.es
dc.format.mimetypeapplication/pdfes
dc.language.isoenges
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.subjectInformáticaes
dc.subject.classificationGPU, Fermi, performance, code tuninges
dc.titleUnderstanding the Impact of CUDA Tuning Techniques for Fermies
dc.typeinfo:eu-repo/semantics/conferenceObjectes
dc.identifier.doi10.1109/HPCSim.2011.5999886es
dc.relation.publisherversionhttps://www.researchgate.net/publication/224255446_Understanding_the_Impact_of_CUDA_Tuning_Techniques_for_Fermies
dc.title.event2011 International Conference on High Performance Computing and Simulation (HPCS)es
dc.description.projectThis research is partly supported by the Ministerio de Industria, Spain (CENIT MARTA, CENIT OASIS, CENITOCEANLIDER), Ministerio de Ciencia y Tecnología (CAPAP-H3 network, TIN2010-12011-E), and the HPC-EUROPA2 project (project number: 228398) with the support of the European Commission - Capacities Area - Research Infrastructures Initiative.es
dc.type.hasVersioninfo:eu-repo/semantics/publishedVersiones
dc.subject.unesco1203 Ciencia de Los Ordenadoreses
dc.subject.unesco3304 Tecnología de Los Ordenadoreses


Ficheros en el ítem

Thumbnail

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem