Por favor, use este identificador para citar o enlazar este ítem:https://uvadoc.uva.es/handle/10324/70416
Título
Optimizing an APSP implementation for NVIDIA GPUs using kernel characterization criteria
Año del Documento
2014
Editorial
Springer
Descripción
Producción Científica
Documento Fuente
The Journal of Supercomputing, Vol. 70, Issue 2, November 2014, pags. 786-798, ISSN 0920-8542
Resumen
During the last years, GPU manycore devices have demonstrated their usefulness to accelerate computationally intensive problems. Although arriving at a parallelization of a highly parallel algorithm is an affordable task, the optimization of GPU codes is a challenging activity. The main reason for this is the number of parameters, programming choices, and tuning techniques available, many of them related with complex and sometimes hidden architecture details. A useful strategy to systematically attack these optimization problems is to characterize the different kernels of the application, and use this knowledge to select appropriate configuration parameters. The All-Pair Shortest-Path (APSP) problem is a well-known problem in graph theory whose objective is to find the shortest paths between any pairs of nodes in a graph. This problem can be solved by highly parallel and computational intensive tasks, being a good candidate to be exploited by manycore devices. In this paper, we use kernel characterization criteria to optimize an APSP algorithm implementation for NVIDIA GPUs. Our experimental results show that the combined use of proper configuration policies, and the concurrent kernels capability of new CUDA architectures, leads to a performance improvement of up to 62 % with respect to one of the possible configurations recommended by CUDA, considered as baseline.
Materias (normalizadas)
Informática
Materias Unesco
1203 Ciencia de Los Ordenadores
3304 Tecnología de Los Ordenadores
Palabras Clave
APSP
Cache configuration
Concurrent kernel
GPU
Kernel characterization
Threadblock size
ISSN
0920-8542
Revisión por pares
SI
Patrocinador
This research has been partially supported by Ministerio de Economía y Competitividad (Spain) and ERDF program of the European Union: CAPAP-H4 network (TIN2011-15734-E), MOGECOPP project (TIN2011-25639); and Junta de Castilla y León (Spain) ATLAS project (VA172A12-2).
Version del Editor
Idioma
eng
Tipo de versión
info:eu-repo/semantics/publishedVersion
Derechos
openAccess
Aparece en las colecciones
Ficheros en el ítem