Por favor, use este identificador para citar o enlazar este ítem:https://uvadoc.uva.es/handle/10324/74462
Título
Auto-tuned nested parallelism: A way to reduce the execution time of scientific software in NUMA systems
Año del Documento
2014
Editorial
Elsevier
Descripción
Producción Científica
Documento Fuente
Parallel Computing, 2014, Volume 40, Issue 7, Pages 309-327
Abstract
The most computationally demanding scientific problems are solved with large parallel systems. In some cases these systems are Non-Uniform Memory Access (NUMA) multiprocessors made up of a large number of cores which share a hierarchically organized memory. The main basic component of these scientific codes is often matrix multiplication, and the efficient development of other linear algebra packages is directly based on the matrix multiplication routine implemented in the BLAS library. BLAS library is used in the form of packages implemented by the vendors or free implementations. The latest versions of this library are multithreaded and can be used efficiently in multicore systems, but when they are used inside parallel codes, the two parallelism levels can interfere and produce a degradation of the performance. In this work, an auto-tuning method is proposed to select automatically the optimum number of threads to use at each parallel level when multithreaded linear algebra routines are called from OpenMP parallel codes. The method is based on a simple but effective theoretical model of the execution time of the two-level routines. The methodology is applied to a two-level matrix–matrix multiplication and to different matrix factorizations (LU, QR and Cholesky) by blocks. Traditional schemes which directly use the multithreaded routine of BLAS, dgemm, are compared with schemes combining the multithreaded dgemm with OpenMP.
Materias (normalizadas)
Computación Paralela
Auto-Tuning
Materias Unesco
1203 Ciencia de Los Ordenadores
3304 Tecnología de Los Ordenadores
Palabras Clave
Auto-tuning
Linear Algebra
Performance Modeling
ISSN
0167-8191
Revisión por pares
SI
Patrocinador
Este trabajo forma parte del proyecto de investigación TIN2012-38341-C04-03 financiado por el Ministerio de Economía (MINECO)
Version del Editor
Propietario de los Derechos
Elsevier B.V.
Idioma
eng
Tipo de versión
info:eu-repo/semantics/publishedVersion
Derechos
restrictedAccess
Collections
Files in this item