Mostrar el registro sencillo del ítem
dc.contributor.author | Cámara Moreno, Jesús | |
dc.contributor.author | Cuenca, Javier | |
dc.contributor.author | Giménez, Domingo | |
dc.contributor.author | García, Luis Pedro | |
dc.contributor.author | Vidal, Antonio M. | |
dc.date.accessioned | 2025-01-27T18:13:38Z | |
dc.date.available | 2025-01-27T18:13:38Z | |
dc.date.issued | 2014 | |
dc.identifier.citation | International Journal of Parallel Programming, 2014, Volume 42, Issue 3, Pages 408-434 | es |
dc.identifier.issn | 0885-7458 | es |
dc.identifier.uri | https://uvadoc.uva.es/handle/10324/74460 | |
dc.description | Producción Científica | es |
dc.description.abstract | The introduction of auto-tuning techniques in linear algebra shared-memory routines is analyzed. Information obtained in the installation of the routines is used at running time to take some decisions to reduce the total execution time. The study is carried out with routines at different levels (matrix multiplication, LU and Cholesky factorizations and linear systems symmetric or general routines) and with calls to routines in the LAPACK and PLASMA libraries with multithread implementations. Medium NUMA and large cc-NUMA systems are used in the experiments. This variety of routines, libraries and systems allows us to obtain general conclusions about the methodology to use for linear algebra shared-memory routines auto-tuning. Satisfactory execution times are obtained with the proposed methodology. | es |
dc.format.mimetype | application/pdf | es |
dc.language.iso | eng | es |
dc.publisher | Springer | es |
dc.rights.accessRights | info:eu-repo/semantics/restrictedAccess | es |
dc.subject | Computación Paralela | es |
dc.subject | Auto-Tuning | es |
dc.subject.classification | Linear Algebra Libraries | es |
dc.subject.classification | Empirical Installation | es |
dc.subject.classification | Shared-memory | es |
dc.subject.classification | Auto-tuning | es |
dc.title | Empirical Installation of Linear Algebra Shared-Memory Subroutines for Auto-Tuning | es |
dc.type | info:eu-repo/semantics/article | es |
dc.rights.holder | Springer Science Business Media New York | es |
dc.identifier.doi | 10.1007/s10766-013-0249-6 | es |
dc.relation.publisherversion | https://link.springer.com/article/10.1007/s10766-013-0249-6 | es |
dc.identifier.publicationfirstpage | 408 | es |
dc.identifier.publicationissue | 3 | es |
dc.identifier.publicationlastpage | 434 | es |
dc.identifier.publicationtitle | International Journal of Parallel Programming | es |
dc.identifier.publicationvolume | 42 | es |
dc.peerreviewed | SI | es |
dc.description.project | Este trabajo ha sido parcialmente financiado por la Fundación Séneca, el proyecto de investigación 08763/PI/08, PROMETEO/2009/013 de la Generalitat Valenciana, el proyecto de investigación TIN2012-38341-C04-03 del Ministerio de Educación y Ciencia y por la Red CAPAP-H (Computación de Altas Prestaciones sobre Arquitecturas Paralelas Heterogéneas) | es |
dc.identifier.essn | 1573-7640 | es |
dc.type.hasVersion | info:eu-repo/semantics/publishedVersion | es |
dc.subject.unesco | 1203 Ciencia de Los Ordenadores | es |
dc.subject.unesco | 3304 Tecnología de Los Ordenadores | es |