Afficher la notice abrégée

dc.contributor.authorCastro Caballero, Manuel De 
dc.contributor.authorSantamaria Valenzuela, Inmaculada
dc.contributor.authorTorres de la Sierra, Yuri 
dc.contributor.authorGonzález Escribano, Arturo 
dc.contributor.authorLlanos Ferraris, Diego Rafael 
dc.date.accessioned2026-03-28T11:26:09Z
dc.date.available2026-03-28T11:26:09Z
dc.date.issued2023
dc.identifier.citationEPSILOD: efficient parallel skeleton for generic iterative stencil computations in distributed GPUs, de Castro, M., Santamaria-Valenzuela, I., Torres, Y. et al. Journal of Supercomputing 79, 9409–9442 (2023).es
dc.identifier.issn0920-8542es
dc.identifier.urihttps://uvadoc.uva.es/handle/10324/83863
dc.descriptionProducción Científicaes
dc.description.abstractIterative stencil computations are widely used in numerical simulations. They present a high degree of parallelism, high locality and mostly-coalesced memory access patterns. Therefore, GPUs are good candidates to speed up their computation. However, the development of stencil programs that can work with huge grids in distributed systems with multiple GPUs is not straightforward, since it requires solving problems related to the partition of the grid across nodes and devices, and the synchronization and data movement across remote GPUs. In this work, we present EPSILOD, a high-productivity parallel programming skeleton for iterative stencil computations on distributed multi-GPUs, of the same or different vendors that supports any type of n-dimensional geometric stencils of any order. It uses an abstract specification of the stencil pattern (neighbors and weights) to internally derive the data partition, synchronizations and communications. Computation is split to better overlap with communications. This paper describes the underlying architecture of EPSILOD, its main components, and presents an experimental evaluation to show the benefits of our approach, including a comparison with another state-of-the-art solution. The experimental results show that EPSILOD is faster and shows good strong and weak scalability for platforms with both homogeneous and heterogeneous types of GPU.es
dc.format.mimetypeapplication/pdfes
dc.language.isoenges
dc.publisherSpringer Naturees
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.subjectInformáticaes
dc.subject.classificationDistributed memory, GPU, Heterogeneous, Stencil, Parallel skeletonses
dc.titleEPSILOD: efficient parallel skeleton for generic iterative stencil computations in distributed GPUses
dc.typeinfo:eu-repo/semantics/articlees
dc.identifier.doi10.1007/s11227-022-05040-yes
dc.relation.publisherversionhttps://link.springer.com/article/10.1007/s11227-022-05040-yes
dc.identifier.publicationfirstpage9409es
dc.identifier.publicationissue9es
dc.identifier.publicationlastpage9442es
dc.identifier.publicationtitleThe Journal of Supercomputinges
dc.identifier.publicationvolume79es
dc.peerreviewedSIes
dc.description.projectThis work has been funded by the Consejería de Educación of Junta de Castilla y León, Ministerio de Economía, Industria y Competitividad of Spain, European Regional Development Fund (ERDF) program: Project PCAS (TIN2017-88614-R) and Project PROPHET-2 (VA226P20). This work was supported in part by grant TED2021-130367B-I00 funded by MCIN/AEI/10.13039/501100011033 and by “European Union NextGenerationEU/PRTR”. The authors thankfully acknowledges the computer resources at CTE-POWER and Minotauro and the technical support provided by Barcelona Supercomputing Center (RES-IM-2021-2-0005, RES-IM-2021-3-0024, RES-IM-2022-1-0014).es
dc.identifier.essn1573-0484es
dc.type.hasVersioninfo:eu-repo/semantics/publishedVersiones
dc.subject.unesco1203 Ciencia de Los Ordenadoreses
dc.subject.unesco3304 Tecnología de Los Ordenadoreses


Fichier(s) constituant ce document

Thumbnail

Ce document figure dans la(les) collection(s) suivante(s)

Afficher la notice abrégée