EPSILOD: efficient parallel skeleton for generic iterative stencil computations in distributed GPUs

Castro Caballero, Manuel De; Santamaria Valenzuela, Inmaculada; Torres de la Sierra, Yuri; González Escribano, Arturo; Llanos Ferraris, Diego Rafael

doi:10.1007/s11227-022-05040-y

Título

EPSILOD: efficient parallel skeleton for generic iterative stencil computations in distributed GPUs

dc.contributor.author	Castro Caballero, Manuel De
dc.contributor.author	Santamaria Valenzuela, Inmaculada
dc.contributor.author	Torres de la Sierra, Yuri
dc.contributor.author	González Escribano, Arturo
dc.contributor.author	Llanos Ferraris, Diego Rafael
dc.date.accessioned	2026-03-28T11:26:09Z
dc.date.available	2026-03-28T11:26:09Z
dc.date.issued	2023
dc.identifier.citation	EPSILOD: efficient parallel skeleton for generic iterative stencil computations in distributed GPUs, de Castro, M., Santamaria-Valenzuela, I., Torres, Y. et al. Journal of Supercomputing 79, 9409–9442 (2023).	es
dc.identifier.issn	0920-8542	es
dc.identifier.uri	https://uvadoc.uva.es/handle/10324/83863
dc.description	Producción Científica	es
dc.description.abstract	Iterative stencil computations are widely used in numerical simulations. They present a high degree of parallelism, high locality and mostly-coalesced memory access patterns. Therefore, GPUs are good candidates to speed up their computation. However, the development of stencil programs that can work with huge grids in distributed systems with multiple GPUs is not straightforward, since it requires solving problems related to the partition of the grid across nodes and devices, and the synchronization and data movement across remote GPUs. In this work, we present EPSILOD, a high-productivity parallel programming skeleton for iterative stencil computations on distributed multi-GPUs, of the same or different vendors that supports any type of n-dimensional geometric stencils of any order. It uses an abstract specification of the stencil pattern (neighbors and weights) to internally derive the data partition, synchronizations and communications. Computation is split to better overlap with communications. This paper describes the underlying architecture of EPSILOD, its main components, and presents an experimental evaluation to show the benefits of our approach, including a comparison with another state-of-the-art solution. The experimental results show that EPSILOD is faster and shows good strong and weak scalability for platforms with both homogeneous and heterogeneous types of GPU.	es
dc.format.mimetype	application/pdf	es
dc.language.iso	eng	es
dc.publisher	Springer Nature	es
dc.rights.accessRights	info:eu-repo/semantics/openAccess	es
dc.subject	Informática	es
dc.subject.classification	Distributed memory, GPU, Heterogeneous, Stencil, Parallel skeletons	es
dc.title	EPSILOD: efficient parallel skeleton for generic iterative stencil computations in distributed GPUs	es
dc.type	info:eu-repo/semantics/article	es
dc.identifier.doi	10.1007/s11227-022-05040-y	es
dc.relation.publisherversion	https://link.springer.com/article/10.1007/s11227-022-05040-y	es
dc.identifier.publicationfirstpage	9409	es
dc.identifier.publicationissue	9	es
dc.identifier.publicationlastpage	9442	es
dc.identifier.publicationtitle	The Journal of Supercomputing	es
dc.identifier.publicationvolume	79	es
dc.peerreviewed	SI	es
dc.description.project	This work has been funded by the Consejería de Educación of Junta de Castilla y León, Ministerio de Economía, Industria y Competitividad of Spain, European Regional Development Fund (ERDF) program: Project PCAS (TIN2017-88614-R) and Project PROPHET-2 (VA226P20). This work was supported in part by grant TED2021-130367B-I00 funded by MCIN/AEI/10.13039/501100011033 and by “European Union NextGenerationEU/PRTR”. The authors thankfully acknowledges the computer resources at CTE-POWER and Minotauro and the technical support provided by Barcelona Supercomputing Center (RES-IM-2021-2-0005, RES-IM-2021-3-0024, RES-IM-2022-1-0014).	es
dc.identifier.essn	1573-0484	es
dc.type.hasVersion	info:eu-repo/semantics/publishedVersion	es
dc.subject.unesco	1203 Ciencia de Los Ordenadores	es
dc.subject.unesco	3304 Tecnología de Los Ordenadores	es

Fichier(s) constituant ce document

Nom:: epsilod.pdf
Taille:: 2.089Mo
Format:: PDF

Voir/Ouvrir

Ce document figure dans la(les) collection(s) suivante(s)

DEP41 - Artículos de revista [147]

Afficher la notice abrégée