<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-04-27T07:54:16Z</responseDate><request verb="GetRecord" identifier="oai:uvadoc.uva.es:10324/79376" metadataPrefix="qdc">https://uvadoc.uva.es/oai/request</request><GetRecord><record><header><identifier>oai:uvadoc.uva.es:10324/79376</identifier><datestamp>2025-12-15T09:23:03Z</datestamp><setSpec>com_10324_1165</setSpec><setSpec>com_10324_931</setSpec><setSpec>com_10324_894</setSpec><setSpec>col_10324_1335</setSpec></header><metadata><qdc:qualifieddc xmlns:qdc="http://dspace.org/qualifieddc/" xmlns:doc="http://www.lyncode.com/xoai" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dc="http://purl.org/dc/elements/1.1/" xsi:schemaLocation="http://purl.org/dc/elements/1.1/ http://dublincore.org/schemas/xmls/qdc/2006/01/06/dc.xsd http://purl.org/dc/terms/ http://dublincore.org/schemas/xmls/qdc/2006/01/06/dcterms.xsd http://dspace.org/qualifieddc/ http://www.ukoln.ac.uk/metadata/dcmi/xmlschema/qualifieddc.xsd">
<dc:title>On the development of high-performance, multi-GPU applications on heterogeneous systems leveraging SYCL</dc:title>
<dc:creator>Andújar Muñoz, Francisco José</dc:creator>
<dc:creator>Carratalá Sáez, Rocío</dc:creator>
<dc:creator>Torres de la Sierra, Yuri</dc:creator>
<dc:creator>González Escribano, Arturo</dc:creator>
<dc:creator>Llanos Ferraris, Diego Rafael</dc:creator>
<dc:subject>Informática</dc:subject>
<dcterms:abstract>Computational platforms for high-performance scientific applications are increasingly heterogeneous, incorporating multiple GPU accelerators. However, differences in GPU vendors, architectures, and programming models challenge performance portability and ease of development. SYCL provides a unified programming approach, enabling applications to target NVIDIA and AMD GPUs simultaneously while offering higher-level abstractions for data and task management. This paper evaluates SYCL’s performance and development effort using the Finite Time Lyapunov Exponent (FTLE) calculation as a case study. We compare SYCL’s AdaptiveCpp (Ahead-Of-Time and Just-In-Time) and Intel oneAPI compilers, along with different data management strategies (Unified Shared Memory and buffers), against equivalent CUDA and HIP implementations. Our analysis considers single and multi-GPU execution, including heterogeneous setups with GPUs from different vendors. Results show that, while SYCL introduces additional development effort compared to native CUDA and HIP implementations, it enables multi-vendor portability with minimal performance overhead when using specific design options. Based on our findings, we provide development guidelines to help programmers decide when to use SYCL versus vendor-specific alternatives.</dcterms:abstract>
<dcterms:dateAccepted>2025-11-06T12:20:08Z</dcterms:dateAccepted>
<dcterms:available>2025-11-06T12:20:08Z</dcterms:available>
<dcterms:created>2025-11-06T12:20:08Z</dcterms:created>
<dcterms:issued>2026</dcterms:issued>
<dc:type>info:eu-repo/semantics/article</dc:type>
<dc:identifier>Journal of Parallel and Distributed Computing Volume 207, January 2026, 105188</dc:identifier>
<dc:identifier>0743-7315</dc:identifier>
<dc:identifier>https://uvadoc.uva.es/handle/10324/79376</dc:identifier>
<dc:identifier>10.1016/j.jpdc.2025.105188</dc:identifier>
<dc:identifier>1</dc:identifier>
<dc:identifier>207</dc:identifier>
<dc:identifier>24</dc:identifier>
<dc:identifier>Journal of Parallel and Distributed Computing</dc:identifier>
<dc:identifier>207</dc:identifier>
<dc:language>eng</dc:language>
<dc:relation>https://www.sciencedirect.com/science/article/pii/S0743731525001558</dc:relation>
<dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
<dc:publisher>Elsevier</dc:publisher>
</qdc:qualifieddc></metadata></record></GetRecord></OAI-PMH>