| dc.contributor.author | Carratalá-Sáez, Rocío | |
| dc.contributor.author | Torres de la Sierra, Yuri | |
| dc.contributor.author | Llanos Ferraris, Diego Rafael | |
| dc.contributor.author | González Escribano, Arturo | |
| dc.contributor.editor | ArXiv | es |
| dc.date.accessioned | 2026-03-28T12:03:55Z | |
| dc.date.available | 2026-03-28T12:03:55Z | |
| dc.date.issued | 2023 | |
| dc.identifier.citation | Open SYCL on heterogeneous GPU systems: A case of study, Rocío Carratalá-Sáez and Francisco J. andújar and Yuri Torres and Arturo Gonzalez-Escribano and Diego R. Llanos, ArXiv preprint 2310.06947, 2023. | es |
| dc.identifier.uri | https://uvadoc.uva.es/handle/10324/83868 | |
| dc.description | Producción Científica | es |
| dc.description.abstract | Computational platforms for high-performance scientic applications are becoming more heterogenous, including hardware accelerators such as multiple GPUs. Applications in a wide variety of scientic elds require an efcient and careful management
of the computational resources of this type of hardware to obtain the best possible performance. However, there are currently
different GPU vendors, architectures and families that can be found in heterogeneous clusters or machines. Programming with the
vendor provided languages or frameworks, and optimizing for specic devices, may become cumbersome and compromise portability to other systems. To overcome this problem, several proposals for high-level heterogeneous programming have appeared, trying to reduce the development eort and increase functional and performance portability, specically when using GPU hardware accelerators. This paper evaluates the SYCL programming model, using the Open SYCL compiler, from two different perspectives: The performance it offers when dealing with single or multiple GPU devices from the same or different vendors, and the development effort required to implement the code. We use as case of study the Finite Time Lyapunov Exponent calculation over two real-world scenarios and compare the performance and the development eort of its Open SYCL-based version against the equivalent versions that use CUDA or HIP. Based on the experimental results, we observe that the use of SYCL does not lead to a remarkable overhead in terms of the GPU kernels execution time. In general terms, the Open SYCL development eort for the host code is lower than that observed with CUDA or HIP. Moreover, the SYCL version can take advantage of both CUDA and AMD GPU devices simultaneously much easier than directly using the vendor-specic programming solutions. | es |
| dc.description.sponsorship | Departamento de Informática, Universidad de Valladolid | es |
| dc.format.mimetype | application/pdf | es |
| dc.language.iso | eng | es |
| dc.rights.accessRights | info:eu-repo/semantics/openAccess | es |
| dc.subject | Informática | es |
| dc.subject.classification | Open SYCL, CUDA, HIP, Finite Time Lyapunov Exponent, Performance evauation, Development effort | es |
| dc.title | Open SYCL on heterogeneous GPU systems: A case of study | es |
| dc.type | info:eu-repo/semantics/preprint | es |
| dc.identifier.doi | 10.48550/arXiv.2310.06947 | es |
| dc.relation.publisherversion | https://arxiv.org/abs/2310.06947 | es |
| dc.description.project | This work was supported in part by the Spanish Ministerio de Ciencia e Innovaci´on and by the European Regional Development Fund (ERDF) program of the European Union, under Grant PID2022-142292NB-I00 (NATASHA Project); and in part by the Junta de Castilla y León - FEDER Grants, under Grant VA226P20 (PROPHET-2 Project), Junta de Castilla y León, Spain. This work was also supported in part by grant TED2021–130367B–I00, funded by European Union NextGenerationEU/ PRTR and byMCIN/AEI/10.13039/501100011033. This work has been also partially supported by NVIDIA Academic Hardware Grant Program. | es |
| dc.type.hasVersion | info:eu-repo/semantics/submittedVersion | es |
| dc.subject.unesco | 1203 Ciencia de Los Ordenadores | es |
| dc.subject.unesco | 3304 Tecnología de Los Ordenadores | es |