TY - JOUR AU - Cámara Moreno, Jesús AU - Cuenca, Javier AU - Galindo, Víctor AU - Vicente, Arturo AU - Boratto, Murilo PY - 2024 SN - 0920-8542 UR - https://uvadoc.uva.es/handle/10324/75227 AB - In this work, an automatic optimisation approach for parallel routines on multi-GPU systems is presented. Several inter-GPU communication libraries (such as CUDA- Aware MPI or NCCL) are used with a set of routines to perform the numerical... LA - eng PB - Springer KW - Autotuning KW - Communication libraries KW - Multi-GPU KW - Heterogeneous computing TI - An autotuning approach to select the inter-GPU communication library on heterogeneous systems DO - 10.1007/s11227-024-06794-3 ER -