Programming and Mapping for Mixed Heterogeneous Devices: The Case of Optical Flow

Alonso Pascual, Sergio; González Escribano, Arturo

doi:10.1109/PDP66500.2025.00033

Título

Programming and Mapping for Mixed Heterogeneous Devices: The Case of Optical Flow

dc.contributor.author	Alonso Pascual, Sergio
dc.contributor.author	González Escribano, Arturo
dc.date.accessioned	2025-05-16T11:27:51Z
dc.date.available	2025-05-16T11:27:51Z
dc.date.issued	2025
dc.identifier.citation	2025 33rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP), Turin, Italy, 2025, pp. 187-195, doi: 10.1109/PDP66500.2025.00033	es
dc.identifier.uri	https://uvadoc.uva.es/handle/10324/75764
dc.description	Producción Científica	es
dc.description.abstract	Current parallel systems are increasingly heterogeneous, mixing devices of different types and computing capabilities. Exploiting multiple different devices for the same application continues to be a challenge that ranges from technical problems related to synchronizing and communicating diverse devices to problems of load distribution and flexibility to adjust the computation to the platform resources. In this work, we study the problem of using and extending a heterogeneous portability layer to program and adapt HSOpticalFlow to heterogeneous platforms. HSOpticalFlow is a streaming application to estimate the apparent movement of objects in a sequence of images. It is a simple but characteristic example of the structure of applications based on multilevel ILS (Iterative Loop Stencil), also known as multi-grid methods, applied to a sequence of inputs. Starting from the original CUDA reference code, we present a methodology and programming techniques based on the Controller programming model to implement it as a pipeline among multiple devices. We discuss a technique to determine a proper work partition and mapping for a set of devices. This allows for building very efficient parallel solutions, using similar devices or taking advantage of devices with lower computing power, to reduce the load and increase the productivity of more powerful ones. We present the results of an experimental study using several GPUs of different vendors, architectures, and generations, showing that this solution allows combinations of devices to be efficiently exploited to improve performance. Specifically, the results include speedups of 1.91x using two NVIDIA A100 GPUs and 1.21x using one NVIDIA V100 GPU and one AMD WX9100 GPU, which is about 3x slower than the NVIDIA GPU for this application.	es
dc.format.extent	9 p.	es
dc.format.mimetype	application/pdf	es
dc.language.iso	eng	es
dc.publisher	IEEE	es
dc.rights.accessRights	info:eu-repo/semantics/openAccess	es
dc.subject	Informática	es
dc.subject.classification	Heterogeneous programming, Streaming, Parallel pipeline, Optical Flow	es
dc.title	Programming and Mapping for Mixed Heterogeneous Devices: The Case of Optical Flow	es
dc.type	info:eu-repo/semantics/conferenceObject	es
dc.identifier.doi	10.1109/PDP66500.2025.00033	es
dc.title.event	2025 33rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)	es
dc.description.project	This work is part of the action PID2022-142292NB-I00 (Knowledge Generation Project 2022), funded by MICI- U/AEI /10.13039/501100011033 and by FEDER, EU. Sup- port has also been received from the Investigo Program of the State Public Employment Service, Call for the Hiring of Research Personnel, financed by the European Union- NextGenerationEU. This research was partially supported by grants from NVIDIA and utilized an NVIDIA A100 GPU. It was also supported by EuroHPC Joint Undertaking for awarding us access to Leonardo at CINECA, Italy (project EHPC-DEV-2024D07-079)	es
dc.type.hasVersion	info:eu-repo/semantics/submittedVersion	es
dc.subject.unesco	1203.17 Informática	es
dc.subject.unesco	3304 Tecnología de Los Ordenadores	es

Files in this item

Name:: 25_PDP_hso.pdf
Size:: 591.9Kb
Format:: PDF

FilesOpen

This item appears in the following Collection(s)

DEP41 - Comunicaciones a congresos, conferencias, etc. [102]

Show simple item record