RT info:eu-repo/semantics/conferenceObject T1 Automatic Data Layout at Multiple Levels for CUDA A1 Torres de la Sierra, Yuri A1 González Escribano, Arturo A1 Llanos Ferraris, Diego Rafael K1 Informática K1 Data layout K1 CUDA K1 GPUs K1 Heterogeneous systems K1 1203 Ciencia de Los Ordenadores K1 3304 Tecnología de Los Ordenadores AB Trasgo is a source-to-source compiler system that translates simple high-level specifications of parallel algorithms to lower-level native programs, with data partition and communication details generated automatically. Hitmap is the run-time library used by the backends of Trasgo for hierarchical tiling and mapping of arrays, currently built on top of the MPI message-passing interface. Hitmap includes a plug-in system for automatic data-layouts. In this paper we extend Hitmap with a new type of data-layout techniques suitable for the CUDA parallel programming model. The combination with the previous type of data-layout techniques allow to generate data distributions, at multiple levels of parallelism, for GPU clusters. The new Hitmap version hides to the programmer the details about the machine structure and thread management, allowing to easily generate programs with multiple levels of parallelism in heterogeneous systems. This work opens the road to develop a new back-end for the Trasgo compiler system to automatically generate CUDA programs. PB Universidad de Salamanca SN 978-84-613-5510-5 YR 2010 FD 2010 LK https://uvadoc.uva.es/handle/10324/71258 UL https://uvadoc.uva.es/handle/10324/71258 LA eng NO CMMSE 2010 (Computational and Mathematical Methods in Science and Engineering), Almería, Spain, june 2010 NO Producción Científica DS UVaDOC RD 23-nov-2024