Por favor, use este identificador para citar o enlazar este ítem:http://uvadoc.uva.es/handle/10324/29139
Analysis of OpenACC Performance using Different Block Geometries
CMMSE 2017 Proceedings (Computational and Mathematical Methods in Science and Engineering)
Año del Documento
Universidad de Salamanca
CMMSE 2017 Proceedings (Computational and Mathematical Methods in Science and Engineering), Rota, Spain, july 2017
OpenACC is a parallel programming model for automatic parallelization of sequential code using compiler directives or pragmas. OpenACC is intended to be used with accelerators such as GPUs and Xeon Phi. The different implementations of the standard, although still in early development, are primarily focused on GPU execution. In this study, we analyze how the different OpenACC compilers available under certain premises behave when the clauses affecting the underlying block geometry implementation are modified. These clauses are the Gang number, Worker number, and Vector Size defined by the standard.
This research has been partially supported by MICINN (Spain) and ERDF program of the European Union: HomProg-HetSys project (TIN2014-58876-P), CAPAP-H5 network (TIN2014-53522-REDT), and COST Program Action IC1305: Network for Sustainable Ul- trascale Computing (NESUS).
Except where otherwise noted, this item's license is described as Attribution 4.0 International