<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-04-14T19:08:37Z</responseDate><request verb="GetRecord" identifier="oai:uvadoc.uva.es:10324/70435" metadataPrefix="edm">https://uvadoc.uva.es/oai/request</request><GetRecord><record><header><identifier>oai:uvadoc.uva.es:10324/70435</identifier><datestamp>2025-02-06T09:07:07Z</datestamp><setSpec>com_10324_1165</setSpec><setSpec>com_10324_931</setSpec><setSpec>com_10324_894</setSpec><setSpec>col_10324_1335</setSpec></header><metadata><rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:doc="http://www.lyncode.com/xoai" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:ore="http://www.openarchives.org/ore/terms/" xmlns:dcterms="http://purl.org/dc/terms/" xmlns:ds="http://dspace.org/ds/elements/1.1/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:edm="http://www.europeana.eu/schemas/edm/" xsi:schemaLocation="http://www.w3.org/1999/02/22-rdf-syntax-ns# http://www.europeana.eu/schemas/edm/EDM.xsd">
<edm:ProvidedCHO rdf:about="https://uvadoc.uva.es/handle/10324/70435">
<dc:creator>Torres de la Sierra, Yuri</dc:creator>
<dc:creator>González Escribano, Arturo</dc:creator>
<dc:creator>Llanos Ferraris, Diego Rafael</dc:creator>
<dc:date>2013</dc:date>
<dc:description>Producción Científica</dc:description>
<dc:description>The choice of thread-block size and shape is one of the most important user decisions when a parallel problem is written for any CUDA architecture. The reason is that thread-block geometry has a significant impact on the global performance of the program. Unfortunately, the programmer has not enough information about the subtle interactions between this choice of parameters and the underlying hardware.&#xd;
This paper presents uBench, a complete suite of micro-benchmarks, in order to explore the impact on performance of (1) the thread-block geometry choice criteria, and (2) the GPU hardware resources and configurations. Each micro-benchmark has been designed to be as simple as possible to focus on a single effect derived from the hardware and thread-block parameter choice.&#xd;
As an example of the capabilities of this benchmark suite, this paper shows an experimental evaluation and comparison of Fermi and Kepler architectures. Our study reveals that, in spite of the new hardware details introduced by Kepler, the principles underlying the block geometry selection criteria are similar for both architectures.</dc:description>
<dc:format>application/pdf</dc:format>
<dc:identifier>https://uvadoc.uva.es/handle/10324/70435</dc:identifier>
<dc:language>eng</dc:language>
<dc:publisher>Springer</dc:publisher>
<dc:subject>Informática</dc:subject>
<dc:subject>1203 Ciencia de Los Ordenadores</dc:subject>
<dc:subject>3304 Tecnología de Los Ordenadores</dc:subject>
<dc:title>uBench: exposing the impact of CUDA block geometry in terms of performance</dc:title>
<dc:type>info:eu-repo/semantics/article</dc:type>
<edm:type>TEXT</edm:type>
</edm:ProvidedCHO>
<ore:Aggregation rdf:about="https://uvadoc.uva.es/handle/10324/70435#aggregation">
<edm:aggregatedCHO rdf:resource="https://uvadoc.uva.es/handle/10324/70435"/>
<edm:dataProvider>UVaDOC. Repositorio Documental de la Universidad de Valladolid</edm:dataProvider>
<edm:isShownAt rdf:resource="https://uvadoc.uva.es/handle/10324/70435"/>
<edm:isShownBy rdf:resource="https://uvadoc.uva.es/bitstream/10324/70435/1/s11227-013-0921-z.pdf"/>
<edm:provider>Hispana</edm:provider>
<edm:rights rdf:resource="http://rightsstatements.org/vocab/CNE/1.0/"/>
</ore:Aggregation>
<edm:WebResource rdf:about="https://uvadoc.uva.es/bitstream/10324/70435/1/s11227-013-0921-z.pdf">
<edm:rights rdf:resource="http://rightsstatements.org/vocab/CNE/1.0/"/>
</edm:WebResource>
</rdf:RDF></metadata></record></GetRecord></OAI-PMH>