Mostrar el registro sencillo del ítem

dc.contributor.authorEstebanez, Alvaro
dc.contributor.authorLlanos, Diego R.
dc.contributor.authorOrden, David
dc.contributor.authorPalop, Belen
dc.date.accessioned2024-09-23T07:53:27Z
dc.date.available2024-09-23T07:53:27Z
dc.date.issued2022
dc.identifier.citationPLOS One, may 2022, ISSN 1932-6203.es
dc.identifier.issn1932-6203es
dc.identifier.urihttps://uvadoc.uva.es/handle/10324/70095
dc.descriptionProducción Científicaes
dc.description.abstractLoops are a rich source of parallelism. Unfortunately, many loops cannot be safely parallelized at compile time because the compiler is not able to guarantee that there will be no dependence violations. Thread-Level Speculation (TLS) techniques, either hardware or software-based, allow the parallel execution of non-analyzable loops, issuing the execution of blocks of consecutive iterations (called chunks) while a hardware or software monitor ensures that no dependence violations arise. If such a dependence violation occurs, the chunk that was fed with incorrect values is discarded and re-started, in order to consume the correct information. In the speculative execution of non-analyzable loops, it is very important to correctly choose the chunk size, because this choice dramatically affects the performance of the parallel execution. Bigger chunks imply less scheduling overheads, but smaller chunks allow fewer calculations to be discarded in the event of a dependence violation. To find a good chunk size is not a simple task, because loops may present dependencies that cannot be detected at compile time. In this paper, we present a comprehensive evaluation of different scheduling methods to estimate the optimal chunk size in the speculative execution of non-analyzable loops. This evaluation ranges from the simple, classical methods originally devised to achieve load balancing in loops with no dependencies, to methods that make some assumptions on the distribution pattern of dependencies, such as Meseta and Just-in-Time scheduling. We also propose and evaluate a general, more complex method called Moody Scheduling, that does not require a-priori assumptions to achieve the highest performance.es
dc.format.mimetypeapplication/pdfes
dc.language.isoenges
dc.publisherPlos ONEes
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.subjectInformáticaes
dc.titleOn the choice of the best chunk size for the speculative execution of loopses
dc.typeinfo:eu-repo/semantics/articlees
dc.identifier.doi10.1371/journal.pone.0267602es
dc.relation.publisherversionhttps://journals.plos.org/plosone/article?id=10.1371/journal.pone.0267602es
dc.identifier.publicationfirstpagee0267602es
dc.identifier.publicationissue5es
dc.identifier.publicationtitlePLOS ONEes
dc.identifier.publicationvolume17es
dc.peerreviewedSIes
dc.identifier.essn1932-6203es
dc.type.hasVersioninfo:eu-repo/semantics/publishedVersiones
dc.subject.unesco1203 Ciencia de Los Ordenadoreses
dc.subject.unesco3304 Tecnología de Los Ordenadoreses


Ficheros en el ítem

Thumbnail

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem