Unbalanced tree search on a manycore system using the GPI programming model

Detalhes bibliográficos
Autor(a) principal: Machado, Rui
Data de Publicação: 2011
Outros Autores: Lojewski, Carsten, Abreu, Salvador, Pfreundt, Franz-Josef
Tipo de documento: Artigo
Idioma: por
Título da fonte: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Texto Completo: http://hdl.handle.net/10174/4645
Resumo: The recent developments in computer architectures progress towards systems with large core count (Manycore) which expose more parallelism to applications. Some applications named irregular and unbalanced applications demand a dynamic and asynchronous load balance implementation to utilize the full performance a Manycore system. For example, the recently established Graph500 benchmark aims at such applications. The UTS benchmark characterizes the performance of such irregular and unbalanced computations with a tree-structured search space that requires continuous dynamic load balancing. GPI is a PGAS API that delivers the full performance of RDMA-enabled networks directly to the application. Its programming model focuses the use of one-sided asynchronous communication, overlapping computation and communication. In this paper we address the dynamic load balancing requirements of unbalanced applications using the GPI programming model. Using the UTS benchmark, we detail the implementation of a work stealing algorithm using GPI and present the performance results. Our performance evaluation shows significant improvements when compared with the optimized MPI version with a maximum performance of 9.5 billion nodes per second on 3072 cores.
id RCAP_6a7cda669525c72fe37d2d08488a48d0
oai_identifier_str oai:dspace.uevora.pt:10174/4645
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Unbalanced tree search on a manycore system using the GPI programming modelDistributed ComputingUTSThe recent developments in computer architectures progress towards systems with large core count (Manycore) which expose more parallelism to applications. Some applications named irregular and unbalanced applications demand a dynamic and asynchronous load balance implementation to utilize the full performance a Manycore system. For example, the recently established Graph500 benchmark aims at such applications. The UTS benchmark characterizes the performance of such irregular and unbalanced computations with a tree-structured search space that requires continuous dynamic load balancing. GPI is a PGAS API that delivers the full performance of RDMA-enabled networks directly to the application. Its programming model focuses the use of one-sided asynchronous communication, overlapping computation and communication. In this paper we address the dynamic load balancing requirements of unbalanced applications using the GPI programming model. Using the UTS benchmark, we detail the implementation of a work stealing algorithm using GPI and present the performance results. Our performance evaluation shows significant improvements when compared with the optimized MPI version with a maximum performance of 9.5 billion nodes per second on 3072 cores.Springer2012-01-31T00:32:46Z2012-01-312011-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/4645http://hdl.handle.net/10174/4645porRui Machado and Carsten Lojewski and Salvador Abreu and Franz-Josef Pfreundt, Unbalanced tree search on a manycore system using the GPI programming model, Computer Science R&D, Vol. 26, No. 3-4, Pag. 229-236, Springer, 2011.http://www.springerlink.com/content/ppn27r4h1402vt37/fulltext.pdfndndspa@di.uevora.ptndMachado, RuiLojewski, CarstenAbreu, SalvadorPfreundt, Franz-Josefinfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-01-03T18:42:41Zoai:dspace.uevora.pt:10174/4645Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T11:53:47.483638Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Unbalanced tree search on a manycore system using the GPI programming model
title Unbalanced tree search on a manycore system using the GPI programming model
spellingShingle Unbalanced tree search on a manycore system using the GPI programming model
Machado, Rui
Distributed Computing
UTS
title_short Unbalanced tree search on a manycore system using the GPI programming model
title_full Unbalanced tree search on a manycore system using the GPI programming model
title_fullStr Unbalanced tree search on a manycore system using the GPI programming model
title_full_unstemmed Unbalanced tree search on a manycore system using the GPI programming model
title_sort Unbalanced tree search on a manycore system using the GPI programming model
author Machado, Rui
author_facet Machado, Rui
Lojewski, Carsten
Abreu, Salvador
Pfreundt, Franz-Josef
author_role author
author2 Lojewski, Carsten
Abreu, Salvador
Pfreundt, Franz-Josef
author2_role author
author
author
dc.contributor.author.fl_str_mv Machado, Rui
Lojewski, Carsten
Abreu, Salvador
Pfreundt, Franz-Josef
dc.subject.por.fl_str_mv Distributed Computing
UTS
topic Distributed Computing
UTS
description The recent developments in computer architectures progress towards systems with large core count (Manycore) which expose more parallelism to applications. Some applications named irregular and unbalanced applications demand a dynamic and asynchronous load balance implementation to utilize the full performance a Manycore system. For example, the recently established Graph500 benchmark aims at such applications. The UTS benchmark characterizes the performance of such irregular and unbalanced computations with a tree-structured search space that requires continuous dynamic load balancing. GPI is a PGAS API that delivers the full performance of RDMA-enabled networks directly to the application. Its programming model focuses the use of one-sided asynchronous communication, overlapping computation and communication. In this paper we address the dynamic load balancing requirements of unbalanced applications using the GPI programming model. Using the UTS benchmark, we detail the implementation of a work stealing algorithm using GPI and present the performance results. Our performance evaluation shows significant improvements when compared with the optimized MPI version with a maximum performance of 9.5 billion nodes per second on 3072 cores.
publishDate 2011
dc.date.none.fl_str_mv 2011-01-01T00:00:00Z
2012-01-31T00:32:46Z
2012-01-31
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10174/4645
http://hdl.handle.net/10174/4645
url http://hdl.handle.net/10174/4645
dc.language.iso.fl_str_mv por
language por
dc.relation.none.fl_str_mv Rui Machado and Carsten Lojewski and Salvador Abreu and Franz-Josef Pfreundt, Unbalanced tree search on a manycore system using the GPI programming model, Computer Science R&D, Vol. 26, No. 3-4, Pag. 229-236, Springer, 2011.
http://www.springerlink.com/content/ppn27r4h1402vt37/fulltext.pdf
nd
nd
spa@di.uevora.pt
nd
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Springer
publisher.none.fl_str_mv Springer
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833592335846342656