Unbalanced tree search on a manycore system using the GPI programming model
| Autor(a) principal: | |
|---|---|
| Data de Publicação: | 2011 |
| Outros Autores: | , , |
| Tipo de documento: | Artigo |
| Idioma: | por |
| Título da fonte: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| Texto Completo: | http://hdl.handle.net/10174/4645 |
Resumo: | The recent developments in computer architectures progress towards systems with large core count (Manycore) which expose more parallelism to applications. Some applications named irregular and unbalanced applications demand a dynamic and asynchronous load balance implementation to utilize the full performance a Manycore system. For example, the recently established Graph500 benchmark aims at such applications. The UTS benchmark characterizes the performance of such irregular and unbalanced computations with a tree-structured search space that requires continuous dynamic load balancing. GPI is a PGAS API that delivers the full performance of RDMA-enabled networks directly to the application. Its programming model focuses the use of one-sided asynchronous communication, overlapping computation and communication. In this paper we address the dynamic load balancing requirements of unbalanced applications using the GPI programming model. Using the UTS benchmark, we detail the implementation of a work stealing algorithm using GPI and present the performance results. Our performance evaluation shows significant improvements when compared with the optimized MPI version with a maximum performance of 9.5 billion nodes per second on 3072 cores. |
| id |
RCAP_6a7cda669525c72fe37d2d08488a48d0 |
|---|---|
| oai_identifier_str |
oai:dspace.uevora.pt:10174/4645 |
| network_acronym_str |
RCAP |
| network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| repository_id_str |
https://opendoar.ac.uk/repository/7160 |
| spelling |
Unbalanced tree search on a manycore system using the GPI programming modelDistributed ComputingUTSThe recent developments in computer architectures progress towards systems with large core count (Manycore) which expose more parallelism to applications. Some applications named irregular and unbalanced applications demand a dynamic and asynchronous load balance implementation to utilize the full performance a Manycore system. For example, the recently established Graph500 benchmark aims at such applications. The UTS benchmark characterizes the performance of such irregular and unbalanced computations with a tree-structured search space that requires continuous dynamic load balancing. GPI is a PGAS API that delivers the full performance of RDMA-enabled networks directly to the application. Its programming model focuses the use of one-sided asynchronous communication, overlapping computation and communication. In this paper we address the dynamic load balancing requirements of unbalanced applications using the GPI programming model. Using the UTS benchmark, we detail the implementation of a work stealing algorithm using GPI and present the performance results. Our performance evaluation shows significant improvements when compared with the optimized MPI version with a maximum performance of 9.5 billion nodes per second on 3072 cores.Springer2012-01-31T00:32:46Z2012-01-312011-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/4645http://hdl.handle.net/10174/4645porRui Machado and Carsten Lojewski and Salvador Abreu and Franz-Josef Pfreundt, Unbalanced tree search on a manycore system using the GPI programming model, Computer Science R&D, Vol. 26, No. 3-4, Pag. 229-236, Springer, 2011.http://www.springerlink.com/content/ppn27r4h1402vt37/fulltext.pdfndndspa@di.uevora.ptndMachado, RuiLojewski, CarstenAbreu, SalvadorPfreundt, Franz-Josefinfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-01-03T18:42:41Zoai:dspace.uevora.pt:10174/4645Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T11:53:47.483638Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
| dc.title.none.fl_str_mv |
Unbalanced tree search on a manycore system using the GPI programming model |
| title |
Unbalanced tree search on a manycore system using the GPI programming model |
| spellingShingle |
Unbalanced tree search on a manycore system using the GPI programming model Machado, Rui Distributed Computing UTS |
| title_short |
Unbalanced tree search on a manycore system using the GPI programming model |
| title_full |
Unbalanced tree search on a manycore system using the GPI programming model |
| title_fullStr |
Unbalanced tree search on a manycore system using the GPI programming model |
| title_full_unstemmed |
Unbalanced tree search on a manycore system using the GPI programming model |
| title_sort |
Unbalanced tree search on a manycore system using the GPI programming model |
| author |
Machado, Rui |
| author_facet |
Machado, Rui Lojewski, Carsten Abreu, Salvador Pfreundt, Franz-Josef |
| author_role |
author |
| author2 |
Lojewski, Carsten Abreu, Salvador Pfreundt, Franz-Josef |
| author2_role |
author author author |
| dc.contributor.author.fl_str_mv |
Machado, Rui Lojewski, Carsten Abreu, Salvador Pfreundt, Franz-Josef |
| dc.subject.por.fl_str_mv |
Distributed Computing UTS |
| topic |
Distributed Computing UTS |
| description |
The recent developments in computer architectures progress towards systems with large core count (Manycore) which expose more parallelism to applications. Some applications named irregular and unbalanced applications demand a dynamic and asynchronous load balance implementation to utilize the full performance a Manycore system. For example, the recently established Graph500 benchmark aims at such applications. The UTS benchmark characterizes the performance of such irregular and unbalanced computations with a tree-structured search space that requires continuous dynamic load balancing. GPI is a PGAS API that delivers the full performance of RDMA-enabled networks directly to the application. Its programming model focuses the use of one-sided asynchronous communication, overlapping computation and communication. In this paper we address the dynamic load balancing requirements of unbalanced applications using the GPI programming model. Using the UTS benchmark, we detail the implementation of a work stealing algorithm using GPI and present the performance results. Our performance evaluation shows significant improvements when compared with the optimized MPI version with a maximum performance of 9.5 billion nodes per second on 3072 cores. |
| publishDate |
2011 |
| dc.date.none.fl_str_mv |
2011-01-01T00:00:00Z 2012-01-31T00:32:46Z 2012-01-31 |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
| format |
article |
| status_str |
publishedVersion |
| dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10174/4645 http://hdl.handle.net/10174/4645 |
| url |
http://hdl.handle.net/10174/4645 |
| dc.language.iso.fl_str_mv |
por |
| language |
por |
| dc.relation.none.fl_str_mv |
Rui Machado and Carsten Lojewski and Salvador Abreu and Franz-Josef Pfreundt, Unbalanced tree search on a manycore system using the GPI programming model, Computer Science R&D, Vol. 26, No. 3-4, Pag. 229-236, Springer, 2011. http://www.springerlink.com/content/ppn27r4h1402vt37/fulltext.pdf nd nd spa@di.uevora.pt nd |
| dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
| eu_rights_str_mv |
openAccess |
| dc.publisher.none.fl_str_mv |
Springer |
| publisher.none.fl_str_mv |
Springer |
| dc.source.none.fl_str_mv |
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
| instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
| instacron_str |
RCAAP |
| institution |
RCAAP |
| reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
| repository.mail.fl_str_mv |
info@rcaap.pt |
| _version_ |
1833592335846342656 |