Towards dense linear algebra for hybrid GPU accelerated manycore systems
| Autor(a) principal: | |
|---|---|
| Data de Publicação: | 2008 |
| Outros Autores: | , |
| Tipo de documento: | Outros |
| Idioma: | eng |
| Título da fonte: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| Texto Completo: | https://hdl.handle.net/10316/11212 |
Resumo: | If multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced with accelerators! This is happening today as accelerators, in particular Graphical Processing Units (GPUs), are steadily making their way into the high performance computing (HPC) world. We highlight the trends leading to the idea of hybrid manycore/GPU systems, and we present a set of techniques that can be used to e ciently program them. The presentation is in the context of Dense Linear Algebra (DLA), a major building block for many scienti c computing applications. We motivate the need for new algorithms that would split the computation in a way that would fully exploit the power that each of the hybrid components o ers. As the area of hybrid multicore/GPU computing is still in its infancy, we also argue for its importance in view of what future architectures may look like. We therefore envision the need for a DLA library similar to LAPACK but for hybrid manycore/GPU systems. We illustrate the main ideas with an LUfactorization algorithm where particular techniques are used to reduce the amount of pivoting, resulting in an algorithm achieving up to 388 GFlop/s for single and up to 99:4 GFlop/s for double precision factorization on a hybrid Intel Xeon (2x4 cores @ 2.33 GHz) { NVIDIA GeForce GTX 280 (240 cores @ 1.30 GHz) system. |
| id |
RCAP_193b4e6636586db27b8e11c21b1f3757 |
|---|---|
| oai_identifier_str |
oai:estudogeral.uc.pt:10316/11212 |
| network_acronym_str |
RCAP |
| network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| repository_id_str |
https://opendoar.ac.uk/repository/7160 |
| spelling |
Towards dense linear algebra for hybrid GPU accelerated manycore systemsHybrid computingDense linear algebraParallel algorithmsLU factorizationMulticore processorsGraphic process unitsAcceleratorsIf multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced with accelerators! This is happening today as accelerators, in particular Graphical Processing Units (GPUs), are steadily making their way into the high performance computing (HPC) world. We highlight the trends leading to the idea of hybrid manycore/GPU systems, and we present a set of techniques that can be used to e ciently program them. The presentation is in the context of Dense Linear Algebra (DLA), a major building block for many scienti c computing applications. We motivate the need for new algorithms that would split the computation in a way that would fully exploit the power that each of the hybrid components o ers. As the area of hybrid multicore/GPU computing is still in its infancy, we also argue for its importance in view of what future architectures may look like. We therefore envision the need for a DLA library similar to LAPACK but for hybrid manycore/GPU systems. We illustrate the main ideas with an LUfactorization algorithm where particular techniques are used to reduce the amount of pivoting, resulting in an algorithm achieving up to 388 GFlop/s for single and up to 99:4 GFlop/s for double precision factorization on a hybrid Intel Xeon (2x4 cores @ 2.33 GHz) { NVIDIA GeForce GTX 280 (240 cores @ 1.30 GHz) system.Centro de Matemática da Universidade de Coimbra2008info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/otherhttps://hdl.handle.net/10316/11212https://hdl.handle.net/10316/11212engPré-Publicações DMUC. 08-53 (2008)Baboulin, MarcDongarra, JackTomov, Stanimireinfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2019-06-01T21:20:04Zoai:estudogeral.uc.pt:10316/11212Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-29T05:23:16.739846Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
| dc.title.none.fl_str_mv |
Towards dense linear algebra for hybrid GPU accelerated manycore systems |
| title |
Towards dense linear algebra for hybrid GPU accelerated manycore systems |
| spellingShingle |
Towards dense linear algebra for hybrid GPU accelerated manycore systems Baboulin, Marc Hybrid computing Dense linear algebra Parallel algorithms LU factorization Multicore processors Graphic process units Accelerators |
| title_short |
Towards dense linear algebra for hybrid GPU accelerated manycore systems |
| title_full |
Towards dense linear algebra for hybrid GPU accelerated manycore systems |
| title_fullStr |
Towards dense linear algebra for hybrid GPU accelerated manycore systems |
| title_full_unstemmed |
Towards dense linear algebra for hybrid GPU accelerated manycore systems |
| title_sort |
Towards dense linear algebra for hybrid GPU accelerated manycore systems |
| author |
Baboulin, Marc |
| author_facet |
Baboulin, Marc Dongarra, Jack Tomov, Stanimire |
| author_role |
author |
| author2 |
Dongarra, Jack Tomov, Stanimire |
| author2_role |
author author |
| dc.contributor.author.fl_str_mv |
Baboulin, Marc Dongarra, Jack Tomov, Stanimire |
| dc.subject.por.fl_str_mv |
Hybrid computing Dense linear algebra Parallel algorithms LU factorization Multicore processors Graphic process units Accelerators |
| topic |
Hybrid computing Dense linear algebra Parallel algorithms LU factorization Multicore processors Graphic process units Accelerators |
| description |
If multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced with accelerators! This is happening today as accelerators, in particular Graphical Processing Units (GPUs), are steadily making their way into the high performance computing (HPC) world. We highlight the trends leading to the idea of hybrid manycore/GPU systems, and we present a set of techniques that can be used to e ciently program them. The presentation is in the context of Dense Linear Algebra (DLA), a major building block for many scienti c computing applications. We motivate the need for new algorithms that would split the computation in a way that would fully exploit the power that each of the hybrid components o ers. As the area of hybrid multicore/GPU computing is still in its infancy, we also argue for its importance in view of what future architectures may look like. We therefore envision the need for a DLA library similar to LAPACK but for hybrid manycore/GPU systems. We illustrate the main ideas with an LUfactorization algorithm where particular techniques are used to reduce the amount of pivoting, resulting in an algorithm achieving up to 388 GFlop/s for single and up to 99:4 GFlop/s for double precision factorization on a hybrid Intel Xeon (2x4 cores @ 2.33 GHz) { NVIDIA GeForce GTX 280 (240 cores @ 1.30 GHz) system. |
| publishDate |
2008 |
| dc.date.none.fl_str_mv |
2008 |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/other |
| format |
other |
| status_str |
publishedVersion |
| dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10316/11212 https://hdl.handle.net/10316/11212 |
| url |
https://hdl.handle.net/10316/11212 |
| dc.language.iso.fl_str_mv |
eng |
| language |
eng |
| dc.relation.none.fl_str_mv |
Pré-Publicações DMUC. 08-53 (2008) |
| dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
| eu_rights_str_mv |
openAccess |
| dc.publisher.none.fl_str_mv |
Centro de Matemática da Universidade de Coimbra |
| publisher.none.fl_str_mv |
Centro de Matemática da Universidade de Coimbra |
| dc.source.none.fl_str_mv |
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
| instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
| instacron_str |
RCAAP |
| institution |
RCAAP |
| reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
| repository.mail.fl_str_mv |
info@rcaap.pt |
| _version_ |
1833602338855583744 |