Towards dense linear algebra for hybrid GPU accelerated manycore systems

Baboulin, Marc; Dongarra, Jack; Tomov, Stanimire

Towards dense linear algebra for hybrid GPU accelerated manycore systems

Detalhes bibliográficos
Autor(a) principal:	Baboulin, Marc
Data de Publicação:	2008
Outros Autores:	Dongarra, Jack, Tomov, Stanimire
Tipo de documento:	Outros
Idioma:	eng
Título da fonte:	Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Texto Completo:	https://hdl.handle.net/10316/11212
Resumo:	If multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced with accelerators! This is happening today as accelerators, in particular Graphical Processing Units (GPUs), are steadily making their way into the high performance computing (HPC) world. We highlight the trends leading to the idea of hybrid manycore/GPU systems, and we present a set of techniques that can be used to e ciently program them. The presentation is in the context of Dense Linear Algebra (DLA), a major building block for many scienti c computing applications. We motivate the need for new algorithms that would split the computation in a way that would fully exploit the power that each of the hybrid components o ers. As the area of hybrid multicore/GPU computing is still in its infancy, we also argue for its importance in view of what future architectures may look like. We therefore envision the need for a DLA library similar to LAPACK but for hybrid manycore/GPU systems. We illustrate the main ideas with an LUfactorization algorithm where particular techniques are used to reduce the amount of pivoting, resulting in an algorithm achieving up to 388 GFlop/s for single and up to 99:4 GFlop/s for double precision factorization on a hybrid Intel Xeon (2x4 cores @ 2.33 GHz) { NVIDIA GeForce GTX 280 (240 cores @ 1.30 GHz) system.

Metadados do item

id	RCAP_193b4e6636586db27b8e11c21b1f3757
oai_identifier_str	oai:estudogeral.uc.pt:10316/11212
network_acronym_str	RCAP
network_name_str	Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str	https://opendoar.ac.uk/repository/7160
spelling	Towards dense linear algebra for hybrid GPU accelerated manycore systemsHybrid computingDense linear algebraParallel algorithmsLU factorizationMulticore processorsGraphic process unitsAcceleratorsIf multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced with accelerators! This is happening today as accelerators, in particular Graphical Processing Units (GPUs), are steadily making their way into the high performance computing (HPC) world. We highlight the trends leading to the idea of hybrid manycore/GPU systems, and we present a set of techniques that can be used to e ciently program them. The presentation is in the context of Dense Linear Algebra (DLA), a major building block for many scienti c computing applications. We motivate the need for new algorithms that would split the computation in a way that would fully exploit the power that each of the hybrid components o ers. As the area of hybrid multicore/GPU computing is still in its infancy, we also argue for its importance in view of what future architectures may look like. We therefore envision the need for a DLA library similar to LAPACK but for hybrid manycore/GPU systems. We illustrate the main ideas with an LUfactorization algorithm where particular techniques are used to reduce the amount of pivoting, resulting in an algorithm achieving up to 388 GFlop/s for single and up to 99:4 GFlop/s for double precision factorization on a hybrid Intel Xeon (2x4 cores @ 2.33 GHz) { NVIDIA GeForce GTX 280 (240 cores @ 1.30 GHz) system.Centro de Matemática da Universidade de Coimbra2008info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/otherhttps://hdl.handle.net/10316/11212https://hdl.handle.net/10316/11212engPré-Publicações DMUC. 08-53 (2008)Baboulin, MarcDongarra, JackTomov, Stanimireinfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2019-06-01T21:20:04Zoai:estudogeral.uc.pt:10316/11212Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-29T05:23:16.739846Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv	Towards dense linear algebra for hybrid GPU accelerated manycore systems
title	Towards dense linear algebra for hybrid GPU accelerated manycore systems
spellingShingle	Towards dense linear algebra for hybrid GPU accelerated manycore systems Baboulin, Marc Hybrid computing Dense linear algebra Parallel algorithms LU factorization Multicore processors Graphic process units Accelerators
title_short	Towards dense linear algebra for hybrid GPU accelerated manycore systems
title_full	Towards dense linear algebra for hybrid GPU accelerated manycore systems
title_fullStr	Towards dense linear algebra for hybrid GPU accelerated manycore systems
title_full_unstemmed	Towards dense linear algebra for hybrid GPU accelerated manycore systems
title_sort	Towards dense linear algebra for hybrid GPU accelerated manycore systems
author	Baboulin, Marc
author_facet	Baboulin, Marc Dongarra, Jack Tomov, Stanimire
author_role	author
author2	Dongarra, Jack Tomov, Stanimire
author2_role	author author
dc.contributor.author.fl_str_mv	Baboulin, Marc Dongarra, Jack Tomov, Stanimire
dc.subject.por.fl_str_mv	Hybrid computing Dense linear algebra Parallel algorithms LU factorization Multicore processors Graphic process units Accelerators
topic	Hybrid computing Dense linear algebra Parallel algorithms LU factorization Multicore processors Graphic process units Accelerators
description	If multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced with accelerators! This is happening today as accelerators, in particular Graphical Processing Units (GPUs), are steadily making their way into the high performance computing (HPC) world. We highlight the trends leading to the idea of hybrid manycore/GPU systems, and we present a set of techniques that can be used to e ciently program them. The presentation is in the context of Dense Linear Algebra (DLA), a major building block for many scienti c computing applications. We motivate the need for new algorithms that would split the computation in a way that would fully exploit the power that each of the hybrid components o ers. As the area of hybrid multicore/GPU computing is still in its infancy, we also argue for its importance in view of what future architectures may look like. We therefore envision the need for a DLA library similar to LAPACK but for hybrid manycore/GPU systems. We illustrate the main ideas with an LUfactorization algorithm where particular techniques are used to reduce the amount of pivoting, resulting in an algorithm achieving up to 388 GFlop/s for single and up to 99:4 GFlop/s for double precision factorization on a hybrid Intel Xeon (2x4 cores @ 2.33 GHz) { NVIDIA GeForce GTX 280 (240 cores @ 1.30 GHz) system.
publishDate	2008
dc.date.none.fl_str_mv	2008
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/other
format	other
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	https://hdl.handle.net/10316/11212 https://hdl.handle.net/10316/11212
url	https://hdl.handle.net/10316/11212
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	Pré-Publicações DMUC. 08-53 (2008)
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.publisher.none.fl_str_mv	Centro de Matemática da Universidade de Coimbra
publisher.none.fl_str_mv	Centro de Matemática da Universidade de Coimbra
dc.source.none.fl_str_mv	reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP
instname_str	FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection	Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv	Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv	info@rcaap.pt
_version_	1833602338855583744

Towards dense linear algebra for hybrid GPU accelerated manycore systems

Registros relacionados