Hadoop mapreduce tolerante a faltas bizantinas

Detalhes bibliográficos
Autor(a) principal: da Costa, Pedro Alexandre Reis Sá
Data de Publicação: 2011
Tipo de documento: Dissertação
Idioma: por
Título da fonte: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Texto Completo: http://hdl.handle.net/10451/13903
Resumo: MapReduce is often used to run critical jobs such as scientific data analysis. However, evidence in the literature shows that arbitrary faults do occur and can probably corrupt the results of MapReduce jobs. MapReduce runtimes like Hadoop tolerate crash faults, butnot arbitrary or Byzantine faults. In this work, it is presented a MapReduce algorithm andprototype that tolerate these faults. An experimental evaluation shows that the execution of a job with the implemented algorithm uses twice the resources of the original Hadoop,instead of the 3 or 4 times more that would be achieved with the direct application of common Byzantine fault-tolerance paradigms. It is believed that this cost is acceptable for critical applications that require that level of fault tolerance.
id RCAP_25a447d0c8cd4af9c4e17aca403db2bf
oai_identifier_str oai:repositorio.ulisboa.pt:10455/6786
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Hadoop mapreduce tolerante a faltas bizantinasHadoop MapReducearbitrary faultsreplicationByzantine Fault-ToleranceMapReduce is often used to run critical jobs such as scientific data analysis. However, evidence in the literature shows that arbitrary faults do occur and can probably corrupt the results of MapReduce jobs. MapReduce runtimes like Hadoop tolerate crash faults, butnot arbitrary or Byzantine faults. In this work, it is presented a MapReduce algorithm andprototype that tolerate these faults. An experimental evaluation shows that the execution of a job with the implemented algorithm uses twice the resources of the original Hadoop,instead of the 3 or 4 times more that would be achieved with the direct application of common Byzantine fault-tolerance paradigms. It is believed that this cost is acceptable for critical applications that require that level of fault tolerance.Pasin, MarceloRepositório da Universidade de Lisboada Costa, Pedro Alexandre Reis Sá2012-02-13T15:05:24Z20112011-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10451/13903porinfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-03-17T13:11:58Zoai:repositorio.ulisboa.pt:10455/6786Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-29T02:37:23.290253Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Hadoop mapreduce tolerante a faltas bizantinas
title Hadoop mapreduce tolerante a faltas bizantinas
spellingShingle Hadoop mapreduce tolerante a faltas bizantinas
da Costa, Pedro Alexandre Reis Sá
Hadoop MapReduce
arbitrary faults
replication
Byzantine Fault-Tolerance
title_short Hadoop mapreduce tolerante a faltas bizantinas
title_full Hadoop mapreduce tolerante a faltas bizantinas
title_fullStr Hadoop mapreduce tolerante a faltas bizantinas
title_full_unstemmed Hadoop mapreduce tolerante a faltas bizantinas
title_sort Hadoop mapreduce tolerante a faltas bizantinas
author da Costa, Pedro Alexandre Reis Sá
author_facet da Costa, Pedro Alexandre Reis Sá
author_role author
dc.contributor.none.fl_str_mv Pasin, Marcelo
Repositório da Universidade de Lisboa
dc.contributor.author.fl_str_mv da Costa, Pedro Alexandre Reis Sá
dc.subject.por.fl_str_mv Hadoop MapReduce
arbitrary faults
replication
Byzantine Fault-Tolerance
topic Hadoop MapReduce
arbitrary faults
replication
Byzantine Fault-Tolerance
description MapReduce is often used to run critical jobs such as scientific data analysis. However, evidence in the literature shows that arbitrary faults do occur and can probably corrupt the results of MapReduce jobs. MapReduce runtimes like Hadoop tolerate crash faults, butnot arbitrary or Byzantine faults. In this work, it is presented a MapReduce algorithm andprototype that tolerate these faults. An experimental evaluation shows that the execution of a job with the implemented algorithm uses twice the resources of the original Hadoop,instead of the 3 or 4 times more that would be achieved with the direct application of common Byzantine fault-tolerance paradigms. It is believed that this cost is acceptable for critical applications that require that level of fault tolerance.
publishDate 2011
dc.date.none.fl_str_mv 2011
2011-01-01T00:00:00Z
2012-02-13T15:05:24Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10451/13903
url http://hdl.handle.net/10451/13903
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833601429852389376