DEDISbench: a benchmark for deduplicated storage systems

Detalhes bibliográficos
Autor(a) principal: Paulo, João
Data de Publicação: 2012
Outros Autores: Reis, Pedro, Pereira, José, Sousa, António Luís
Idioma: eng
Título da fonte: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Texto Completo: http://hdl.handle.net/1822/38817
Resumo: Lecture Notes in Computer Science, 7566
id RCAP_4e92d8748e18e38496c7a67e6c043748
oai_identifier_str oai:repositorium.sdum.uminho.pt:1822/38817
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling DEDISbench: a benchmark for deduplicated storage systemsDeduplicationStorageBenchmarkCloud computingLecture Notes in Computer Science, 7566Deduplication is widely accepted as an effective technique for eliminating duplicated data in backup and archival systems. Nowadays, deduplication is also becoming appealing in cloud computing, where large-scale virtualized storage infrastructures hold huge data volumes with a significant share of duplicated content. There have thus been several proposals for embedding deduplication in storage appliances and file systems, providing different performance trade-offs while targeting both user and application data, as well as virtual machine images. It is however hard to determine to what extent is deduplication useful in a particular setting and what technique will provide the best results. In fact, existing disk I/O micro-benchmarks are not designed for evaluating deduplication systems, following simplistic approaches for generating data written that lead to unrealistic amounts of duplicates. We address this with DEDISbench, a novel micro-benchmark for evaluating disk I/O performance of block based deduplication systems. As the main contribution, we introduce the generation of a realistic duplicate distribution based on real datasets. Moreover, DEDISbench also allows simulating access hotspots and different load intensities for I/O operations. The usefulness of DEDISbench is shown by comparing it with Bonnie++ and IOzone open-source disk I/O micro-benchmarks on assessing two open-source deduplication systems, Opendedup and Lessfs, using Ext4 as a baseline. As a secondary contribution, our results lead to novel insight on the performance of these file systems.(undefined)SpringerSpringerUniversidade do MinhoPaulo, JoãoReis, PedroPereira, JoséSousa, António Luís20122012-01-01T00:00:00Zconference paperinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/1822/38817eng978-3-642-33614-00302-974310.1007/978-3-642-33615-7_9http://link.springer.com/chapter/10.1007%2F978-3-642-33615-7_9info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-05-11T05:30:37Zoai:repositorium.sdum.uminho.pt:1822/38817Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T15:20:50.456461Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv DEDISbench: a benchmark for deduplicated storage systems
title DEDISbench: a benchmark for deduplicated storage systems
spellingShingle DEDISbench: a benchmark for deduplicated storage systems
Paulo, João
Deduplication
Storage
Benchmark
Cloud computing
title_short DEDISbench: a benchmark for deduplicated storage systems
title_full DEDISbench: a benchmark for deduplicated storage systems
title_fullStr DEDISbench: a benchmark for deduplicated storage systems
title_full_unstemmed DEDISbench: a benchmark for deduplicated storage systems
title_sort DEDISbench: a benchmark for deduplicated storage systems
author Paulo, João
author_facet Paulo, João
Reis, Pedro
Pereira, José
Sousa, António Luís
author_role author
author2 Reis, Pedro
Pereira, José
Sousa, António Luís
author2_role author
author
author
dc.contributor.none.fl_str_mv Universidade do Minho
dc.contributor.author.fl_str_mv Paulo, João
Reis, Pedro
Pereira, José
Sousa, António Luís
dc.subject.por.fl_str_mv Deduplication
Storage
Benchmark
Cloud computing
topic Deduplication
Storage
Benchmark
Cloud computing
description Lecture Notes in Computer Science, 7566
publishDate 2012
dc.date.none.fl_str_mv 2012
2012-01-01T00:00:00Z
dc.type.driver.fl_str_mv conference paper
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/1822/38817
url http://hdl.handle.net/1822/38817
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 978-3-642-33614-0
0302-9743
10.1007/978-3-642-33615-7_9
http://link.springer.com/chapter/10.1007%2F978-3-642-33615-7_9
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Springer
Springer
publisher.none.fl_str_mv Springer
Springer
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833595258004307968