Two high-performance alternatives to ZLIB scientific-data compression
Main Author: | |
---|---|
Publication Date: | 2014 |
Other Authors: | , , |
Language: | eng |
Source: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
Download full: | http://hdl.handle.net/1822/31490 |
Summary: | ZLIB is used in diverse frameworks by the scientific community, both to reduce disk storage and to alleviate pressure on I/O. As it becomes a bottleneck on multi-core systems, higher throughput alternatives must be considered, exploring parallelism and/or more effective compression schemes. This work provides a comparative study of the ZLIB, LZ4 and FPC compressors (serial and parallel implementations), focusing on CR, bandwidth and speedup. LZ4 provides very high throughput (decompressing over 1GB/s versus 120MB/s for ZLIB) but its CR suffers a degradation of 5-10%. FPC also provides higher throughputs than ZLIB, but the CR varies a lot with the data. ZLIB and LZ4 can achieve almost linear speedups for some datasets, while current implementation of parallel FPC provides little if any performance gain. For the ROOT dataset, LZ4 was found to provide higher CR, scalability and lower memory consumption than FPC, thus emerging as a better alternative to ZLIB. |
id |
RCAP_a80e698ec3ad0eace56628c016fd3b52 |
---|---|
oai_identifier_str |
oai:repositorium.sdum.uminho.pt:1822/31490 |
network_acronym_str |
RCAP |
network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository_id_str |
https://opendoar.ac.uk/repository/7160 |
spelling |
Two high-performance alternatives to ZLIB scientific-data compressionData CompressionScientific DataROOTParallel CompressionZLIBLZ4FPCScience & TechnologyZLIB is used in diverse frameworks by the scientific community, both to reduce disk storage and to alleviate pressure on I/O. As it becomes a bottleneck on multi-core systems, higher throughput alternatives must be considered, exploring parallelism and/or more effective compression schemes. This work provides a comparative study of the ZLIB, LZ4 and FPC compressors (serial and parallel implementations), focusing on CR, bandwidth and speedup. LZ4 provides very high throughput (decompressing over 1GB/s versus 120MB/s for ZLIB) but its CR suffers a degradation of 5-10%. FPC also provides higher throughputs than ZLIB, but the CR varies a lot with the data. ZLIB and LZ4 can achieve almost linear speedups for some datasets, while current implementation of parallel FPC provides little if any performance gain. For the ROOT dataset, LZ4 was found to provide higher CR, scalability and lower memory consumption than FPC, thus emerging as a better alternative to ZLIB.This work is funded by National Funds through the FCT Fundacao para a Ciencia e a Tecnologia (Portuguese Foundation for Science and Technology) within project PEst-OE/EEI/UI0752/2014, UT Austin - Portugal FCT grant SFRH/ BD/47840/2008, and the resources from the project SeARCH funded under contract CONC- REEQ/443/2005. We would also like to thank Nuno Castro and Rafael Silva for their contributions.Springer International Publishing AGUniversidade do MinhoAlmeida, SamuelOliveira, Vitor Serafim PereiraPina, António Manuel SilvaMelle-Franco, M.2014-112014-11-01T00:00:00Zconference paperinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/1822/31490eng978-3-319-09146-40302-974310.1007/978-3-319-09147-1_45info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-05-11T04:22:59Zoai:repositorium.sdum.uminho.pt:1822/31490Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T14:46:59.247608Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
dc.title.none.fl_str_mv |
Two high-performance alternatives to ZLIB scientific-data compression |
title |
Two high-performance alternatives to ZLIB scientific-data compression |
spellingShingle |
Two high-performance alternatives to ZLIB scientific-data compression Almeida, Samuel Data Compression Scientific Data ROOT Parallel Compression ZLIB LZ4 FPC Science & Technology |
title_short |
Two high-performance alternatives to ZLIB scientific-data compression |
title_full |
Two high-performance alternatives to ZLIB scientific-data compression |
title_fullStr |
Two high-performance alternatives to ZLIB scientific-data compression |
title_full_unstemmed |
Two high-performance alternatives to ZLIB scientific-data compression |
title_sort |
Two high-performance alternatives to ZLIB scientific-data compression |
author |
Almeida, Samuel |
author_facet |
Almeida, Samuel Oliveira, Vitor Serafim Pereira Pina, António Manuel Silva Melle-Franco, M. |
author_role |
author |
author2 |
Oliveira, Vitor Serafim Pereira Pina, António Manuel Silva Melle-Franco, M. |
author2_role |
author author author |
dc.contributor.none.fl_str_mv |
Universidade do Minho |
dc.contributor.author.fl_str_mv |
Almeida, Samuel Oliveira, Vitor Serafim Pereira Pina, António Manuel Silva Melle-Franco, M. |
dc.subject.por.fl_str_mv |
Data Compression Scientific Data ROOT Parallel Compression ZLIB LZ4 FPC Science & Technology |
topic |
Data Compression Scientific Data ROOT Parallel Compression ZLIB LZ4 FPC Science & Technology |
description |
ZLIB is used in diverse frameworks by the scientific community, both to reduce disk storage and to alleviate pressure on I/O. As it becomes a bottleneck on multi-core systems, higher throughput alternatives must be considered, exploring parallelism and/or more effective compression schemes. This work provides a comparative study of the ZLIB, LZ4 and FPC compressors (serial and parallel implementations), focusing on CR, bandwidth and speedup. LZ4 provides very high throughput (decompressing over 1GB/s versus 120MB/s for ZLIB) but its CR suffers a degradation of 5-10%. FPC also provides higher throughputs than ZLIB, but the CR varies a lot with the data. ZLIB and LZ4 can achieve almost linear speedups for some datasets, while current implementation of parallel FPC provides little if any performance gain. For the ROOT dataset, LZ4 was found to provide higher CR, scalability and lower memory consumption than FPC, thus emerging as a better alternative to ZLIB. |
publishDate |
2014 |
dc.date.none.fl_str_mv |
2014-11 2014-11-01T00:00:00Z |
dc.type.driver.fl_str_mv |
conference paper |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/1822/31490 |
url |
http://hdl.handle.net/1822/31490 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
978-3-319-09146-4 0302-9743 10.1007/978-3-319-09147-1_45 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Springer International Publishing AG |
publisher.none.fl_str_mv |
Springer International Publishing AG |
dc.source.none.fl_str_mv |
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
repository.mail.fl_str_mv |
info@rcaap.pt |
_version_ |
1833594881937768448 |