Export Ready — 

Two high-performance alternatives to ZLIB scientific-data compression

Bibliographic Details
Main Author: Almeida, Samuel
Publication Date: 2014
Other Authors: Oliveira, Vitor Serafim Pereira, Pina, António Manuel Silva, Melle-Franco, M.
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: http://hdl.handle.net/1822/31490
Summary: ZLIB is used in diverse frameworks by the scientific community, both to reduce disk storage and to alleviate pressure on I/O. As it becomes a bottleneck on multi-core systems, higher throughput alternatives must be considered, exploring parallelism and/or more effective compression schemes. This work provides a comparative study of the ZLIB, LZ4 and FPC compressors (serial and parallel implementations), focusing on CR, bandwidth and speedup. LZ4 provides very high throughput (decompressing over 1GB/s versus 120MB/s for ZLIB) but its CR suffers a degradation of 5-10%. FPC also provides higher throughputs than ZLIB, but the CR varies a lot with the data. ZLIB and LZ4 can achieve almost linear speedups for some datasets, while current implementation of parallel FPC provides little if any performance gain. For the ROOT dataset, LZ4 was found to provide higher CR, scalability and lower memory consumption than FPC, thus emerging as a better alternative to ZLIB.
id RCAP_a80e698ec3ad0eace56628c016fd3b52
oai_identifier_str oai:repositorium.sdum.uminho.pt:1822/31490
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Two high-performance alternatives to ZLIB scientific-data compressionData CompressionScientific DataROOTParallel CompressionZLIBLZ4FPCScience & TechnologyZLIB is used in diverse frameworks by the scientific community, both to reduce disk storage and to alleviate pressure on I/O. As it becomes a bottleneck on multi-core systems, higher throughput alternatives must be considered, exploring parallelism and/or more effective compression schemes. This work provides a comparative study of the ZLIB, LZ4 and FPC compressors (serial and parallel implementations), focusing on CR, bandwidth and speedup. LZ4 provides very high throughput (decompressing over 1GB/s versus 120MB/s for ZLIB) but its CR suffers a degradation of 5-10%. FPC also provides higher throughputs than ZLIB, but the CR varies a lot with the data. ZLIB and LZ4 can achieve almost linear speedups for some datasets, while current implementation of parallel FPC provides little if any performance gain. For the ROOT dataset, LZ4 was found to provide higher CR, scalability and lower memory consumption than FPC, thus emerging as a better alternative to ZLIB.This work is funded by National Funds through the FCT Fundacao para a Ciencia e a Tecnologia (Portuguese Foundation for Science and Technology) within project PEst-OE/EEI/UI0752/2014, UT Austin - Portugal FCT grant SFRH/ BD/47840/2008, and the resources from the project SeARCH funded under contract CONC- REEQ/443/2005. We would also like to thank Nuno Castro and Rafael Silva for their contributions.Springer International Publishing AGUniversidade do MinhoAlmeida, SamuelOliveira, Vitor Serafim PereiraPina, António Manuel SilvaMelle-Franco, M.2014-112014-11-01T00:00:00Zconference paperinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/1822/31490eng978-3-319-09146-40302-974310.1007/978-3-319-09147-1_45info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-05-11T04:22:59Zoai:repositorium.sdum.uminho.pt:1822/31490Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T14:46:59.247608Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Two high-performance alternatives to ZLIB scientific-data compression
title Two high-performance alternatives to ZLIB scientific-data compression
spellingShingle Two high-performance alternatives to ZLIB scientific-data compression
Almeida, Samuel
Data Compression
Scientific Data
ROOT
Parallel Compression
ZLIB
LZ4
FPC
Science & Technology
title_short Two high-performance alternatives to ZLIB scientific-data compression
title_full Two high-performance alternatives to ZLIB scientific-data compression
title_fullStr Two high-performance alternatives to ZLIB scientific-data compression
title_full_unstemmed Two high-performance alternatives to ZLIB scientific-data compression
title_sort Two high-performance alternatives to ZLIB scientific-data compression
author Almeida, Samuel
author_facet Almeida, Samuel
Oliveira, Vitor Serafim Pereira
Pina, António Manuel Silva
Melle-Franco, M.
author_role author
author2 Oliveira, Vitor Serafim Pereira
Pina, António Manuel Silva
Melle-Franco, M.
author2_role author
author
author
dc.contributor.none.fl_str_mv Universidade do Minho
dc.contributor.author.fl_str_mv Almeida, Samuel
Oliveira, Vitor Serafim Pereira
Pina, António Manuel Silva
Melle-Franco, M.
dc.subject.por.fl_str_mv Data Compression
Scientific Data
ROOT
Parallel Compression
ZLIB
LZ4
FPC
Science & Technology
topic Data Compression
Scientific Data
ROOT
Parallel Compression
ZLIB
LZ4
FPC
Science & Technology
description ZLIB is used in diverse frameworks by the scientific community, both to reduce disk storage and to alleviate pressure on I/O. As it becomes a bottleneck on multi-core systems, higher throughput alternatives must be considered, exploring parallelism and/or more effective compression schemes. This work provides a comparative study of the ZLIB, LZ4 and FPC compressors (serial and parallel implementations), focusing on CR, bandwidth and speedup. LZ4 provides very high throughput (decompressing over 1GB/s versus 120MB/s for ZLIB) but its CR suffers a degradation of 5-10%. FPC also provides higher throughputs than ZLIB, but the CR varies a lot with the data. ZLIB and LZ4 can achieve almost linear speedups for some datasets, while current implementation of parallel FPC provides little if any performance gain. For the ROOT dataset, LZ4 was found to provide higher CR, scalability and lower memory consumption than FPC, thus emerging as a better alternative to ZLIB.
publishDate 2014
dc.date.none.fl_str_mv 2014-11
2014-11-01T00:00:00Z
dc.type.driver.fl_str_mv conference paper
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/1822/31490
url http://hdl.handle.net/1822/31490
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 978-3-319-09146-4
0302-9743
10.1007/978-3-319-09147-1_45
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Springer International Publishing AG
publisher.none.fl_str_mv Springer International Publishing AG
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833594881937768448