On the use of suffix arrays for memory-efficient Lempel-Ziv data compression

Detalhes bibliográficos
Autor(a) principal: J. Ferreira, Artur
Data de Publicação: 2009
Outros Autores: Oliveira, Arlindo L., Figueiredo, Mario
Idioma: eng
Título da fonte: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Texto Completo: http://hdl.handle.net/10400.21/17906
Resumo: The Lempel-Ziv 77 (LZ77) and LZ-Storer-Szymanski (LZSS) text compression algorithms use a sliding window over the sequence of symbols, with two sub-windows: the dictionary (symbols already encoded) and the look-ahead-buffer (LAB) (symbols not yet encoded). Binary search trees and suffix trees (ST) have been used to speedup the search of the LAB over the dictionary, at the expense of high memory usage [1]. A suffix array (SA) is a simpler, more compact data structure which uses (much) less memory [2,3] to hold the same information. The SA for a length m string is an array of integers ([1], ...[k], ...a[m]) that stores the lexicographic order of suffix k of the string; sub-string searching, as used in LZ77/LZSS, is done by searching the SA.
id RCAP_b65ad8ffb6eca83492c44ac8bd6dfd37
oai_identifier_str oai:repositorio.ipl.pt:10400.21/17906
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling On the use of suffix arrays for memory-efficient Lempel-Ziv data compressionLempel-Ziv 77 (LZ77)LZ-Storer-Szymanski (LZSS)The Lempel-Ziv 77 (LZ77) and LZ-Storer-Szymanski (LZSS) text compression algorithms use a sliding window over the sequence of symbols, with two sub-windows: the dictionary (symbols already encoded) and the look-ahead-buffer (LAB) (symbols not yet encoded). Binary search trees and suffix trees (ST) have been used to speedup the search of the LAB over the dictionary, at the expense of high memory usage [1]. A suffix array (SA) is a simpler, more compact data structure which uses (much) less memory [2,3] to hold the same information. The SA for a length m string is an array of integers ([1], ...[k], ...a[m]) that stores the lexicographic order of suffix k of the string; sub-string searching, as used in LZ77/LZSS, is done by searching the SA.IEEERCIPLJ. Ferreira, ArturOliveira, Arlindo L.Figueiredo, Mario2024-11-14T11:18:28Z2009-05-262009-05-26T00:00:00Zconference objectinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/10400.21/17906eng978-0-7695-3592-01068-031410.1109/DCC.2009.50info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-02-12T10:38:54Zoai:repositorio.ipl.pt:10400.21/17906Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T20:07:37.040662Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv On the use of suffix arrays for memory-efficient Lempel-Ziv data compression
title On the use of suffix arrays for memory-efficient Lempel-Ziv data compression
spellingShingle On the use of suffix arrays for memory-efficient Lempel-Ziv data compression
J. Ferreira, Artur
Lempel-Ziv 77 (LZ77)
LZ-Storer-Szymanski (LZSS)
title_short On the use of suffix arrays for memory-efficient Lempel-Ziv data compression
title_full On the use of suffix arrays for memory-efficient Lempel-Ziv data compression
title_fullStr On the use of suffix arrays for memory-efficient Lempel-Ziv data compression
title_full_unstemmed On the use of suffix arrays for memory-efficient Lempel-Ziv data compression
title_sort On the use of suffix arrays for memory-efficient Lempel-Ziv data compression
author J. Ferreira, Artur
author_facet J. Ferreira, Artur
Oliveira, Arlindo L.
Figueiredo, Mario
author_role author
author2 Oliveira, Arlindo L.
Figueiredo, Mario
author2_role author
author
dc.contributor.none.fl_str_mv RCIPL
dc.contributor.author.fl_str_mv J. Ferreira, Artur
Oliveira, Arlindo L.
Figueiredo, Mario
dc.subject.por.fl_str_mv Lempel-Ziv 77 (LZ77)
LZ-Storer-Szymanski (LZSS)
topic Lempel-Ziv 77 (LZ77)
LZ-Storer-Szymanski (LZSS)
description The Lempel-Ziv 77 (LZ77) and LZ-Storer-Szymanski (LZSS) text compression algorithms use a sliding window over the sequence of symbols, with two sub-windows: the dictionary (symbols already encoded) and the look-ahead-buffer (LAB) (symbols not yet encoded). Binary search trees and suffix trees (ST) have been used to speedup the search of the LAB over the dictionary, at the expense of high memory usage [1]. A suffix array (SA) is a simpler, more compact data structure which uses (much) less memory [2,3] to hold the same information. The SA for a length m string is an array of integers ([1], ...[k], ...a[m]) that stores the lexicographic order of suffix k of the string; sub-string searching, as used in LZ77/LZSS, is done by searching the SA.
publishDate 2009
dc.date.none.fl_str_mv 2009-05-26
2009-05-26T00:00:00Z
2024-11-14T11:18:28Z
dc.type.driver.fl_str_mv conference object
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.21/17906
url http://hdl.handle.net/10400.21/17906
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 978-0-7695-3592-0
1068-0314
10.1109/DCC.2009.50
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv IEEE
publisher.none.fl_str_mv IEEE
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833598502589956096