Export Ready — 

On the support of versioning in distributed key-value stores

Bibliographic Details
Main Author: Vilaça, Ricardo
Publication Date: 2014
Other Authors: Oliveira, Rui Carlos Mendes de, Sutra, Pierre, Coelho, Fábio, Felber, Pascal, Rivière, Étienne, Pasin, Marcelo, Matos, Miguel, Schiavoni, Valerio
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: http://hdl.handle.net/1822/37803
Summary: The ability to access and query data stored in multiple versions is an important asset for many applications, such as Web graph analysis, collaborative editing platforms, data forensics, or correlation mining. The storage and retrieval of versioned data requires a specific API and support from the storage layer. The choice of the data structures used to maintain versioned data has a fundamental impact on the performance of insertions and queries. The appropriate data structure also depends on the nature of the versioned data and the nature of the access patterns. In this paper we study the design and implementation space for providing versioning support on top of a distributed key-value store (KVS). We define an API for versioned data access supporting multiple writers and show that a plain KVS does not offer the necessary synchronization power for implementing this API. We leverage the support for listeners at the KVS level and propose a general construction for implementing arbitrary types of data structures for storing and querying versioned data. We explore the design space of versioned data storage ranging from a flat data structure to a distributed sharded index. The resulting system, \system, is implemented on top of an industrial-grade open-source KVS, Infinispan. Our evaluation, based on real-world Wikipedia access logs, studies the performance of each versioning mechanisms in terms of load balancing, latency and storage overhead in the context of different access scenarios.
id RCAP_c00be3fd8af029ceb496b295b99c1d4e
oai_identifier_str oai:repositorium.sdum.uminho.pt:1822/37803
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling On the support of versioning in distributed key-value storesVersioningKey-value storeListenersScience & TechnologyThe ability to access and query data stored in multiple versions is an important asset for many applications, such as Web graph analysis, collaborative editing platforms, data forensics, or correlation mining. The storage and retrieval of versioned data requires a specific API and support from the storage layer. The choice of the data structures used to maintain versioned data has a fundamental impact on the performance of insertions and queries. The appropriate data structure also depends on the nature of the versioned data and the nature of the access patterns. In this paper we study the design and implementation space for providing versioning support on top of a distributed key-value store (KVS). We define an API for versioned data access supporting multiple writers and show that a plain KVS does not offer the necessary synchronization power for implementing this API. We leverage the support for listeners at the KVS level and propose a general construction for implementing arbitrary types of data structures for storing and querying versioned data. We explore the design space of versioned data storage ranging from a flat data structure to a distributed sharded index. The resulting system, \system, is implemented on top of an industrial-grade open-source KVS, Infinispan. Our evaluation, based on real-world Wikipedia access logs, studies the performance of each versioning mechanisms in terms of load balancing, latency and storage overhead in the context of different access scenarios.Institute of Electrical and Electronics Engineers (IEEE)Universidade do MinhoVilaça, RicardoOliveira, Rui Carlos Mendes deSutra, PierreCoelho, FábioFelber, PascalRivière, ÉtiennePasin, MarceloMatos, MiguelSchiavoni, Valerio20142014-01-01T00:00:00Zconference paperinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/1822/37803engFelber P, Pasin M, Rivière E, Schiavoni V, Sutra P, Coelho F, Matos M, Oliveira R, Vilaça R. 2014. On the Support of Versioning in Distributed Key-Value Stores. 33rd IEEE International Symposium on Reliable Distributed Systems - SRDS.97814799558481060-985710.1109/SRDS.2014.35info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-05-11T07:24:26Zoai:repositorium.sdum.uminho.pt:1822/37803Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T16:26:07.332677Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv On the support of versioning in distributed key-value stores
title On the support of versioning in distributed key-value stores
spellingShingle On the support of versioning in distributed key-value stores
Vilaça, Ricardo
Versioning
Key-value store
Listeners
Science & Technology
title_short On the support of versioning in distributed key-value stores
title_full On the support of versioning in distributed key-value stores
title_fullStr On the support of versioning in distributed key-value stores
title_full_unstemmed On the support of versioning in distributed key-value stores
title_sort On the support of versioning in distributed key-value stores
author Vilaça, Ricardo
author_facet Vilaça, Ricardo
Oliveira, Rui Carlos Mendes de
Sutra, Pierre
Coelho, Fábio
Felber, Pascal
Rivière, Étienne
Pasin, Marcelo
Matos, Miguel
Schiavoni, Valerio
author_role author
author2 Oliveira, Rui Carlos Mendes de
Sutra, Pierre
Coelho, Fábio
Felber, Pascal
Rivière, Étienne
Pasin, Marcelo
Matos, Miguel
Schiavoni, Valerio
author2_role author
author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv Universidade do Minho
dc.contributor.author.fl_str_mv Vilaça, Ricardo
Oliveira, Rui Carlos Mendes de
Sutra, Pierre
Coelho, Fábio
Felber, Pascal
Rivière, Étienne
Pasin, Marcelo
Matos, Miguel
Schiavoni, Valerio
dc.subject.por.fl_str_mv Versioning
Key-value store
Listeners
Science & Technology
topic Versioning
Key-value store
Listeners
Science & Technology
description The ability to access and query data stored in multiple versions is an important asset for many applications, such as Web graph analysis, collaborative editing platforms, data forensics, or correlation mining. The storage and retrieval of versioned data requires a specific API and support from the storage layer. The choice of the data structures used to maintain versioned data has a fundamental impact on the performance of insertions and queries. The appropriate data structure also depends on the nature of the versioned data and the nature of the access patterns. In this paper we study the design and implementation space for providing versioning support on top of a distributed key-value store (KVS). We define an API for versioned data access supporting multiple writers and show that a plain KVS does not offer the necessary synchronization power for implementing this API. We leverage the support for listeners at the KVS level and propose a general construction for implementing arbitrary types of data structures for storing and querying versioned data. We explore the design space of versioned data storage ranging from a flat data structure to a distributed sharded index. The resulting system, \system, is implemented on top of an industrial-grade open-source KVS, Infinispan. Our evaluation, based on real-world Wikipedia access logs, studies the performance of each versioning mechanisms in terms of load balancing, latency and storage overhead in the context of different access scenarios.
publishDate 2014
dc.date.none.fl_str_mv 2014
2014-01-01T00:00:00Z
dc.type.driver.fl_str_mv conference paper
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/1822/37803
url http://hdl.handle.net/1822/37803
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Felber P, Pasin M, Rivière E, Schiavoni V, Sutra P, Coelho F, Matos M, Oliveira R, Vilaça R. 2014. On the Support of Versioning in Distributed Key-Value Stores. 33rd IEEE International Symposium on Reliable Distributed Systems - SRDS.
9781479955848
1060-9857
10.1109/SRDS.2014.35
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Institute of Electrical and Electronics Engineers (IEEE)
publisher.none.fl_str_mv Institute of Electrical and Electronics Engineers (IEEE)
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833595940192124929