Quantum tree-based planning

Bibliographic Details
Main Author: Sequeira, Andre
Publication Date: 2021
Other Authors: Santos, Luís Paulo, Barbosa, L. S.
Format: Article
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: https://hdl.handle.net/1822/78050
Summary: Reinforcement Learning is at the core of a recent revolution in Arti cial Intelligence. Simultaneously, we are witnessing the emergence of a new  eld: Quantum Machine Learning. In the context of these two major developments, this work addresses the interplay between Quantum Computing and Reinforcement Learning. Learning by interaction is possible in the quantum setting using the concept of oraculization of environments. The paper extends previous oracular instances to address more general stochastic environments. In this setting, we developed a novel quantum algorithm for near-optimal decision-making based on the Reinforcement Learning paradigm known as Sparse Sampling. The proposed algorithm exhibits a quadratic speedup compared to its classical counterpart. To the best of the authors' knowledge, this is the  first quantum planning algorithm exhibiting a time complexity independent of the number of states of the environment, which makes it suitable for large state space environments, where planning is otherwise intractable.
id RCAP_b4f90a8f75ff09d55f1393790ccca54a
oai_identifier_str oai:repositorium.sdum.uminho.pt:1822/78050
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Quantum tree-based planningQuantum computationquantum reinforcement learningsparse samplingPlanningHeuristic algorithmsQuantum computingReinforcement learningQubitEncodingQuantum algorithmCiências Naturais::Ciências da Computação e da InformaçãoScience & TechnologyReinforcement Learning is at the core of a recent revolution in Arti cial Intelligence. Simultaneously, we are witnessing the emergence of a new  eld: Quantum Machine Learning. In the context of these two major developments, this work addresses the interplay between Quantum Computing and Reinforcement Learning. Learning by interaction is possible in the quantum setting using the concept of oraculization of environments. The paper extends previous oracular instances to address more general stochastic environments. In this setting, we developed a novel quantum algorithm for near-optimal decision-making based on the Reinforcement Learning paradigm known as Sparse Sampling. The proposed algorithm exhibits a quadratic speedup compared to its classical counterpart. To the best of the authors' knowledge, this is the  first quantum planning algorithm exhibiting a time complexity independent of the number of states of the environment, which makes it suitable for large state space environments, where planning is otherwise intractable.This work is financed by National Funds through the Portuguese funding agency, FCT - Fundação para a Ciência e a Tecnologia, within project UIDB/50014/2020.IEEEUniversidade do MinhoSequeira, AndreSantos, Luís PauloBarbosa, L. S.20212021-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttps://hdl.handle.net/1822/78050engA. Sequeira, L. P. Santos and L. S. Barbosa, "Quantum Tree-Based Planning," in IEEE Access, vol. 9, pp. 125416-125427, 2021, doi: 10.1109/ACCESS.2021.3110652.2169-353610.1109/ACCESS.2021.3110652https://ieeexplore.ieee.org/document/9530390info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-05-11T06:13:08Zoai:repositorium.sdum.uminho.pt:1822/78050Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T15:45:00.771406Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Quantum tree-based planning
title Quantum tree-based planning
spellingShingle Quantum tree-based planning
Sequeira, Andre
Quantum computation
quantum reinforcement learning
sparse sampling
Planning
Heuristic algorithms
Quantum computing
Reinforcement learning
Qubit
Encoding
Quantum algorithm
Ciências Naturais::Ciências da Computação e da Informação
Science & Technology
title_short Quantum tree-based planning
title_full Quantum tree-based planning
title_fullStr Quantum tree-based planning
title_full_unstemmed Quantum tree-based planning
title_sort Quantum tree-based planning
author Sequeira, Andre
author_facet Sequeira, Andre
Santos, Luís Paulo
Barbosa, L. S.
author_role author
author2 Santos, Luís Paulo
Barbosa, L. S.
author2_role author
author
dc.contributor.none.fl_str_mv Universidade do Minho
dc.contributor.author.fl_str_mv Sequeira, Andre
Santos, Luís Paulo
Barbosa, L. S.
dc.subject.por.fl_str_mv Quantum computation
quantum reinforcement learning
sparse sampling
Planning
Heuristic algorithms
Quantum computing
Reinforcement learning
Qubit
Encoding
Quantum algorithm
Ciências Naturais::Ciências da Computação e da Informação
Science & Technology
topic Quantum computation
quantum reinforcement learning
sparse sampling
Planning
Heuristic algorithms
Quantum computing
Reinforcement learning
Qubit
Encoding
Quantum algorithm
Ciências Naturais::Ciências da Computação e da Informação
Science & Technology
description Reinforcement Learning is at the core of a recent revolution in Arti cial Intelligence. Simultaneously, we are witnessing the emergence of a new  eld: Quantum Machine Learning. In the context of these two major developments, this work addresses the interplay between Quantum Computing and Reinforcement Learning. Learning by interaction is possible in the quantum setting using the concept of oraculization of environments. The paper extends previous oracular instances to address more general stochastic environments. In this setting, we developed a novel quantum algorithm for near-optimal decision-making based on the Reinforcement Learning paradigm known as Sparse Sampling. The proposed algorithm exhibits a quadratic speedup compared to its classical counterpart. To the best of the authors' knowledge, this is the  first quantum planning algorithm exhibiting a time complexity independent of the number of states of the environment, which makes it suitable for large state space environments, where planning is otherwise intractable.
publishDate 2021
dc.date.none.fl_str_mv 2021
2021-01-01T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/1822/78050
url https://hdl.handle.net/1822/78050
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv A. Sequeira, L. P. Santos and L. S. Barbosa, "Quantum Tree-Based Planning," in IEEE Access, vol. 9, pp. 125416-125427, 2021, doi: 10.1109/ACCESS.2021.3110652.
2169-3536
10.1109/ACCESS.2021.3110652
https://ieeexplore.ieee.org/document/9530390
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv IEEE
publisher.none.fl_str_mv IEEE
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833595519285329920