PROGRAMAÇÃO DINÂMICA HEURÍSTICA DUAL E REDES DE FUNÇÕES DE BASE RADIAL PARA SOLUÇÃO DA EQUAÇÃO DE HAMILTON-JACOBI-BELLMAN EM PROBLEMAS DE CONTROLE ÓTIMO

Andrade, Gustavo Araújo de

PROGRAMAÇÃO DINÂMICA HEURÍSTICA DUAL E REDES DE FUNÇÕES DE BASE RADIAL PARA SOLUÇÃO DA EQUAÇÃO DE HAMILTON-JACOBI-BELLMAN EM PROBLEMAS DE CONTROLE ÓTIMO

Detalhes bibliográficos
Ano de defesa:	2014
Autor(a) principal:	Andrade, Gustavo Araújo de
Orientador(a):	FONSECA NETO, João Viana da
Banca de defesa:	Serra, Ginalber Luiz de Oliveira
Tipo de documento:	Dissertação
Tipo de acesso:	Acesso aberto
Idioma:	por
Instituição de defesa:	Universidade Federal do Maranhão
Programa de Pós-Graduação:	PROGRAMA DE PÓS-GRADUAÇÃO EM ENGENHARIA DE ELETRICIDADE/CCET
Departamento:	Engenharia
País:	BR
Palavras-chave em Português:	Controle Ótimo Aprendizagem por Reforço Programação Dinâmica Aproximada Programação Heurística Dual, Redes de Função de Base Radial
Palavras-chave em Inglês:	Optimal Control Reinforcement Learning Approximate Dynamic Programming Dual Heuristic Programming Radial Basis Function Neural Networks
Área do conhecimento CNPq:	CNPQ::ENGENHARIAS::ENGENHARIA ELETRICA
Link de acesso:	http://tedebc.ufma.br:8080/jspui/handle/tede/517
Resumo:	In this work the main objective is to present the development of learning algorithms for online application for the solution of algebraic Hamilton-Jacobi-Bellman equation. The concepts covered are focused on developing the methodology for control systems, through techniques that aims to design online adaptive controllers to reject noise sensors, parametric variations and modeling errors. Concepts of neurodynamic programming and reinforcement learning are are discussed to design algorithms where the context of a given operating point causes the control system to adapt and thus present the performance according to specifications design. Are designed methods for online estimation of adaptive critic focusing efforts on techniques for gradient estimating of the environment value function.

PROGRAMAÇÃO DINÂMICA HEURÍSTICA DUAL E REDES DE FUNÇÕES DE BASE RADIAL PARA SOLUÇÃO DA EQUAÇÃO DE HAMILTON-JACOBI-BELLMAN EM PROBLEMAS DE CONTROLE ÓTIMO

Registros relacionados