CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA

Maciel, Allan James Ferreira

CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA

Detalhes bibliográficos
Ano de defesa:	2012
Autor(a) principal:	Maciel, Allan James Ferreira
Orientador(a):	FONSECA NETO, João Viana da
Banca de defesa:	Serra, Ginalber Luiz de Oliveira
Tipo de documento:	Dissertação
Tipo de acesso:	Acesso aberto
Idioma:	por
Instituição de defesa:	Universidade Federal do Maranhão
Programa de Pós-Graduação:	PROGRAMA DE PÓS-GRADUAÇÃO EM ENGENHARIA DE ELETRICIDADE/CCET
Departamento:	Engenharia
País:	BR
Palavras-chave em Português:	Programação Dinâmica Heurística Controle Multivariável Controle Ótimo Regulador Quadrático Linear Discreto Mínimos Quadrados Recursivos Controle Digital
Palavras-chave em Inglês:	Heuristic Dynamic Programming Multivariable Control Optimal Control Discrete Linear Quadratic Regulator Recursive Least Squares Digital Control
Área do conhecimento CNPq:	CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
Link de acesso:	http://tedebc.ufma.br:8080/jspui/handle/tede/494
Resumo:	The union of methodologies for optimal control and dynamics programming has stimulated the development of algorithms for realization of discrete control systems of the type linear quadratic regulator (DLQR). The methodology is based on reinforcement learning methods based on temporal differences and approximate dynamic programming. The proposed method combines the approach of the value function by method RLS (recursive least squares) and approximate policy iteration schemes heuristic dynamic programming (HDP). The approach is directed to the assessment of convergence of the solution DLQR and the heuristic weighting matrices 􀜳 and 􀜴 of the utility function associated with DLQR. The investigation of convergence properties related to consistency, persistent excitation and polarization of the RLS estimator is performed. The methodology involved in a project achievements online DLQR controllers and is evaluated in a fourth order multivariable dynamic system.

CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA

Registros relacionados