área de pesquisa Data Driven State Reconstruction of Dynamical System Based on Approximate Dynamic Programming and Reinforcement Learning Fábio Nogueira da Silva Tuning heuristics and convergence analysis of reinforcement learning algorithm for online data-based optimal control design