Samenvatting
This report contains a brief explanation of the principles of dynamic programming. Only discrete time systems are discussed. A method for online approximation of the optimal policy is developed from the theory of dynamic programming.