Handbook of Learning and Approximate Dynamic ProgrammingJennie Si, Andrew G. Barto, Warren B. Powell, Don Wunsch
|
From inside the book
Page 635
... Electric Power System Applications of Optimization , Marcel Dekker Inc. , New York , 2001 . 3. A. A. Fouad and V. Vittal , Power Systems Transient Stability Analysis Using the Transient Energy Function Method , Prentice - Hall ...
... Electric Power System Applications of Optimization , Marcel Dekker Inc. , New York , 2001 . 3. A. A. Fouad and V. Vittal , Power Systems Transient Stability Analysis Using the Transient Energy Function Method , Prentice - Hall ...
Contents
Foreword | 1 |
Reinforcement Learning and Its Relationship to Supervised Learning | 47 |
ModelBased Adaptive Critic Designs | 65 |
Copyright | |
20 other sections not shown
Other editions - View all
Common terms and phrases
action actor adaptive critic agent algorithm analysis angle applications approach approximate behavior bound called changes chapter complex computational consider constraints continuous convergence cost decision defined depends derivatives described determine developed direct NDP discussed distribution dynamic programming effect equation error estimate example Figure formulation fuzzy given goal gradient IEEE implemented important improve initial input Intelligence involves iteration limited linear load Machine Markov measure methods minimize neural network nonlinear objective observed obtained operating optimal output parameters performance plant position possible power system presented probability problem Proc reference reinforcement learning represents respect reward robust sample shown shows simulation solution solve space stability step stochastic structure Table task techniques termination transition unit University update utility value function variables vector voltage weights