Handbook of Learning and Approximate Dynamic Programming

Jennie Si

Wiley, 2004 - Computers - 644 pages

A complete resource to Approximate Dynamic Programming (ADP), including on-line simulation code
Provides a tutorial that readers can use to start implementing the learning algorithms provided in the book
Includes ideas, directions, and recent results on current research issues and addresses applications where ADP has been successfully implemented
The contributors are leading researchers in the field

From inside the book

Results 1-3 of 78

Page 247
... iteration portion of the 0 - LSPE method with y to the approximate value iteration ( 9.18 ) with weights w ( i ) can view 0 - LSPE as the approximate value iteration method ( 9.18 ) , plus noise that asymptotically tends to 0 . Note ...

Page 325
... iteration algorithms for determining the optimal policy can be easily de- veloped by combining Lemma 12.5.1 and Theorem 12.5.1 . Roughly speaking , at the kth step with policy Lk , we set the policy for the next step ( the ( k + 1 ) th ...

Page 326
... iterations . The on - line policy iteration approach is a counterpart of the online gradient based optimization approach presented in Section 12.4 ; the latter applies to parameterized systems and the former to systems within the MDP ...

Where's the rest of this book?

Foreword	1

Reinforcement Learning and Its Relationship to Supervised Learning	47

ModelBased Adaptive Critic Designs	65

Copyright

20 other sections not shown

About the author (2004)

JENNIE SI is Professor of Electrical Engineering, Arizona State University, Tempe, AZ. She is director of Intelligent Systems Laboratory, which focuses on analysis and design of learning and adaptive systems. In addition to her own publications, she is the Associate Editor for IEEE Transactions on Neural Networks, and past Associate Editor for IEEE Transactions on Automatic Control and IEEE Transactions on Semiconductor Manufacturing. She was the co-chair for the 2002 NSF Workshop on Learning and Approximate Dynamic Programming. ANDREW G. BARTO is Professor of Computer Science, University of Massachusetts, Amherst. He is co-director of the Autonomous Learning Laboratory, which carries out interdisciplinary research on machine learning and modeling of biological learning. He is a core faculty member of the Neuroscience and Behavior Program of the University of Massachusetts and was the co-chair for the 2002 NSF Workshop on Learning and Approximate Dynamic Programming. He currently serves as an associate editor of Neural Computation. WARREN B. POWELL is Professor of Operations Research and Financial Engineering at Princeton University. He is director of CASTLE Laboratory, which focuses on real-time optimization of complex dynamic systems arising in transportation and logistics. DONALD C. WUNSCH is the Mary K. Finley Missouri Distinguished Professor in the Electrical and Computer Engineering Department at the University of Missouri, Rolla. He heads the Applied Computational Intelligence Laboratory and also has a joint appointment in Computer Science, and is President-Elect of the International Neural Networks Society.

Bibliographic information

Title	Handbook of Learning and Approximate Dynamic Programming Volume 2 of IEEE Press Series on Computational Intelligence
Editor	Jennie Si
Edition	illustrated
Publisher	Wiley, 2004
Original from	the University of Michigan
Digitized	27 Nov 2007
ISBN	047166054X, 9780471660545
Length	644 pages
Subjects	Computers › Programming › General Computers / Data Science / Machine Learning Computers / Programming / General Mathematics / Functional Analysis Mathematics / Optimization Technology & Engineering / Electrical Technology & Engineering / Electronics / General

Export Citation	BiBTeX EndNote RefMan

About Google Books - Privacy Policy - Terms of Service - Information for Publishers - Report an issue - Help - Google Home

Books

Handbook of Learning and Approximate Dynamic Programming

From inside the book

Contents

Other editions - View all

Common terms and phrases

References to this book

About the author (2004)

Bibliographic information