Neuro-Dynamic Programming

by Dimitri P. Bertsekas and John Tsitsiklis

ISBN: 1-886529-10-8
Publication: September 1996, 512 pages, hardcover
Price: $69.00

This is the first textbook that fully explains the neuro-dynamic programming/reinforcement learning methodology, which is a recent breakthrough in the practical application of neural networks and dynamic programming to complex problems of planning, optimal decision making, and intelligent control.

Neuro-dynamic programming uses neural network approximations to overcome the "curse of dimensionality" and the "curse of modeling" that have been the bottlenecks to the practical application of dynamic programming and stochastic control to complex problems. The methodology allows systems to learn about their behavior through simulation, and to improve their performance through iterative reinforcement.

This book provides the first systematic presentation of the science and the art behind this exciting and far-reaching methodology.

The book develops a comprehensive analysis of neuro-dynamic programming algorithms, and guides the reader to their successful application through case studies from complex problem areas.

Review by George Cybenko for IEEE Computational Science and Engineering, May 1998:

"Neurodynamic Programming is a remarkable monograph that integrates a sweeping mathematical and computational landscape into a coherent body of rigorous knowledge. The topics are current, the writing is clear and to the point, the examples are comprehensive and the historical notes and comments are scholarly."

"In this monograph, Bertsekas and Tsitsiklis have performed a Herculean task that will be studied and appreciated by generations to come. I strongly recommend it to scientists and engineers eager to seriously understand the mathematics and computations behind modern behavioral machine learning."

Comments by Martin L. Puterman:

"I believe that Neuro-Dynamic Programming by Bertsekas and Tsitsiklis will have a major impact on operations research theory and practice over the next decade. The methods it presents will produce solution of many large scale sequential optimization problems that up to now have proved intractable. The book systematically provides countless new ideas, concepts and problems, and will stimulate considerable future research in operations research and computer science on this exciting and important subject."

Among its special features, the book:

Describes and unifies a large number of NDP methods, including several that are new
Describes new approaches to formulation and solution of important problems in stochastic optimal control, sequential decision making, and discrete optimization
Rigorously explains the mathematical principles behind NDP
Illustrates through examples and case studies the practical application of NDP to complex problems from optimal resource allocation, optimal feedback control, data communications, game playing, and combinatorial optimization
Presents extensive background and new research material on dynamic programming and neural network training

The authors are Professors of Electrical Engineering and Computer Science at the Massachusetts Institute of Technology, and members of the US National Academy of Engineering. They have been awarded several prestigious awards, including the INFORMS John von Neumann Theory Prize, for the contributions of the research monographs "Neuro-Dynamic Programming" and "Parallel and Distributed Computation".

Pointers to Related Literature:

Research papers and other material by Dimitri P. Bertsekas
Research papers and other material by John N. Tsitsiklis

[Return to Athena Scientific Homepage]