正版保障 假一赔十 可开发票
¥ 58.74 6.6折 ¥ 89 全新
仅1件
作者[美]德梅萃·P.博赛卡斯(Dimitri P. Bertsekas) 著
出版社清华大学出版社
ISBN9787302599814
出版时间2021-08
装帧平装
开本16开
定价89元
货号11539122
上书时间2024-11-20
德梅萃 P.博塞克斯(Dimitri P. Bertseka),美国MIT终身教授,美国国家工程院院士,清华大学复杂与网络化系统研究中心客座教授。电气工程与计算机科学领域国际知名作者,著有《非线性规划》《网络优化》《凸优化》等十几本畅销教材和专著。
1 Introduction
1.1 Structure ofDynamic Programming Problems
1.2 Abstract Dynamic Programming Models
1.2.1 Problem Formulation
1.2.2 Monotonicity and Contraction Properties
1.2.3 Some Examples
1.2.4 Approximation Models-Projected and Aggregation Bellman Equations
1.2.5 Multistep Models-Temporal Difference and ProximalAlgorithms
1.3 Organizationofthe Book
1.4 Notes, Sources, and Exercises
2 Contractive Models
2.1 Bellman's Equation and Optimality Conditions
2.2 Limited Lookahead Policies
2.3 Value Iteration
2.4 Policylteration
2.4.1 Approximate Policylteration
2.4.2 Approximate Policy Iteration Where Policies Converge
2.5 Optimistic Policylteration and A-Policylteration
2.5.1 Convergence ofOptimistic Policylteration
2.5.2 Approximate Optimistic Policylteration
— 没有更多了 —
以下为对购买帮助不大的评价