£60.77

Reinforcement Learning & Optimal Control: Theorems, Proofs, and Python Implementations (Computational Mathematics Library)

Price data last checked 30 day(s) ago - will refresh soon

View at Amazon

We'll watch every seller, every day. One email when your price arrives.

It has never been this cheap. We have no record of a lower price.

£61 today · cheaper than every other day in the last 3 months

NEW HERE?

Amazon shows you one price. We show you all of them.

Tosheroon watches Amazon prices so you don't have to. Every product on Amazon has a price history — we make it visible. Set the price you'd actually pay, and we'll email you the second it gets there. No app, no account, one email.

WHAT'S ON THIS PAGE

↓ Price chart
when this has been cheap or pricey
↓ Forecast
where the price is heading next
↓ Statistics
all-time high & low, recent range
↑ Price alert
name your number, we'll email you

Price History & Forecast

Grey patches = out of stock. Cheaper = lower on the chart. Hover for exact prices.

Last 61 days • 61 data points (No recent data available)

Historical
Generating forecast...
£60.77 £57.73 £58.95 £60.16 £61.38 £62.59 £63.81 21 February 2026 08 March 2026 23 March 2026 07 April 2026 22 April 2026

Price Distribution

Price distribution over 61 days • 1 price levels

Days at Price
61 days 0 15 31 46 61 £61 Days at Price

Price Analysis

Most common price: £61 (61 days, 100.0%)

Price range: £61 - £61

Price levels: 1 different prices over 61 days

Description

A rigorous graduate-level reference unifying dynamic programming, stochastic control, and modern reinforcement learning. Twenty-four focused chapters build from measure-theoretic probability and operator theory to cutting-edge algorithms, with complete proofs, sharp assumptions, and tight error bounds. Mathematical foundations. Borel-space MDPs, stochastic kernels, measurable policies, ergodicity, and trajectory measures; contraction mappings, span seminorms, and fixed-point theory. Dynamic programming at scale. Value and policy iteration, modified/asynchronous variants, linear programming duality and occupancy measures, performance difference lemmas, finite-time stopping rules. Long-run criteria. Average cost, the Poisson equation, ACOE, relative value iteration, Blackwell optimality, drift conditions for stability. Episodic control. Stochastic shortest path models with proper policies, boundary conditions, and convergence without discounting. Partial observability. Belief-MDPs, piecewise-linear convex value functions, Bayes filters, and stability of belief dynamics. Continuous time. HJB PDEs, verification theorems, viscosity solutions, and convergent numerical schemes. Linear systems. LQR and algebraic Riccati equations; LQG, Kalman filtering, and the separation principle. Nonlinear optimal control. Differential dynamic programming, iLQR, and model predictive control with recursive feasibility and Lyapunov stability. Exploration fundamentals. Stochastic and contextual bandits, UCB, Thompson sampling, and information-theoretic lower bounds. Stochastic approximation. Robbins–Monro, ODE method, two-time-scale analysis, Polyak–Ruppert averaging, and Markovian noise. Temporal-difference learning. TD(λ), LSTD, GTD-family methods, emphatic weighting, and off-policy stability. Function approximation theory. Projected Bellman equations, MSPBE, fitted value and Q iteration, Rademacher complexity, concentrability, and Bellman rank. Control algorithms. Q-learning, SARSA, double Q-learning, finite-time tabular rates, and divergence with approximation. Policy optimization. Policy gradient theorem, variance reduction, natural gradients, trust-region methods, monotonic improvement guarantees. Actor–critic and entropy regularization. GAE, PPO, SAC, mirror descent and primal–dual views, two-time-scale convergence. Model-based RL. System identification, adaptive LQR, OFU and PSRL, regret bounds, Dyna-style planning, and simulation lemmas. Offline evaluation and control. Importance sampling, doubly robust estimators, FQE/FQI, high-confidence bounds, and pessimism for reliability. Safety and robustness. Constrained MDPs, Lagrangian methods, CVaR and risk envelopes, control barrier functions, robust and distributionally robust RL. Multi-agent settings. Zero-sum and general-sum Markov games, Shapley operators, equilibrium computation, and decentralized learning dynamics.

Product Specifications

Format
paperback
Domain
Amazon UK
Release Date
08 November 2025
Listed Since
08 November 2025

Barcode

No barcode data available

Similar Products You Might Like

Reinforcement Learning for Decision & Optimal Control - Springer
97% match

Reinforcement Learning for Decision & Optimal Control - Springer

£61.49 13 Jan 2026
Frontiers Of Intelligent Control And Information Processing
97% match

Frontiers Of Intelligent Control And Information Processing

World Scientific Publishing Company

£71.10 14 Jan 2026
Reinforcement Learning for Sequential Decision and Optimal Control
97% match

Reinforcement Learning for Sequential Decision and Optimal Control

£76.78 13 Jan 2026
Reinforcement Learning and Dynamic Programming Using Function Approximators: 39 (Automation and Control Engineering)
97% match

Reinforcement Learning and Dynamic Programming Using Function Approximators: 39 (Automation and Control Engineering)

CRC Press

£106.83 28 Feb 2026
Deep Reinforcement Learning Hands-On: Apply modern RL methods to practical problems of chatbots, robotics, discrete optimization, web automation, and more
97% match

Deep Reinforcement Learning Hands-On: Apply modern RL methods to practical problems of chatbots, robotics, discrete optimization, web automation, and more

Packt Publishing

£58.15 01 May 2026
Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning: 55 (Operations Research/Computer Science Interfaces Series, 55)
97% match

Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning: 55 (Operations Research/Computer Science Interfaces Series, 55)

Springer

£102.76 08 Mar 2026
Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning: 55 (Operations Research/Computer Science Interfaces Series, 55)
97% match

Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning: 55 (Operations Research/Computer Science Interfaces Series, 55)

Springer

£93.19 27 Feb 2026
Deep Reinforcement Learning and Its Industrial Use Cases: AI for Real-World Applications
97% match

Deep Reinforcement Learning and Its Industrial Use Cases: AI for Real-World Applications

£139.35 08 Jan 2026
Decision Making: Uncertainty, Imperfection, Deliberation and Scalability: 538 (Studies in Computational Intelligence, 538)
97% match

Decision Making: Uncertainty, Imperfection, Deliberation and Scalability: 538 (Studies in Computational Intelligence, 538)

Springer

£76.14 08 Apr 2026
Nonlinear System I: Proceedings of the 14th World Congress, International Federation of Automatic Control, Beijing, P.R. China, 5-9 July 1999 (IFAC ... Volumes) (IFAC Proceedings Volumes, Volume F)
97% match

Nonlinear System I: Proceedings of the 14th World Congress, International Federation of Automatic Control, Beijing, P.R. China, 5-9 July 1999 (IFAC ... Volumes) (IFAC Proceedings Volumes, Volume F)

Elsevier

£100.00 11 Apr 2026
Birkhauser Stochastic Networked Control Systems Book
97% match

Birkhauser Stochastic Networked Control Systems Book

Birkhauser

£124.99 17 Apr 2026
Springer Security of Cyber-Physical Systems - Volume 396
97% match

Springer Security of Cyber-Physical Systems - Volume 396

Springer

£103.81 25 Apr 2026
Multi-Agent Reinforcement Learning: Foundations and Modern Approaches
97% match

Multi-Agent Reinforcement Learning: Foundations and Modern Approaches

MIT Press

£50.81 08 Feb 2026
Proceedings of 2021 Chinese Intelligent Systems Conference: Volume II: 804 (Lecture Notes in Electrical Engineering, 804)
96% match

Proceedings of 2021 Chinese Intelligent Systems Conference: Volume II: 804 (Lecture Notes in Electrical Engineering, 804)

Springer

£119.38 04 Apr 2026
Proceedings of 2021 Chinese Intelligent Systems Conference: Volume I: 803 (Lecture Notes in Electrical Engineering, 803)
96% match

Proceedings of 2021 Chinese Intelligent Systems Conference: Volume I: 803 (Lecture Notes in Electrical Engineering, 803)

Springer

£254.01 23 Feb 2026
Model-based Reinforcement Learning: A Survey (Foundations and Trends® in Machine Learning)
96% match

Model-based Reinforcement Learning: A Survey (Foundations and Trends® in Machine Learning)

Out of Stock 04 Apr 2026
Proceedings of 2020 Chinese Intelligent Systems Conference: Volume I: 705 (Lecture Notes in Electrical Engineering, 705)
96% match

Proceedings of 2020 Chinese Intelligent Systems Conference: Volume I: 705 (Lecture Notes in Electrical Engineering, 705)

Springer

£203.06 20 Apr 2026
Recent Advances in Control and Filtering of Dynamic Systems with Constrained Signals: 170 (Studies in Systems, Decision and Control)
96% match

Recent Advances in Control and Filtering of Dynamic Systems with Constrained Signals: 170 (Studies in Systems, Decision and Control)

Springer

£102.04 12 May 2026
Self-Learning Optimal Control of Nonlinear Systems: Adaptive Dynamic Programming Approach: 103 (Studies in Systems, Decision and Control, 103)
96% match

Self-Learning Optimal Control of Nonlinear Systems: Adaptive Dynamic Programming Approach: 103 (Studies in Systems, Decision and Control, 103)

Springer

£96.96 05 May 2026
Control of Complex Systems: Theory and Applications
96% match

Control of Complex Systems: Theory and Applications

Butterworth-Heinemann

£110.65 09 Mar 2026
Routledge - Earnings Management and Fintech-Driven Incentives
96% match

Routledge - Earnings Management and Fintech-Driven Incentives

Routledge

£136.33 18 Apr 2026
Reinforcement and Systemic Machine Learning for Decision Making (IEEE Press Series on Systems Science and Engineering)
96% match

Reinforcement and Systemic Machine Learning for Decision Making (IEEE Press Series on Systems Science and Engineering)

Wiley-IEEE Press

£91.69 21 Feb 2026
Practical Issues of Intelligent Innovations: 140 (Studies in Systems, Decision and Control, 140)
96% match

Practical Issues of Intelligent Innovations: 140 (Studies in Systems, Decision and Control, 140)

Springer

£107.73 14 Apr 2026
Recent Advances in Soft Computing and Cybernetics: 403 (Studies in Fuzziness and Soft Computing, 403)
96% match

Recent Advances in Soft Computing and Cybernetics: 403 (Studies in Fuzziness and Soft Computing, 403)

Springer

£115.63 13 Apr 2026