Book Description
A broad introduction to algorithms for decision making under uncertainty, introducing the underlying mathematical problem formulations and the algorithms for solving them.
Automated decision-making systems or decision-support systems - used in applications that range from aircraft collision avoidance to breast cancer screening - must be designed to account for various sources of uncertainty while carefully balancing multiple objectives. This textbook provides a broad introduction to algorithms for decision making under uncertainty, covering the underlying mathematical problem formulations and the algorithms for solving them.
The book first addresses the problem of reasoning about uncertainty and objectives in simple decisions at a single point in time, and then turns to sequential decision problems in stochastic environments where the outcomes of our actions are uncertain. It goes on to address model uncertainty, when we do not start with a known model and must learn how to act through interaction with the environment; state uncertainty, in which we do not know the current state of the environment due to imperfect perceptual information; and decision contexts involving multiple agents. The book focuses primarily on planning and reinforcement learning, although some of the techniques presented draw on elements of supervised learning and optimization. Algorithms are implemented in the Julia programming language. Figures, examples, and exercises convey the intuition behind the various approaches presented.
This open book is licensed under a Creative Commons License (CC BY-NC-ND). You can download Algorithms for Decision Making ebook for free in PDF format (12.5 MB).
Table of Contents
Chapter 1
Introduction
PART I
Probabilistic Reasoning
Chapter 2
Representation
Chapter 3
Inference
Chapter 4
Parameter Learning
Chapter 5
Structure Learning
Chapter 6
Simple Decisions
PART II
Sequential Problems
Chapter 7
Exact Solution Methods
Chapter 8
Approximate Value Functions
Chapter 9
Online Planning
Chapter 10
Policy Search
Chapter 11
Policy Gradient Estimation
Chapter 12
Policy Gradient Optimization
Chapter 13
Actor-Critic Methods
Chapter 14
Policy Validation
PART III
Model Uncertainty
Chapter 15
Exploration and Exploitation
Chapter 16
Model-Based Methods
Chapter 17
Model-Free Methods
Chapter 18
Imitation Learning
PART IV
State Uncertainty
Chapter 19
Beliefs
Chapter 20
Exact Belief State Planning
Chapter 21
Offline Belief State Planning
Chapter 22
Online Belief State Planning
Chapter 23
Controller Abstractions
PART V
Multiagent Systems
Chapter 24
Multiagent Reasoning
Chapter 25
Sequential Problems
Chapter 26
State Uncertainty
Chapter 27
Collaborative Agents