📚 node [[bellman_equation]]

Bellman equation

Go back to the [[AI Glossary]]

#rl

In reinforcement learning, the following identity satisfied by the optimal Q-function:

The Q-function in reinforcement learning

Reinforcement learning algorithms apply this identity to create Q-learning via the following update rule:

The Bellman equation

Beyond reinforcement learning, the Bellman equation has applications to dynamic programming. See the Wikipedia entry for Bellman Equation.

📖 stoas
⥱ context