...
See the slides for more examples.
Getting a Handle
NB: Any specific action has a probability transition matrix of it's own,
Mathinline | ||||
---|---|---|---|---|
|
The Optimal State Value Function is the maximum state value function over all policies, i.e.
Mathinline | ||||
---|---|---|---|---|
|
...
See the slides for more examples.
NB: Any specific action has a probability transition matrix of it's own,
Mathinline | ||||
---|---|---|---|---|
|
The Optimal State Value Function is the maximum state value function over all policies, i.e.
Mathinline | ||||
---|---|---|---|---|
|