Path: blob/main/C3 - Unsupervised Learning, Recommenders, Reinforcement Learning/week3/optional-labs/State-action value function example.ipynb
7540 views
Kernel: Python 3
State Action Value Function Example
In this Jupyter notebook, you can modify the mars rover example to see how the values of Q(s,a) will change depending on the rewards and discount factor changing.
In [1]:
In [2]:
In [5]:
In [6]:
Out[6]:
In [ ]: