GridWorld
Enter a number between 5 and 9:
Generate Square
Discount (γ):
Step Reward:
Goal Reward:
Click on a cell to set up the start grid as
green
or the end grid as
red
. Next, you may optionally select up to
3
obstacles (
grey
).
Random Policy
Optimal Policy
Value Matrix
Policy Matrix