Maze Navigation Reinforcement Learning
Program in Java or Python ONLY
For the maze, assume bottom left cell is at (1, 1) and theformat for the coordinates is (col, row )
a. 4 X 3 world with one obstacle in (2, 2), reward +1 at (4, 3)and -1 at (4, 2) all other states are -.04
the maze would look like this:
-.04-.04-.04+1-.04BLOCKED-.04-1-.04-.04-.04-.04
That uses Reinforcement learning algorithm. The goal is the +1state. The program must start by asking for a start state from theuser and then output the directions needed to the goal.
Expert Answer
An answer will be send to you shortly. . .
PayPal Gateway not configured
PayPal Gateway not configured