Maze Navigation Reinforcement Learning
Program in Java or Python ONLY
For the maze, assume bottom left cell is at (1, 1) andthe format for the coordinates is (col, row )
a. 4 X 3 world with one obstacle in (2, 2), reward +1 at(4, 3) and -1 at (4, 2) all other states are -.04
the maze would look like this:
-.04-.04-.04+1-.04BLOCKED-.04-1-.04-.04-.04-.04
That uses Reinforcement learning algorithm. The goal isthe +1 state. The program must start by asking for a start statefrom the user and then output the directions needed to thegoal.
READ THE PROMPT STOP SUBMITTING INCORRECTANSWERS
Expert Answer
An answer will be send to
PayPal Gateway not configured
PayPal Gateway not configured