Error in distance states

So the way I was implementing the learning algorithm, I had two distance states and the closer the agent to an obstacle, larger the distance number was. This what is stated in the paper I am implementing.

The problem is: it does not work. When one converts the distance status into a state number, closer obstacles returns smaller numbers and therefore the agent gets closer to the obstacle. I changed my distance states. Here is an initial test.

working iteration

Does not work all the time which is strange though. I will make further tests.

Leave a comment