Taxi-v3 help. What is meant exactly by convergence of the algo, the highest reward and optimal action for every state?

You are here:
Go to Top