Iterative Policy Evaluation Algorithm in Python and OpenAI Gym - Reinforcement Learning Tutorial