Thomas Simonini
1 min readJun 15, 2018

--

Hello,

Indeed in reality we are in stochastic environments but Taxi v-2 is a deterministic environment (if you choose to go left, you’ll go left) that’s why I didn’t considered the stochastic aspect because in this environment it doesn’t exists.

On the other hand, for the frozen lake implementation also in this article, we are in a stochastic environment. But remember that in stochastic environments the best is to work with Policy gradients.

--

--

Thomas Simonini
Thomas Simonini

Written by Thomas Simonini

Developer Advocate 🥑 at Hugging Face 🤗| Founder Deep Reinforcement Learning class 📚 https://bit.ly/3QADz2Q |

Responses (1)