Thomas Simonini
1 min readJun 7, 2018

--

Simple reason: this environment is stochastic, it means that the action you take will not all the time push you in the defined state.

“However, the ice is slippery, so you won’t always move in the direction you intend.”

--

--

Thomas Simonini

Developer Advocate 🥑 at Hugging Face 🤗| Founder Deep Reinforcement Learning class 📚 https://bit.ly/3QADz2Q |