Hi!. You better use policy gradient for your… | by Thomas Simonini

Could u also explain how the luna lander environment??
2
1
Heba Mazhar
Thomas Simonini
·Follow
1 min read·
May 30, 2018
--
Hi!
You better use policy gradient for your situation use my implement with cartpole and change the environment with lunar lander it will work. (You can also add a mini batch function).
https://github.com/simoninithomas/Deep_reinforcement_learning_Course/tree/master/Policy%20Gradients/Cartpole
--
--
Written by Thomas Simonini5.3K Followers
·221 Following
Developer Advocate 🥑 at Hugging Face 🤗| Founder Deep Reinforcement Learning class 📚 https://bit.ly/3QADz2Q |
No responses yet
Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams