PPO Agent Playing LunarLander-v2

This is a trained model of a PPO agent playing LunarLander-v2. Trained from scratch using PyTorch (CleanRL-style implementation).

Results

  • Mean Reward: 97.67 +/- 91.15
Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading

Evaluation results