This is a trained model of a PPO agent playing LunarLander-v2. Trained from scratch using PyTorch (CleanRL-style implementation).
-