deep-rl-course / ppo-LunarLander-v2
147 kB
sigma-bit-dot's picture
train ppo model with 1,000,000 time steps
6f66be1