Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
hakancapuk
/
rl_learning
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results (legacy)
Model card
Files
Files and versions
xet
Community
Use this model
main
rl_learning
/
ppo-LunarLander-v2
147 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
hakancapuk
this is the first test model for lunarlender (10000 steps)
383f2ec
verified
11 months ago
_stable_baselines3_version
Safe
7 Bytes
this is the first test model for lunarlender (10000 steps)
11 months ago
data
14 kB
this is the first test model for lunarlender (10000 steps)
11 months ago
policy.optimizer.pth
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
88.4 kB
xet
this is the first test model for lunarlender (10000 steps)
11 months ago
policy.pth
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
43.8 kB
xet
this is the first test model for lunarlender (10000 steps)
11 months ago
pytorch_variables.pth
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
864 Bytes
xet
this is the first test model for lunarlender (10000 steps)
11 months ago
system_info.txt
Safe
263 Bytes
this is the first test model for lunarlender (10000 steps)
11 months ago