Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

hakancapuk
/
rl_learning

Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results (legacy)
Model card Files Files and versions
xet
Community
rl_learning / ppo-LunarLander-v2
147 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
hakancapuk's picture
hakancapuk
this is the first test model for lunarlender (10000 steps)
383f2ec verified 11 months ago
  • _stable_baselines3_version
    7 Bytes
    this is the first test model for lunarlender (10000 steps) 11 months ago
  • data
    14 kB
    this is the first test model for lunarlender (10000 steps) 11 months ago
  • policy.optimizer.pth

    Detected Pickle imports (3)

    • "torch.FloatStorage",
    • "collections.OrderedDict",
    • "torch._utils._rebuild_tensor_v2"

    What is a pickle import?

    88.4 kB
    xet
    this is the first test model for lunarlender (10000 steps) 11 months ago
  • policy.pth

    Detected Pickle imports (3)

    • "collections.OrderedDict",
    • "torch.FloatStorage",
    • "torch._utils._rebuild_tensor_v2"

    What is a pickle import?

    43.8 kB
    xet
    this is the first test model for lunarlender (10000 steps) 11 months ago
  • pytorch_variables.pth

    Pickle imports

    • No problematic imports detected

    What is a pickle import?

    864 Bytes
    xet
    this is the first test model for lunarlender (10000 steps) 11 months ago
  • system_info.txt
    263 Bytes
    this is the first test model for lunarlender (10000 steps) 11 months ago