ianshank/mousedroid-dual-stream-rssm

Trained weights for MouseDroid autonomous navigation system.

Components

Component File Description
RSSM World Model rssm/final.pt Recurrent State-Space Model
MCTS Policy Init mcts/policy_init.npz Warm-started PolicyMLP
BDI Belief bdi/belief.npz Belief encoder weights
BDI Desire bdi/desire.npz Desire encoder weights
BDI Intention bdi/intention.npz Intention predictor weights
BDI Affect bdi/affect.npz Affect estimator weights
Constitutional RL policy.npz, value.npz PPO policy + value networks

Training

Trained on Jetson Orin Nano (8 GB) using synthetic observation sequences.

Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading