Project: RL1/RL2 (obsolete)
Collection
Older models that are no longer useful for anything in RL1 or RL2, or are now unused as experimentation discontinued. • 16 items • Updated
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
These models were part of the experiment to cut the antenna off the butterfly. They are obsolete. Do not use. Have been replaced with these models.
Wandb: https://wandb.ai/devinterp/jaxgmg_banlist
Models trained with restrictions: Environment is (1-alpha) chance of cheese_always_in_the_corner, and alpha chance of the environment:
Models