TempControl PPO โ task1
Trained with Stable-Baselines3 PPO on TempControl-OpenEnv.
Load & Run
from stable_baselines3 import PPO
from tasks.task1_* import make_env
model = PPO.load("ppo_task1")
env = make_env()
obs, _ = env.reset()
action, _ = model.predict(obs)