How many epochs has this been trained for?
What I want to say is that the model I am training with the official agent does not reach the baseline here at all. What are the reasons?
· Sign up or log in to comment